RESUMO
The Taiwanese people are composed of diverse indigenous populations and the Taiwanese Han. About 95% of the Taiwanese identify themselves as Taiwanese Han, but this may not be a homogeneous population because they migrated to the island from various regions of continental East Asia over a period of 400 years. Little is known about the underlying patterns of genetic ancestry, population admixture, and evolutionary adaptation in the Taiwanese Han people. Here, we analyzed the whole-genome single-nucleotide polymorphism genotyping data from 14,401 individuals of Taiwanese Han collected by the Taiwan Biobank and the whole-genome sequencing data for a subset of 772 people. We detected four major genetic ancestries with distinct geographic distributions (i.e., Northern, Southeastern, Japonic, and Island Southeast Asian ancestries) and signatures of population mixture contributing to the genomes of Taiwanese Han. We further scanned for signatures of positive natural selection that caused unusually long-range haplotypes and elevations of hitchhiked variants. As a result, we identified 16 candidate loci in which selection signals can be unambiguously localized at five single genes: CTNNA2, LRP1B, CSNK1G3, ASTN2, and NEO1. Statistical associations were examined in 16 metabolic-related traits to further elucidate the functional effects of each candidate gene. All five genes appear to have pleiotropic connections to various types of disease susceptibility and significant associations with at least one metabolic-related trait. Together, our results provide critical insights for understanding the evolutionary history and adaption of the Taiwanese Han population.
Assuntos
Povo Asiático , Genoma , Povo Asiático/genética , Estudo de Associação Genômica Ampla , Haplótipos , Humanos , Polimorfismo de Nucleotídeo ÚnicoRESUMO
The long-term persistence of a population which has suffered a bottleneck partly depends on how historical demographic dynamics impacted its genetic diversity and the accumulation of deleterious mutations. Here we provide genomic evidence for the genetic effect of a recent population bottleneck in the endangered black-faced spoonbill (Platalea minor) after its rapid population recovery. Our data suggest that the bird's effective population size, Ne , had been relatively stable (7500-9000) since 22,000 years ago; however, a recent brief yet severe bottleneck (Ne = 20) which we here estimated to occur around the 1940s wiped out >99% of its historical Ne in roughly three generations. Despite a >15-fold population recovery since 1988, we found that black-faced spoonbill population has higher levels of inbreeding (7.4 times more runs of homozygosity) than its sister species, the royal spoonbill (P. regia), which is not thought to have undergone a marked population contraction. Although the two spoonbills have similar levels of genome-wide genetic diversity, our results suggest that selection on more genes was relaxed in the black-faced spoonbill; moreover individual black-faced spoonbills carry more putatively deleterious mutations (Grantham's score > 50), and may therefore express more deleterious phenotypic effects than royal spoonbills. Here we demonstrate the value of using genomic indices to monitor levels of genetic erosion, inbreeding and mutation load in species with conservation concerns. To mitigate the prolonged negative genetic effect of a population bottleneck, we recommend that all possible measures should be employed to maintain population growth of a threatened species.
Assuntos
Aves , Espécies em Perigo de Extinção , Animais , Aves/genética , Variação Genética , Genoma , Endogamia , Densidade DemográficaRESUMO
Anatomically modern humans arose in Africa â¼300,000 years ago, but the demographic and adaptive histories of African populations are not well-characterized. Here, we have generated a genome-wide dataset from 840 Africans, residing in western, eastern, southern, and northern Africa, belonging to 50 ethnicities, and speaking languages belonging to four language families. In addition to agriculturalists and pastoralists, our study includes 16 populations that practice, or until recently have practiced, a hunting-gathering (HG) lifestyle. We observe that genetic structure in Africa is broadly correlated not only with geography, but to a lesser extent, with linguistic affiliation and subsistence strategy. Four East African HG (EHG) populations that are geographically distant from each other show evidence of common ancestry: the Hadza and Sandawe in Tanzania, who speak languages with clicks classified as Khoisan; the Dahalo in Kenya, whose language has remnant clicks; and the Sabue in Ethiopia, who speak an unclassified language. Additionally, we observed common ancestry between central African rainforest HGs and southern African San, the latter of whom speak languages with clicks classified as Khoisan. With the exception of the EHG, central African rainforest HGs, and San, other HG groups in Africa appear genetically similar to neighboring agriculturalist or pastoralist populations. We additionally demonstrate that infectious disease, immune response, and diet have played important roles in the adaptive landscape of African history. However, while the broad biological processes involved in recent human adaptation in Africa are often consistent across populations, the specific loci affected by selective pressures more often vary across populations.
Assuntos
População Negra/genética , Etnicidade/genética , Variação Genética , Genoma Humano , Idioma , Filogenia , Feminino , Humanos , MasculinoRESUMO
Insular flying foxes are keystone species in island ecosystems due to their critical roles in plant pollination and seed dispersal. These species are vulnerable to population decline because of their small populations and low reproductive rates. The Formosan flying fox (Pteropus dasymallus formosus) is one of the 5 subspecies of the Ryukyu flying fox. Pteropus dasymallus formosus has suffered from a severe decline and is currently recognized as a critically endangered population in Taiwan. On the contrary, the Orii's flying fox (Pteropus dasymallus inopinatus) is a relatively stable population inhabiting Okinawa Island. Here, we applied a genomic approach called double digest restriction-site associated DNA sequencing to study these 2 subspecies for a total of 7 individuals. We detected significant genetic structure between the 2 populations. Despite their contrasting contemporary population sizes, both populations harbor very low degrees of genetic diversity. We further inferred their demographic history based on the joint folded site frequency spectrum and revealed that both P. d. formosus and P. d. inopinatus had maintained small population sizes for a long period of time after their divergence. Recently, these populations experienced distinct trajectories of demographic changes. While P. d. formosus suffered from a drastic ~10-fold population decline not long ago, P. d. inopinatus underwent a ~4.5-fold population expansion. Our results suggest separate conservation management for the 2 populations-population recovery is urgently needed for P. d. formosus while long-term monitoring for adverse genetic effects should be considered for P. d. inopinatus.
Assuntos
Quirópteros/genética , Variação Genética , Genética Populacional , Animais , Conservação dos Recursos Naturais , Espécies em Perigo de Extinção , Endogamia , Polimorfismo de Nucleotídeo Único , Densidade Demográfica , Dinâmica Populacional , Análise de Sequência de DNA , TaiwanRESUMO
In humans, the ability to digest lactose, the sugar in milk, declines after weaning because of decreasing levels of the enzyme lactase-phlorizin hydrolase, encoded by LCT. However, some individuals maintain high enzyme amounts and are able to digest lactose into adulthood (i.e., they have the lactase-persistence [LP] trait). It is thought that selection has played a major role in maintaining this genetically determined phenotypic trait in different human populations that practice pastoralism. To identify variants associated with the LP trait and to study its evolutionary history in Africa, we sequenced MCM6 introns 9 and 13 and ~2 kb of the LCT promoter region in 819 individuals from 63 African populations and in 154 non-Africans from nine populations. We also genotyped four microsatellites in an ~198 kb region in a subset of 252 individuals to reconstruct the origin and spread of LP-associated variants in Africa. Additionally, we examined the association between LP and genetic variability at candidate regulatory regions in 513 individuals from eastern Africa. Our analyses confirmed the association between the LP trait and three common variants in intron 13 (C-14010, G-13907, and G-13915). Furthermore, we identified two additional LP-associated SNPs in intron 13 and the promoter region (G-12962 and T-956, respectively). Using neutrality tests based on the allele frequency spectrum and long-range linkage disequilibrium, we detected strong signatures of recent positive selection in eastern African populations and the Fulani from central Africa. In addition, haplotype analysis supported an eastern African origin of the C-14010 LP-associated mutation in southern Africa.
Assuntos
Lactase/metabolismo , África , Humanos , Íntrons , Lactase-Florizina Hidrolase/genética , Lactase-Florizina Hidrolase/metabolismo , Repetições de Microssatélites/genética , Componente 6 do Complexo de Manutenção de Minicromossomo/genética , Reação em Cadeia da Polimerase , Polimorfismo de Nucleotídeo Único , Regiões Promotoras GenéticasRESUMO
Disease susceptibility can arise as a consequence of adaptation to infectious disease. Recent findings have suggested that higher rates of chronic kidney disease (CKD) in individuals with recent African ancestry might be attributed to two risk alleles (G1 and G2) at the serum-resistance-associated (SRA)-interacting-domain-encoding region of APOL1. These two alleles appear to have arisen adaptively, possibly as a result of their protective effects against human African trypanosomiasis (HAT), or African sleeping sickness. In order to explore the distribution of potential functional variation at APOL1, we studied nucleotide variation in 187 individuals across ten geographically and genetically diverse African ethnic groups with exposure to two Trypanosoma brucei subspecies that cause HAT. We observed unusually high levels of nonsynonymous polymorphism in the regions encoding the functional domains that are required for lysing parasites. Whereas allele frequencies of G2 were similar across all populations (3%-8%), the G1 allele was only common in the Yoruba (39%). Additionally, we identified a haplotype (termed G3) that contains a nonsynonymous change at the membrane-addressing-domain-encoding region of APOL1 and is present in all populations except for the Yoruba. Analyses of long-range patterns of linkage disequilibrium indicate evidence of recent selection acting on the G3 haplotype in Fulani from Cameroon. Our results indicate that the G1 and G2 variants in APOL1 are geographically restricted and that there might be other functional variants that could play a role in HAT resistance and CKD risk in African populations.
Assuntos
Apolipoproteínas/genética , População Negra/genética , Lipoproteínas HDL/genética , Polimorfismo de Nucleotídeo Único , Seleção Genética , Adaptação Biológica , África , Alelos , Apolipoproteína L1 , Resistência à Doença/genética , Evolução Molecular , Éxons , Frequência do Gene , Predisposição Genética para Doença , Genética Populacional/métodos , Haplótipos , Humanos , Desequilíbrio de Ligação , Dados de Sequência Molecular , Insuficiência Renal Crônica/etnologia , Insuficiência Renal Crônica/genética , Fatores de Risco , Tripanossomíase Africana/etnologia , Tripanossomíase Africana/genéticaRESUMO
Whole-genome and -exome resequencing using next-generation sequencers is a powerful approach for identifying genomic variations that are associated with diseases. However, systematic strategies for prioritizing causative variants from many candidates to explain the disease phenotype are still far from being established, because the population-specific frequency spectrum of genetic variation has not been characterized. Here, we have collected exomic genetic variation from 1208 Japanese individuals through a collaborative effort, and aggregated the data into a prevailing catalog. In total, we identified 156 622 previously unreported variants. The allele frequencies for the majority (88.8%) were lower than 0.5% in allele frequency and predicted to be functionally deleterious. In addition, we have constructed a Japanese-specific major allele reference genome by which the number of unique mapping of the short reads in our data has increased 0.045% on average. Our results illustrate the importance of constructing an ethnicity-specific reference genome for identifying rare variants. All the collected data were centralized to a newly developed database to serve as useful resources for exploring pathogenic variations. Public access to the database is available at http://www.genome.med.kyoto-u.ac.jp/SnpDB/.
Assuntos
Bases de Dados Genéticas , Variação Genética , Genética Populacional , Alelos , Exoma , Frequência do Gene , Genoma Humano , Genômica/métodos , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Japão , Controle de Qualidade , Seleção Genética , NavegadorRESUMO
Malaria has been a very strong selection pressure in recent human evolution, particularly in Africa. Of the one million deaths per year due to malaria, more than 90% are in sub-Saharan Africa, a region with high levels of genetic variation and population substructure. However, there have been few studies of nucleotide variation at genetic loci that are relevant to malaria susceptibility across geographically and genetically diverse ethnic groups in Africa. Invasion of erythrocytes by Plasmodium falciparum parasites is central to the pathology of malaria. Glycophorin A (GYPA) and B (GYPB), which determine MN and Ss blood types, are two major receptors that are expressed on erythrocyte surfaces and interact with parasite ligands. We analyzed nucleotide diversity of the glycophorin gene family in 15 African populations with different levels of malaria exposure. High levels of nucleotide diversity and gene conversion were found at these genes. We observed divergent patterns of genetic variation between these duplicated genes and between different extracellular domains of GYPA. Specifically, we identified fixed adaptive changes at exons 3-4 of GYPA. By contrast, we observed an allele frequency spectrum skewed toward a significant excess of intermediate-frequency alleles at GYPA exon 2 in many populations; the degree of spectrum distortion is correlated with malaria exposure, possibly because of the joint effects of gene conversion and balancing selection. We also identified a haplotype causing three amino acid changes in the extracellular domain of glycophorin B. This haplotype might have evolved adaptively in five populations with high exposure to malaria.
Assuntos
Doenças Endêmicas , Predisposição Genética para Doença , Glicoforinas/genética , Sistema do Grupo Sanguíneo MNSs/genética , Malária Falciparum/genética , Seleção Genética , África Subsaariana , Substituição de Aminoácidos , Animais , Sequência de Bases , Eritrócitos/metabolismo , Eritrócitos/parasitologia , Etnicidade/genética , Éxons , Loci Gênicos , Glicoforinas/química , Glicoforinas/classificação , Humanos , Malária Falciparum/sangue , Malária Falciparum/epidemiologia , Dados de Sequência Molecular , Filogenia , Plasmodium falciparum , Polimorfismo de Nucleotídeo Único , Estrutura Terciária de ProteínaRESUMO
Taiwan harbors five endemic species of salamanders (Hynobius spp.) that inhabit distinct alpine regions, contributing to population fragmentation across isolated "sky islands". With an evolutionary history spanning multiple glacial-interglacial cycles, these species represent an exceptional paradigm for exploring biogeography and speciation. However, a lack of suitable genetic markers applicable across species has limited research efforts. Thus, developing cross-amplifying markers is imperative. Expressed sequence-tag simple-sequence repeats (EST-SSRs) that amplify across divergent lineages are ideal for species identification in instances where phenotypic differentiation is challenging. Here, we report a suite of cross-amplifying EST-SSRs from the transcriptomes of the five Hynobius species that exhibit an interspecies transferability rate of 67.67%. To identify individual markers exhibiting cross-species polymorphism and to assess interspecies genetic diversity, we assayed 140 individuals from the five species across 84 sampling sites. A set of EST-SSRs with a high interspecies polymorphic information content (PIC = 0.63) effectively classified these individuals into five distinct clusters, as supported by discriminant analysis of principal components (DAPC), STRUCTURE assignment tests, and Neighbor-joining trees. Moreover, pair-wise FST values > 0.15 indicate notable between-cluster genetic divergence. Our set of 20 polymorphic EST-SSRs is suitable for assessing population structure within and among Hynobius species, as well as for long-term monitoring of their genetic composition.
Assuntos
Etiquetas de Sequências Expressas , Repetições de Microssatélites , Animais , Repetições de Microssatélites/genética , Taiwan , Urodelos/genética , Urodelos/classificação , Variação Genética , Polimorfismo Genético , Filogenia , Transcriptoma/genéticaRESUMO
Malaria is one of the strongest selective pressures in recent human evolution. African populations have been and continue to be at risk for malarial infections. However, few studies have re-sequenced malaria susceptibility loci across geographically and genetically diverse groups in Africa. We examined nucleotide diversity at Intercellular adhesion molecule-1 (ICAM-1), a malaria susceptibility candidate locus, in a number of human populations with a specific focus on diverse African ethnic groups. We used tests of neutrality to assess whether natural selection has impacted this locus and tested whether SNP variation at ICAM-1 is correlated with malaria endemicity. We observe differing patterns of nucleotide and haplotype variation in global populations and higher levels of diversity in Africa. Although we do not observe a deviation from neutrality based on the allele frequency distribution, we do observe several alleles at ICAM-1, including the ICAM-1 (Kilifi) allele, that are correlated with malaria endemicity. We show that the ICAM-1 (Kilifi) allele, which is common in Africa and Asia, exists on distinct haplotype backgrounds and is likely to have arisen more recently in Asia. Our results suggest that correlation analyses of allele frequencies and malaria endemicity may be useful for identifying candidate functional variants that play a role in malaria resistance and susceptibility.
Assuntos
Etnicidade/genética , Predisposição Genética para Doença/genética , Variação Genética , Molécula 1 de Adesão Intercelular/genética , Malária/genética , Sequência de Bases , População Negra/genética , Primers do DNA/genética , Frequência do Gene , Genética Populacional , Haplótipos/genética , Humanos , Desequilíbrio de Ligação , Malária/etnologia , Dados de Sequência Molecular , Polimorfismo de Nucleotídeo Único/genética , Alinhamento de Sequência , Análise de Sequência de DNARESUMO
BACKGROUND: Chronic hepatitis B virus (HBV) infection is a great health burden with geographical variations. AIMS: To explore genetic variants associated with chronic HBV infection. METHODS: The study included 15 352 participants seropositive for HBV core antibodies in Taiwan Biobank. Among them, 2591 (16.9%) seropositive for HBV surface antigen (HBsAg) were defined as having chronic HBV infection. All participants were examined for whole-genome genotyping by Axiom-Taiwan Biobank Array. The human leucocyte antigen (HLA) imputation was performed after identification of the variants within the region. Logistic regressions were used to estimate odds ratios (ORs) with 95% confidence intervals. Correlations of different HLA allele frequencies with HBsAg seroprevalence were evaluated across worldwide populations by Pearson correlation coefficients. Epitope prediction was performed for HLA alleles using NetMHCIIpan method. RESULTS: Located within a cluster of 450 single nucleotide polymorphisms in HLA class II, rs7770370 (P = 2.73 × 10-35 ) was significantly associated with HBV chronicity (Pcorrected < 8.6 × 10-8 ). Imputation analyses showed that HLA-DPA1*02:02 and HLA-DPB1*05:01 were associated with chronic HBV, with adjusted ORs of 1.43 (1.09-1.89) and 1.61 (1.29-2.01). These allele frequencies were positively correlated with global HBsAg seroprevalence, with R of 0.75 and 0.62 respectively (P < 0.05). HLA-DRB1*13:02, HLA-DQA1* 01:02 and HLA-DQB1*06:09 associated with HBV chronicity negatively, with adjusted ORs of 0.31 (0.17-0.58), 0.70 (0.56-0.87) and 0.33 (0.18-0.63). These HLA alleles had various binding affinities to the predicted epitopes derived from HBV nucleocapsid protein. CONCLUSIONS: HLA class II variants are relevant for chronicity after HBV acquisition.
Assuntos
Genes MHC da Classe II/genética , Estudo de Associação Genômica Ampla , Hepatite B Crônica/epidemiologia , Hepatite B Crônica/genética , Polimorfismo de Nucleotídeo Único , Adulto , Idoso , Alelos , Bancos de Espécimes Biológicos/estatística & dados numéricos , Feminino , Frequência do Gene , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla/métodos , Anticorpos Anti-Hepatite B/sangue , Antígenos de Superfície da Hepatite B/sangue , Antígenos de Superfície da Hepatite B/imunologia , Hepatite B Crônica/sangue , Humanos , Masculino , Pessoa de Meia-Idade , Estudos Soroepidemiológicos , Taiwan/epidemiologiaRESUMO
BACKGROUND: The GMC oxidoreductases comprise a large family of diverse FAD enzymes that share a homologous backbone. The relationship and origin of the GMC oxidoreductase genes, however, was unknown. Recent sequencing of entire genomes has allowed for the evolutionary analysis of the GMC oxidoreductase family. RESULTS: Although genes that encode enzyme families are rarely linked in higher eukaryotes, we discovered that the majority of the GMC oxidoreductase genes in the fruit fly (D. melanogaster), mosquito (A. gambiae), honeybee (A. mellifera), and flour beetle (T. castaneum) are located in a highly conserved cluster contained within a large intron of the flotillin-2 (Flo-2) gene. In contrast, the genomes of vertebrates and the nematode C. elegans contain few GMC genes and lack a GMC cluster, suggesting that the GMC cluster and the function of its resident genes are unique to insects or arthropods. We found that the development patterns of expression of the GMC cluster genes are highly complex. Among the GMC oxidoreductases located outside of the GMC gene cluster, the identities of two related enzymes, glucose dehydrogenase (GLD) and glucose oxidase (GOX), are known, and they play major roles in development and immunity. We have discovered that several additional GLD and GOX homologues exist in insects but are remotely similar to fungal GOX. CONCLUSION: We speculate that the GMC oxidoreductase cluster has been conserved to coordinately regulate these genes for a common developmental or physiological function related to ecdysteroid metabolism. Furthermore, we propose that the GMC gene cluster may be the birthplace of the insect GMC oxidoreductase genes. Through tandem duplication and divergence within the cluster, new GMC genes evolved. Some of the GMC genes have been retained in the cluster for hundreds of millions of years while others might have transposed to other regions of the genome. Consistent with this hypothesis, our analysis indicates that insect GOX and GLD arose from a different ancestral GMC gene than that of fungal GOX.
Assuntos
Oxirredutases do Álcool/genética , Insetos/genética , Animais , Anopheles/genética , Drosophila melanogaster/genética , Evolução Molecular , Éxons , Flavina-Adenina Dinucleotídeo/metabolismo , Duplicação Gênica , Expressão Gênica , Genes de Insetos , Genoma de Inseto , Glucose 1-Desidrogenase/genética , Glucose Oxidase/genética , Insetos/enzimologia , Família Multigênica , Filogenia , Homologia de Sequência de Aminoácidos , Tribolium/genéticaRESUMO
Fluctuations in base composition appear to be prevalent in Drosophila and mammal genome evolution, but their timescale, genomic breadth, and causes remain obscure. Here, we study base composition evolution within the X chromosomes of Drosophila melanogaster and five of its close relatives. Substitutions were inferred on six extant and two ancestral lineages for 14 near-telomeric and 9 nontelomeric genes. GC content evolution is highly variable both within the genome and within the phylogenetic tree. In the lineages leading to D. yakuba and D. orena, GC content at silent sites has increased rapidly near telomeres, but has decreased in more proximal (nontelomeric) regions. D. orena shows a 17-fold excess of GC-increasing vs. AT-increasing synonymous changes within a small (approximately 130-kb) region close to the telomeric end. Base composition changes within introns are consistent with changes in mutation patterns, but stronger GC elevation at synonymous sites suggests contributions of natural selection or biased gene conversion. The Drosophila yakuba lineage shows a less extreme elevation of GC content distributed over a wider genetic region (approximately 1.2 Mb). A lack of change in GC content for most introns within this region suggests a role of natural selection in localized base composition fluctuations.
Assuntos
Drosophila/genética , Evolução Molecular , Variação Genética , Cromossomo X/química , Animais , Composição de Bases , Linhagem da Célula/genética , Códon , Íntrons , Dados de Sequência Molecular , Fases de Leitura Aberta , Filogenia , TelômeroRESUMO
Although mutation, genetic drift, and natural selection are well established as determinants of genome evolution, the importance (frequency and magnitude) of parameter fluctuations in molecular evolution is less understood. DNA sequence comparisons among closely related species allow specific substitutions to be assigned to lineages on a phylogenetic tree. In this study, we compare patterns of codon usage and protein evolution in 22 genes (>11,000 codons) among Drosophila melanogaster and five relatives within the D. melanogaster subgroup. We assign changes to eight lineages using a maximum-likelihood approach to infer ancestral states. Uncertainty in ancestral reconstructions is taken into account, at least to some extent, by weighting reconstructions by their posterior probabilities. Four of the eight lineages show potentially genomewide departures from equilibrium synonymous codon usage; three are decreasing and one is increasing in major codon usage. Several of these departures are consistent with lineage-specific changes in selection intensity (selection coefficients scaled to effective population size) at silent sites. Intron base composition and rates and patterns of protein evolution are also heterogeneous among these lineages. The magnitude of forces governing silent, intron, and protein evolution appears to have varied frequently, and in a lineage-specific manner, within the D. melanogaster subgroup.
Assuntos
Drosophila melanogaster/genética , Evolução Molecular , Filogenia , Substituição de Aminoácidos/genética , Animais , Composição de Bases , Códon , Frequência do Gene , Íntrons , Dados de Sequência Molecular , Polimorfismo Genético , Especificidade da EspécieRESUMO
Although molecular and phenotypic evolution have been studied extensively in Drosophila melanogaster and its close relatives, phylogenetic relationships within the D. melanogaster species subgroup remain unresolved. In particular, recent molecular studies have not converged on the branching orders of the D. yakuba-D. teissieri and D. erecta-D. orena species pairs relative to the D. melanogaster-D. simulans-D. mauritiana-D. sechellia species complex. Here, we reconstruct the phylogeny of the melanogaster species subgroup using DNA sequence data from four nuclear genes. We have employed "vectorette PCR" to obtain sequence data for orthologous regions of the Alcohol dehydrogenase (Adh), Alcohol dehydrogenase related (Adhr), Glucose dehydrogenase (Gld), and rosy (ry) genes (totaling 7164 bp) from six melanogaster subgroup species (D. melanogaster, D. simulans, D. teissieri, D. yakuba, D. erecta, and D. orena) and three species from subgroups outside the melanogaster species subgroup [D. eugracilis (eugracilis subgroup), D. mimetica (suzukii subgroup), and D. lutescens (takahashii subgroup)]. Relationships within the D. simulans complex are not addressed. Phylogenetic analyses employing maximum parsimony, neighbor-joining, and maximum likelihood methods strongly support a D. yakuba-D. teissieri and D. erecta-D. orena clade within the melanogaster species subgroup. D. eugracilis is grouped closer to the melanogaster subgroup than a D. mimetica-D. lutescens clade. This tree topology is supported by reconstructions employing simple (single parameter) and more complex (nonreversible) substitution models.