RESUMO
BACKGROUND: The common deletion of the glutathione S-transferase Mu 1 (GSTM1) gene in humans has been shown to be involved in xenobiotic metabolism and associated with bladder cancer. However, the evolution of this deletion has not been investigated. RESULTS: In this study, we conducted comparative analyses of primate genomes. We demonstrated that the GSTM gene family has evolved through multiple structural variations, involving gene duplications, losses, large inversions and gene conversions. We further showed experimentally that the GSTM1 was polymorphically deleted in both humans and also in chimpanzees, through independent deletion events. To generalize our results, we searched for genic deletions that are polymorphic in both humans and chimpanzees. Consequently, we found only two such deletions among the thousands that we have searched, one of them being the GSTM1 deletion and the other surprisingly being another metabolizing gene, the UGT2B17. CONCLUSIONS: Overall, our results support the emerging notion that metabolizing gene families, such as the GSTM, NAT, UGT and CYP, have been evolving rapidly through gene duplication and deletion events in primates, leading to complex structural variation within and among species with unknown evolutionary consequences.
Assuntos
Evolução Molecular , Glutationa Transferase/genética , Pan troglodytes/genética , Animais , Hibridização Genômica Comparativa , Variações do Número de Cópias de DNA , Deleção de Genes , Duplicação Gênica , Genoma , Glucuronosiltransferase/genética , Glutationa Transferase/classificação , Humanos , Filogenia , Polimorfismo GenéticoRESUMO
Genetic diversity in human leukocyte antigen (HLA) molecules is thought to have arisen from the co-evolution between host and pathogen and maintained by balancing selection. Heterozygote advantage is a common proposed scenario for maintaining high levels of diversity in HLA genes, and extending from this, the divergent allele advantage (DAA) model suggests that individuals with more divergent HLA alleles bind and recognize a wider array of antigens. While the DAA model seems biologically suitable for driving HLA diversity, there is likely an upper threshold to the amount of sequence divergence. We used peptide-binding and pathogen-recognition capacity of DRB1 alleles as a model to further explore the DAA model; within the DRB1 locus, we examined binding predictions based on two distinct phylogenetic groups (denoted group A and B) previously identified based on non-peptide-binding region (PBR) nucleotide sequences. Predictions in this study support that group A allele and group B allele lineages have contrasting binding/recognition capacity, with only the latter supporting the DAA model. Furthermore, computer simulations revealed an inconsistency in the DAA model alone with observed extent of polymorphisms, supporting that the DAA model could only work effectively in combination with other mechanisms. Overall, we support that the mechanisms driving HLA diversity are non-exclusive. By investigating the relationships among HLA alleles, and pathogens recognized, we can provide further insights into the mechanisms on how humans have adapted to infectious diseases over time.
Assuntos
Alelos , Apresentação de Antígeno/genética , Simulação por Computador , Loci Gênicos/imunologia , Cadeias HLA-DRB1 , Modelos Genéticos , Modelos Imunológicos , Feminino , Cadeias HLA-DRB1/genética , Cadeias HLA-DRB1/imunologia , Humanos , MasculinoRESUMO
The deletion of the metabolizing Glutathione S-transferase Mu 1 (GSTM1) gene has been associated with multiple cancers, metabolic and autoimmune disorders, as well as drug response. It is unusually common, with allele frequency reaching up to 75% in some human populations. Such high allele frequency of a derived allele with apparent impact on an otherwise conserved gene is a rare phenomenon. To investigate the evolutionary history of this locus, we analyzed 310 genomes using population genetics tools. Our analysis revealed a surprising lack of linkage disequilibrium between the deletion and the flanking single nucleotide variants in this locus. Tests that measure extended homozygosity and rapid change in allele frequency revealed signatures of an incomplete sweep in the locus. Using empirical approaches, we identified the Tanuki haplogroup, which carries the GSTM1 deletion and is found in approximately 70% of East Asian chromosomes. This haplogroup has rapidly increased in frequency in East Asian populations, contributing to a high population differentiation among continental human groups. We showed that extended homozygosity and population differentiation for this haplogroup is incompatible with simulated neutral expectations in East Asian populations. In parallel, we found that the Tanuki haplogroup is significantly associated with the expression levels of other GSTM genes. Collectively, our results suggest that standing variation in this locus has likely undergone an incomplete sweep in East Asia with regulatory impact on multiple GSTM genes. Our study provides the necessary framework for further studies to elucidate the evolutionary reasons that maintain disease-susceptibility variants in the GSTM1 locus.
Assuntos
Povo Asiático/genética , Cromossomos Humanos/genética , Deleção de Genes , Loci Gênicos , Glutationa Transferase/genética , Haplótipos , Alelos , Ásia Oriental , Feminino , Regulação Enzimológica da Expressão Gênica/genética , Frequência do Gene , Predisposição Genética para Doença , Glutationa Transferase/metabolismo , Humanos , MasculinoRESUMO
To study rapidly evolving male specific Y (MSY) genes we retrieved and analyzed nine such genes. VCY, HSFY and RBMY were found to have functional X gametologs, but the rest did not. Using chimpanzee orthologs for XKRY, CDY, HSFY, PRY, and TSPY, the average silent substitution is estimated as 0.017 +/- 0.006/site and the substitution rate is 1.42 x 10(-9)/site/year. Except for VCY, all other loci possess two or more pseudogenes on the Y chromosome. Sequence differences from functional genes show that BPY2, DAZ, XKRY, and RBMY each have one pseudogene for each one that is human specific, while others were generated well before the human-chimpanzee split, by means of duplication, retro-transposition or translocation. Some functional MSY gene duplication of VCY, CDY and HSFY, as well as X-linked VCX and HSFX duplication, occurred in the lineage leading to humans; these duplicates have accumulated nucleotide substitutions that permit their identification.
Assuntos
Evolução Molecular , Pseudogenes/genética , Caracteres Sexuais , Cromossomo Y/genética , Animais , Proteínas de Ligação a DNA/genética , Fatores de Transcrição de Choque Térmico , Humanos , Masculino , Proteínas Nucleares/genética , Pan troglodytes , Proteínas de Ligação a RNA/genética , Fatores de Transcrição/genéticaRESUMO
Amino acid replacements in the peptide-binding region (PBR) of the functional major histocompatibility complex (Mhc) genes appear to be driven by balancing selection. Of the various types of balancing selection, we have examined a model equivalent to overdominance that confers heterozygote advantage. As discussed by A. Robertson, overdominance selection tends to maintain alleles that have more or less the same degree of heterozygote advantage. Because of this symmetry, the model makes various testable predictions about the genealogical relationships among different alleles and provides ways of analyzing DNA sequences of Mhc alleles. In this paper, we analyze DNA sequences of 85 alleles at the HLA-A, -B, -C, -DRB1 and -DQB1 loci with respect to the number of alleles and extent of nucleotide differences at the PBR, as well as at the synonymous (presumably neutral) sites. Theory suggests that the number of alleles that differ at the sites targeted by selection (presumably the nonsynonymous sites in the PBR) should be equal to the mean number of nucleotide substitutions among pairs of alleles. We also demonstrate that the nucleotide substitution rate at the targeted sites relative to that of neutral sites may be much larger than 1. The predictions of the presented model are in surprisingly good agreement with the actual data and thus provide means for inferring certain population parameters. For overdominance selection in a finite population at equilibrium, the product of selection intensity (s) against homozygotes and the effective population size (N) is estimated to be 350-3000, being largest at the B locus and smallest at the C locus. We argue that N is of the order of 10(5) and s is several percent at most, if the mutation rate per site per generation is 10(-8).
Assuntos
Complexo Principal de Histocompatibilidade/genética , Polimorfismo Genético/genética , Seleção Genética , Alelos , Mapeamento Cromossômico , Simulação por Computador , Genes MHC Classe I/genética , Genes MHC da Classe II/genética , Genética Populacional , Modelos Genéticos , Modelos Estatísticos , Ligação Proteica/genéticaRESUMO
Self-incompatibility (SI) is a mechanism for preventing self-fertilization in flowering plants. In Brassica, it is controlled by a single multi-allelic locus, S, and it is believed that two highly polymorphic genes in the S locus, SLG and SRK, play central roles in self-recognition in stigmas. SRK is a putative receptor protein kinase, whose extracellular domain exhibits high similarity to SLG. We analyzed two pairs of lines showing cross-incompatibility (S(2) and S(2-b); S(13) and S(13-b)). In S(2) and S(2-b), SRKs were more highly conserved than SLGs. This was also the case with S(13) and S(13-b). This suggests that the SRKs of different lines must be conserved for the lines to have the same self-recognition specificity. In particular, SLG(2-b) showed only 88. 5% identity to SLG(2), which is comparable to that between the SLGs of different S haplotypes, while SRK(2-b) showed 97.3% identity to SRK(2) in the S domain. These findings suggest that the SLGs in these S haplotypes are not important for self-recognition in SI.
Assuntos
Brassica/genética , Glicoproteínas/genética , Proteínas de Plantas/genética , Sequência de Aminoácidos , Sequência de Bases , Clonagem Molecular , Primers do DNA , DNA Complementar , Glicoproteínas/química , Haplótipos , Dados de Sequência Molecular , Proteínas de Plantas/química , Homologia de Sequência de AminoácidosRESUMO
To investigate the evolutionary dynamics at Mhc class II DR genes of mice (genus Mus), we sequenced the peptide binding regions (PBRs) of 41 DRB (= E beta) genes and eight DRA (= E alpha) genes from 15 strains representing eight species. As expected trees of these PBR sequences imply extensive maintenance of ancestral DRB alleles across species. We use a coalescent simulation model to show that the number of interspecific coalescent events (c) observed on these trees was higher than the number expected for neutral genealogies and similar sample sizes and is more consistent with balancing selection that with neutrality. Patterns of ancestral polymorphism in mouse DRB alleles were also used to examine the tempo of synonymous substitution in the PBR of mouse class II genes. Both absolute and relative rate tests on DRA and DRB genes imply increased substitution rates at two- and fourfold degenerate sites of mice and rats relative to primates, and decreased rates for the DRB genes of primates relative to ungulate and carnivore relatives. Thus rates of synonymous substitution at Mhc DR genes in mammals appear to be subject to generation time effects in ways similar to those found at other mammalian genes.
Assuntos
Genes MHC da Classe II/genética , Polimorfismo Genético , Seleção Genética , Alelos , Animais , Sítios de Ligação , Evolução Molecular , Éxons/genética , Camundongos , Dados de Sequência Molecular , Filogenia , Estrutura Secundária de ProteínaRESUMO
In the Brassicaceae, self-vs. nonself-recognition in self-incompatibility is controlled by sporophytic S-alleles. Haplotypes specifying both SRK (S-receptor kinase) and SLG (S-locus glycoprotein) are considered to play an important role in the recognition reactions. We compared the nucleotide sequences of SRK9(Bc) and SRK6(Bo). The number of nonsynonymous substitutions per site (Pn) was lower, constrained, in the kinase than the receptor domain, while the numbers of synonymous substitutions (Ps) in the two domains were largely comparable. Pairwise values for Ps and Pn were calculated among 17 operational taxonomic units, including eight SLGs, the receptor domains of two SRKs, four SRAs (S-related A) and three SRBs (S-related B), which have high homologies with each other. The values of Ps and Pn of SLG were mostly comparable to those of the receptor domain of SRK. Dendrograms constructed on the basis of Pn and Ps indicated that SRA differentiated first, followed by SRB. The differentiation of SLG alleles is one of prerequisite factors for the establishment of self-incompatibility, and the allelic differentiation has occurred more than tens of million years ago.
Assuntos
Evolução Biológica , Brassica/genética , Genes de Plantas , Alelos , DNA de Plantas/química , DNA de Plantas/genética , Variação Genética , FilogeniaRESUMO
A systematic survey of six intergenic regions flanking the human HLA-B locus in eight haplotypes reveals the regions to be up to 20 times more polymorphic than the reported average degree of human neutral polymorphism. Furthermore, the extent of polymorphism is directly related to the proximity to the HLA-B locus. Apparently linkage to HLA-B locus alleles, which are under balancing selection, maintains the neutral polymorphism of adjacent regions. For these linked polymorphisms to persist, recombination in the 200-kb interval from HLA-B to TNF must occur at a low frequency. The high degree of polymorphism found distal to HLA-B suggests that recombination is uncommon on both sides of the HLA-B locus. The least-squares estimate is 0.15% per megabase with an estimated range from 0.02 to 0.54%. These findings place strong restrictions on possible recombinational mechanisms for the generation of diversity at the HLA-B.
Assuntos
Mapeamento Cromossômico/métodos , Evolução Molecular , Genes MHC Classe I/genética , Antígenos HLA-B/genética , Íntrons , Complexo Principal de Histocompatibilidade , Polimorfismo Genético , Animais , Sequência de Bases , Linhagem Celular , Sequência Consenso , Primers do DNA , Ligação Genética , Gorilla gorilla , Haplótipos , Homozigoto , Humanos , Análise dos Mínimos Quadrados , Dados de Sequência Molecular , Pan troglodytes , Reação em Cadeia da Polimerase , Alinhamento de Sequência , Homologia de Sequência do Ácido Nucleico , Tetra-Hidrofolato Desidrogenase/genética , Fator de Necrose Tumoral alfa/genéticaRESUMO
The possibility of incomplete maternal transmission of mitochondrial DNA (mtDNA) in Drosophila, previously suggested by the presence of heteroplasmy, was examined by intra- and interspecific backcrosses of Drosophila simulans and its closest relative, Drosophila mauritiana. mtDNAs of offspring in these crosses were characterized by Southern hybridization with two alpha-32P-labeled probes that are specific to paternal mtDNAs. This method could detect as little as 0.03% paternal mtDNA, if present, in a sample. Among 331 lines that had been backcrossed for ten generations, four lines from the interspecific cross D. simulans (female) x D. mauritiana (male) showed clear evidence for paternal leakage of mtDNA. In three of these the maternal type was completely replaced while the fourth was heteroplasmic. Since in this experiment the total number of fertilization is known to be 331 x 10 = 3310, the proportion of paternal mtDNA per fertilization was estimated as about 0.1%. The mechanisms and evolutionary significance for paternal leakage are discussed in light of this finding.
Assuntos
DNA Mitocondrial/genética , Drosophila/genética , Animais , Evolução Biológica , Southern Blotting , Cruzamentos Genéticos , Mapeamento por RestriçãoRESUMO
Self-incompatibility (SI) enables flowering plants to discriminate between self- and non-self-pollen. In Brassica, SI is controlled by the highly polymorphic S locus. The recently identified male determinant, termed SP11 or SCR, is thought to be the ligand of S receptor kinase, the female determinant. To examine functional and evolutionary properties of SP11, we cloned 14 alleles from class-I S haplotypes of Brassica campestris and carried out sequence analyses. The sequences of mature SP11 proteins are highly divergent, except for the presence of conserved cysteines. The phylogenetic trees suggest possible co-evolution of the genes encoding the male and female determinants.
Assuntos
Brassica/genética , Glicoproteínas/genética , Proteínas de Plantas/genética , Pólen/genética , Alelos , Sequência de Aminoácidos , Mapeamento Cromossômico , Clonagem Molecular , DNA Complementar/química , DNA Complementar/genética , DNA de Plantas/genética , Eletroforese em Gel de Campo Pulsado , Evolução Molecular , Variação Genética , Haplótipos , Dados de Sequência Molecular , Filogenia , Alinhamento de Sequência , Análise de Sequência de DNA , Homologia de Sequência de AminoácidosRESUMO
Based on available DNA sequence data in the HLA region of 4 Mb, we review the degree of polymorphism at 39 loci of which most are involved in the immune system. The extent of nucleotide differences per silent site differs greatly from locus to locus. It is exceptionally high at classical MHC loci, intermediate at six MHC-related pseudogenes as well as at some loci in class I and II regions, and low in the class III region. Different exons of individual MHC loci show also different degrees of silent polymorphism; high in the exons encoding for the peptide binding region (PBR) and low in the exons encoding for trans-membranes and cytoplasmic tails. The degree of polymorphism within MHC allelic lineages is not much smaller than that between allelic lineages, contrary to the expectation where intra-allelic sequence exchanges are restricted. The observation that many allelic lineages at the HLA-DRB1 locus are combinations of distinct motifs in the beta pleated sheet and alpha helix of PBR indicates that sequence exchanges occur even within exon 2. Semi-quantitative analysis is presented about the rate of sequence exchanges between selected and linked neutral regions, although more sequence information is necessary to make definite conclusions. The extraordinary MHC polymorphism is viewed from the dual function of MHC molecules that controls the acquired immune system.
Assuntos
Antígenos HLA/genética , Complexo Principal de Histocompatibilidade , Polimorfismo Genético , Alelos , Evolução Molecular , Conversão Gênica , Predisposição Genética para Doença , Antígenos HLA-DR/genética , Cadeias HLA-DRB1 , Humanos , Modelos Genéticos , Recombinação Genética , Seleção GenéticaRESUMO
Human DRB genes encode beta chains of the major histocompatibility complex (MHC) class II molecules. Although nine DRB loci have been mapped to the short arm of chromosome 6, an individual chromosome contains only one to five loci and is classified into one of five major haplotypes. To elucidate the origin of human DRB loci and haplotypes, intron 1 sequences approximately 5000 bp in length were determined for three DRB1 alleles (DRB1*03, DRB1*04, and DRB1*15) and five DRB genes (DRB2, DRB3, DRB4, DRB5, and DRB7). The sequences were subjected to phylogenetic analyses together with previously determined intron 4 and 5 sequences. The sequences provided two sources of information: Nucleotide substitutions that could be used to construct phylogenetic trees and to estimate divergence times and a set of insertions (mostly Alu elements) that reveal the order of splitting of duplicated genes. The combined data indicate that the ancestor of the human DRB genes was HLA-DRB1*04-like and that the DRB2, DRB7, DRB5, and DRB3 genes arose from this ancestor by four rounds of duplication 58, 56, 53, and 36 million years (MY) ago, respectively. The DRB4 gene may have arisen 46 MY ago by a deletion from the DRB1 and DRB2 genes and the DRB6 gene is probably an allele at the DRB2 locus. During the course of its evolution, the DRB1*04 gene acquired an intron 1 segment (including two Alu elements) from a gene that became the ancestor of DRB1*03. The present-day HLA-DR haplotypes were derived from three principal ancestral haplotypes: DRB1-DRB2, DRB1-DRB5, and DRB1-DRB7.
Assuntos
Antígenos HLA-DR/genética , Haplótipos , Íntrons , Sequência de Bases , Marcadores Genéticos , Humanos , Dados de Sequência Molecular , FilogeniaRESUMO
To examine the nucleotide diversity at silent (synonymous + intron + untranslated) and non-silent (nonsynonymous) sites in chimpanzees and humans, genes at six nuclear loci from two chimpanzees were sequenced. The average silent diversity was 0.19%, which was significantly higher than that in humans (0.05%). This observation suggests a significantly larger effective population size and a higher extent of neutral polymorphism in chimpanzees than in humans. On the other hand, the non-silent nucleotide diversity is similar in both species, resulting in a larger fraction of neutral mutations at non-silent sites in humans than in chimpanzees. Other types of polymorphism data were collected from the literature or databases to examine whether or not they are consistent with the nuclear DNA sequence polymorphism observed here. The nucleotide diversity at both silent and non-silent sites in mitochondrial (mt) DNA genes was compatible with that of the nuclear genes. Microsatellite loci showed a similar high extent of heterozygosity in both species, perhaps due to the combined effect of a high mutation rate and a recent population expansion in humans. At protein loci, humans are more heterozygous than chimpanzees, and the estimated fraction of neutral alleles in humans (0.84) is much larger than that in chimpanzees (0.26). These data show that the neutral fraction in non-silent changes is relatively large in the human population. This difference may be due to a relaxation of the functional constraint against proteins in the human lineage. To evaluate this possibility, it will be necessary to examine nucleotide sequences in relation to the physiological or biochemical properties of proteins.
Assuntos
DNA/genética , Pan troglodytes/genética , Polimorfismo Genético , Proteínas/genética , Aldeído Redutase/genética , Angiotensinogênio/genética , Animais , Sequência de Bases , Proteínas Inativadoras do Complemento 1/genética , Primers do DNA/genética , DNA Mitocondrial/genética , Fator IX/genética , Variação Genética , Genética Populacional , Humanos , Isoenzimas/genética , Repetições de Microssatélites , Receptores Androgênicos/genética , Ribonuclease Pancreático/genética , Especificidade da EspécieRESUMO
The species divergence times and demographic histories of Drosophila melanogaster and its three sibling species, D. mauritiana, D. simulans, and D. yakuba, were investigated using a maximum likelihood (ML) method. Thirty-nine orthologous loci for these four species were retrieved from DDBJ/EMBL/GenBank database. Both autosomal and X-linked loci were used in this study. A significant degree of rate heterogeneity across loci was observed for each pair of species. Most loci have the GC content greater than 50% at the third codon position. The codon usage bias in Drosophila loci is considered to result in the high GC content and the heterogenous rates across loci. The chi-square, G, and Fisher's exact tests indicated that data sets with 11, 23, and 9 pairs of DNA sequences for the comparison of D. melanogaster with D. mauritiana, D. simulans, and D. yakuba, respectively, retain homogeneous rates across loci. We applied the ML method to these data sets to estimate the DNA sequence divergences before and after speciation of each species pair along with their standard deviations. Using 1.6 x 10(-8) as the rate of nucleotide substitutions per silent site per year, our results indicate that the D. melanogaster lineage split from D. yakuba approximately 5.1 +/- 0.8 million years ago (mya), D. mauritiana 2.7 +/- 0.4 mya, and D. simulans 2.3 +/- 0.3 mya. It implies that D. melanogaster became distinct from D. mauritiana and D. simulans at approximately the same time and from D. yakuba no earlier than 10 mya. The effective ancestral population size of D. melanogaster appears to be stable over evolutionary time. Assuming 10 generations per year for Drosophila, the effective population size in the ancestral lineage immediately prior to the time of species divergence is approximately 3 x 10(6), which is close to that estimated for the extant D. melanogaster population. The D. melanogaster did not encounter any obvious bottleneck during the past 10 million years.