RESUMEN
Genetic differences that specify unique aspects of human evolution have typically been identified by comparative analyses between the genomes of humans and closely related primates, including more recently the genomes of archaic hominins. Not all regions of the genome, however, are equally amenable to such study. Recurrent copy number variation (CNV) at chromosome 16p11.2 accounts for approximately 1% of cases of autism and is mediated by a complex set of segmental duplications, many of which arose recently during human evolution. Here we reconstruct the evolutionary history of the locus and identify bolA family member 2 (BOLA2) as a gene duplicated exclusively in Homo sapiens. We estimate that a 95-kilobase-pair segment containing BOLA2 duplicated across the critical region approximately 282 thousand years ago (ka), one of the latest among a series of genomic changes that dramatically restructured the locus during hominid evolution. All humans examined carried one or more copies of the duplication, which nearly fixed early in the human lineage--a pattern unlikely to have arisen so rapidly in the absence of selection (P < 0.0097). We show that the duplication of BOLA2 led to a novel, human-specific in-frame fusion transcript and that BOLA2 copy number correlates with both RNA expression (r = 0.36) and protein level (r = 0.65), with the greatest expression difference between human and chimpanzee in experimentally derived stem cells. Analyses of 152 patients carrying a chromosome 16p11. rearrangement show that more than 96% of breakpoints occur within the H. sapiens-specific duplication. In summary, the duplicative transposition of BOLA2 at the root of the H. sapiens lineage about 282 ka simultaneously increased copy number of a gene associated with iron homeostasis and predisposed our species to recurrent rearrangements associated with disease.
Asunto(s)
Cromosomas Humanos Par 16/genética , Variaciones en el Número de Copia de ADN/genética , Evolución Molecular , Predisposición Genética a la Enfermedad , Proteínas/genética , Animales , Trastorno Autístico/genética , Rotura Cromosómica , Duplicación de Gen , Homeostasis/genética , Humanos , Hierro/metabolismo , Pan troglodytes/genética , Pongo/genética , Proteínas/análisis , Recombinación Genética , Especificidad de la Especie , Factores de TiempoRESUMEN
In the Cercopithecini ancestor two chromosomes, homologous to human chromosomes 20 and 21, fused to form the Cercopithecini specific 20/21 association. In some individuals from the genus Cercopithecus, this association was shown to be polymorphic for the position of the centromere, suggesting centromere repositioning events. We set out to test this hypothesis by defining the evolutionary history of the 20/21 association in four Cercopithecini species from three different genera. The marker order of the various 20/21 associations was established using molecular cytogenetic techniques, including an array of more than 100 BACs. We discovered that five different forms of the 20/21 association were present in the four studied Cercopithecini species. Remarkably, in the two Cercopithecus species, we found individuals in which one homolog conserved the ancestral condition, but the other homolog was highly rearranged. The phylogenetic analysis showed that the heterozygosity in these two species originated about 8 million years ago and was maintained for this entire arc of time, surviving multiple speciation events. Our report is a remarkable extension of Dobzhansky's pioneering observation in Drosophila concerning the maintenance of chromosomal heterozygosity due to selective advantage. Dobzhansky's hypothesis recently received strong support in a series of detailed reports on the fruit fly genome. Our findings are first extension to primates, indeed to Old World monkeys phylogenetically close to humans of an analogous situation. Our results have important implications for hypotheses on how chromosome rearrangements, selection, and speciation are related.
Asunto(s)
Cromosomas de los Mamíferos , Evolución Molecular , Haplorrinos/genética , Heterocigoto , Animales , Evolución Biológica , Centrómero , Duplicación Cromosómica , Pintura Cromosómica , Cromosomas Artificiales Bacterianos , Humanos , Hibridación Fluorescente in Situ , CariotipificaciónRESUMEN
Gibbons are small arboreal apes that display an accelerated rate of evolutionary chromosomal rearrangement and occupy a key node in the primate phylogeny between Old World monkeys and great apes. Here we present the assembly and analysis of a northern white-cheeked gibbon (Nomascus leucogenys) genome. We describe the propensity for a gibbon-specific retrotransposon (LAVA) to insert into chromosome segregation genes and alter transcription by providing a premature termination site, suggesting a possible molecular mechanism for the genome plasticity of the gibbon lineage. We further show that the gibbon genera (Nomascus, Hylobates, Hoolock and Symphalangus) experienced a near-instantaneous radiation â¼5 million years ago, coincident with major geographical changes in southeast Asia that caused cycles of habitat compression and expansion. Finally, we identify signatures of positive selection in genes important for forelimb development (TBX5) and connective tissues (COL1A1) that may have been involved in the adaptation of gibbons to their arboreal habitat.
Asunto(s)
Genoma/genética , Hylobates/clasificación , Hylobates/genética , Cariotipo , Filogenia , Animales , Evolución Molecular , Hominidae/clasificación , Hominidae/genética , Humanos , Datos de Secuencia Molecular , Retroelementos/genética , Selección Genética , Terminación de la Transcripción GenéticaRESUMEN
Cytogenetics has historically played a key role in research on squirrel monkey (genus Saimiri) evolutionary biology. Squirrel monkeys have a diploid number of 2n = 44, but vary in fundamental number (FN). Apparently, differences in FN have phylogenetic implications and are correlated with geographic regions. A number of hypothetical mechanisms were proposed to explain difference in FN: translocations, heterochromatin, or, most commonly, pericentric inversions. Recently, an additional mechanism, centromere repositioning, was discovered, which can alter chromosome morphology and FN. Here, we used chromosome banding, chromosome painting, and BAC-FISH to test these hypotheses. We demonstrate that centromere repositioning on chromosomes 5 and 15 is the mechanism that accounts for differences in FN. Current phylogenomic trees of platyrrhines provide a temporal framework for evolutionary new centromeres (ENC) in Saimiri. The X-chromosome ENC could be up to 15 million years (my) old that on chromosome 5 as recent as 0.3 my. The chromosome 15 ENC is intermediate, as young as 2.24 my. All ENC have abundant satellite DNAs indicating that the maturation process was fairly rapid. Callithrix jacchus was used as an outgroup for the BAC-FISH data analysis. Comparison with scaffolds from the S. boliviensis genome revealed an error in the last marmoset genome release. Future research including at the sequence level will provide better understanding of chromosome evolution in Saimiri and other platyrrhines. Probably other cases of differences in chromosome morphology and FN, both within and between taxa, will be shown to be due to centromere repositioning and not pericentric inversions.
Asunto(s)
Centrómero/genética , Cariotipo , Saimiri/genética , Animales , Centrómero/fisiología , Inversión Cromosómica , Pintura Cromosómica , Análisis Citogenético , Evolución Molecular , Filogenia , Translocación GenéticaRESUMEN
Dicentric chromosomes are products of genomic rearrangements that place two centromeres on the same chromosome. Due to the presence of two primary constrictions, they are inherently unstable and overcome their instability by epigenetically inactivating and/or deleting one of the two centromeres, thus resulting in functionally monocentric chromosomes that segregate normally during cell division. Our understanding to date of dicentric chromosome formation, behavior and fate has been largely inferred from observational studies in plants and humans as well as artificially produced de novo dicentrics in yeast and in human cells. We investigate the most recent product of a chromosome fusion event fixed in the human lineage, human chromosome 2, whose stability was acquired by the suppression of one centromere, resulting in a unique difference in chromosome number between humans (46 chromosomes) and our most closely related ape relatives (48 chromosomes). Using molecular cytogenetics, sequencing, and comparative sequence data, we deeply characterize the relicts of the chromosome 2q ancestral centromere and its flanking regions, gaining insight into the ancestral organization that can be easily broadened to all acrocentric chromosome centromeres. Moreover, our analyses offered the opportunity to trace the evolutionary history of rDNA and satellite III sequences among great apes, thus suggesting a new hypothesis for the preferential inactivation of some human centromeres, including IIq. Our results suggest two possible centromere inactivation models to explain the evolutionarily stabilization of human chromosome 2 over the last 5-6 million years. Our results strongly favor centromere excision through a one-step process.
Asunto(s)
Centrómero/genética , Cromosomas Humanos Par 2 , Centrómero/fisiología , ADN Antiguo , Evolución Molecular , Humanos , Translocación GenéticaRESUMEN
Grapevine (Vitis vinifera L.) is one of the world's most important crop plants, which is of large economic value for fruit and wine production. There is much interest in identifying genomic variations and their functional effects on inter-varietal, phenotypic differences. Using an approach developed for the analysis of human and mammalian genomes, which combines high-throughput sequencing, array comparative genomic hybridization, fluorescent in situ hybridization and quantitative PCR, we created an inter-varietal atlas of structural variations and single nucleotide variants (SNVs) for the grapevine genome analyzing four economically and genetically relevant table grapevine varieties. We found 4.8 million SNVs and detected 8% of the grapevine genome to be affected by genomic variations. We identified more than 700 copy number variation (CNV) regions and more than 2000 genes subjected to CNV as potential candidates for phenotypic differences between varieties.
Asunto(s)
Genoma de Planta/genética , Vitis/genética , Hibridación Genómica Comparativa/métodos , Variaciones en el Número de Copia de ADN/genética , Hibridación Fluorescente in Situ , Reacción en Cadena de la PolimerasaRESUMEN
BACKGROUND: Although many algorithms are now available that aim to characterize different classes of structural variation, discovery of balanced rearrangements such as inversions remains an open problem. This is mainly due to the fact that breakpoints of such events typically lie within segmental duplications or common repeats, which reduces the mappability of short reads. The algorithms developed within the 1000 Genomes Project to identify inversions are limited to relatively short inversions, and there are currently no available algorithms to discover large inversions using high throughput sequencing technologies. RESULTS: Here we propose a novel algorithm, VALOR, to discover large inversions using new sequencing methods that provide long range information such as 10X Genomics linked-read sequencing, pooled clone sequencing, or other similar technologies that we commonly refer to as long range sequencing. We demonstrate the utility of VALOR using both pooled clone sequencing and 10X Genomics linked-read sequencing generated from the genome of an individual from the HapMap project (NA12878). We also provide a comprehensive comparison of VALOR against several state-of-the-art structural variation discovery algorithms that use whole genome shotgun sequencing data. CONCLUSIONS: In this paper, we show that VALOR is able to accurately discover all previously identified and experimentally validated large inversions in the same genome with a low false discovery rate. Using VALOR, we also predicted a novel inversion, which we validated using fluorescent in situ hybridization. VALOR is available at https://github.com/BilkentCompGen/VALOR.
Asunto(s)
Genómica/métodos , Inversión de Secuencia/genética , Algoritmos , Genoma Humano/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Secuenciación Completa del GenomaRESUMEN
We analyzed 83 fully sequenced great ape genomes for mobile element insertions, predicting a total of 49,452 fixed and polymorphic Alu and long interspersed element 1 (L1) insertions not present in the human reference assembly and assigning each retrotransposition event to a different time point during great ape evolution. We used these homoplasy-free markers to construct a mobile element insertions-based phylogeny of humans and great apes and demonstrate their differential power to discern ape subspecies and populations. Within this context, we find a good correlation between L1 diversity and single-nucleotide polymorphism heterozygosity (r(2) = 0.65) in contrast to Alu repeats, which show little correlation (r(2) = 0.07). We estimate that the "rate" of Alu retrotransposition has differed by a factor of 15-fold in these lineages. Humans, chimpanzees, and bonobos show the highest rates of Alu accumulation--the latter two since divergence 1.5 Mya. The L1 insertion rate, in contrast, has remained relatively constant, with rates differing by less than a factor of three. We conclude that Alu retrotransposition has been the most variable form of genetic variation during recent human-great ape evolution, with increases and decreases occurring over very short periods of evolutionary time.
Asunto(s)
Variación Genética , Genoma/genética , Hominidae/genética , Filogenia , Elementos Alu/genética , Animales , Análisis por Conglomerados , Cartilla de ADN/genética , Genómica , Hominidae/clasificación , Humanos , Funciones de Verosimilitud , Elementos de Nucleótido Esparcido Largo/genética , Reacción en Cadena de la Polimerasa , Polimorfismo de Nucleótido Simple , Análisis de Componente Principal , Especificidad de la EspecieRESUMEN
Copy number variants (CNVs) are subject to stronger selective pressure than single-nucleotide variants, but their roles in archaic introgression and adaptation have not been systematically investigated. We show that stratified CNVs are significantly associated with signatures of positive selection in Melanesians and provide evidence for adaptive introgression of large CNVs at chromosomes 16p11.2 and 8p21.3 from Denisovans and Neanderthals, respectively. Using long-read sequence data, we reconstruct the structure and complex evolutionary history of these polymorphisms and show that both encode positively selected genes absent from most human populations. Our results collectively suggest that large CNVs originating in archaic hominins and introgressed into modern humans have played an important role in local population adaptation and represent an insufficiently studied source of large-scale genetic variation.