Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
1.
Nature ; 593(7857): 101-107, 2021 05.
Article in English | MEDLINE | ID: mdl-33828295

ABSTRACT

The complete assembly of each human chromosome is essential for understanding human biology and evolution1,2. Here we use complementary long-read sequencing technologies to complete the linear assembly of human chromosome 8. Our assembly resolves the sequence of five previously long-standing gaps, including a 2.08-Mb centromeric α-satellite array, a 644-kb copy number polymorphism in the ß-defensin gene cluster that is important for disease risk, and an 863-kb variable number tandem repeat at chromosome 8q21.2 that can function as a neocentromere. We show that the centromeric α-satellite array is generally methylated except for a 73-kb hypomethylated region of diverse higher-order α-satellites enriched with CENP-A nucleosomes, consistent with the location of the kinetochore. In addition, we confirm the overall organization and methylation pattern of the centromere in a diploid human genome. Using a dual long-read sequencing approach, we complete high-quality draft assemblies of the orthologous centromere from chromosome 8 in chimpanzee, orangutan and macaque to reconstruct its evolutionary history. Comparative and phylogenetic analyses show that the higher-order α-satellite structure evolved in the great ape ancestor with a layered symmetry, in which more ancient higher-order repeats locate peripherally to monomeric α-satellites. We estimate that the mutation rate of centromeric satellite DNA is accelerated by more than 2.2-fold compared to the unique portions of the genome, and this acceleration extends into the flanking sequence.


Subject(s)
Chromosomes, Human, Pair 8/chemistry , Chromosomes, Human, Pair 8/genetics , Evolution, Molecular , Animals , Cell Line , Centromere/chemistry , Centromere/genetics , Centromere/metabolism , Chromosomes, Human, Pair 8/physiology , DNA Methylation , DNA, Satellite/genetics , Epigenesis, Genetic , Female , Humans , Macaca mulatta/genetics , Male , Minisatellite Repeats/genetics , Pan troglodytes/genetics , Phylogeny , Pongo abelii/genetics , Telomere/chemistry , Telomere/genetics , Telomere/metabolism
2.
Nature ; 594(7861): 77-81, 2021 06.
Article in English | MEDLINE | ID: mdl-33953399

ABSTRACT

The divergence of chimpanzee and bonobo provides one of the few examples of recent hominid speciation1,2. Here we describe a fully annotated, high-quality bonobo genome assembly, which was constructed without guidance from reference genomes by applying a multiplatform genomics approach. We generate a bonobo genome assembly in which more than 98% of genes are completely annotated and 99% of the gaps are closed, including the resolution of about half of the segmental duplications and almost all of the full-length mobile elements. We compare the bonobo genome to those of other great apes1,3-5 and identify more than 5,569 fixed structural variants that specifically distinguish the bonobo and chimpanzee lineages. We focus on genes that have been lost, changed in structure or expanded in the last few million years of bonobo evolution. We produce a high-resolution map of incomplete lineage sorting and estimate that around 5.1% of the human genome is genetically closer to chimpanzee or bonobo and that more than 36.5% of the genome shows incomplete lineage sorting if we consider a deeper phylogeny including gorilla and orangutan. We also show that 26% of the segments of incomplete lineage sorting between human and chimpanzee or human and bonobo are non-randomly distributed and that genes within these clustered segments show significant excess of amino acid replacement compared to the rest of the genome.


Subject(s)
Evolution, Molecular , Genome/genetics , Genomics , Pan paniscus/genetics , Phylogeny , Animals , Eukaryotic Initiation Factor-4A/genetics , Female , Genes , Gorilla gorilla/genetics , Molecular Sequence Annotation/standards , Pan troglodytes/genetics , Pongo/genetics , Segmental Duplications, Genomic , Sequence Analysis, DNA
3.
Genome Res ; 32(10): 1941-1951, 2022 10.
Article in English | MEDLINE | ID: mdl-36180231

ABSTRACT

Gibbons are the most speciose family of living apes, characterized by a diverse chromosome number and rapid rate of large-scale rearrangements. Here we performed single-cell template strand sequencing (Strand-seq), molecular cytogenetics, and deep in silico analysis of a southern white-cheeked gibbon genome, providing the first comprehensive map of 238 previously hidden small-scale inversions. We determined that more than half are gibbon specific, at least fivefold higher than shown for other primate lineage-specific inversions, with a significantly high number of small heterozygous inversions, suggesting that accelerated evolution of inversions may have played a role in the high sympatric diversity of gibbons. Although the precise mechanisms underlying these inversions are not yet understood, it is clear that segmental duplication-mediated NAHR only accounts for a small fraction of events. Several genomic features, including gene density and repeat (e.g., LINE-1) content, might render these regions more break-prone and susceptible to inversion formation. In the attempt to characterize interspecific variation between southern and northern white-cheeked gibbons, we identify several large assembly errors in the current GGSC Nleu3.0/nomLeu3 reference genome comprising more than 49 megabases of DNA. Finally, we provide a list of 182 candidate genes potentially involved in gibbon diversification and speciation.


Subject(s)
Hominidae , Hylobates , Animals , Hylobates/genetics , Genome , Primates/genetics , Chromosome Inversion/genetics , Chromosomes , Hominidae/genetics
4.
Genome Res ; 30(11): 1680-1693, 2020 11.
Article in English | MEDLINE | ID: mdl-33093070

ABSTRACT

Rhesus macaque is an Old World monkey that shared a common ancestor with human ∼25 Myr ago and is an important animal model for human disease studies. A deep understanding of its genetics is therefore required for both biomedical and evolutionary studies. Among structural variants, inversions represent a driving force in speciation and play an important role in disease predisposition. Here we generated a genome-wide map of inversions between human and macaque, combining single-cell strand sequencing with cytogenetics. We identified 375 total inversions between 859 bp and 92 Mbp, increasing by eightfold the number of previously reported inversions. Among these, 19 inversions flanked by segmental duplications overlap with recurrent copy number variants associated with neurocognitive disorders. Evolutionary analyses show that in 17 out of 19 cases, the Hominidae orientation of these disease-associated regions is always derived. This suggests that duplicated sequences likely played a fundamental role in generating inversions in humans and great apes, creating architectures that nowadays predispose these regions to disease-associated genetic instability. Finally, we identified 861 genes mapping at 156 inversions breakpoints, with some showing evidence of differential expression in human and macaque cell lines, thus highlighting candidates that might have contributed to the evolution of species-specific features. This study depicts the most accurate fine-scale map of inversions between human and macaque using a two-pronged integrative approach, such as single-cell strand sequencing and cytogenetics, and represents a valuable resource toward understanding of the biology and evolution of primate species.


Subject(s)
Chromosome Breakpoints , Chromosome Inversion , Evolution, Molecular , Macaca mulatta/genetics , Animals , Disease/genetics , Gene Expression Regulation , Genome , Genomics , Heterozygote , Humans , In Situ Hybridization, Fluorescence , Recombination, Genetic , Sequence Analysis, DNA , Single-Cell Analysis
5.
Science ; 376(6588): eabj6965, 2022 04.
Article in English | MEDLINE | ID: mdl-35357917

ABSTRACT

Despite their importance in disease and evolution, highly identical segmental duplications (SDs) are among the last regions of the human reference genome (GRCh38) to be fully sequenced. Using a complete telomere-to-telomere human genome (T2T-CHM13), we present a comprehensive view of human SD organization. SDs account for nearly one-third of the additional sequence, increasing the genome-wide estimate from 5.4 to 7.0% [218 million base pairs (Mbp)]. An analysis of 268 human genomes shows that 91% of the previously unresolved T2T-CHM13 SD sequence (68.3 Mbp) better represents human copy number variation. Comparing long-read assemblies from human (n = 12) and nonhuman primate (n = 5) genomes, we systematically reconstruct the evolution and structural haplotype diversity of biomedically relevant and duplicated genes. This analysis reveals patterns of structural heterozygosity and evolutionary differences in SD organization between humans and other primates.


Subject(s)
DNA Copy Number Variations , Gene Duplication , Genome, Human , Segmental Duplications, Genomic , Evolution, Molecular , GTPase-Activating Proteins/genetics , Humans , Polymorphism, Single Nucleotide , Proto-Oncogene Proteins/genetics
6.
J Med Case Rep ; 15(1): 208, 2021 Apr 21.
Article in English | MEDLINE | ID: mdl-33883018

ABSTRACT

BACKGROUND: Mayer-Rokitansky-Küster-Hauser (MRKH) syndrome (Online Mendelian Inheritance in Man [OMIM] #277000) is a congenital condition characterized by the total or partial agenesis of vagina and uterus. Agenesis can be isolated (MRKH 1) or associated with other renal, vertebral or cardiac defects (MRKH 2). CASE PRESENTATION: In this paper, we report a case of a Caucasian patient showing the clinical signs associated with MRKH. Array-based comparative genomic hybridization (a-CGH) analysis revealed a microduplication of approximately 3.01 megabases (Mb) located on the long arm of chromosome 22 (22q11.21). Microduplications affecting the 22q11.21 region have been shown to be associated with MRKH syndrome and Müllerian aplasia. The phenotype of patients with 22q11.2 duplication (OMIM #608363) appears extremely variable, ranging from apparently normal to mild learning difficulties or with multiple defects, sharing features with DiGeorge/velocardiofacial (DGS/VCFS) syndrome. CONCLUSIONS: The altered gene expression together with other genetic, nongenetic, epigenetic or environmental factors can cause the extremely variable phenotype in patients carrying such duplication. Therefore, we can consider MRKH syndrome to be one of the clinical features of DGS/VCFS syndrome.


Subject(s)
46, XX Disorders of Sex Development , Congenital Abnormalities , 46, XX Disorders of Sex Development/genetics , Comparative Genomic Hybridization , Congenital Abnormalities/genetics , Female , Humans , Mullerian Ducts/abnormalities , Vagina
7.
Genes (Basel) ; 11(2)2020 02 18.
Article in English | MEDLINE | ID: mdl-32085667

ABSTRACT

POTE (prostate, ovary, testis, and placenta expressed) genes belong to a primate-specific gene family expressed in prostate, ovary, and testis as well as in several cancers including breast, prostate, and lung cancers. Due to their tumor-specific expression, POTEs are potential oncogenes, therapeutic targets, and biomarkers for these malignancies. This gene family maps within human and primate segmental duplications with a copy number ranging from two to 14 in different species. Due to the high sequence identity among the gene copies, specific efforts are needed to assemble these loci in order to correctly define the organization and evolution of the gene family. Using single-molecule, real-time (SMRT) sequencing, in silico analyses, and molecular cytogenetics, we characterized the structure, copy number, and chromosomal distribution of the POTE genes, as well as their expression in normal and disease tissues, and provided a comparative analysis of the POTE organization and gene structure in primate genomes. We were able, for the first time, to de novo sequence and assemble a POTE tandem duplication in marmoset that is misassembled and collapsed in the reference genome, thus revealing the presence of a second POTE copy. Taken together, our findings provide comprehensive insights into the evolutionary dynamics of the primate-specific POTE gene family, involving gene duplications, deletions, and long interspersed nuclear element (LINE) transpositions to explain the actual repertoire of these genes in human and primate genomes.


Subject(s)
Multigene Family , Ovary/chemistry , Placenta/chemistry , Primates/genetics , Prostate/chemistry , Testis/chemistry , Animals , Chromosome Mapping , Computer Simulation , Evolution, Molecular , Female , Gene Expression Profiling , Gene Expression Regulation , Humans , Male , Pregnancy , Single Molecule Imaging , Tissue Distribution
8.
Nat Genet ; 52(8): 849-858, 2020 08.
Article in English | MEDLINE | ID: mdl-32541924

ABSTRACT

Inversions play an important role in disease and evolution but are difficult to characterize because their breakpoints map to large repeats. We increased by sixfold the number (n = 1,069) of previously reported great ape inversions by using single-cell DNA template strand and long-read sequencing. We find that the X chromosome is most enriched (2.5-fold) for inversions, on the basis of its size and duplication content. There is an excess of differentially expressed primate genes near the breakpoints of large (>100 kilobases (kb)) inversions but not smaller events. We show that when great ape lineage-specific duplications emerge, they preferentially (approximately 75%) occur in an inverted orientation compared to that at their ancestral locus. We construct megabase-pair scale haplotypes for individual chromosomes and identify 23 genomic regions that have recurrently toggled between a direct and an inverted state over 15 million years. The direct orientation is most frequently the derived state for human polymorphisms that predispose to recurrent copy number variants associated with neurodevelopmental disease.


Subject(s)
Chromosome Inversion/genetics , Genome/genetics , Hominidae/genetics , Animals , Chromosomes/genetics , DNA Copy Number Variations/genetics , Evolution, Molecular , Female , Haplotypes/genetics , Humans , Male
9.
Science ; 370(6523)2020 12 18.
Article in English | MEDLINE | ID: mdl-33335035

ABSTRACT

The rhesus macaque (Macaca mulatta) is the most widely studied nonhuman primate (NHP) in biomedical research. We present an updated reference genome assembly (Mmul_10, contig N50 = 46 Mbp) that increases the sequence contiguity 120-fold and annotate it using 6.5 million full-length transcripts, thus improving our understanding of gene content, isoform diversity, and repeat organization. With the improved assembly of segmental duplications, we discovered new lineage-specific genes and expanded gene families that are potentially informative in studies of evolution and disease susceptibility. Whole-genome sequencing (WGS) data from 853 rhesus macaques identified 85.7 million single-nucleotide variants (SNVs) and 10.5 million indel variants, including potentially damaging variants in genes associated with human autism and developmental delay, providing a framework for developing noninvasive NHP models of human disease.


Subject(s)
Genetic Predisposition to Disease , Genome , Macaca mulatta/genetics , Polymorphism, Single Nucleotide , Animals , Genetic Variation , Humans , Molecular Sequence Annotation , Whole Genome Sequencing
SELECTION OF CITATIONS
SEARCH DETAIL