Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 16 de 16
Filter
1.
Nature ; 585(7823): 79-84, 2020 09.
Article in English | MEDLINE | ID: mdl-32663838

ABSTRACT

After two decades of improvements, the current human reference genome (GRCh38) is the most accurate and complete vertebrate genome ever produced. However, no single chromosome has been finished end to end, and hundreds of unresolved gaps persist1,2. Here we present a human genome assembly that surpasses the continuity of GRCh382, along with a gapless, telomere-to-telomere assembly of a human chromosome. This was enabled by high-coverage, ultra-long-read nanopore sequencing of the complete hydatidiform mole CHM13 genome, combined with complementary technologies for quality improvement and validation. Focusing our efforts on the human X chromosome3, we reconstructed the centromeric satellite DNA array (approximately 3.1 Mb) and closed the 29 remaining gaps in the current reference, including new sequences from the human pseudoautosomal regions and from cancer-testis ampliconic gene families (CT-X and GAGE). These sequences will be integrated into future human reference genome releases. In addition, the complete chromosome X, combined with the ultra-long nanopore data, allowed us to map methylation patterns across complex tandem repeats and satellite arrays. Our results demonstrate that finishing the entire human genome is now within reach, and the data presented here will facilitate ongoing efforts to complete the other human chromosomes.


Subject(s)
Chromosomes, Human, X/genetics , Genome, Human/genetics , Telomere/genetics , Centromere/genetics , CpG Islands/genetics , DNA Methylation , DNA, Satellite/genetics , Female , Humans , Hydatidiform Mole/genetics , Male , Pregnancy , Reproducibility of Results , Testis/metabolism
2.
PLoS Genet ; 13(2): e1006481, 2017 02.
Article in English | MEDLINE | ID: mdl-28158191

ABSTRACT

ATP6V1H is a component of a large protein complex with vacuolar ATPase (V-ATPase) activity. We identified two generations of individuals in which short stature and osteoporosis co-segregated with a mutation in ATP6V1H. Since V-ATPases are highly conserved between human and zebrafish, we generated loss-of-function mutants in atp6v1h in zebrafish through CRISPR/Cas9-mediated gene knockout. Homozygous mutant atp6v1h zebrafish exhibited a severe reduction in the number of mature calcified bone cells and a dramatic increase in the expression of mmp9 and mmp13. Heterozygous adults showed curved vertebra that lack calcified centrum structure and reduced bone mass and density. Treatment of mutant embryos with small molecule inhibitors of MMP9 and MMP13 significantly restored bone mass in the atp6v1h mutants. These studies have uncovered a new, ATP6V1H-mediated pathway that regulates bone formation, and defines a new mechanism of disease that leads to bone loss. We propose that MMP9/MMP13 could be therapeutic targets for patients with this rare genetic disease.


Subject(s)
Bone Development/genetics , Matrix Metalloproteinase 13/genetics , Matrix Metalloproteinase 9/genetics , Osteoporosis/genetics , Vacuolar Proton-Translocating ATPases/genetics , Adult , Animals , Bone Density/genetics , CRISPR-Cas Systems , Chondrocytes/metabolism , Chondrocytes/pathology , Humans , Matrix Metalloproteinase 13/biosynthesis , Matrix Metalloproteinase 9/biosynthesis , Mice , Mutation , Osteoporosis/metabolism , Osteoporosis/pathology , Signal Transduction/genetics , Vacuolar Proton-Translocating ATPases/deficiency , Zebrafish/genetics , Zebrafish/growth & development
4.
Am J Hum Genet ; 97(1): 99-110, 2015 Jul 02.
Article in English | MEDLINE | ID: mdl-26119818

ABSTRACT

Ablepharon macrostomia syndrome (AMS) and Barber-Say syndrome (BSS) are rare congenital ectodermal dysplasias characterized by similar clinical features. To establish the genetic basis of AMS and BSS, we performed extensive clinical phenotyping, whole exome and candidate gene sequencing, and functional validations. We identified a recurrent de novo mutation in TWIST2 in seven independent AMS-affected families, as well as another recurrent de novo mutation affecting the same amino acid in ten independent BSS-affected families. Moreover, a genotype-phenotype correlation was observed, because the two syndromes differed based solely upon the nature of the substituting amino acid: a lysine at TWIST2 residue 75 resulted in AMS, whereas a glutamine or alanine yielded BSS. TWIST2 encodes a basic helix-loop-helix transcription factor that regulates the development of mesenchymal tissues. All identified mutations fell in the basic domain of TWIST2 and altered the DNA-binding pattern of Flag-TWIST2 in HeLa cells. Comparison of wild-type and mutant TWIST2 expressed in zebrafish identified abnormal developmental phenotypes and widespread transcriptome changes. Our results suggest that autosomal-dominant TWIST2 mutations cause AMS or BSS by inducing protean effects on the transcription factor's DNA binding.


Subject(s)
Abnormalities, Multiple/genetics , Eye Abnormalities/genetics , Eyelid Diseases/genetics , Hirsutism/genetics , Hypertelorism/genetics , Hypertrichosis/genetics , Macrostomia/genetics , Models, Molecular , Phenotype , Repressor Proteins/genetics , Skin Abnormalities/genetics , Twist-Related Protein 1/genetics , Abnormalities, Multiple/pathology , Amino Acid Sequence , Animals , Base Sequence , Chromatin Immunoprecipitation , Exome/genetics , Eye Abnormalities/pathology , Eyelid Diseases/pathology , HeLa Cells , Hirsutism/pathology , Humans , Hypertelorism/pathology , Hypertrichosis/pathology , Macrostomia/pathology , Microscopy, Electron , Molecular Sequence Data , Mutation, Missense/genetics , Protein Conformation , Repressor Proteins/chemistry , Sequence Analysis, DNA , Skin Abnormalities/pathology , Twist-Related Protein 1/chemistry , Zebrafish
5.
Am J Med Genet A ; 170(12): 3106-3114, 2016 12.
Article in English | MEDLINE | ID: mdl-27513830

ABSTRACT

Failure to thrive arises as a complication of a heterogeneous group of disorders. We describe two female siblings with spastic paraplegia and global developmental delay but also, atypically for the HSPs, poor weight gain classified as failure to thrive. After extensive clinical and biochemical investigations failed to identify the etiology, we used exome sequencing to identify biallelic UNC80 mutations (NM_032504.1:c.[3983-3_3994delinsA];[2431C>T]. The paternally inherited NM_032504.1:c.3983-3_3994delinsA is predicted to encode p.Ser1328Argfs*19 and the maternally inherited NM_032504.1:c.2431C>T is predicted to encode p.Arg811*. No UNC80 mRNA was detectable in patient cultured skin fibroblasts, suggesting UNC80 loss of function by nonsense mediated mRNA decay. Further supporting the UNC80 mutations as causative of these siblings' disorder, biallelic mutations in UNC80 have recently been described among individuals with an overlapping phenotype. This report expands the disease spectrum associated with UNC80 mutations. © 2016 Wiley Periodicals, Inc.


Subject(s)
Carrier Proteins/genetics , Developmental Disabilities/genetics , Failure to Thrive/genetics , Membrane Proteins/genetics , Paraplegia/genetics , Child , Child, Preschool , Developmental Disabilities/complications , Developmental Disabilities/physiopathology , Exome/genetics , Failure to Thrive/complications , Failure to Thrive/physiopathology , Female , Genetic Predisposition to Disease , High-Throughput Nucleotide Sequencing , Humans , Paraplegia/complications , Paraplegia/physiopathology , RNA Stability/genetics , Siblings
6.
BMC Genomics ; 16: 998, 2015 Nov 25.
Article in English | MEDLINE | ID: mdl-26602380

ABSTRACT

BACKGROUND: Whole-exome sequencing (WES) is rapidly evolving into a tool of choice for rapid, and inexpensive identification of molecular genetic lesions within targeted regions of the human genome. While biases in WES coverage of nucleotides in targeted regions are recognized, it is not well understood how repetition of WES improves the interpretation of sequencing results in a clinical diagnostic setting. METHOD: To address this, we compared independently generated exome-capture of six individuals from three-generations sequenced in triplicate. This generated between 48x-86x mean target depth of high-quality mapped bases (>Q20) for each technical replicate library. Cumulatively, we achieved 179 - 208x average target coverage for each individual in the pedigree. Using this experimental design, we evaluated stochastics in WES interpretation, genotyping sensitivity, and accuracy to detect de novo variants. RESULTS: In this study, we show that repetition of WES improved the interpretation of the capture target regions after aggregating the data (93.5 - 93.9 %). Compared to 81.2 - 89.6 % (50.2-55.4 Mb of 61.7 M) coverage of targeted bases at ≥20x in the individual technical replicates, the aggregated data covered 93.5 - 93.9 % of targeted bases (57.7 - 58.0 of 61.7 M) at ≥20x threshold, suggesting a 4.3 - 12.7 % improvement in coverage. Each individual's aggregate dataset recovered 3.4 - 6.4 million bases within variable targeted regions. We uncovered technical variability (2-5 %) inherent to WES technique. We also show improved interpretation in assessing clinically important regions that lack interpretation under current conditions, affecting 12-16 of the 56 genes recommended for secondary analysis by American College of Medical Genetics (ACMG). We demonstrate that comparing technical replicate WES datasets and their derived aggregate data can effectively address overall WES genotyping discrepancies. CONCLUSION: We describe a method to evaluate the reproducibility and stochastics in exome library preparation, and delineate the advantages of aggregating the data derived from technical replicates. The implications of this study are directly applicable to improved experimental design and provide an opportunity to rapidly, efficiently, and accurately arrive at reliable candidate nucleotide variants.


Subject(s)
Exome , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , Gene Library , Humans , Pedigree
7.
Am J Med Genet A ; 167(6): 1374-80, 2015 Jun.
Article in English | MEDLINE | ID: mdl-25845469

ABSTRACT

Intellectual disability (ID) is a heterogeneous condition arising from a variety of environmental and genetic factors. Among these causes are defects in transcriptional regulators. Herein, we report on two brothers in a nonconsanguineous family with novel compound heterozygous, disease-segregating mutations (NM_015979.3: [3656A > G];[4006C > T], NP_057063.2: [H1219R];[R1336X]) in MED23. This gene encodes a subunit of the Mediator complex that modulates the expression of RNA polymerase II-dependent genes. These brothers, who had profound ID, spasticity, congenital heart disease, brain abnormalities, and atypical electroencephalography, represent the first case of MED23-associated ID in a non-consanguineous family. They also expand upon the clinical features previously reported for mutations in this gene.


Subject(s)
Abnormalities, Multiple/genetics , Heart Defects, Congenital/genetics , Intellectual Disability/genetics , Mediator Complex/genetics , Mutation, Missense , Abnormalities, Multiple/diagnosis , Abnormalities, Multiple/pathology , Child , Child, Preschool , Exome , Gene Expression , Heart Defects, Congenital/diagnosis , Heart Defects, Congenital/pathology , Heterozygote , High-Throughput Nucleotide Sequencing , Humans , Intellectual Disability/diagnosis , Intellectual Disability/pathology , Male , Proto-Oncogene Proteins c-fos/genetics , Proto-Oncogene Proteins c-fos/metabolism , Proto-Oncogene Proteins c-jun/genetics , Proto-Oncogene Proteins c-jun/metabolism , Siblings
8.
Front Med (Lausanne) ; 4: 62, 2017.
Article in English | MEDLINE | ID: mdl-28603714

ABSTRACT

Traditionally, the use of genomic information for personalized medical decisions relies on prior discovery and validation of genotype-phenotype associations. This approach constrains care for patients presenting with undescribed problems. The National Institutes of Health (NIH) Undiagnosed Diseases Program (UDP) hypothesized that defining disease as maladaptation to an ecological niche allows delineation of a logical framework to diagnose and evaluate such patients. Herein, we present the philosophical bases, methodologies, and processes implemented by the NIH UDP. The NIH UDP incorporated use of the Human Phenotype Ontology, developed a genomic alignment strategy cognizant of parental genotypes, pursued agnostic biochemical analyses, implemented functional validation, and established virtual villages of global experts. This systematic approach provided a foundation for the diagnostic or non-diagnostic answers provided to patients and serves as a paradigm for scalable translational research.

9.
Orphanet J Rare Dis ; 11(1): 62, 2016 05 14.
Article in English | MEDLINE | ID: mdl-27179618

ABSTRACT

BACKGROUND: Mutations of TCF4, which encodes a basic helix-loop-helix transcription factor, cause Pitt-Hopkins syndrome (PTHS) via multiple genetic mechanisms. TCF4 is a complex locus expressing multiple transcripts by alternative splicing and use of multiple promoters. To address the relationship between mutation of these transcripts and phenotype, we report a three-generation family segregating mild intellectual disability with a chromosomal translocation disrupting TCF4. RESULTS: Using whole genome sequencing, we detected a complex unbalanced karyotype disrupting TCF4 (46,XY,del(14)(q23.3q23.3)del(18)(q21.2q21.2)del(18)(q21.2q21.2)inv(18)(q21.2q21.2)t(14;18)(q23.3;q21.2)(14pter®14q23.3::18q21.2®18q21.2::18q21.1®18qter;18pter®18q21.2::14q23.3®14qter). Subsequent transcriptome sequencing, qRT-PCR and nCounter analyses revealed that cultured skin fibroblasts and peripheral blood had normal expression of genes along chromosomes 14 or 18 and no marked changes in expression of genes other than TCF4. Affected individuals had 12-33 fold higher mRNA levels of TCF4 than did unaffected controls or individuals with PTHS. Although the derivative chromosome generated a PLEKHG3-TCF4 fusion transcript, the increased levels of TCF4 mRNA arose from transcript variants originating distal to the translocation breakpoint, not from the fusion transcript. CONCLUSIONS: Although validation in additional patients is required, our findings suggest that the dysmorphic features and severe intellectual disability characteristic of PTHS are partially rescued by overexpression of those short TCF4 transcripts encoding a nuclear localization signal, a transcription activation domain, and the basic helix-loop-helix domain.


Subject(s)
Basic Helix-Loop-Helix Leucine Zipper Transcription Factors/genetics , Intellectual Disability/genetics , Protein Isoforms/genetics , Transcription Factors/genetics , Translocation, Genetic/genetics , Alternative Splicing/genetics , Child , Facies , Female , Humans , Hyperventilation/genetics , Mutation/genetics , Polymerase Chain Reaction , Polymorphism, Single Nucleotide/genetics , Promoter Regions, Genetic/genetics , Transcription Factor 4
10.
Front Med (Lausanne) ; 3: 39, 2016.
Article in English | MEDLINE | ID: mdl-27785453

ABSTRACT

The National Institutes of Health Undiagnosed Diseases Program (NIH UDP) applies translational research systematically to diagnose patients with undiagnosed diseases. The challenge is to implement an information system enabling scalable translational research. The authors hypothesized that similar complex problems are resolvable through process management and the distributed cognition of communities. The team, therefore, built the NIH UDP integrated collaboration system (UDPICS) to form virtual collaborative multidisciplinary research networks or communities. UDPICS supports these communities through integrated process management, ontology-based phenotyping, biospecimen management, cloud-based genomic analysis, and an electronic laboratory notebook. UDPICS provided a mechanism for efficient, transparent, and scalable translational research and thereby addressed many of the complex and diverse research and logistical problems of the NIH UDP. Full definition of the strengths and deficiencies of UDPICS will require formal qualitative and quantitative usability and process improvement measurement.

11.
Orphanet J Rare Dis ; 10: 27, 2015 Mar 07.
Article in English | MEDLINE | ID: mdl-25888122

ABSTRACT

BACKGROUND: Snyder-Robinson Syndrome (SRS) is an X-linked intellectual disability disorder also characterized by osteoporosis, scoliosis, and dysmorphic facial features. It is caused by mutations in SMS, a ubiquitously expressed gene encoding the polyamine biosynthetic enzyme spermine synthase. We hypothesized that the tissue specificity of SRS arises from differential sensitivity to spermidine toxicity or spermine deficiency. METHODS: We performed detailed clinical, endocrine, histopathologic, and morphometric studies on two affected brothers with a spermine synthase loss of function mutation (NM_004595.4:c.443A > G, p.Gln148Arg). We also measured spermine and spermidine levels in cultured human bone marrow stromal cells (hBMSCs) and fibroblasts using the Biochrom 30 polyamine protocol and assessed the osteogenic potential of hBMSCs. RESULTS: In addition to the known tissue-specific features of SRS, the propositi manifested retinal pigmentary changes, recurrent episodes of hyper- and hypoglycemia, nephrocalcinosis, renal cysts, and frequent respiratory infections. Bone histopathology and morphometry identified a profound depletion of osteoblasts and osteoclasts, absence of a trabecular meshwork, a low bone volume and a thin cortex. Comparison of cultured fibroblasts from affected and unaffected individuals showed relatively small changes in polyamine content, whereas comparison of cultured osteoblasts identified marked differences in spermidine and spermine content. Osteogenic differentiation of the SRS-derived hBMSCs identified a severe deficiency of calcium phosphate mineralization. CONCLUSIONS: Our findings support the hypothesis that cell specific alterations in polyamine metabolism contribute to the tissue specificity of SRS features, and that the low bone density arises from a failure of mineralization.


Subject(s)
Mental Retardation, X-Linked/pathology , Osteoblasts/pathology , Osteoclasts/pathology , Osteoporosis/pathology , Fibroblasts/metabolism , Humans , Male , Mental Retardation, X-Linked/metabolism , Mesenchymal Stem Cells/metabolism , Mutation , Osteoblasts/metabolism , Osteoclasts/metabolism , Osteoporosis/metabolism , Spermidine/metabolism , Spermine/metabolism , Spermine Synthase/genetics , Spermine Synthase/metabolism
12.
PLoS One ; 5(12): e14346, 2010 Dec 16.
Article in English | MEDLINE | ID: mdl-21179557

ABSTRACT

BACKGROUND: Myelin protein zero (MPZ) is a critical structural component of myelin in the peripheral nervous system. The MPZ gene is regulated, in part, by the transcription factors SOX10 and EGR2. Mutations in MPZ, SOX10, and EGR2 have been implicated in demyelinating peripheral neuropathies, suggesting that components of this transcriptional network are candidates for harboring disease-causing mutations (or otherwise functional variants) that affect MPZ expression. METHODOLOGY: We utilized a combination of multi-species sequence comparisons, transcription factor-binding site predictions, targeted human DNA re-sequencing, and in vitro and in vivo enhancer assays to study human non-coding MPZ variants. PRINCIPAL FINDINGS: Our efforts revealed a variant within the first intron of MPZ that resides within a previously described SOX10 binding site is associated with decreased enhancer activity, and alters binding of nuclear proteins. Additionally, the genomic segment harboring this variant directs tissue-relevant reporter gene expression in zebrafish. CONCLUSIONS: This is the first reported MPZ variant within a cis-acting transcriptional regulatory element. While we were unable to implicate this variant in disease onset, our data suggests that similar non-coding sequences should be screened for mutations in patients with neurological disease. Furthermore, our multi-faceted approach for examining the functional significance of non-coding variants can be readily generalized to study other loci important for myelin structure and function.


Subject(s)
Enhancer Elements, Genetic , Myelin P0 Protein/genetics , Animals , Binding Sites , Gene Expression Regulation , Genetic Variation , Humans , In Vitro Techniques , Mice , Mutation , Myelin P0 Protein/metabolism , Rats , SOXE Transcription Factors/metabolism , Sequence Analysis, DNA , Species Specificity , Transcription Factors/metabolism , Zebrafish
13.
Genome Res ; 17(6): 760-74, 2007 Jun.
Article in English | MEDLINE | ID: mdl-17567995

ABSTRACT

A key component of the ongoing ENCODE project involves rigorous comparative sequence analyses for the initially targeted 1% of the human genome. Here, we present orthologous sequence generation, alignment, and evolutionary constraint analyses of 23 mammalian species for all ENCODE targets. Alignments were generated using four different methods; comparisons of these methods reveal large-scale consistency but substantial differences in terms of small genomic rearrangements, sensitivity (sequence coverage), and specificity (alignment accuracy). We describe the quantitative and qualitative trade-offs concomitant with alignment method choice and the levels of technical error that need to be accounted for in applications that require multisequence alignments. Using the generated alignments, we identified constrained regions using three different methods. While the different constraint-detecting methods are in general agreement, there are important discrepancies relating to both the underlying alignments and the specific algorithms. However, by integrating the results across the alignments and constraint-detecting methods, we produced constraint annotations that were found to be robust based on multiple independent measures. Analyses of these annotations illustrate that most classes of experimentally annotated functional elements are enriched for constrained sequences; however, large portions of each class (with the exception of protein-coding sequences) do not overlap constrained regions. The latter elements might not be under primary sequence constraint, might not be constrained across all mammals, or might have expendable molecular functions. Conversely, 40% of the constrained sequences do not overlap any of the functional elements that have been experimentally identified. Together, these findings demonstrate and quantify how many genomic functional elements await basic molecular characterization.


Subject(s)
Evolution, Molecular , Genome, Human , Mammals/genetics , Open Reading Frames , Phylogeny , Sequence Alignment , Animals , Human Genome Project , Humans
14.
Proc Natl Acad Sci U S A ; 102(9): 3354-9, 2005 Mar 01.
Article in English | MEDLINE | ID: mdl-15718282

ABSTRACT

Sequencing and comparative analyses of genomes from multiple vertebrates are providing insights about the genetic basis for biological diversity. To date, these efforts largely have focused on eutherian mammals, chicken, and fish. In this article, we describe the generation and study of genomic sequences from noneutherian mammals, a group of species occupying unusual phylogenetic positions. A large sequence data set (totaling >5 Mb) was generated for the same orthologous region in three marsupial (North American opossum, South American opossum, and Australian tammar wallaby) and one monotreme (platypus) genomes. These ancient mammalian genomes are characterized by unusual architectural features with respect to G + C and repeat content, as well as compression relative to human. Approximately 14% and 34% of the human sequence forms alignments with the orthologous sequence from platypus and the marsupials, respectively; these numbers are distinctly lower than that observed with nonprimate eutherian mammals (45-70%). The alignable sequences between human and each marsupial species are not completely overlapping (only 80% common to all three species) nor are the platypus-alignable sequences completely contained within the marsupial-alignable sequences. Phylogenetic analysis of synonymous coding positions reveals that platypus has a notably long branch length, with the human-platypus substitution rate being on average 55% greater than that seen with human-marsupial pairs. Finally, analyses of the major mammalian lineages reveal distinct patterns with respect to the common presence of evolutionarily conserved vertebrate sequences. Our results confirm that genomic sequence from noneutherian mammals can contribute uniquely to unraveling the functional and evolutionary histories of the mammalian genome.


Subject(s)
Genome , Marsupialia/genetics , Monotremata/genetics , Animals , Biological Evolution , Humans , Molecular Sequence Data , Phylogeny
15.
Genome Res ; 12(8): 1277-85, 2002 Aug.
Article in English | MEDLINE | ID: mdl-12176935

ABSTRACT

Comparison is a fundamental tool for analyzing DNA sequence. Interspecies sequence comparison is particularly powerful for inferring genome function and is based on the simple premise that conserved sequences are likely to be important. Thus, the comparison of a genomic sequence with its orthologous counterpart from another species is increasingly becoming an integral component of genome analysis. In ideal situations, such comparisons are performed with orthologous sequences from multiple species. To facilitate multispecies comparative sequence analysis, a robust and scalable strategy for simultaneously constructing sequence-ready bacterial artificial chromosome (BAC) contig maps from targeted genomic regions has been developed. Central to this approach is the generation and utilization of "universal" oligonucleotide-based hybridization probes ("overgo" probes), which are designed from sequences that are highly conserved between distantly related species. Large collections of these probes are used en masse to screen BAC libraries from multiple species in parallel, with the isolated clones assembled into physical contig maps. To validate the effectiveness of this strategy, efforts were focused on the construction of BAC-based physical maps from multiple mammalian species (chimpanzee, baboon, cat, dog, cow, and pig). Using available human and mouse genomic sequence and a newly developed computer program to design the requisite probes, sequence-ready maps were constructed in all species for a series of targeted regions totaling approximately 16 Mb in the human genome. The described approach can be used to facilitate the multispecies comparative sequencing of targeted genomic regions and can be adapted for constructing BAC contig maps in other vertebrates.


Subject(s)
Cloning, Molecular/methods , Contig Mapping/methods , DNA/genetics , Sequence Homology, Nucleic Acid , Animals , Cats , Cattle , Chromosomes, Artificial, Bacterial/genetics , DNA Probes/genetics , Dogs , Genome , Genome, Human , Genomic Library , Humans , Internet , Mice , Nucleic Acid Hybridization/methods , Pan troglodytes , Papio , Species Specificity , Swine
16.
Genome Res ; 13(1): 55-63, 2003 Jan.
Article in English | MEDLINE | ID: mdl-12529306

ABSTRACT

Duplications have long been postulated to be an important mechanism by which genomes evolve. Interspecies genomic comparisons are one method by which the origin and molecular mechanism of duplications can be inferred. By comparative mapping in human, mouse, and rat, we previously found evidence for a recent chromosome-fission event that occurred in the mouse lineage. Cytogenetic mapping revealed that the genomic segments flanking the fission site appeared to be duplicated, with copies residing near the centromere of multiple mouse chromosomes. Here we report the mapping and sequencing of the regions of mouse chromosomes 5 and 6 involved in this chromosome-fission event as well as the results of comparative sequence analysis with the orthologous human and rat genomic regions. Our data indicate that the duplications associated with mouse chromosomes 5 and 6 are recent and that the resulting duplicated segments share significant sequence similarity with a series of regions near the centromeres of the mouse chromosomes previously identified by cytogenetic mapping. We also identified pericentromeric duplicated segments shared between mouse chromosomes 5 and 1. Finally, novel mouse satellite sequences as well as putative chimeric transcripts were found to be associated with the duplicated segments. Together, these findings demonstrate that pericentromeric duplications are not restricted to primates and may be a common mechanism for genome evolution in mammals.


Subject(s)
Centromere/genetics , Gene Duplication , Animals , Chimera/genetics , Chromosomes/genetics , Chromosomes, Human/genetics , Conserved Sequence/genetics , DNA, Satellite/genetics , Evolution, Molecular , Genetic Markers/genetics , Humans , Mice , Physical Chromosome Mapping/methods , Rats
SELECTION OF CITATIONS
SEARCH DETAIL