Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
1.
Mol Biol Evol ; 40(3)2023 03 04.
Artículo en Inglés | MEDLINE | ID: mdl-36869750

RESUMEN

As the accuracy and throughput of nanopore sequencing improve, it is increasingly common to perform long-read first de novo genome assemblies followed by polishing with accurate short reads. We briefly introduce FMLRC2, the successor to the original FM-index Long Read Corrector (FMLRC), and illustrate its performance as a fast and accurate de novo assembly polisher for both bacterial and eukaryotic genomes.


Asunto(s)
Eucariontes , Nanoporos , Análisis de Secuencia de ADN , Eucariontes/genética , Bacterias/genética , Genoma Bacteriano , Secuenciación de Nucleótidos de Alto Rendimiento
2.
Proc Natl Acad Sci U S A ; 118(20)2021 05 18.
Artículo en Inglés | MEDLINE | ID: mdl-33990463

RESUMEN

To investigate the origins and stages of vertebrate adaptive radiation, we reconstructed the spatial and temporal histories of adaptive alleles underlying major phenotypic axes of diversification from the genomes of 202 Caribbean pupfishes. On a single Bahamian island, ancient standing variation from disjunct geographic sources was reassembled into new combinations under strong directional selection for adaptation to the novel trophic niches of scale-eating and molluscivory. We found evidence for two longstanding hypotheses of adaptive radiation: hybrid swarm origins and temporal stages of adaptation. Using a combination of population genomics, transcriptomics, and genome-wide association mapping, we demonstrate that this microendemic adaptive radiation of novel trophic specialists on San Salvador Island, Bahamas experienced twice as much adaptive introgression as generalist populations on neighboring islands and that adaptive divergence occurred in stages. First, standing regulatory variation in genes associated with feeding behavior (prlh, cfap20, and rmi1) were swept to fixation by selection, then standing regulatory variation in genes associated with craniofacial and muscular development (itga5, ext1, cyp26b1, and galr2) and finally the only de novo nonsynonymous substitution in an osteogenic transcription factor and oncogene (twist1) swept to fixation most recently. Our results demonstrate how ancient alleles maintained in distinct environmental refugia can be assembled into new adaptive combinations and provide a framework for reconstructing the spatiotemporal landscape of adaptation and speciation.


Asunto(s)
Adaptación Fisiológica/genética , Especiación Genética , Peces Killi/genética , Filogenia , Análisis Espacio-Temporal , Vertebrados/genética , Animales , Bahamas , Región del Caribe , Proteínas de Peces/genética , Perfilación de la Expresión Génica/métodos , Estudio de Asociación del Genoma Completo/métodos , Genómica/métodos , Genotipo , Geografía , Peces Killi/anatomía & histología , Peces Killi/clasificación , Polimorfismo de Nucleótido Simple , Vertebrados/anatomía & histología , Vertebrados/clasificación
3.
RNA ; 25(8): 1004-1019, 2019 08.
Artículo en Inglés | MEDLINE | ID: mdl-31097619

RESUMEN

The marsupial inactive X chromosome expresses a long noncoding RNA (lncRNA) called Rsx that has been proposed to be the functional analog of eutherian Xist Despite the possibility that Xist and Rsx encode related functions, the two lncRNAs harbor no linear sequence similarity. However, both lncRNAs harbor domains of tandemly repeated sequence. In Xist, these repeat domains are known to be critical for function. Using k-mer based comparison, we show that the repeat domains of Xist and Rsx unexpectedly partition into two major clusters that each harbor substantial levels of nonlinear sequence similarity. Xist Repeats B, C, and D were most similar to each other and to Rsx Repeat 1, whereas Xist Repeats A and E were most similar to each other and to Rsx Repeats 2, 3, and 4. Similarities at the level of k-mers corresponded to domain-specific enrichment of protein-binding motifs. Within individual domains, protein-binding motifs were often enriched to extreme levels. Our data support the hypothesis that Xist and Rsx encode similar functions through different spatial arrangements of functionally analogous protein-binding domains. We propose that the two clusters of repeat domains in Xist and Rsx function in part to cooperatively recruit PRC1 and PRC2 to chromatin. The physical manner in which these domains engage with protein cofactors may be just as critical to the function of the domains as the protein cofactors themselves. The general approaches we outline in this report should prove useful in the study of any set of RNAs.


Asunto(s)
Marsupiales/genética , ARN Largo no Codificante/química , ARN Largo no Codificante/genética , Animales , Análisis por Conglomerados , Humanos , Marsupiales/metabolismo , Proteínas del Grupo Polycomb/metabolismo , Dominios Proteicos , Homología de Secuencia de Ácido Nucleico , Secuencias Repetidas en Tándem , Inactivación del Cromosoma X
4.
BMC Bioinformatics ; 19(1): 50, 2018 02 09.
Artículo en Inglés | MEDLINE | ID: mdl-29426289

RESUMEN

BACKGROUND: Long read sequencing is changing the landscape of genomic research, especially de novo assembly. Despite the high error rate inherent to long read technologies, increased read lengths dramatically improve the continuity and accuracy of genome assemblies. However, the cost and throughput of these technologies limits their application to complex genomes. One solution is to decrease the cost and time to assemble novel genomes by leveraging "hybrid" assemblies that use long reads for scaffolding and short reads for accuracy. RESULTS: We describe a novel method leveraging a multi-string Burrows-Wheeler Transform with auxiliary FM-index to correct errors in long read sequences using a set of complementary short reads. We demonstrate that our method efficiently produces significantly more high quality corrected sequence than existing hybrid error-correction methods. We also show that our method produces more contiguous assemblies, in many cases, than existing state-of-the-art hybrid and long-read only de novo assembly methods. CONCLUSION: Our method accurately corrects long read sequence data using complementary short reads. We demonstrate higher total throughput of corrected long reads and a corresponding increase in contiguity of the resulting de novo assemblies. Improved throughput and computational efficiency than existing methods will help better economically utilize emerging long read sequencing technologies.


Asunto(s)
Algoritmos , Bases de Datos Genéticas , Genoma Fúngico , Saccharomyces cerevisiae/genética , Análisis de Secuencia de ADN
5.
Proc Natl Acad Sci U S A ; 112(52): 15976-81, 2015 Dec 29.
Artículo en Inglés | MEDLINE | ID: mdl-26598659

RESUMEN

Horizontal gene transfer (HGT), or the transfer of genes between species, has been recognized recently as more pervasive than previously suspected. Here, we report evidence for an unprecedented degree of HGT into an animal genome, based on a draft genome of a tardigrade, Hypsibius dujardini. Tardigrades are microscopic eight-legged animals that are famous for their ability to survive extreme conditions. Genome sequencing, direct confirmation of physical linkage, and phylogenetic analysis revealed that a large fraction of the H. dujardini genome is derived from diverse bacteria as well as plants, fungi, and Archaea. We estimate that approximately one-sixth of tardigrade genes entered by HGT, nearly double the fraction found in the most extreme cases of HGT into animals known to date. Foreign genes have supplemented, expanded, and even replaced some metazoan gene families within the tardigrade genome. Our results demonstrate that an unexpectedly large fraction of an animal genome can be derived from foreign sources. We speculate that animals that can survive extremes may be particularly prone to acquiring foreign genes.


Asunto(s)
Transferencia de Gen Horizontal , Genoma/genética , Biblioteca Genómica , Análisis de Secuencia de ADN/métodos , Tardigrada/genética , Animales , ADN de Archaea/química , ADN de Archaea/genética , ADN Bacteriano/química , ADN Bacteriano/genética , ADN de Hongos/química , ADN de Hongos/genética , ADN de Plantas/química , ADN de Plantas/genética , ADN Viral/química , ADN Viral/genética , Filogenia , Tardigrada/clasificación
6.
BMC Bioinformatics ; 18(1): 357, 2017 Aug 01.
Artículo en Inglés | MEDLINE | ID: mdl-28764645

RESUMEN

BACKGROUND: High-throughput sequence (HTS) data exhibit position-specific nucleotide biases that obscure the intended signal and reduce the effectiveness of these data for downstream analyses. These biases are particularly evident in HTS assays for identifying regulatory regions in DNA (DNase-seq, ChIP-seq, FAIRE-seq, ATAC-seq). Biases may result from many experiment-specific factors, including selectivity of DNA restriction enzymes and fragmentation method, as well as sequencing technology-specific factors, such as choice of adapters/primers and sample amplification methods. RESULTS: We present a novel method to detect and correct position-specific nucleotide biases in HTS short read data. Our method calculates read-specific weights based on aligned reads to correct the over- or underrepresentation of position-specific nucleotide subsequences, both within and adjacent to the aligned read, relative to a baseline calculated in assay-specific enriched regions. Using HTS data from a variety of ChIP-seq, DNase-seq, FAIRE-seq, and ATAC-seq experiments, we show that our weight-adjusted reads reduce the position-specific nucleotide imbalance across reads and improve the utility of these data for downstream analyses, including identification and characterization of open chromatin peaks and transcription-factor binding sites. CONCLUSIONS: A general-purpose method to characterize and correct position-specific nucleotide sequence biases fills the need to recognize and deal with, in a systematic manner, binding-site preference for the growing number of HTS-based epigenetic assays. As the breadth and impact of these biases are better understood, the availability of a standard toolkit to correct them will be important.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Nucleótidos/genética , Secuencia de Bases , Sesgo , Sitios de Unión , Biología Computacional , ADN/metabolismo , Desoxirribonucleasas/metabolismo , Análisis de Componente Principal , Unión Proteica , Análisis de Secuencia de ADN
7.
PLoS Genet ; 9(10): e1003853, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-24098153

RESUMEN

X chromosome inactivation (XCI) is the mammalian mechanism of dosage compensation that balances X-linked gene expression between the sexes. Early during female development, each cell of the embryo proper independently inactivates one of its two parental X-chromosomes. In mice, the choice of which X chromosome is inactivated is affected by the genotype of a cis-acting locus, the X-chromosome controlling element (Xce). Xce has been localized to a 1.9 Mb interval within the X-inactivation center (Xic), yet its molecular identity and mechanism of action remain unknown. We combined genotype and sequence data for mouse stocks with detailed phenotyping of ten inbred strains and with the development of a statistical model that incorporates phenotyping data from multiple sources to disentangle sources of XCI phenotypic variance in natural female populations on X inactivation. We have reduced the Xce candidate 10-fold to a 176 kb region located approximately 500 kb proximal to Xist. We propose that structural variation in this interval explains the presence of multiple functional Xce alleles in the genus Mus. We have identified a new allele, Xce(e) present in Mus musculus and a possible sixth functional allele in Mus spicilegus. We have also confirmed a parent-of-origin effect on X inactivation choice and provide evidence that maternal inheritance magnifies the skewing associated with strong Xce alleles. Based on the phylogenetic analysis of 155 laboratory strains and wild mice we conclude that Xce(a) is either a derived allele that arose concurrently with the domestication of fancy mice but prior the derivation of most classical inbred strains or a rare allele in the wild. Furthermore, we have found that despite the presence of multiple haplotypes in the wild Mus musculus domesticus has only one functional Xce allele, Xce(b). Lastly, we conclude that each mouse taxa examined has a different functional Xce allele.


Asunto(s)
Compensación de Dosificación (Genética) , Genes Ligados a X , ARN Largo no Codificante/genética , Inactivación del Cromosoma X/genética , Alelos , Animales , Mapeo Cromosómico , Femenino , Sitios Genéticos , Haplotipos , Ratones , Filogenia
8.
PLoS One ; 19(3): e0301016, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38547181

RESUMEN

Saliva is a readily accessible and inexpensive biological specimen that enables investigation of the oral microbiome, which can serve as a biomarker of oral and systemic health. There are two routine approaches to collect saliva, stimulated and unstimulated; however, there is no consensus on how sampling method influences oral microbiome metrics. In this study, we analyzed paired saliva samples (unstimulated and stimulated) from 88 individuals, aged 7-18 years. Using 16S rRNA gene sequencing, we investigated the differences in bacterial microbiome composition between sample types and determined how sampling method affects the distribution of taxa associated with untreated dental caries and gingivitis. Our analyses indicated significant differences in microbiome composition between the sample types. Both sampling methods were able to detect significant differences in microbiome composition between healthy subjects and subjects with untreated caries. However, only stimulated saliva revealed a significant association between microbiome diversity and composition in individuals with diagnosed gingivitis. Furthermore, taxa previously associated with dental caries and gingivitis were preferentially enriched in individuals with each respective disease only in stimulated saliva. Our study suggests that stimulated saliva provides a more nuanced readout of microbiome composition and taxa distribution associated with untreated dental caries and gingivitis compared to unstimulated saliva.


Asunto(s)
Caries Dental , Gingivitis , Microbiota , Humanos , Saliva/microbiología , ARN Ribosómico 16S/genética , Microbiota/genética
9.
Cell Rep ; 43(4): 114076, 2024 Apr 23.
Artículo en Inglés | MEDLINE | ID: mdl-38607917

RESUMEN

The severe acute respiratory syndrome coronavirus 2 pandemic is characterized by the emergence of novel variants of concern (VOCs) that replace ancestral strains. Here, we dissect the complex selective pressures by evaluating variant fitness and adaptation in human respiratory tissues. We evaluate viral properties and host responses to reconstruct forces behind D614G through Omicron (BA.1) emergence. We observe differential replication in airway epithelia, differences in cellular tropism, and virus-induced cytotoxicity. D614G accumulates the most mutations after infection, supporting zoonosis and adaptation to the human airway. We perform head-to-head competitions and observe the highest fitness for Gamma and Delta. Under these conditions, RNA recombination favors variants encoding the B.1.617.1 lineage 3' end. Based on viral growth kinetics, Alpha, Gamma, and Delta exhibit increased fitness compared to D614G. In contrast, the global success of Omicron likely derives from increased transmission and antigenic variation. Our data provide molecular evidence to support epidemiological observations of VOC emergence.


Asunto(s)
COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/fisiología , SARS-CoV-2/genética , COVID-19/virología , COVID-19/transmisión , Replicación Viral , Mutación/genética , Mucosa Respiratoria/virología , Aptitud Genética , Animales , Células Epiteliales/virología , Chlorocebus aethiops , Adaptación Fisiológica/genética , Células Vero
10.
Sci Rep ; 14(1): 9785, 2024 04 29.
Artículo en Inglés | MEDLINE | ID: mdl-38684791

RESUMEN

Several studies have documented the significant impact of methodological choices in microbiome analyses. The myriad of methodological options available complicate the replication of results and generally limit the comparability of findings between independent studies that use differing techniques and measurement pipelines. Here we describe the Mosaic Standards Challenge (MSC), an international interlaboratory study designed to assess the impact of methodological variables on the results. The MSC did not prescribe methods but rather asked participating labs to analyze 7 shared reference samples (5 × human stool samples and 2 × mock communities) using their standard laboratory methods. To capture the array of methodological variables, each participating lab completed a metadata reporting sheet that included 100 different questions regarding the details of their protocol. The goal of this study was to survey the methodological landscape for microbiome metagenomic sequencing (MGS) analyses and the impact of methodological decisions on metagenomic sequencing results. A total of 44 labs participated in the MSC by submitting results (16S or WGS) along with accompanying metadata; thirty 16S rRNA gene amplicon datasets and 14 WGS datasets were collected. The inclusion of two types of reference materials (human stool and mock communities) enabled analysis of both MGS measurement variability between different protocols using the biologically-relevant stool samples, and MGS bias with respect to ground truth values using the DNA mixtures. Owing to the compositional nature of MGS measurements, analyses were conducted on the ratio of Firmicutes: Bacteroidetes allowing us to directly apply common statistical methods. The resulting analysis demonstrated that protocol choices have significant effects, including both bias of the MGS measurement associated with a particular methodological choices, as well as effects on measurement robustness as observed through the spread of results between labs making similar methodological choices. In the analysis of the DNA mock communities, MGS measurement bias was observed even when there was general consensus among the participating laboratories. This study was the result of a collaborative effort that included academic, commercial, and government labs. In addition to highlighting the impact of different methodological decisions on MGS result comparability, this work also provides insights for consideration in future microbiome measurement study design.


Asunto(s)
Heces , Metagenómica , Microbiota , ARN Ribosómico 16S , Humanos , Metagenómica/métodos , Metagenómica/normas , ARN Ribosómico 16S/genética , Heces/microbiología , Microbiota/genética , Sesgo , Metagenoma , Microbioma Gastrointestinal/genética , Análisis de Secuencia de ADN/métodos , Bacterias/genética , Bacterias/clasificación , Bacterias/aislamiento & purificación , Secuenciación de Nucleótidos de Alto Rendimiento/métodos
11.
BMC Bioinformatics ; 13 Suppl 3: S13, 2012 Mar 21.
Artículo en Inglés | MEDLINE | ID: mdl-22536897

RESUMEN

BACKGROUND: Genome browsers are a common tool used by biologists to visualize genomic features including genes, polymorphisms, and many others. However, existing genome browsers and visualization tools are not well-suited to perform meaningful comparative analysis among a large number of genomes. With the increasing quantity and availability of genomic data, there is an increased burden to provide useful visualization and analysis tools for comparison of multiple collinear genomes such as the large panels of model organisms which are the basis for much of the current genetic research. RESULTS: We have developed a novel web-based tool for visualizing and analyzing multiple collinear genomes. Our tool illustrates genome-sequence similarity through a mosaic of intervals representing local phylogeny, subspecific origin, and haplotype identity. Comparative analysis is facilitated through reordering and clustering of tracks, which can vary throughout the genome. In addition, we provide local phylogenetic trees as an alternate visualization to assess local variations. CONCLUSIONS: Unlike previous genome browsers and viewers, ours allows for simultaneous and comparative analysis. Our browser provides intuitive selection and interactive navigation about features of interest. Dynamic visualizations adjust to scale and data content making analysis at variable resolutions and of multiple data sets more informative. We demonstrate our genome browser for an extensive set of genomic data sets composed of almost 200 distinct mouse laboratory strains.


Asunto(s)
Genoma , Internet , Ratones/genética , Programas Informáticos , Animales , Análisis por Conglomerados , Ratones/clasificación , Ratones Endogámicos , Filogenia , Polimorfismo de Nucleótido Simple
12.
Elife ; 102021 07 19.
Artículo en Inglés | MEDLINE | ID: mdl-34279216

RESUMEN

Over 100 years of studies in Drosophila melanogaster and related species in the genus Drosophila have facilitated key discoveries in genetics, genomics, and evolution. While high-quality genome assemblies exist for several species in this group, they only encompass a small fraction of the genus. Recent advances in long-read sequencing allow high-quality genome assemblies for tens or even hundreds of species to be efficiently generated. Here, we utilize Oxford Nanopore sequencing to build an open community resource of genome assemblies for 101 lines of 93 drosophilid species encompassing 14 species groups and 35 sub-groups. The genomes are highly contiguous and complete, with an average contig N50 of 10.5 Mb and greater than 97% BUSCO completeness in 97/101 assemblies. We show that Nanopore-based assemblies are highly accurate in coding regions, particularly with respect to coding insertions and deletions. These assemblies, along with a detailed laboratory protocol and assembly pipelines, are released as a public resource and will serve as a starting point for addressing broad questions of genetics, ecology, and evolution at the scale of hundreds of species.


Asunto(s)
Drosophila melanogaster/genética , Tamaño del Genoma , Genómica/métodos , Animales , Línea Celular , Cromosomas , Biología Computacional/métodos , Femenino , Genoma , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Nanoporos
13.
Inflamm Bowel Dis ; 26(12): 1843-1855, 2020 11 19.
Artículo en Inglés | MEDLINE | ID: mdl-32469069

RESUMEN

BACKGROUND: The intestinal microbiota play a key role in the onset, progression, and recurrence of Crohn disease (CD). Most microbiome studies assay fecal material, which does not provide region-specific information on mucosally adherent bacteria that directly interact with host systems. Changes in luminal oxygen have been proposed as a contributor to CD dybiosis. METHODS: The authors generated 16S rRNA data using colonic and ileal mucosal bacteria from patients with CD and without inflammatory bowel disease. We developed profiles reflecting bacterial abundance within defined aerotolerance categories. Bacterial diversity, composition, and aerotolerance profiles were compared across intestinal regions and disease phenotypes. RESULTS: Bacterial diversity decreased in CD in both the ileum and the colon. Aerotolerance profiles significantly differed between intestinal segments in patients without inflammatory bowel disease, although both were dominated by obligate anaerobes, as expected. In CD, high relative levels of obligate anaerobes were maintained in the colon and increased in the ileum. Relative abundances of similar and distinct taxa were altered in colon and ileum. Notably, several obligate anaerobes, such as Bacteroides fragilis, dramatically increased in CD in one or both intestinal segments, although specific increasing taxa varied across patients. Increased abundance of taxa from the Proteobacteria phylum was found only in the ileum. Bacterial diversity was significantly reduced in resected tissues of patients who developed postoperative disease recurrence across 2 independent cohorts, with common lower abundance of bacteria from the Bacteroides, Streptococcus, and Blautia genera. CONCLUSIONS: Mucosally adherent bacteria in the colon and ileum show distinct alterations in CD that provide additional insights not revealed in fecal material.


Asunto(s)
Colon/microbiología , Enfermedad de Crohn/microbiología , Microbioma Gastrointestinal/genética , Íleon/microbiología , Mucosa Intestinal/microbiología , Aerobiosis , Estudios de Casos y Controles , Femenino , Humanos , Masculino , Persona de Mediana Edad , Fenotipo , ARN Ribosómico 16S/metabolismo
15.
Nat Genet ; 47(4): 353-60, 2015 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-25730764

RESUMEN

Complex human traits are influenced by variation in regulatory DNA through mechanisms that are not fully understood. Because regulatory elements are conserved between humans and mice, a thorough annotation of cis regulatory variants in mice could aid in further characterizing these mechanisms. Here we provide a detailed portrait of mouse gene expression across multiple tissues in a three-way diallel. Greater than 80% of mouse genes have cis regulatory variation. Effects from these variants influence complex traits and usually extend to the human ortholog. Further, we estimate that at least one in every thousand SNPs creates a cis regulatory effect. We also observe two types of parent-of-origin effects, including classical imprinting and a new global allelic imbalance in expression favoring the paternal allele. We conclude that, as with humans, pervasive regulatory variation influences complex genetic traits in mice and provide a new resource toward understanding the genetic control of transcription in mammals.


Asunto(s)
Alelos , Desequilibrio Alélico/genética , Cruzamientos Genéticos , Expresión Génica , Especiación Genética , Ratones/genética , Animales , Compensación de Dosificación (Genética) , Femenino , Humanos , Masculino , Ratones Noqueados , Filogenia , Polimorfismo de Nucleótido Simple
16.
Genetics ; 190(2): 449-58, 2012 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-22345612

RESUMEN

We present full-genome genotype imputations for 100 classical laboratory mouse strains, using a novel method. Using genotypes at 549,683 SNP loci obtained with the Mouse Diversity Array, we partitioned the genome of 100 mouse strains into 40,647 intervals that exhibit no evidence of historical recombination. For each of these intervals we inferred a local phylogenetic tree. We combined these data with 12 million loci with sequence variations recently discovered by whole-genome sequencing in a common subset of 12 classical laboratory strains. For each phylogenetic tree we identified strains sharing a leaf node with one or more of the sequenced strains. We then imputed high- and medium-confidence genotypes for each of 88 nonsequenced genomes. Among inbred strains, we imputed 92% of SNPs genome-wide, with 71% in high-confidence regions. Our method produced 977 million new genotypes with an estimated per-SNP error rate of 0.083% in high-confidence regions and 0.37% genome-wide. Our analysis identified which of the 88 nonsequenced strains would be the most informative for improving full-genome imputation, as well as which additional strain sequences will reveal more new genetic variants. Imputed sequences and quality scores can be downloaded and visualized online.


Asunto(s)
Ratones Endogámicos/genética , Filogenia , Polimorfismo de Nucleótido Simple , Secuencia de Aminoácidos , Animales , Cromosomas de los Mamíferos , Femenino , Genoma , Genotipo , Masculino , Ratones , Datos de Secuencia Molecular , Reproducibilidad de los Resultados , Alineación de Secuencia
17.
Nat Genet ; 43(7): 648-55, 2011 May 29.
Artículo en Inglés | MEDLINE | ID: mdl-21623374

RESUMEN

Here we provide a genome-wide, high-resolution map of the phylogenetic origin of the genome of most extant laboratory mouse inbred strains. Our analysis is based on the genotypes of wild-caught mice from three subspecies of Mus musculus. We show that classical laboratory strains are derived from a few fancy mice with limited haplotype diversity. Their genomes are overwhelmingly Mus musculus domesticus in origin, and the remainder is mostly of Japanese origin. We generated genome-wide haplotype maps based on identity by descent from fancy mice and show that classical inbred strains have limited and non-randomly distributed genetic diversity. In contrast, wild-derived laboratory strains represent a broad sampling of diversity within M. musculus. Intersubspecific introgression is pervasive in these strains, and contamination by laboratory stocks has played a role in this process. The subspecific origin, haplotype diversity and identity by descent maps can be visualized using the Mouse Phylogeny Viewer (see URLs).


Asunto(s)
Cromosomas de los Mamíferos/genética , Variación Genética , Haplotipos/genética , Ratones Endogámicos/clasificación , Ratones Endogámicos/genética , Animales , Mapeo Cromosómico , Especiación Genética , Genotipo , Ratones , Datos de Secuencia Molecular , Filogenia , Polimorfismo de Nucleótido Simple/genética , Especificidad de la Especie
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA