Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 28
Filtrar
1.
Nature ; 563(7732): 501-507, 2018 11.
Artículo en Inglés | MEDLINE | ID: mdl-30429615

RESUMEN

Female Aedes aegypti mosquitoes infect more than 400 million people each year with dangerous viral pathogens including dengue, yellow fever, Zika and chikungunya. Progress in understanding the biology of mosquitoes and developing the tools to fight them has been slowed by the lack of a high-quality genome assembly. Here we combine diverse technologies to produce the markedly improved, fully re-annotated AaegL5 genome assembly, and demonstrate how it accelerates mosquito science. We anchored physical and cytogenetic maps, doubled the number of known chemosensory ionotropic receptors that guide mosquitoes to human hosts and egg-laying sites, provided further insight into the size and composition of the sex-determining M locus, and revealed copy-number variation among glutathione S-transferase genes that are important for insecticide resistance. Using high-resolution quantitative trait locus and population genomic analyses, we mapped new candidates for dengue vector competence and insecticide resistance. AaegL5 will catalyse new biological insights and intervention strategies to fight this deadly disease vector.


Asunto(s)
Aedes/genética , Infecciones por Arbovirus/virología , Arbovirus , Genoma de los Insectos/genética , Genómica/normas , Control de Insectos , Mosquitos Vectores/genética , Mosquitos Vectores/virología , Aedes/virología , Animales , Infecciones por Arbovirus/transmisión , Arbovirus/aislamiento & purificación , Variaciones en el Número de Copia de ADN/genética , Virus del Dengue/aislamiento & purificación , Femenino , Variación Genética/genética , Genética de Población , Glutatión Transferasa/genética , Resistencia a los Insecticidas/efectos de los fármacos , Masculino , Anotación de Secuencia Molecular , Familia de Multigenes/genética , Piretrinas/farmacología , Estándares de Referencia , Procesos de Determinación del Sexo/genética
2.
Genome Res ; 22(8): 1499-511, 2012 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-22534282

RESUMEN

The three species of the Drosophila simulans clade--the cosmopolitan species, D. simulans, and the two island endemic species, D. mauritiana and D. sechellia--are important models in speciation genetics, but some details of their phylogenetic and speciation history remain unresolved. The order and timing of speciation are disputed, and the existence, magnitude, and timing of gene flow among the three species remain unclear. Here we report on the analysis of a whole-genome four-species sequence alignment that includes all three D. simulans clade species as well as the D. melanogaster reference sequence. The alignment comprises novel, paired short-read sequence data from a single highly inbred line each from D. simulans, D. mauritiana, and D. sechellia. We are unable to reject a species phylogeny with a basal polytomy; the estimated age of the polytomy is 242,000 yr before the present. However, we also find that up to 4.6% of autosomal and 2.2% of X-linked regions have evolutionary histories consistent with recent gene flow between the mainland species (D. simulans) and the two island endemic species (D. mauritiana and D. sechellia). Our findings thus show that gene flow has occurred throughout the genomes of the D. simulans clade species despite considerable geographic, ecological, and intrinsic reproductive isolation. Last, our analysis of lineage-specific changes confirms that the D. sechellia genome has experienced a significant excess of slightly deleterious changes and a dearth of presumed favorable changes. The relatively reduced efficacy of natural selection in D. sechellia is consistent with its derived, persistently reduced historical effective population size.


Asunto(s)
Drosophila/clasificación , Especiación Genética , Genoma de los Insectos , Animales , Secuencia de Bases , Cromosomas de Insectos/genética , Drosophila/genética , Evolución Molecular , Flujo Génico , Haplotipos , Filogenia , Densidad de Población , Aislamiento Reproductivo , Selección Genética , Alineación de Secuencia , Análisis de Secuencia de ADN
3.
Mol Biol Evol ; 30(9): 2177-86, 2013 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-23827876

RESUMEN

Adaptive mutations that accumulate during species divergence are likely to contribute to reproductive incompatibilities and hinder gene flow; however, there may also be a class of mutations that are generally advantageous and can spread across species boundaries. In this study, we characterize a 15 kb region on chromosome 3R that has introgressed from the cosmopolitan generalist species Drosophila simulans into the island endemic D. sechellia, which is an ecological specialist. The introgressed haplotype is fixed in D. sechellia over almost the entirety of the resequenced region, whereas a core region of the introgressed haplotype occurs at high frequency in D. simulans. The observed patterns of nucleotide variation and linkage disequilibrium are consistent with a recently completed selective sweep in D. sechellia and an incomplete sweep in D. simulans. Independent estimates of both the time to the introgression and sweep events are all close to 10,000 years before the present. Interestingly, the most likely target of selection is a highly occupied transcription factor binding region. This work confirms that it is possible for mutations to be globally advantageous, despite their occurrence in divergent genomic and ecological backgrounds.


Asunto(s)
Drosophila/genética , Evolución Molecular , Flujo Génico , Especiación Genética , Animales , Mapeo Cromosómico , Cromosomas de Insectos , Drosophila/clasificación , Femenino , Variación Genética , Haplotipos , Desequilibrio de Ligamiento , Masculino , Mutación , Filogenia , Selección Genética , Especificidad de la Especie
4.
PLoS Biol ; 9(8): e1001126, 2011 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-21857805

RESUMEN

The evolution of heteromorphic sex chromosomes (e.g., XY in males or ZW in females) has repeatedly elicited the evolution of two kinds of chromosome-specific regulation: dosage compensation--the equalization of X chromosome gene expression in males and females--and meiotic sex chromosome inactivation (MSCI)--the transcriptional silencing and heterochromatinization of the X during meiosis in the male (or Z in the female) germline. How the X chromosome is regulated in the Drosophila melanogaster male germline is unclear. Here we report three new findings concerning gene expression from the X in Drosophila testes. First, X chromosome-wide dosage compensation appears to be absent from most of the Drosophila male germline. Second, microarray analysis provides no evidence for X chromosome-specific inactivation during meiosis. Third, we confirm the previous discovery that the expression of transgene reporters driven by autosomal spermatogenesis-specific promoters is strongly reduced when inserted on the X chromosome versus the autosomes; but we show that this chromosomal difference in expression is established in premeiotic cells and persists in meiotic cells. The magnitude of the X-autosome difference in transgene expression cannot be explained by the absence of dosage compensation, suggesting that a previously unrecognized mechanism limits expression from the X during spermatogenesis in Drosophila. These findings help to resolve several previously conflicting reports and have implications for patterns of genome evolution and speciation in Drosophila.


Asunto(s)
Compensación de Dosificación (Genética)/genética , Drosophila/genética , Meiosis/genética , Cromosomas Sexuales/genética , Animales , Femenino , Células Germinativas/metabolismo , Masculino , Espermatogénesis/genética , Testículo/metabolismo , Inactivación del Cromosoma X/genética
5.
bioRxiv ; 2024 Oct 18.
Artículo en Inglés | MEDLINE | ID: mdl-39345378

RESUMEN

The Genome in a Bottle Consortium (GIAB), hosted by the National Institute of Standards and Technology (NIST), is developing new matched tumor-normal samples, the first to be explicitly consented for public dissemination of genomic data and cell lines. Here, we describe a comprehensive genomic dataset from the first individual, HG008, including DNA from an adherent, epithelial-like pancreatic ductal adenocarcinoma (PDAC) tumor cell line and matched normal cells from duodenal and pancreatic tissues. Data for the tumor-normal matched samples comes from thirteen distinct state-of-the-art whole genome measurement technologies, including high depth short and long-read bulk whole genome sequencing (WGS), single cell WGS, and Hi-C, and karyotyping. These data will be used by the GIAB Consortium to develop matched tumor-normal benchmarks for somatic variant detection. We expect these data to facilitate innovation for whole genome measurement technologies, de novo assembly of tumor and normal genomes, and bioinformatic tools to identify small and structural somatic mutations. This first-of-its-kind broadly consented open-access resource will facilitate further understanding of sequencing methods used for cancer biology.

6.
Nat Genet ; 36(10): 1122-5, 2004 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-15378061

RESUMEN

Global-scale patterns of human population structure may be influenced by the rate of migration among populations that is nearly eight times higher for females than for males. This difference is attributed mainly to the widespread practice of patrilocality, in which women move into their mates' residences after marriage. Here we directly test this hypothesis by comparing global patterns of DNA sequence variation on the Y chromosome and mitochondrial DNA (mtDNA) in the same panel of 389 individuals from ten populations (four from Africa and two each from Europe, Asia and Oceania). We introduce a new strategy to assay Y-chromosome variation that identifies a high density of single-nucleotide polymorphisms, allows complete sequencing of all individuals rather than relying on predetermined markers and provides direct sequence comparisons with mtDNA. We found the overall proportion of between-group variation (Phi(ST)) to be 0.334 for the Y chromosome and 0.382 for mtDNA. Genetic differentiation between populations was similar for the Y chromosome and mtDNA at all geographic scales that we tested. Although patrilocality may be important at the local scale, patterns of genetic structure on the continental and global scales are not shaped by the higher rate of migration among females than among males.


Asunto(s)
Cromosomas Humanos Y/genética , ADN Mitocondrial/genética , Elementos Alu , Emigración e Inmigración , Composición Familiar , Femenino , Variación Genética , Genética de Población , Humanos , Masculino , Modelos Genéticos , Datos de Secuencia Molecular , Dinámica Poblacional , Caracteres Sexuales
7.
Nat Genet ; 55(2): 301-311, 2023 02.
Artículo en Inglés | MEDLINE | ID: mdl-36658436

RESUMEN

Ixodes spp. and related ticks transmit prevalent infections, although knowledge of their biology and development of anti-tick measures have been hindered by the lack of a high-quality genome. In the present study, we present the assembly of a 2.23-Gb Ixodes scapularis genome by sequencing two haplotypes within one individual, complemented by chromosome-level scaffolding and full-length RNA isoform sequencing, yielding a fully reannotated genome featuring thousands of new protein-coding genes and various RNA species. Analyses of the repetitive DNA identified transposable elements, whereas the examination of tick-associated bacterial sequences yielded an improved Rickettsia buchneri genome. We demonstrate how the Ixodes genome advances tick science by contributing to new annotations, gene models and epigenetic functions, expansion of gene families, development of in-depth proteome catalogs and deciphering of genetic variations in wild ticks. Overall, we report critical genetic resources and biological insights impacting our understanding of tick biology and future interventions against tick-transmitted infections.


Asunto(s)
Ixodes , Animales , Ixodes/genética , Ixodes/microbiología , Genoma/genética , Bacterias/genética , Secuencia de Bases , ARN
8.
Plant Genome ; 14(1): e20072, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-33605092

RESUMEN

Hop (Humulus lupulus L. var Lupulus) is a diploid, dioecious plant with a history of cultivation spanning more than one thousand years. Hop cones are valued for their use in brewing and contain compounds of therapeutic interest including xanthohumol. Efforts to determine how biochemical pathways responsible for desirable traits are regulated have been challenged by the large (2.8 Gb), repetitive, and heterozygous genome of hop. We present a draft haplotype-phased assembly of the Cascade cultivar genome. Our draft assembly and annotation of the Cascade genome is the most extensive representation of the hop genome to date. PacBio long-read sequences from hop were assembled with FALCON and partially phased with FALCON-Unzip. Comparative analysis of haplotype sequences provides insight into selective pressures that have driven evolution in hop. We discovered genes with greater sequence divergence enriched for stress-response, growth, and flowering functions in the draft phased assembly. With improved resolution of long terminal retrotransposons (LTRs) due to long-read sequencing, we found that hop is over 70% repetitive. We identified a homolog of cannabidiolic acid synthase (CBDAS) that is expressed in multiple tissues. The approaches we developed to analyze the draft phased assembly serve to deepen our understanding of the genomic landscape of hop and may have broader applicability to the study of other large, complex genomes.


Asunto(s)
Humulus , Diploidia , Genoma de Planta , Genómica , Haplotipos , Humulus/genética
9.
Front Plant Sci ; 12: 720670, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34567033

RESUMEN

A defining component of agroforestry parklands across Sahelo-Sudanian Africa (SSA), the shea tree (Vitellaria paradoxa) is central to sustaining local livelihoods and the farming environments of rural communities. Despite its economic and cultural value, however, not to mention the ecological roles it plays as a dominant parkland species, shea remains semi-domesticated with virtually no history of systematic genetic improvement. In truth, shea's extended juvenile period makes traditional breeding approaches untenable; but the opportunity for genome-assisted breeding is immense, provided the foundational resources are available. Here we report the development and public release of such resources. Using the FALCON-Phase workflow, 162.6 Gb of long-read PacBio sequence data were assembled into a 658.7 Mbp, chromosome-scale reference genome annotated with 38,505 coding genes. Whole genome duplication (WGD) analysis based on this gene space revealed clear signatures of two ancient WGD events in shea's evolutionary past, one prior to the Astrid-Rosid divergence (116-126 Mya) and the other at the root of the order Ericales (65-90 Mya). In a first genome-wide look at the suite of fatty acid (FA) biosynthesis genes that likely govern stearin content, the primary determinant of shea butter quality, relatively high copy numbers of six key enzymes were found (KASI, KASIII, FATB, FAD2, FAD3, and FAX2), some likely originating in shea's more recent WGD event. To help translate these findings into practical tools for characterization, selection, and genome-wide association studies (GWAS), resequencing data from a shea diversity panel was used to develop a database of more than 3.5 million functionally annotated, physically anchored SNPs. Two smaller, more curated sets of suggested SNPs, one for GWAS (104,211 SNPs) and the other targeting FA biosynthesis genes (90 SNPs), are also presented. With these resources, the hope is to support national programs across the shea belt in the strategic, genome-enabled conservation and long-term improvement of the shea tree for SSA.

10.
Nat Commun ; 12(1): 1935, 2021 04 28.
Artículo en Inglés | MEDLINE | ID: mdl-33911078

RESUMEN

Haplotype-resolved genome assemblies are important for understanding how combinations of variants impact phenotypes. To date, these assemblies have been best created with complex protocols, such as cultured cells that contain a single-haplotype (haploid) genome, single cells where haplotypes are separated, or co-sequencing of parental genomes in a trio-based approach. These approaches are impractical in most situations. To address this issue, we present FALCON-Phase, a phasing tool that uses ultra-long-range Hi-C chromatin interaction data to extend phase blocks of partially-phased diploid assembles to chromosome or scaffold scale. FALCON-Phase uses the inherent phasing information in Hi-C reads, skipping variant calling, and reduces the computational complexity of phasing. Our method is validated on three benchmark datasets generated as part of the Vertebrate Genomes Project (VGP), including human, cow, and zebra finch, for which high-quality, fully haplotype-resolved assemblies are available using the trio-based approach. FALCON-Phase is accurate without having parental data and performance is better in samples with higher heterozygosity. For cow and zebra finch the accuracy is 97% compared to 80-91% for human. FALCON-Phase is applicable to any draft assembly that contains long primary contigs and phased associate contigs.


Asunto(s)
Mapeo Contig/métodos , Genoma Humano/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Algoritmos , Animales , Bovinos , Haplotipos/genética , Humanos , Polimorfismo de Nucleótido Simple/genética , Pez Cebra/genética
11.
PLoS Biol ; 5(11): e293, 2007 Nov 06.
Artículo en Inglés | MEDLINE | ID: mdl-17988173

RESUMEN

The evolution of heteromorphic sex chromosomes creates a genetic condition favoring the invasion of sex-ratio meiotic drive elements, resulting in the biased transmission of one sex chromosome over the other, in violation of Mendel's first law. The molecular mechanisms of sex-ratio meiotic drive may therefore help us to understand the evolutionary forces shaping the meiotic behavior of the sex chromosomes. Here we characterize a sex-ratio distorter on the X chromosome (Dox) in Drosophila simulans by genetic and molecular means. Intriguingly, Dox has very limited coding capacity. It evolved from another X-linked gene, which also evolved de nova. Through retrotransposition, Dox also gave rise to an autosomal suppressor, not much yang (Nmy). An RNA interference mechanism seems to be involved in the suppression of the Dox distorter by the Nmy suppressor. Double mutant males of the genotype dox; nmy are normal for both sex-ratio and spermatogenesis. We postulate that recurrent bouts of sex-ratio meiotic drive and its subsequent suppression might underlie several common features observed in the heterogametic sex, including meiotic sex chromosome inactivation and achiasmy.


Asunto(s)
Drosophila/genética , Meiosis , Razón de Masculinidad , Cromosoma X/genética , Secuencia de Aminoácidos , Animales , Secuencia de Bases , Mapeo Cromosómico , Proteínas de Drosophila/genética , Evolución Molecular , Femenino , Genes Supresores , Masculino , Modelos Genéticos , Datos de Secuencia Molecular , Filogenia , Interferencia de ARN , Cromosoma Y/genética
12.
Nat Commun ; 11(1): 2071, 2020 04 29.
Artículo en Inglés | MEDLINE | ID: mdl-32350247

RESUMEN

Inbred animals were historically chosen for genome analysis to circumvent assembly issues caused by haplotype variation but this resulted in a composite of the two genomes. Here we report a haplotype-aware scaffolding and polishing pipeline which was used to create haplotype-resolved, chromosome-level genome assemblies of Angus (taurine) and Brahman (indicine) cattle subspecies from contigs generated by the trio binning method. These assemblies reveal structural and copy number variants that differentiate the subspecies and that variant detection is sensitive to the specific reference genome chosen. Six genes with immune related functions have additional copies in the indicine compared with taurine lineage and an indicus-specific extra copy of fatty acid desaturase is under positive selection. The haplotyped genomes also enable transcripts to be phased to detect allele-specific expression. This work exemplifies the value of haplotype-resolved genomes to better explore evolutionary and functional variations.


Asunto(s)
Bovinos/genética , Variación Genética , Genoma , Haplotipos/genética , Alelos , Desequilibrio Alélico , Animales , Secuencia de Bases , Cromosomas de los Mamíferos/genética , Femenino , Sitios Genéticos , Mutación INDEL/genética , Masculino , Anotación de Secuencia Molecular , Polimorfismo de Nucleótido Simple/genética , ARN Mensajero/genética , ARN Mensajero/metabolismo , Secuencias Repetitivas de Ácidos Nucleicos/genética
13.
Genetics ; 178(1): 427-37, 2008 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-18202385

RESUMEN

A 2.4-kb stretch within the RRM2P4 region of the X chromosome, previously sequenced in a sample of 41 globally distributed humans, displayed both an ancient time to the most recent common ancestor (e.g., a TMRCA of approximately 2 million years) and a basal clade composed entirely of Asian sequences. This pattern was interpreted to reflect a history of introgressive hybridization from archaic hominins (most likely Asian Homo erectus) into the anatomically modern human genome. Here, we address this hypothesis by resequencing the 2.4-kb RRM2P4 region in 131 African and 122 non-African individuals and by extending the length of sequence in a window of 16.5 kb encompassing the RRM2P4 pseudogene in a subset of 90 individuals. We find that both the ancient TMRCA and the skew in non-African representation in one of the basal clades are essentially limited to the central 2.4-kb region. We define a new summary statistic called the minimum clade proportion (pmc), which quantifies the proportion of individuals from a specified geographic region in each of the two basal clades of a binary gene tree, and then employ coalescent simulations to assess the likelihood of the observed central RRM2P4 genealogy under two alternative views of human evolutionary history: recent African replacement (RAR) and archaic admixture (AA). A molecular-clock-based TMRCA estimate of 2.33 million years is a statistical outlier under the RAR model; however, the large variance associated with this estimate makes it difficult to distinguish the predictions of the human origins models tested here. The pmc summary statistic, which has improved power with larger samples of chromosomes, yields values that are significantly unlikely under the RAR model and fit expectations better under a range of archaic admixture scenarios.


Asunto(s)
Cromosomas Humanos X/genética , Genealogía y Heráldica , Modelos Genéticos , ADN Intergénico/genética , Demografía , Variación Genética , Humanos , Funciones de Verosimilitud , Filogenia , Análisis de Secuencia de ADN
14.
Genes (Basel) ; 10(1)2019 01 18.
Artículo en Inglés | MEDLINE | ID: mdl-30669388

RESUMEN

A high-quality reference genome is a fundamental resource for functional genetics, comparative genomics, and population genomics, and is increasingly important for conservation biology. PacBio Single Molecule, Real-Time (SMRT) sequencing generates long reads with uniform coverage and high consensus accuracy, making it a powerful technology for de novo genome assembly. Improvements in throughput and concomitant reductions in cost have made PacBio an attractive core technology for many large genome initiatives, however, relatively high DNA input requirements (~5 µg for standard library protocol) have placed PacBio out of reach for many projects on small organisms that have lower DNA content, or on projects with limited input DNA for other reasons. Here we present a high-quality de novo genome assembly from a single Anopheles coluzzii mosquito. A modified SMRTbell library construction protocol without DNA shearing and size selection was used to generate a SMRTbell library from just 100 ng of starting genomic DNA. The sample was run on the Sequel System with chemistry 3.0 and software v6.0, generating, on average, 25 Gb of sequence per SMRT Cell with 20 h movies, followed by diploid de novo genome assembly with FALCON-Unzip. The resulting curated assembly had high contiguity (contig N50 3.5 Mb) and completeness (more than 98% of conserved genes were present and full-length). In addition, this single-insect assembly now places 667 (>90%) of formerly unplaced genes into their appropriate chromosomal contexts in the AgamP4 PEST reference. We were also able to resolve maternal and paternal haplotypes for over 1/3 of the genome. By sequencing and assembling material from a single diploid individual, only two haplotypes were present, simplifying the assembly process compared to samples from multiple pooled individuals. The method presented here can be applied to samples with starting DNA amounts as low as 100 ng per 1 Gb genome size. This new low-input approach puts PacBio-based assemblies in reach for small highly heterozygous organisms that comprise much of the diversity of life.


Asunto(s)
Anopheles/genética , Genoma de los Insectos , Análisis de Secuencia de ADN/métodos , Animales , Mapeo Contig/métodos , Mapeo Contig/normas , Ploidias , Polimorfismo Genético , Análisis de Secuencia de ADN/normas
15.
Gigascience ; 8(10)2019 10 01.
Artículo en Inglés | MEDLINE | ID: mdl-31609423

RESUMEN

BACKGROUND: A high-quality reference genome is an essential tool for applied and basic research on arthropods. Long-read sequencing technologies may be used to generate more complete and contiguous genome assemblies than alternate technologies; however, long-read methods have historically had greater input DNA requirements and higher costs than next-generation sequencing, which are barriers to their use on many samples. Here, we present a 2.3 Gb de novo genome assembly of a field-collected adult female spotted lanternfly (Lycorma delicatula) using a single Pacific Biosciences SMRT Cell. The spotted lanternfly is an invasive species recently discovered in the northeastern United States that threatens to damage economically important crop plants in the region. RESULTS: The DNA from 1 individual was used to make 1 standard, size-selected library with an average DNA fragment size of ∼20 kb. The library was run on 1 Sequel II SMRT Cell 8M, generating a total of 132 Gb of long-read sequences, of which 82 Gb were from unique library molecules, representing ∼36× coverage of the genome. The assembly had high contiguity (contig N50 length = 1.5 Mb), completeness, and sequence level accuracy as estimated by conserved gene set analysis (96.8% of conserved genes both complete and without frame shift errors). Furthermore, it was possible to segregate more than half of the diploid genome into the 2 separate haplotypes. The assembly also recovered 2 microbial symbiont genomes known to be associated with L. delicatula, each microbial genome being assembled into a single contig. CONCLUSIONS: We demonstrate that field-collected arthropods can be used for the rapid generation of high-quality genome assemblies, an attractive approach for projects on emerging invasive species, disease vectors, or conservation efforts of endangered species.


Asunto(s)
Dípteros/genética , Genoma de los Insectos , Genómica/métodos , Animales , Femenino , Biblioteca de Genes , Especies Introducidas , Análisis de Secuencia de ADN
16.
Nat Commun ; 10(1): 260, 2019 01 16.
Artículo en Inglés | MEDLINE | ID: mdl-30651564

RESUMEN

Rapid innovation in sequencing technologies and improvement in assembly algorithms have enabled the creation of highly contiguous mammalian genomes. Here we report a chromosome-level assembly of the water buffalo (Bubalus bubalis) genome using single-molecule sequencing and chromatin conformation capture data. PacBio Sequel reads, with a mean length of 11.5 kb, helped to resolve repetitive elements and generate sequence contiguity. All five B. bubalis sub-metacentric chromosomes were correctly scaffolded with centromeres spanned. Although the index animal was partly inbred, 58% of the genome was haplotype-phased by FALCON-Unzip. This new reference genome improves the contig N50 of the previous short-read based buffalo assembly more than a thousand-fold and contains only 383 gaps. It surpasses the human and goat references in sequence contiguity and facilitates the annotation of hard to assemble gene clusters such as the major histocompatibility complex (MHC).


Asunto(s)
Búfalos/genética , Cromosomas de los Mamíferos/genética , Mapeo Contig/métodos , Genoma/genética , Cabras/genética , Animales , Cromatina/química , Cromatina/genética , Femenino , Genómica/métodos , Haplotipos , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Complejo Mayor de Histocompatibilidad/genética , Anotación de Secuencia Molecular/métodos , Familia de Multigenes/genética , Secuencias Repetitivas de Ácidos Nucleicos/genética , Secuenciación Completa del Genoma
17.
Genetics ; 177(4): 2195-207, 2007 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-18073427

RESUMEN

We estimate parameters of a general isolation-with-migration model using resequence data from mitochondrial DNA (mtDNA), the Y chromosome, and two loci on the X chromosome in samples of 25-50 individuals from each of 10 human populations. Application of a coalescent-based Markov chain Monte Carlo technique allows simultaneous inference of divergence times, rates of gene flow, as well as changes in effective population size. Results from comparisons between sub-Saharan African and Eurasian populations estimate that 1500 individuals founded the ancestral Eurasian population approximately 40 thousand years ago (KYA). Furthermore, these small Eurasian founding populations appear to have grown much more dramatically than either African or Oceanian populations. Analyses of sub-Saharan African populations provide little evidence for a history of population bottlenecks and suggest that they began diverging from one another upward of 50 KYA. We surmise that ancestral African populations had already been geographically structured prior to the founding of ancestral Eurasian populations. African populations are shown to experience low levels of mitochondrial DNA gene flow, but high levels of Y chromosome gene flow. In particular, Y chromosome gene flow appears to be asymmetric, i.e., from the Bantu-speaking population into other African populations. Conversely, mitochondrial gene flow is more extensive between non-African populations, but appears to be absent between European and Asian populations.


Asunto(s)
Cromosomas Humanos X , Cromosomas Humanos Y , ADN Mitocondrial , Flujo Génico , Genética de Población , Dinámica Poblacional , Secuencia de Bases , Humanos , Cadenas de Markov , Densidad de Población , Grupos Raciales/genética
18.
Curr Biol ; 28(8): 1289-1295.e4, 2018 04 23.
Artículo en Inglés | MEDLINE | ID: mdl-29606420

RESUMEN

Crossing over between homologous chromosomes during meiosis repairs programmed DNA double-strand breaks, ensures proper segregation at meiosis I [1], shapes the genomic distribution of nucleotide variability in populations, and enhances the efficacy of natural selection among genetically linked sites [2]. Between closely related Drosophila species, large differences exist in the rate and chromosomal distribution of crossing over. Little, however, is known about the molecular genetic changes or population genetic forces that mediate evolved differences in recombination between species [3, 4]. Here, we show that a meiosis gene with a history of rapid evolution acts as a trans-acting modifier of species differences in crossing over. In transgenic flies, the dicistronic gene, mei-217/mei-218, recapitulates a large part of the species differences in the rate and chromosomal distribution of crossing over. These phenotypic differences appear to result from changes in protein sequence not gene expression. Our population genetics analyses show that the protein-coding sequence of mei-218, but not mei-217, has a history of recurrent positive natural selection. By modulating the intensity of centromeric and telomeric suppression of crossing over, evolution at mei-217/-218 has incidentally shaped gross differences in the chromosomal distribution of nucleotide variability between species. We speculate that recurrent bouts of adaptive evolution at mei-217/-218 might reflect a history of coevolution with selfish genetic elements.


Asunto(s)
Proteínas de Ciclo Celular/genética , Intercambio Genético/genética , Proteínas de Drosophila/genética , Meiosis/genética , Secuencia de Aminoácidos , Animales , Animales Modificados Genéticamente/genética , Proteínas de Ciclo Celular/metabolismo , Proteínas de Ciclo Celular/fisiología , Centrómero/genética , Centrómero/fisiología , Roturas del ADN de Doble Cadena , Drosophila/genética , Proteínas de Drosophila/metabolismo , Proteínas de Drosophila/fisiología , Drosophila melanogaster/genética , Evolución Molecular , Expresión Génica/genética , Recombinación Genética/genética , Selección Genética , Especificidad de la Especie
19.
Nat Biotechnol ; 2018 Oct 22.
Artículo en Inglés | MEDLINE | ID: mdl-30346939

RESUMEN

Complex allelic variation hampers the assembly of haplotype-resolved sequences from diploid genomes. We developed trio binning, an approach that simplifies haplotype assembly by resolving allelic variation before assembly. In contrast with prior approaches, the effectiveness of our method improved with increasing heterozygosity. Trio binning uses short reads from two parental genomes to first partition long reads from an offspring into haplotype-specific sets. Each haplotype is then assembled independently, resulting in a complete diploid reconstruction. We used trio binning to recover both haplotypes of a diploid human genome and identified complex structural variants missed by alternative approaches. We sequenced an F1 cross between the cattle subspecies Bos taurus taurus and Bos taurus indicus and completely assembled both parental haplotypes with NG50 haplotig sizes of >20 Mb and 99.998% accuracy, surpassing the quality of current cattle reference genomes. We suggest that trio binning improves diploid genome assembly and will facilitate new studies of haplotype variation and inheritance.

20.
Elife ; 72018 12 13.
Artículo en Inglés | MEDLINE | ID: mdl-30543325

RESUMEN

During speciation, sex chromosomes often accumulate interspecific genetic incompatibilities faster than the rest of the genome. The drive theory posits that sex chromosomes are susceptible to recurrent bouts of meiotic drive and suppression, causing the evolutionary build-up of divergent cryptic sex-linked drive systems and, incidentally, genetic incompatibilities. To assess the role of drive during speciation, we combine high-resolution genetic mapping of X-linked hybrid male sterility with population genomics analyses of divergence and recent gene flow between the fruitfly species, Drosophila mauritiana and D. simulans. Our findings reveal a high density of genetic incompatibilities and a corresponding dearth of gene flow on the X chromosome. Surprisingly, we find that a known drive element recently migrated between species and, rather than contributing to interspecific divergence, caused a strong reduction in local sequence divergence, undermining the evolution of hybrid sterility. Gene flow can therefore mediate the effects of selfish genetic elements during speciation.


Asunto(s)
Evolución Biológica , Especiación Genética , Cromosoma X/genética , Cromosoma Y/genética , Animales , Drosophila/genética , Drosophila simulans/genética , Flujo Génico , Infertilidad Masculina/genética , Masculino , Meiosis/genética , Especificidad de la Especie
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA