Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 32
Filtrar
1.
Nature ; 563(7732): 501-507, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30429615

RESUMO

Female Aedes aegypti mosquitoes infect more than 400 million people each year with dangerous viral pathogens including dengue, yellow fever, Zika and chikungunya. Progress in understanding the biology of mosquitoes and developing the tools to fight them has been slowed by the lack of a high-quality genome assembly. Here we combine diverse technologies to produce the markedly improved, fully re-annotated AaegL5 genome assembly, and demonstrate how it accelerates mosquito science. We anchored physical and cytogenetic maps, doubled the number of known chemosensory ionotropic receptors that guide mosquitoes to human hosts and egg-laying sites, provided further insight into the size and composition of the sex-determining M locus, and revealed copy-number variation among glutathione S-transferase genes that are important for insecticide resistance. Using high-resolution quantitative trait locus and population genomic analyses, we mapped new candidates for dengue vector competence and insecticide resistance. AaegL5 will catalyse new biological insights and intervention strategies to fight this deadly disease vector.


Assuntos
Aedes/genética , Infecções por Arbovirus/virologia , Arbovírus , Genoma de Inseto/genética , Genômica/normas , Controle de Insetos , Mosquitos Vetores/genética , Mosquitos Vetores/virologia , Aedes/virologia , Animais , Infecções por Arbovirus/transmissão , Arbovírus/isolamento & purificação , Variações do Número de Cópias de DNA/genética , Vírus da Dengue/isolamento & purificação , Feminino , Variação Genética/genética , Genética Populacional , Glutationa Transferase/genética , Resistência a Inseticidas/efeitos dos fármacos , Masculino , Anotação de Sequência Molecular , Família Multigênica/genética , Piretrinas/farmacologia , Padrões de Referência , Processos de Determinação Sexual/genética
2.
BMC Plant Biol ; 19(1): 319, 2019 Jul 16.
Artigo em Inglês | MEDLINE | ID: mdl-31311507

RESUMO

BACKGROUND: Non-host resistance (NHR) presents a compelling long-term plant protection strategy for global food security, yet the genetic basis of NHR remains poorly understood. For many diseases, including stem rust of wheat [causal organism Puccinia graminis (Pg)], NHR is largely unexplored due to the inherent challenge of developing a genetically tractable system within which the resistance segregates. The present study turns to the pathogen's alternate host, barberry (Berberis spp.), to overcome this challenge. RESULTS: In this study, an interspecific mapping population derived from a cross between Pg-resistant Berberis thunbergii (Bt) and Pg-susceptible B. vulgaris was developed to investigate the Pg-NHR exhibited by Bt. To facilitate QTL analysis and subsequent trait dissection, the first genetic linkage maps for the two parental species were constructed and a chromosome-scale reference genome for Bt was assembled (PacBio + Hi-C). QTL analysis resulted in the identification of a single 13 cM region (~ 5.1 Mbp spanning 13 physical contigs) on the short arm of Bt chromosome 3. Differential gene expression analysis, combined with sequence variation analysis between the two parental species, led to the prioritization of several candidate genes within the QTL region, some of which belong to gene families previously implicated in disease resistance. CONCLUSIONS: Foundational genetic and genomic resources developed for Berberis spp. enabled the identification and annotation of a QTL associated with Pg-NHR. Although subsequent validation and fine mapping studies are needed, this study demonstrates the feasibility of and lays the groundwork for dissecting Pg-NHR in the alternate host of one of agriculture's most devastating pathogens.


Assuntos
Basidiomycota/fisiologia , Berberis/genética , Berberis/microbiologia , Doenças das Plantas/genética , Mapeamento Cromossômico , Cromossomos de Plantas , Resistência à Doença/genética , Perfilação da Expressão Gênica , Genoma de Planta , Hibridização Genética , Padrões de Herança , Fenótipo , Doenças das Plantas/microbiologia , Caules de Planta/microbiologia , Locos de Características Quantitativas
3.
Genome Res ; 22(8): 1499-511, 2012 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-22534282

RESUMO

The three species of the Drosophila simulans clade--the cosmopolitan species, D. simulans, and the two island endemic species, D. mauritiana and D. sechellia--are important models in speciation genetics, but some details of their phylogenetic and speciation history remain unresolved. The order and timing of speciation are disputed, and the existence, magnitude, and timing of gene flow among the three species remain unclear. Here we report on the analysis of a whole-genome four-species sequence alignment that includes all three D. simulans clade species as well as the D. melanogaster reference sequence. The alignment comprises novel, paired short-read sequence data from a single highly inbred line each from D. simulans, D. mauritiana, and D. sechellia. We are unable to reject a species phylogeny with a basal polytomy; the estimated age of the polytomy is 242,000 yr before the present. However, we also find that up to 4.6% of autosomal and 2.2% of X-linked regions have evolutionary histories consistent with recent gene flow between the mainland species (D. simulans) and the two island endemic species (D. mauritiana and D. sechellia). Our findings thus show that gene flow has occurred throughout the genomes of the D. simulans clade species despite considerable geographic, ecological, and intrinsic reproductive isolation. Last, our analysis of lineage-specific changes confirms that the D. sechellia genome has experienced a significant excess of slightly deleterious changes and a dearth of presumed favorable changes. The relatively reduced efficacy of natural selection in D. sechellia is consistent with its derived, persistently reduced historical effective population size.


Assuntos
Drosophila/classificação , Especiação Genética , Genoma de Inseto , Animais , Sequência de Bases , Cromossomos de Insetos/genética , Drosophila/genética , Evolução Molecular , Fluxo Gênico , Haplótipos , Filogenia , Densidade Demográfica , Isolamento Reprodutivo , Seleção Genética , Alinhamento de Sequência , Análise de Sequência de DNA
4.
Mol Biol Evol ; 30(9): 2177-86, 2013 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-23827876

RESUMO

Adaptive mutations that accumulate during species divergence are likely to contribute to reproductive incompatibilities and hinder gene flow; however, there may also be a class of mutations that are generally advantageous and can spread across species boundaries. In this study, we characterize a 15 kb region on chromosome 3R that has introgressed from the cosmopolitan generalist species Drosophila simulans into the island endemic D. sechellia, which is an ecological specialist. The introgressed haplotype is fixed in D. sechellia over almost the entirety of the resequenced region, whereas a core region of the introgressed haplotype occurs at high frequency in D. simulans. The observed patterns of nucleotide variation and linkage disequilibrium are consistent with a recently completed selective sweep in D. sechellia and an incomplete sweep in D. simulans. Independent estimates of both the time to the introgression and sweep events are all close to 10,000 years before the present. Interestingly, the most likely target of selection is a highly occupied transcription factor binding region. This work confirms that it is possible for mutations to be globally advantageous, despite their occurrence in divergent genomic and ecological backgrounds.


Assuntos
Drosophila/genética , Evolução Molecular , Fluxo Gênico , Especiação Genética , Animais , Mapeamento Cromossômico , Cromossomos de Insetos , Drosophila/classificação , Feminino , Variação Genética , Haplótipos , Desequilíbrio de Ligação , Masculino , Mutação , Filogenia , Seleção Genética , Especificidade da Espécie
5.
PLoS Biol ; 9(8): e1001126, 2011 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-21857805

RESUMO

The evolution of heteromorphic sex chromosomes (e.g., XY in males or ZW in females) has repeatedly elicited the evolution of two kinds of chromosome-specific regulation: dosage compensation--the equalization of X chromosome gene expression in males and females--and meiotic sex chromosome inactivation (MSCI)--the transcriptional silencing and heterochromatinization of the X during meiosis in the male (or Z in the female) germline. How the X chromosome is regulated in the Drosophila melanogaster male germline is unclear. Here we report three new findings concerning gene expression from the X in Drosophila testes. First, X chromosome-wide dosage compensation appears to be absent from most of the Drosophila male germline. Second, microarray analysis provides no evidence for X chromosome-specific inactivation during meiosis. Third, we confirm the previous discovery that the expression of transgene reporters driven by autosomal spermatogenesis-specific promoters is strongly reduced when inserted on the X chromosome versus the autosomes; but we show that this chromosomal difference in expression is established in premeiotic cells and persists in meiotic cells. The magnitude of the X-autosome difference in transgene expression cannot be explained by the absence of dosage compensation, suggesting that a previously unrecognized mechanism limits expression from the X during spermatogenesis in Drosophila. These findings help to resolve several previously conflicting reports and have implications for patterns of genome evolution and speciation in Drosophila.


Assuntos
Mecanismo Genético de Compensação de Dose/genética , Drosophila/genética , Meiose/genética , Cromossomos Sexuais/genética , Animais , Feminino , Células Germinativas/metabolismo , Masculino , Espermatogênese/genética , Testículo/metabolismo , Inativação do Cromossomo X/genética
6.
medRxiv ; 2024 Mar 18.
Artigo em Inglês | MEDLINE | ID: mdl-38562723

RESUMO

Comprehending the mechanism behind human diseases with an established heritable component represents the forefront of personalized medicine. Nevertheless, numerous medically important genes are inaccurately represented in short-read sequencing data analysis due to their complexity and repetitiveness or the so-called 'dark regions' of the human genome. The advent of PacBio as a long-read platform has provided new insights, yet HiFi whole-genome sequencing (WGS) cost remains frequently prohibitive. We introduce a targeted sequencing and analysis framework, Twist Alliance Dark Genes Panel (TADGP), designed to offer phased variants across 389 medically important yet complex autosomal genes. We highlight TADGP accuracy across eleven control samples and compare it to WGS. This demonstrates that TADGP achieves variant calling accuracy comparable to HiFi-WGS data, but at a fraction of the cost. Thus, enabling scalability and broad applicability for studying rare diseases or complementing previously sequenced samples to gain insights into these complex genes. TADGP revealed several candidate variants across all cases and provided insight into LPA diversity when tested on samples from rare disease and cardiovascular disease cohorts. In both cohorts, we identified novel variants affecting individual disease-associated genes (e.g., IKZF1, KCNE1). Nevertheless, the annotation of the variants across these 389 medically important genes remains challenging due to their underrepresentation in ClinVar and gnomAD. Consequently, we also offer an annotation resource to enhance the evaluation and prioritization of these variants. Overall, we can demonstrate that TADGP offers a cost-efficient and scalable approach to routinely assess the dark regions of the human genome with clinical relevance.

7.
Nat Genet ; 36(10): 1122-5, 2004 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-15378061

RESUMO

Global-scale patterns of human population structure may be influenced by the rate of migration among populations that is nearly eight times higher for females than for males. This difference is attributed mainly to the widespread practice of patrilocality, in which women move into their mates' residences after marriage. Here we directly test this hypothesis by comparing global patterns of DNA sequence variation on the Y chromosome and mitochondrial DNA (mtDNA) in the same panel of 389 individuals from ten populations (four from Africa and two each from Europe, Asia and Oceania). We introduce a new strategy to assay Y-chromosome variation that identifies a high density of single-nucleotide polymorphisms, allows complete sequencing of all individuals rather than relying on predetermined markers and provides direct sequence comparisons with mtDNA. We found the overall proportion of between-group variation (Phi(ST)) to be 0.334 for the Y chromosome and 0.382 for mtDNA. Genetic differentiation between populations was similar for the Y chromosome and mtDNA at all geographic scales that we tested. Although patrilocality may be important at the local scale, patterns of genetic structure on the continental and global scales are not shaped by the higher rate of migration among females than among males.


Assuntos
Cromossomos Humanos Y/genética , DNA Mitocondrial/genética , Elementos Alu , Emigração e Imigração , Características da Família , Feminino , Variação Genética , Genética Populacional , Humanos , Masculino , Modelos Genéticos , Dados de Sequência Molecular , Dinâmica Populacional , Caracteres Sexuais
8.
Nat Genet ; 55(2): 301-311, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36658436

RESUMO

Ixodes spp. and related ticks transmit prevalent infections, although knowledge of their biology and development of anti-tick measures have been hindered by the lack of a high-quality genome. In the present study, we present the assembly of a 2.23-Gb Ixodes scapularis genome by sequencing two haplotypes within one individual, complemented by chromosome-level scaffolding and full-length RNA isoform sequencing, yielding a fully reannotated genome featuring thousands of new protein-coding genes and various RNA species. Analyses of the repetitive DNA identified transposable elements, whereas the examination of tick-associated bacterial sequences yielded an improved Rickettsia buchneri genome. We demonstrate how the Ixodes genome advances tick science by contributing to new annotations, gene models and epigenetic functions, expansion of gene families, development of in-depth proteome catalogs and deciphering of genetic variations in wild ticks. Overall, we report critical genetic resources and biological insights impacting our understanding of tick biology and future interventions against tick-transmitted infections.


Assuntos
Ixodes , Animais , Ixodes/genética , Ixodes/microbiologia , Genoma/genética , Bactérias/genética , Sequência de Bases , RNA
9.
Plant Genome ; 14(1): e20072, 2021 03.
Artigo em Inglês | MEDLINE | ID: mdl-33605092

RESUMO

Hop (Humulus lupulus L. var Lupulus) is a diploid, dioecious plant with a history of cultivation spanning more than one thousand years. Hop cones are valued for their use in brewing and contain compounds of therapeutic interest including xanthohumol. Efforts to determine how biochemical pathways responsible for desirable traits are regulated have been challenged by the large (2.8 Gb), repetitive, and heterozygous genome of hop. We present a draft haplotype-phased assembly of the Cascade cultivar genome. Our draft assembly and annotation of the Cascade genome is the most extensive representation of the hop genome to date. PacBio long-read sequences from hop were assembled with FALCON and partially phased with FALCON-Unzip. Comparative analysis of haplotype sequences provides insight into selective pressures that have driven evolution in hop. We discovered genes with greater sequence divergence enriched for stress-response, growth, and flowering functions in the draft phased assembly. With improved resolution of long terminal retrotransposons (LTRs) due to long-read sequencing, we found that hop is over 70% repetitive. We identified a homolog of cannabidiolic acid synthase (CBDAS) that is expressed in multiple tissues. The approaches we developed to analyze the draft phased assembly serve to deepen our understanding of the genomic landscape of hop and may have broader applicability to the study of other large, complex genomes.


Assuntos
Humulus , Diploide , Genoma de Planta , Genômica , Haplótipos , Humulus/genética
10.
Front Plant Sci ; 12: 720670, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34567033

RESUMO

A defining component of agroforestry parklands across Sahelo-Sudanian Africa (SSA), the shea tree (Vitellaria paradoxa) is central to sustaining local livelihoods and the farming environments of rural communities. Despite its economic and cultural value, however, not to mention the ecological roles it plays as a dominant parkland species, shea remains semi-domesticated with virtually no history of systematic genetic improvement. In truth, shea's extended juvenile period makes traditional breeding approaches untenable; but the opportunity for genome-assisted breeding is immense, provided the foundational resources are available. Here we report the development and public release of such resources. Using the FALCON-Phase workflow, 162.6 Gb of long-read PacBio sequence data were assembled into a 658.7 Mbp, chromosome-scale reference genome annotated with 38,505 coding genes. Whole genome duplication (WGD) analysis based on this gene space revealed clear signatures of two ancient WGD events in shea's evolutionary past, one prior to the Astrid-Rosid divergence (116-126 Mya) and the other at the root of the order Ericales (65-90 Mya). In a first genome-wide look at the suite of fatty acid (FA) biosynthesis genes that likely govern stearin content, the primary determinant of shea butter quality, relatively high copy numbers of six key enzymes were found (KASI, KASIII, FATB, FAD2, FAD3, and FAX2), some likely originating in shea's more recent WGD event. To help translate these findings into practical tools for characterization, selection, and genome-wide association studies (GWAS), resequencing data from a shea diversity panel was used to develop a database of more than 3.5 million functionally annotated, physically anchored SNPs. Two smaller, more curated sets of suggested SNPs, one for GWAS (104,211 SNPs) and the other targeting FA biosynthesis genes (90 SNPs), are also presented. With these resources, the hope is to support national programs across the shea belt in the strategic, genome-enabled conservation and long-term improvement of the shea tree for SSA.

11.
Nat Commun ; 12(1): 1935, 2021 04 28.
Artigo em Inglês | MEDLINE | ID: mdl-33911078

RESUMO

Haplotype-resolved genome assemblies are important for understanding how combinations of variants impact phenotypes. To date, these assemblies have been best created with complex protocols, such as cultured cells that contain a single-haplotype (haploid) genome, single cells where haplotypes are separated, or co-sequencing of parental genomes in a trio-based approach. These approaches are impractical in most situations. To address this issue, we present FALCON-Phase, a phasing tool that uses ultra-long-range Hi-C chromatin interaction data to extend phase blocks of partially-phased diploid assembles to chromosome or scaffold scale. FALCON-Phase uses the inherent phasing information in Hi-C reads, skipping variant calling, and reduces the computational complexity of phasing. Our method is validated on three benchmark datasets generated as part of the Vertebrate Genomes Project (VGP), including human, cow, and zebra finch, for which high-quality, fully haplotype-resolved assemblies are available using the trio-based approach. FALCON-Phase is accurate without having parental data and performance is better in samples with higher heterozygosity. For cow and zebra finch the accuracy is 97% compared to 80-91% for human. FALCON-Phase is applicable to any draft assembly that contains long primary contigs and phased associate contigs.


Assuntos
Mapeamento de Sequências Contíguas/métodos , Genoma Humano/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Algoritmos , Animais , Bovinos , Haplótipos/genética , Humanos , Polimorfismo de Nucleotídeo Único/genética , Peixe-Zebra/genética
12.
PLoS Biol ; 5(11): e293, 2007 Nov 06.
Artigo em Inglês | MEDLINE | ID: mdl-17988173

RESUMO

The evolution of heteromorphic sex chromosomes creates a genetic condition favoring the invasion of sex-ratio meiotic drive elements, resulting in the biased transmission of one sex chromosome over the other, in violation of Mendel's first law. The molecular mechanisms of sex-ratio meiotic drive may therefore help us to understand the evolutionary forces shaping the meiotic behavior of the sex chromosomes. Here we characterize a sex-ratio distorter on the X chromosome (Dox) in Drosophila simulans by genetic and molecular means. Intriguingly, Dox has very limited coding capacity. It evolved from another X-linked gene, which also evolved de nova. Through retrotransposition, Dox also gave rise to an autosomal suppressor, not much yang (Nmy). An RNA interference mechanism seems to be involved in the suppression of the Dox distorter by the Nmy suppressor. Double mutant males of the genotype dox; nmy are normal for both sex-ratio and spermatogenesis. We postulate that recurrent bouts of sex-ratio meiotic drive and its subsequent suppression might underlie several common features observed in the heterogametic sex, including meiotic sex chromosome inactivation and achiasmy.


Assuntos
Drosophila/genética , Meiose , Razão de Masculinidade , Cromossomo X/genética , Sequência de Aminoácidos , Animais , Sequência de Bases , Mapeamento Cromossômico , Proteínas de Drosophila/genética , Evolução Molecular , Feminino , Genes Supressores , Masculino , Modelos Genéticos , Dados de Sequência Molecular , Filogenia , Interferência de RNA , Cromossomo Y/genética
13.
Nat Commun ; 11(1): 2071, 2020 04 29.
Artigo em Inglês | MEDLINE | ID: mdl-32350247

RESUMO

Inbred animals were historically chosen for genome analysis to circumvent assembly issues caused by haplotype variation but this resulted in a composite of the two genomes. Here we report a haplotype-aware scaffolding and polishing pipeline which was used to create haplotype-resolved, chromosome-level genome assemblies of Angus (taurine) and Brahman (indicine) cattle subspecies from contigs generated by the trio binning method. These assemblies reveal structural and copy number variants that differentiate the subspecies and that variant detection is sensitive to the specific reference genome chosen. Six genes with immune related functions have additional copies in the indicine compared with taurine lineage and an indicus-specific extra copy of fatty acid desaturase is under positive selection. The haplotyped genomes also enable transcripts to be phased to detect allele-specific expression. This work exemplifies the value of haplotype-resolved genomes to better explore evolutionary and functional variations.


Assuntos
Bovinos/genética , Variação Genética , Genoma , Haplótipos/genética , Alelos , Desequilíbrio Alélico , Animais , Sequência de Bases , Cromossomos de Mamíferos/genética , Feminino , Loci Gênicos , Mutação INDEL/genética , Masculino , Anotação de Sequência Molecular , Polimorfismo de Nucleotídeo Único/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Sequências Repetitivas de Ácido Nucleico/genética
14.
Mol Biol Evol ; 25(3): 517-25, 2008 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-18093995

RESUMO

A history of Pleistocene population expansion has been inferred from the frequency spectrum of polymorphism in the mitochondrial DNA (mtDNA) of many human populations. Similar patterns are not typically observed for autosomal and X-linked loci. One explanation for this discrepancy is a recent population bottleneck, with different rates of recovery for haploid and autosomal loci as a result of their different effective population sizes. This hypothesis predicts that mitochondrial and Y chromosomal DNA will show a similar skew in the frequency spectrum in populations that have experienced a recent increase in effective population size. We test this hypothesis by resequencing 6.6 kb of noncoding Y chromosomal DNA and 780 basepairs of the mtDNA cytochrome c oxidase subunit III (COIII) gene in 172 males from 5 African populations. Four tests of population expansion are employed for each locus in each population: Fu's Fs statistic, the R(2) statistic, coalescent simulations, and the mismatch distribution. Consistent with previous results, patterns of mtDNA polymorphism better fit a model of constant population size for food-gathering populations and a model of population expansion for food-producing populations. In contrast, none of the tests reveal evidence of Y chromosome growth for either food-gatherers or food-producers. The distinct mtDNA and Y chromosome polymorphism patterns most likely reflect sex-biased demographic processes in the recent history of African populations. We hypothesize that males experienced smaller effective population sizes and/or lower rates of migration during the Bantu expansion, which occurred over the last 5,000 years.


Assuntos
Cromossomos Humanos Y/genética , DNA Mitocondrial/genética , Complexo IV da Cadeia de Transporte de Elétrons/genética , Polimorfismo Genético , Densidade Demográfica , África , Genética Populacional , Humanos , Masculino
15.
Genetics ; 178(1): 427-37, 2008 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18202385

RESUMO

A 2.4-kb stretch within the RRM2P4 region of the X chromosome, previously sequenced in a sample of 41 globally distributed humans, displayed both an ancient time to the most recent common ancestor (e.g., a TMRCA of approximately 2 million years) and a basal clade composed entirely of Asian sequences. This pattern was interpreted to reflect a history of introgressive hybridization from archaic hominins (most likely Asian Homo erectus) into the anatomically modern human genome. Here, we address this hypothesis by resequencing the 2.4-kb RRM2P4 region in 131 African and 122 non-African individuals and by extending the length of sequence in a window of 16.5 kb encompassing the RRM2P4 pseudogene in a subset of 90 individuals. We find that both the ancient TMRCA and the skew in non-African representation in one of the basal clades are essentially limited to the central 2.4-kb region. We define a new summary statistic called the minimum clade proportion (pmc), which quantifies the proportion of individuals from a specified geographic region in each of the two basal clades of a binary gene tree, and then employ coalescent simulations to assess the likelihood of the observed central RRM2P4 genealogy under two alternative views of human evolutionary history: recent African replacement (RAR) and archaic admixture (AA). A molecular-clock-based TMRCA estimate of 2.33 million years is a statistical outlier under the RAR model; however, the large variance associated with this estimate makes it difficult to distinguish the predictions of the human origins models tested here. The pmc summary statistic, which has improved power with larger samples of chromosomes, yields values that are significantly unlikely under the RAR model and fit expectations better under a range of archaic admixture scenarios.


Assuntos
Cromossomos Humanos X/genética , Genealogia e Heráldica , Modelos Genéticos , DNA Intergênico/genética , Demografia , Variação Genética , Humanos , Funções Verossimilhança , Filogenia , Análise de Sequência de DNA
16.
Gigascience ; 8(10)2019 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-31609423

RESUMO

BACKGROUND: A high-quality reference genome is an essential tool for applied and basic research on arthropods. Long-read sequencing technologies may be used to generate more complete and contiguous genome assemblies than alternate technologies; however, long-read methods have historically had greater input DNA requirements and higher costs than next-generation sequencing, which are barriers to their use on many samples. Here, we present a 2.3 Gb de novo genome assembly of a field-collected adult female spotted lanternfly (Lycorma delicatula) using a single Pacific Biosciences SMRT Cell. The spotted lanternfly is an invasive species recently discovered in the northeastern United States that threatens to damage economically important crop plants in the region. RESULTS: The DNA from 1 individual was used to make 1 standard, size-selected library with an average DNA fragment size of ∼20 kb. The library was run on 1 Sequel II SMRT Cell 8M, generating a total of 132 Gb of long-read sequences, of which 82 Gb were from unique library molecules, representing ∼36× coverage of the genome. The assembly had high contiguity (contig N50 length = 1.5 Mb), completeness, and sequence level accuracy as estimated by conserved gene set analysis (96.8% of conserved genes both complete and without frame shift errors). Furthermore, it was possible to segregate more than half of the diploid genome into the 2 separate haplotypes. The assembly also recovered 2 microbial symbiont genomes known to be associated with L. delicatula, each microbial genome being assembled into a single contig. CONCLUSIONS: We demonstrate that field-collected arthropods can be used for the rapid generation of high-quality genome assemblies, an attractive approach for projects on emerging invasive species, disease vectors, or conservation efforts of endangered species.


Assuntos
Dípteros/genética , Genoma de Inseto , Genômica/métodos , Animais , Feminino , Biblioteca Gênica , Espécies Introduzidas , Análise de Sequência de DNA
17.
Genes (Basel) ; 10(1)2019 01 18.
Artigo em Inglês | MEDLINE | ID: mdl-30669388

RESUMO

A high-quality reference genome is a fundamental resource for functional genetics, comparative genomics, and population genomics, and is increasingly important for conservation biology. PacBio Single Molecule, Real-Time (SMRT) sequencing generates long reads with uniform coverage and high consensus accuracy, making it a powerful technology for de novo genome assembly. Improvements in throughput and concomitant reductions in cost have made PacBio an attractive core technology for many large genome initiatives, however, relatively high DNA input requirements (~5 µg for standard library protocol) have placed PacBio out of reach for many projects on small organisms that have lower DNA content, or on projects with limited input DNA for other reasons. Here we present a high-quality de novo genome assembly from a single Anopheles coluzzii mosquito. A modified SMRTbell library construction protocol without DNA shearing and size selection was used to generate a SMRTbell library from just 100 ng of starting genomic DNA. The sample was run on the Sequel System with chemistry 3.0 and software v6.0, generating, on average, 25 Gb of sequence per SMRT Cell with 20 h movies, followed by diploid de novo genome assembly with FALCON-Unzip. The resulting curated assembly had high contiguity (contig N50 3.5 Mb) and completeness (more than 98% of conserved genes were present and full-length). In addition, this single-insect assembly now places 667 (>90%) of formerly unplaced genes into their appropriate chromosomal contexts in the AgamP4 PEST reference. We were also able to resolve maternal and paternal haplotypes for over 1/3 of the genome. By sequencing and assembling material from a single diploid individual, only two haplotypes were present, simplifying the assembly process compared to samples from multiple pooled individuals. The method presented here can be applied to samples with starting DNA amounts as low as 100 ng per 1 Gb genome size. This new low-input approach puts PacBio-based assemblies in reach for small highly heterozygous organisms that comprise much of the diversity of life.


Assuntos
Anopheles/genética , Genoma de Inseto , Análise de Sequência de DNA/métodos , Animais , Mapeamento de Sequências Contíguas/métodos , Mapeamento de Sequências Contíguas/normas , Ploidias , Polimorfismo Genético , Análise de Sequência de DNA/normas
18.
Nat Commun ; 10(1): 260, 2019 01 16.
Artigo em Inglês | MEDLINE | ID: mdl-30651564

RESUMO

Rapid innovation in sequencing technologies and improvement in assembly algorithms have enabled the creation of highly contiguous mammalian genomes. Here we report a chromosome-level assembly of the water buffalo (Bubalus bubalis) genome using single-molecule sequencing and chromatin conformation capture data. PacBio Sequel reads, with a mean length of 11.5 kb, helped to resolve repetitive elements and generate sequence contiguity. All five B. bubalis sub-metacentric chromosomes were correctly scaffolded with centromeres spanned. Although the index animal was partly inbred, 58% of the genome was haplotype-phased by FALCON-Unzip. This new reference genome improves the contig N50 of the previous short-read based buffalo assembly more than a thousand-fold and contains only 383 gaps. It surpasses the human and goat references in sequence contiguity and facilitates the annotation of hard to assemble gene clusters such as the major histocompatibility complex (MHC).


Assuntos
Búfalos/genética , Cromossomos de Mamíferos/genética , Mapeamento de Sequências Contíguas/métodos , Genoma/genética , Cabras/genética , Animais , Cromatina/química , Cromatina/genética , Feminino , Genômica/métodos , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Complexo Principal de Histocompatibilidade/genética , Anotação de Sequência Molecular/métodos , Família Multigênica/genética , Sequências Repetitivas de Ácido Nucleico/genética , Sequenciamento Completo do Genoma
19.
Genetics ; 177(4): 2195-207, 2007 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-18073427

RESUMO

We estimate parameters of a general isolation-with-migration model using resequence data from mitochondrial DNA (mtDNA), the Y chromosome, and two loci on the X chromosome in samples of 25-50 individuals from each of 10 human populations. Application of a coalescent-based Markov chain Monte Carlo technique allows simultaneous inference of divergence times, rates of gene flow, as well as changes in effective population size. Results from comparisons between sub-Saharan African and Eurasian populations estimate that 1500 individuals founded the ancestral Eurasian population approximately 40 thousand years ago (KYA). Furthermore, these small Eurasian founding populations appear to have grown much more dramatically than either African or Oceanian populations. Analyses of sub-Saharan African populations provide little evidence for a history of population bottlenecks and suggest that they began diverging from one another upward of 50 KYA. We surmise that ancestral African populations had already been geographically structured prior to the founding of ancestral Eurasian populations. African populations are shown to experience low levels of mitochondrial DNA gene flow, but high levels of Y chromosome gene flow. In particular, Y chromosome gene flow appears to be asymmetric, i.e., from the Bantu-speaking population into other African populations. Conversely, mitochondrial gene flow is more extensive between non-African populations, but appears to be absent between European and Asian populations.


Assuntos
Cromossomos Humanos X , Cromossomos Humanos Y , DNA Mitocondrial , Fluxo Gênico , Genética Populacional , Dinâmica Populacional , Sequência de Bases , Humanos , Cadeias de Markov , Densidade Demográfica , Grupos Raciais/genética
20.
Curr Biol ; 28(8): 1289-1295.e4, 2018 04 23.
Artigo em Inglês | MEDLINE | ID: mdl-29606420

RESUMO

Crossing over between homologous chromosomes during meiosis repairs programmed DNA double-strand breaks, ensures proper segregation at meiosis I [1], shapes the genomic distribution of nucleotide variability in populations, and enhances the efficacy of natural selection among genetically linked sites [2]. Between closely related Drosophila species, large differences exist in the rate and chromosomal distribution of crossing over. Little, however, is known about the molecular genetic changes or population genetic forces that mediate evolved differences in recombination between species [3, 4]. Here, we show that a meiosis gene with a history of rapid evolution acts as a trans-acting modifier of species differences in crossing over. In transgenic flies, the dicistronic gene, mei-217/mei-218, recapitulates a large part of the species differences in the rate and chromosomal distribution of crossing over. These phenotypic differences appear to result from changes in protein sequence not gene expression. Our population genetics analyses show that the protein-coding sequence of mei-218, but not mei-217, has a history of recurrent positive natural selection. By modulating the intensity of centromeric and telomeric suppression of crossing over, evolution at mei-217/-218 has incidentally shaped gross differences in the chromosomal distribution of nucleotide variability between species. We speculate that recurrent bouts of adaptive evolution at mei-217/-218 might reflect a history of coevolution with selfish genetic elements.


Assuntos
Proteínas de Ciclo Celular/genética , Troca Genética/genética , Proteínas de Drosophila/genética , Meiose/genética , Sequência de Aminoácidos , Animais , Animais Geneticamente Modificados/genética , Proteínas de Ciclo Celular/metabolismo , Proteínas de Ciclo Celular/fisiologia , Centrômero/genética , Centrômero/fisiologia , Quebras de DNA de Cadeia Dupla , Drosophila/genética , Proteínas de Drosophila/metabolismo , Proteínas de Drosophila/fisiologia , Drosophila melanogaster/genética , Evolução Molecular , Expressão Gênica/genética , Recombinação Genética/genética , Seleção Genética , Especificidade da Espécie
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA