Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 34
Filtrar
1.
Nature ; 563(7732): 501-507, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30429615

RESUMO

Female Aedes aegypti mosquitoes infect more than 400 million people each year with dangerous viral pathogens including dengue, yellow fever, Zika and chikungunya. Progress in understanding the biology of mosquitoes and developing the tools to fight them has been slowed by the lack of a high-quality genome assembly. Here we combine diverse technologies to produce the markedly improved, fully re-annotated AaegL5 genome assembly, and demonstrate how it accelerates mosquito science. We anchored physical and cytogenetic maps, doubled the number of known chemosensory ionotropic receptors that guide mosquitoes to human hosts and egg-laying sites, provided further insight into the size and composition of the sex-determining M locus, and revealed copy-number variation among glutathione S-transferase genes that are important for insecticide resistance. Using high-resolution quantitative trait locus and population genomic analyses, we mapped new candidates for dengue vector competence and insecticide resistance. AaegL5 will catalyse new biological insights and intervention strategies to fight this deadly disease vector.


Assuntos
Aedes/genética , Infecções por Arbovirus/virologia , Arbovírus , Genoma de Inseto/genética , Genômica/normas , Controle de Insetos , Mosquitos Vetores/genética , Mosquitos Vetores/virologia , Aedes/virologia , Animais , Infecções por Arbovirus/transmissão , Arbovírus/isolamento & purificação , Variações do Número de Cópias de DNA/genética , Vírus da Dengue/isolamento & purificação , Feminino , Variação Genética/genética , Genética Populacional , Glutationa Transferase/genética , Resistência a Inseticidas/efeitos dos fármacos , Masculino , Anotação de Sequência Molecular , Família Multigênica/genética , Piretrinas/farmacologia , Padrões de Referência , Processos de Determinação Sexual/genética
2.
BMC Plant Biol ; 19(1): 319, 2019 Jul 16.
Artigo em Inglês | MEDLINE | ID: mdl-31311507

RESUMO

BACKGROUND: Non-host resistance (NHR) presents a compelling long-term plant protection strategy for global food security, yet the genetic basis of NHR remains poorly understood. For many diseases, including stem rust of wheat [causal organism Puccinia graminis (Pg)], NHR is largely unexplored due to the inherent challenge of developing a genetically tractable system within which the resistance segregates. The present study turns to the pathogen's alternate host, barberry (Berberis spp.), to overcome this challenge. RESULTS: In this study, an interspecific mapping population derived from a cross between Pg-resistant Berberis thunbergii (Bt) and Pg-susceptible B. vulgaris was developed to investigate the Pg-NHR exhibited by Bt. To facilitate QTL analysis and subsequent trait dissection, the first genetic linkage maps for the two parental species were constructed and a chromosome-scale reference genome for Bt was assembled (PacBio + Hi-C). QTL analysis resulted in the identification of a single 13 cM region (~ 5.1 Mbp spanning 13 physical contigs) on the short arm of Bt chromosome 3. Differential gene expression analysis, combined with sequence variation analysis between the two parental species, led to the prioritization of several candidate genes within the QTL region, some of which belong to gene families previously implicated in disease resistance. CONCLUSIONS: Foundational genetic and genomic resources developed for Berberis spp. enabled the identification and annotation of a QTL associated with Pg-NHR. Although subsequent validation and fine mapping studies are needed, this study demonstrates the feasibility of and lays the groundwork for dissecting Pg-NHR in the alternate host of one of agriculture's most devastating pathogens.


Assuntos
Basidiomycota/fisiologia , Berberis/genética , Berberis/microbiologia , Doenças das Plantas/genética , Mapeamento Cromossômico , Cromossomos de Plantas , Resistência à Doença/genética , Perfilação da Expressão Gênica , Genoma de Planta , Hibridização Genética , Padrões de Herança , Fenótipo , Doenças das Plantas/microbiologia , Caules de Planta/microbiologia , Locos de Características Quantitativas
3.
Genome Res ; 22(8): 1499-511, 2012 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-22534282

RESUMO

The three species of the Drosophila simulans clade--the cosmopolitan species, D. simulans, and the two island endemic species, D. mauritiana and D. sechellia--are important models in speciation genetics, but some details of their phylogenetic and speciation history remain unresolved. The order and timing of speciation are disputed, and the existence, magnitude, and timing of gene flow among the three species remain unclear. Here we report on the analysis of a whole-genome four-species sequence alignment that includes all three D. simulans clade species as well as the D. melanogaster reference sequence. The alignment comprises novel, paired short-read sequence data from a single highly inbred line each from D. simulans, D. mauritiana, and D. sechellia. We are unable to reject a species phylogeny with a basal polytomy; the estimated age of the polytomy is 242,000 yr before the present. However, we also find that up to 4.6% of autosomal and 2.2% of X-linked regions have evolutionary histories consistent with recent gene flow between the mainland species (D. simulans) and the two island endemic species (D. mauritiana and D. sechellia). Our findings thus show that gene flow has occurred throughout the genomes of the D. simulans clade species despite considerable geographic, ecological, and intrinsic reproductive isolation. Last, our analysis of lineage-specific changes confirms that the D. sechellia genome has experienced a significant excess of slightly deleterious changes and a dearth of presumed favorable changes. The relatively reduced efficacy of natural selection in D. sechellia is consistent with its derived, persistently reduced historical effective population size.


Assuntos
Drosophila/classificação , Especiação Genética , Genoma de Inseto , Animais , Sequência de Bases , Cromossomos de Insetos/genética , Drosophila/genética , Evolução Molecular , Fluxo Gênico , Haplótipos , Filogenia , Densidade Demográfica , Isolamento Reprodutivo , Seleção Genética , Alinhamento de Sequência , Análise de Sequência de DNA
4.
Mol Biol Evol ; 30(9): 2177-86, 2013 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-23827876

RESUMO

Adaptive mutations that accumulate during species divergence are likely to contribute to reproductive incompatibilities and hinder gene flow; however, there may also be a class of mutations that are generally advantageous and can spread across species boundaries. In this study, we characterize a 15 kb region on chromosome 3R that has introgressed from the cosmopolitan generalist species Drosophila simulans into the island endemic D. sechellia, which is an ecological specialist. The introgressed haplotype is fixed in D. sechellia over almost the entirety of the resequenced region, whereas a core region of the introgressed haplotype occurs at high frequency in D. simulans. The observed patterns of nucleotide variation and linkage disequilibrium are consistent with a recently completed selective sweep in D. sechellia and an incomplete sweep in D. simulans. Independent estimates of both the time to the introgression and sweep events are all close to 10,000 years before the present. Interestingly, the most likely target of selection is a highly occupied transcription factor binding region. This work confirms that it is possible for mutations to be globally advantageous, despite their occurrence in divergent genomic and ecological backgrounds.


Assuntos
Drosophila/genética , Evolução Molecular , Fluxo Gênico , Especiação Genética , Animais , Mapeamento Cromossômico , Cromossomos de Insetos , Drosophila/classificação , Feminino , Variação Genética , Haplótipos , Desequilíbrio de Ligação , Masculino , Mutação , Filogenia , Seleção Genética , Especificidade da Espécie
5.
PLoS Biol ; 9(8): e1001126, 2011 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-21857805

RESUMO

The evolution of heteromorphic sex chromosomes (e.g., XY in males or ZW in females) has repeatedly elicited the evolution of two kinds of chromosome-specific regulation: dosage compensation--the equalization of X chromosome gene expression in males and females--and meiotic sex chromosome inactivation (MSCI)--the transcriptional silencing and heterochromatinization of the X during meiosis in the male (or Z in the female) germline. How the X chromosome is regulated in the Drosophila melanogaster male germline is unclear. Here we report three new findings concerning gene expression from the X in Drosophila testes. First, X chromosome-wide dosage compensation appears to be absent from most of the Drosophila male germline. Second, microarray analysis provides no evidence for X chromosome-specific inactivation during meiosis. Third, we confirm the previous discovery that the expression of transgene reporters driven by autosomal spermatogenesis-specific promoters is strongly reduced when inserted on the X chromosome versus the autosomes; but we show that this chromosomal difference in expression is established in premeiotic cells and persists in meiotic cells. The magnitude of the X-autosome difference in transgene expression cannot be explained by the absence of dosage compensation, suggesting that a previously unrecognized mechanism limits expression from the X during spermatogenesis in Drosophila. These findings help to resolve several previously conflicting reports and have implications for patterns of genome evolution and speciation in Drosophila.


Assuntos
Mecanismo Genético de Compensação de Dose/genética , Drosophila/genética , Meiose/genética , Cromossomos Sexuais/genética , Animais , Feminino , Células Germinativas/metabolismo , Masculino , Espermatogênese/genética , Testículo/metabolismo , Inativação do Cromossomo X/genética
6.
Sci Data ; 11(1): 918, 2024 Aug 24.
Artigo em Inglês | MEDLINE | ID: mdl-39181902

RESUMO

Phlebotomine sand flies are the vectors of leishmaniasis, a neglected tropical disease. High-quality reference genomes are an important tool for understanding the biology and eco-evolutionary dynamics underpinning disease epidemiology. Previous leishmaniasis vector reference sequences were limited by sequencing technologies available at the time and inadequate for high-resolution genomic inquiry. Here, we present updated reference assemblies of two sand flies, Phlebotomus papatasi and Lutzomyia longipalpis. These chromosome-level assemblies were generated using an ultra-low input library protocol, PacBio HiFi long reads, and Hi-C technology. The new P. papatasi reference has a final assembly span of 351.6 Mb and contig and scaffold N50s of 926 kb and 111.8 Mb, respectively. The new Lu. longipalpis reference has a final assembly span of 147.8 Mb and contig and scaffold N50s of 1.09 Mb and 40.6 Mb, respectively. Benchmarking Universal Single-Copy Orthologue (BUSCO) assessments indicated 94.5% and 95.6% complete single copy insecta orthologs for P. papatasi and Lu. longipalpis. These improved assemblies will serve as an invaluable resource for future genomic work on phlebotomine sandflies.


Assuntos
Genoma de Inseto , Psychodidae , Animais , Psychodidae/genética , Phlebotomus/genética , Phlebotomus/classificação , Insetos Vetores/genética , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA
7.
medRxiv ; 2024 Mar 18.
Artigo em Inglês | MEDLINE | ID: mdl-38562723

RESUMO

Comprehending the mechanism behind human diseases with an established heritable component represents the forefront of personalized medicine. Nevertheless, numerous medically important genes are inaccurately represented in short-read sequencing data analysis due to their complexity and repetitiveness or the so-called 'dark regions' of the human genome. The advent of PacBio as a long-read platform has provided new insights, yet HiFi whole-genome sequencing (WGS) cost remains frequently prohibitive. We introduce a targeted sequencing and analysis framework, Twist Alliance Dark Genes Panel (TADGP), designed to offer phased variants across 389 medically important yet complex autosomal genes. We highlight TADGP accuracy across eleven control samples and compare it to WGS. This demonstrates that TADGP achieves variant calling accuracy comparable to HiFi-WGS data, but at a fraction of the cost. Thus, enabling scalability and broad applicability for studying rare diseases or complementing previously sequenced samples to gain insights into these complex genes. TADGP revealed several candidate variants across all cases and provided insight into LPA diversity when tested on samples from rare disease and cardiovascular disease cohorts. In both cohorts, we identified novel variants affecting individual disease-associated genes (e.g., IKZF1, KCNE1). Nevertheless, the annotation of the variants across these 389 medically important genes remains challenging due to their underrepresentation in ClinVar and gnomAD. Consequently, we also offer an annotation resource to enhance the evaluation and prioritization of these variants. Overall, we can demonstrate that TADGP offers a cost-efficient and scalable approach to routinely assess the dark regions of the human genome with clinical relevance.

8.
bioRxiv ; 2024 Sep 22.
Artigo em Inglês | MEDLINE | ID: mdl-39345378

RESUMO

The Genome in a Bottle Consortium (GIAB), hosted by the National Institute of Standards and Technology (NIST), is developing new matched tumor-normal samples, the first to be explicitly consented for public dissemination of genomic data and cell lines. Here, we describe a comprehensive genomic dataset from the first individual, HG008, including DNA from an adherent, epithelial-like pancreatic ductal adenocarcinoma (PDAC) tumor cell line (HG008-T) and matched normal cells from duodenal tissue (HG008-N-D) and pancreatic tissue (HG008-N-P). The data come from thirteen whole genome measurement technologies: Illumina paired-end, Element standard and long insert, Ultima UG100, PacBio (HiFi and Onso), Oxford Nanopore (standard and ultra-long), Bionano Optical Mapping, Arima and Phase Genomics Hi-C, G-banded karyotyping, directional genomic hybridization, and BioSkryb Genomics single-cell ResolveDNA. Most tumor data is from a large homogenous batch of non-viable cells after 23 passages of the primary tumor cells, along with some data from different passages to enable an initial understanding of genomic instability. These data will be used by the GIAB Consortium to develop matched tumor-normal benchmarks for somatic variant detection. In addition, extensive data from two different normal tissues from the same individual can enable understanding of mosaicism. Long reads also contain methylation tags for epigenetic analyses. We expect these data to facilitate innovation for whole genome measurement technologies, de novo assembly of tumor and normal genomes, and bioinformatic tools to identify small and structural somatic mutations. This first-of-its-kind broadly consented open-access resource will facilitate further understanding of sequencing methods used for cancer biology.

9.
Nat Genet ; 36(10): 1122-5, 2004 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-15378061

RESUMO

Global-scale patterns of human population structure may be influenced by the rate of migration among populations that is nearly eight times higher for females than for males. This difference is attributed mainly to the widespread practice of patrilocality, in which women move into their mates' residences after marriage. Here we directly test this hypothesis by comparing global patterns of DNA sequence variation on the Y chromosome and mitochondrial DNA (mtDNA) in the same panel of 389 individuals from ten populations (four from Africa and two each from Europe, Asia and Oceania). We introduce a new strategy to assay Y-chromosome variation that identifies a high density of single-nucleotide polymorphisms, allows complete sequencing of all individuals rather than relying on predetermined markers and provides direct sequence comparisons with mtDNA. We found the overall proportion of between-group variation (Phi(ST)) to be 0.334 for the Y chromosome and 0.382 for mtDNA. Genetic differentiation between populations was similar for the Y chromosome and mtDNA at all geographic scales that we tested. Although patrilocality may be important at the local scale, patterns of genetic structure on the continental and global scales are not shaped by the higher rate of migration among females than among males.


Assuntos
Cromossomos Humanos Y/genética , DNA Mitocondrial/genética , Elementos Alu , Emigração e Imigração , Características da Família , Feminino , Variação Genética , Genética Populacional , Humanos , Masculino , Modelos Genéticos , Dados de Sequência Molecular , Dinâmica Populacional , Caracteres Sexuais
10.
Nat Genet ; 55(2): 301-311, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36658436

RESUMO

Ixodes spp. and related ticks transmit prevalent infections, although knowledge of their biology and development of anti-tick measures have been hindered by the lack of a high-quality genome. In the present study, we present the assembly of a 2.23-Gb Ixodes scapularis genome by sequencing two haplotypes within one individual, complemented by chromosome-level scaffolding and full-length RNA isoform sequencing, yielding a fully reannotated genome featuring thousands of new protein-coding genes and various RNA species. Analyses of the repetitive DNA identified transposable elements, whereas the examination of tick-associated bacterial sequences yielded an improved Rickettsia buchneri genome. We demonstrate how the Ixodes genome advances tick science by contributing to new annotations, gene models and epigenetic functions, expansion of gene families, development of in-depth proteome catalogs and deciphering of genetic variations in wild ticks. Overall, we report critical genetic resources and biological insights impacting our understanding of tick biology and future interventions against tick-transmitted infections.


Assuntos
Ixodes , Animais , Ixodes/genética , Ixodes/microbiologia , Genoma/genética , Bactérias/genética , Sequência de Bases , RNA
11.
Plant Genome ; 14(1): e20072, 2021 03.
Artigo em Inglês | MEDLINE | ID: mdl-33605092

RESUMO

Hop (Humulus lupulus L. var Lupulus) is a diploid, dioecious plant with a history of cultivation spanning more than one thousand years. Hop cones are valued for their use in brewing and contain compounds of therapeutic interest including xanthohumol. Efforts to determine how biochemical pathways responsible for desirable traits are regulated have been challenged by the large (2.8 Gb), repetitive, and heterozygous genome of hop. We present a draft haplotype-phased assembly of the Cascade cultivar genome. Our draft assembly and annotation of the Cascade genome is the most extensive representation of the hop genome to date. PacBio long-read sequences from hop were assembled with FALCON and partially phased with FALCON-Unzip. Comparative analysis of haplotype sequences provides insight into selective pressures that have driven evolution in hop. We discovered genes with greater sequence divergence enriched for stress-response, growth, and flowering functions in the draft phased assembly. With improved resolution of long terminal retrotransposons (LTRs) due to long-read sequencing, we found that hop is over 70% repetitive. We identified a homolog of cannabidiolic acid synthase (CBDAS) that is expressed in multiple tissues. The approaches we developed to analyze the draft phased assembly serve to deepen our understanding of the genomic landscape of hop and may have broader applicability to the study of other large, complex genomes.


Assuntos
Humulus , Diploide , Genoma de Planta , Genômica , Haplótipos , Humulus/genética
12.
Front Plant Sci ; 12: 720670, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34567033

RESUMO

A defining component of agroforestry parklands across Sahelo-Sudanian Africa (SSA), the shea tree (Vitellaria paradoxa) is central to sustaining local livelihoods and the farming environments of rural communities. Despite its economic and cultural value, however, not to mention the ecological roles it plays as a dominant parkland species, shea remains semi-domesticated with virtually no history of systematic genetic improvement. In truth, shea's extended juvenile period makes traditional breeding approaches untenable; but the opportunity for genome-assisted breeding is immense, provided the foundational resources are available. Here we report the development and public release of such resources. Using the FALCON-Phase workflow, 162.6 Gb of long-read PacBio sequence data were assembled into a 658.7 Mbp, chromosome-scale reference genome annotated with 38,505 coding genes. Whole genome duplication (WGD) analysis based on this gene space revealed clear signatures of two ancient WGD events in shea's evolutionary past, one prior to the Astrid-Rosid divergence (116-126 Mya) and the other at the root of the order Ericales (65-90 Mya). In a first genome-wide look at the suite of fatty acid (FA) biosynthesis genes that likely govern stearin content, the primary determinant of shea butter quality, relatively high copy numbers of six key enzymes were found (KASI, KASIII, FATB, FAD2, FAD3, and FAX2), some likely originating in shea's more recent WGD event. To help translate these findings into practical tools for characterization, selection, and genome-wide association studies (GWAS), resequencing data from a shea diversity panel was used to develop a database of more than 3.5 million functionally annotated, physically anchored SNPs. Two smaller, more curated sets of suggested SNPs, one for GWAS (104,211 SNPs) and the other targeting FA biosynthesis genes (90 SNPs), are also presented. With these resources, the hope is to support national programs across the shea belt in the strategic, genome-enabled conservation and long-term improvement of the shea tree for SSA.

13.
Nat Commun ; 12(1): 1935, 2021 04 28.
Artigo em Inglês | MEDLINE | ID: mdl-33911078

RESUMO

Haplotype-resolved genome assemblies are important for understanding how combinations of variants impact phenotypes. To date, these assemblies have been best created with complex protocols, such as cultured cells that contain a single-haplotype (haploid) genome, single cells where haplotypes are separated, or co-sequencing of parental genomes in a trio-based approach. These approaches are impractical in most situations. To address this issue, we present FALCON-Phase, a phasing tool that uses ultra-long-range Hi-C chromatin interaction data to extend phase blocks of partially-phased diploid assembles to chromosome or scaffold scale. FALCON-Phase uses the inherent phasing information in Hi-C reads, skipping variant calling, and reduces the computational complexity of phasing. Our method is validated on three benchmark datasets generated as part of the Vertebrate Genomes Project (VGP), including human, cow, and zebra finch, for which high-quality, fully haplotype-resolved assemblies are available using the trio-based approach. FALCON-Phase is accurate without having parental data and performance is better in samples with higher heterozygosity. For cow and zebra finch the accuracy is 97% compared to 80-91% for human. FALCON-Phase is applicable to any draft assembly that contains long primary contigs and phased associate contigs.


Assuntos
Mapeamento de Sequências Contíguas/métodos , Genoma Humano/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Algoritmos , Animais , Bovinos , Haplótipos/genética , Humanos , Polimorfismo de Nucleotídeo Único/genética , Peixe-Zebra/genética
14.
PLoS Biol ; 5(11): e293, 2007 Nov 06.
Artigo em Inglês | MEDLINE | ID: mdl-17988173

RESUMO

The evolution of heteromorphic sex chromosomes creates a genetic condition favoring the invasion of sex-ratio meiotic drive elements, resulting in the biased transmission of one sex chromosome over the other, in violation of Mendel's first law. The molecular mechanisms of sex-ratio meiotic drive may therefore help us to understand the evolutionary forces shaping the meiotic behavior of the sex chromosomes. Here we characterize a sex-ratio distorter on the X chromosome (Dox) in Drosophila simulans by genetic and molecular means. Intriguingly, Dox has very limited coding capacity. It evolved from another X-linked gene, which also evolved de nova. Through retrotransposition, Dox also gave rise to an autosomal suppressor, not much yang (Nmy). An RNA interference mechanism seems to be involved in the suppression of the Dox distorter by the Nmy suppressor. Double mutant males of the genotype dox; nmy are normal for both sex-ratio and spermatogenesis. We postulate that recurrent bouts of sex-ratio meiotic drive and its subsequent suppression might underlie several common features observed in the heterogametic sex, including meiotic sex chromosome inactivation and achiasmy.


Assuntos
Drosophila/genética , Meiose , Razão de Masculinidade , Cromossomo X/genética , Sequência de Aminoácidos , Animais , Sequência de Bases , Mapeamento Cromossômico , Proteínas de Drosophila/genética , Evolução Molecular , Feminino , Genes Supressores , Masculino , Modelos Genéticos , Dados de Sequência Molecular , Filogenia , Interferência de RNA , Cromossomo Y/genética
15.
Nat Commun ; 11(1): 2071, 2020 04 29.
Artigo em Inglês | MEDLINE | ID: mdl-32350247

RESUMO

Inbred animals were historically chosen for genome analysis to circumvent assembly issues caused by haplotype variation but this resulted in a composite of the two genomes. Here we report a haplotype-aware scaffolding and polishing pipeline which was used to create haplotype-resolved, chromosome-level genome assemblies of Angus (taurine) and Brahman (indicine) cattle subspecies from contigs generated by the trio binning method. These assemblies reveal structural and copy number variants that differentiate the subspecies and that variant detection is sensitive to the specific reference genome chosen. Six genes with immune related functions have additional copies in the indicine compared with taurine lineage and an indicus-specific extra copy of fatty acid desaturase is under positive selection. The haplotyped genomes also enable transcripts to be phased to detect allele-specific expression. This work exemplifies the value of haplotype-resolved genomes to better explore evolutionary and functional variations.


Assuntos
Bovinos/genética , Variação Genética , Genoma , Haplótipos/genética , Alelos , Desequilíbrio Alélico , Animais , Sequência de Bases , Cromossomos de Mamíferos/genética , Feminino , Loci Gênicos , Mutação INDEL/genética , Masculino , Anotação de Sequência Molecular , Polimorfismo de Nucleotídeo Único/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Sequências Repetitivas de Ácido Nucleico/genética
16.
Mol Biol Evol ; 25(3): 517-25, 2008 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-18093995

RESUMO

A history of Pleistocene population expansion has been inferred from the frequency spectrum of polymorphism in the mitochondrial DNA (mtDNA) of many human populations. Similar patterns are not typically observed for autosomal and X-linked loci. One explanation for this discrepancy is a recent population bottleneck, with different rates of recovery for haploid and autosomal loci as a result of their different effective population sizes. This hypothesis predicts that mitochondrial and Y chromosomal DNA will show a similar skew in the frequency spectrum in populations that have experienced a recent increase in effective population size. We test this hypothesis by resequencing 6.6 kb of noncoding Y chromosomal DNA and 780 basepairs of the mtDNA cytochrome c oxidase subunit III (COIII) gene in 172 males from 5 African populations. Four tests of population expansion are employed for each locus in each population: Fu's Fs statistic, the R(2) statistic, coalescent simulations, and the mismatch distribution. Consistent with previous results, patterns of mtDNA polymorphism better fit a model of constant population size for food-gathering populations and a model of population expansion for food-producing populations. In contrast, none of the tests reveal evidence of Y chromosome growth for either food-gatherers or food-producers. The distinct mtDNA and Y chromosome polymorphism patterns most likely reflect sex-biased demographic processes in the recent history of African populations. We hypothesize that males experienced smaller effective population sizes and/or lower rates of migration during the Bantu expansion, which occurred over the last 5,000 years.


Assuntos
Cromossomos Humanos Y/genética , DNA Mitocondrial/genética , Complexo IV da Cadeia de Transporte de Elétrons/genética , Polimorfismo Genético , Densidade Demográfica , África , Genética Populacional , Humanos , Masculino
17.
Genetics ; 178(1): 427-37, 2008 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18202385

RESUMO

A 2.4-kb stretch within the RRM2P4 region of the X chromosome, previously sequenced in a sample of 41 globally distributed humans, displayed both an ancient time to the most recent common ancestor (e.g., a TMRCA of approximately 2 million years) and a basal clade composed entirely of Asian sequences. This pattern was interpreted to reflect a history of introgressive hybridization from archaic hominins (most likely Asian Homo erectus) into the anatomically modern human genome. Here, we address this hypothesis by resequencing the 2.4-kb RRM2P4 region in 131 African and 122 non-African individuals and by extending the length of sequence in a window of 16.5 kb encompassing the RRM2P4 pseudogene in a subset of 90 individuals. We find that both the ancient TMRCA and the skew in non-African representation in one of the basal clades are essentially limited to the central 2.4-kb region. We define a new summary statistic called the minimum clade proportion (pmc), which quantifies the proportion of individuals from a specified geographic region in each of the two basal clades of a binary gene tree, and then employ coalescent simulations to assess the likelihood of the observed central RRM2P4 genealogy under two alternative views of human evolutionary history: recent African replacement (RAR) and archaic admixture (AA). A molecular-clock-based TMRCA estimate of 2.33 million years is a statistical outlier under the RAR model; however, the large variance associated with this estimate makes it difficult to distinguish the predictions of the human origins models tested here. The pmc summary statistic, which has improved power with larger samples of chromosomes, yields values that are significantly unlikely under the RAR model and fit expectations better under a range of archaic admixture scenarios.


Assuntos
Cromossomos Humanos X/genética , Genealogia e Heráldica , Modelos Genéticos , DNA Intergênico/genética , Demografia , Variação Genética , Humanos , Funções Verossimilhança , Filogenia , Análise de Sequência de DNA
18.
Gigascience ; 8(10)2019 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-31609423

RESUMO

BACKGROUND: A high-quality reference genome is an essential tool for applied and basic research on arthropods. Long-read sequencing technologies may be used to generate more complete and contiguous genome assemblies than alternate technologies; however, long-read methods have historically had greater input DNA requirements and higher costs than next-generation sequencing, which are barriers to their use on many samples. Here, we present a 2.3 Gb de novo genome assembly of a field-collected adult female spotted lanternfly (Lycorma delicatula) using a single Pacific Biosciences SMRT Cell. The spotted lanternfly is an invasive species recently discovered in the northeastern United States that threatens to damage economically important crop plants in the region. RESULTS: The DNA from 1 individual was used to make 1 standard, size-selected library with an average DNA fragment size of ∼20 kb. The library was run on 1 Sequel II SMRT Cell 8M, generating a total of 132 Gb of long-read sequences, of which 82 Gb were from unique library molecules, representing ∼36× coverage of the genome. The assembly had high contiguity (contig N50 length = 1.5 Mb), completeness, and sequence level accuracy as estimated by conserved gene set analysis (96.8% of conserved genes both complete and without frame shift errors). Furthermore, it was possible to segregate more than half of the diploid genome into the 2 separate haplotypes. The assembly also recovered 2 microbial symbiont genomes known to be associated with L. delicatula, each microbial genome being assembled into a single contig. CONCLUSIONS: We demonstrate that field-collected arthropods can be used for the rapid generation of high-quality genome assemblies, an attractive approach for projects on emerging invasive species, disease vectors, or conservation efforts of endangered species.


Assuntos
Dípteros/genética , Genoma de Inseto , Genômica/métodos , Animais , Feminino , Biblioteca Gênica , Espécies Introduzidas , Análise de Sequência de DNA
19.
Genes (Basel) ; 10(1)2019 01 18.
Artigo em Inglês | MEDLINE | ID: mdl-30669388

RESUMO

A high-quality reference genome is a fundamental resource for functional genetics, comparative genomics, and population genomics, and is increasingly important for conservation biology. PacBio Single Molecule, Real-Time (SMRT) sequencing generates long reads with uniform coverage and high consensus accuracy, making it a powerful technology for de novo genome assembly. Improvements in throughput and concomitant reductions in cost have made PacBio an attractive core technology for many large genome initiatives, however, relatively high DNA input requirements (~5 µg for standard library protocol) have placed PacBio out of reach for many projects on small organisms that have lower DNA content, or on projects with limited input DNA for other reasons. Here we present a high-quality de novo genome assembly from a single Anopheles coluzzii mosquito. A modified SMRTbell library construction protocol without DNA shearing and size selection was used to generate a SMRTbell library from just 100 ng of starting genomic DNA. The sample was run on the Sequel System with chemistry 3.0 and software v6.0, generating, on average, 25 Gb of sequence per SMRT Cell with 20 h movies, followed by diploid de novo genome assembly with FALCON-Unzip. The resulting curated assembly had high contiguity (contig N50 3.5 Mb) and completeness (more than 98% of conserved genes were present and full-length). In addition, this single-insect assembly now places 667 (>90%) of formerly unplaced genes into their appropriate chromosomal contexts in the AgamP4 PEST reference. We were also able to resolve maternal and paternal haplotypes for over 1/3 of the genome. By sequencing and assembling material from a single diploid individual, only two haplotypes were present, simplifying the assembly process compared to samples from multiple pooled individuals. The method presented here can be applied to samples with starting DNA amounts as low as 100 ng per 1 Gb genome size. This new low-input approach puts PacBio-based assemblies in reach for small highly heterozygous organisms that comprise much of the diversity of life.


Assuntos
Anopheles/genética , Genoma de Inseto , Análise de Sequência de DNA/métodos , Animais , Mapeamento de Sequências Contíguas/métodos , Mapeamento de Sequências Contíguas/normas , Ploidias , Polimorfismo Genético , Análise de Sequência de DNA/normas
20.
Nat Commun ; 10(1): 260, 2019 01 16.
Artigo em Inglês | MEDLINE | ID: mdl-30651564

RESUMO

Rapid innovation in sequencing technologies and improvement in assembly algorithms have enabled the creation of highly contiguous mammalian genomes. Here we report a chromosome-level assembly of the water buffalo (Bubalus bubalis) genome using single-molecule sequencing and chromatin conformation capture data. PacBio Sequel reads, with a mean length of 11.5 kb, helped to resolve repetitive elements and generate sequence contiguity. All five B. bubalis sub-metacentric chromosomes were correctly scaffolded with centromeres spanned. Although the index animal was partly inbred, 58% of the genome was haplotype-phased by FALCON-Unzip. This new reference genome improves the contig N50 of the previous short-read based buffalo assembly more than a thousand-fold and contains only 383 gaps. It surpasses the human and goat references in sequence contiguity and facilitates the annotation of hard to assemble gene clusters such as the major histocompatibility complex (MHC).


Assuntos
Búfalos/genética , Cromossomos de Mamíferos/genética , Mapeamento de Sequências Contíguas/métodos , Genoma/genética , Cabras/genética , Animais , Cromatina/química , Cromatina/genética , Feminino , Genômica/métodos , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Complexo Principal de Histocompatibilidade/genética , Anotação de Sequência Molecular/métodos , Família Multigênica/genética , Sequências Repetitivas de Ácido Nucleico/genética , Sequenciamento Completo do Genoma
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA