Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 27
Filter
1.
Nature ; 563(7732): 501-507, 2018 11.
Article in English | MEDLINE | ID: mdl-30429615

ABSTRACT

Female Aedes aegypti mosquitoes infect more than 400 million people each year with dangerous viral pathogens including dengue, yellow fever, Zika and chikungunya. Progress in understanding the biology of mosquitoes and developing the tools to fight them has been slowed by the lack of a high-quality genome assembly. Here we combine diverse technologies to produce the markedly improved, fully re-annotated AaegL5 genome assembly, and demonstrate how it accelerates mosquito science. We anchored physical and cytogenetic maps, doubled the number of known chemosensory ionotropic receptors that guide mosquitoes to human hosts and egg-laying sites, provided further insight into the size and composition of the sex-determining M locus, and revealed copy-number variation among glutathione S-transferase genes that are important for insecticide resistance. Using high-resolution quantitative trait locus and population genomic analyses, we mapped new candidates for dengue vector competence and insecticide resistance. AaegL5 will catalyse new biological insights and intervention strategies to fight this deadly disease vector.


Subject(s)
Aedes/genetics , Arbovirus Infections/virology , Arboviruses , Genome, Insect/genetics , Genomics/standards , Insect Control , Mosquito Vectors/genetics , Mosquito Vectors/virology , Aedes/virology , Animals , Arbovirus Infections/transmission , Arboviruses/isolation & purification , DNA Copy Number Variations/genetics , Dengue Virus/isolation & purification , Female , Genetic Variation/genetics , Genetics, Population , Glutathione Transferase/genetics , Insecticide Resistance/drug effects , Male , Molecular Sequence Annotation , Multigene Family/genetics , Pyrethrins/pharmacology , Reference Standards , Sex Determination Processes/genetics
2.
Genome Res ; 22(8): 1499-511, 2012 Aug.
Article in English | MEDLINE | ID: mdl-22534282

ABSTRACT

The three species of the Drosophila simulans clade--the cosmopolitan species, D. simulans, and the two island endemic species, D. mauritiana and D. sechellia--are important models in speciation genetics, but some details of their phylogenetic and speciation history remain unresolved. The order and timing of speciation are disputed, and the existence, magnitude, and timing of gene flow among the three species remain unclear. Here we report on the analysis of a whole-genome four-species sequence alignment that includes all three D. simulans clade species as well as the D. melanogaster reference sequence. The alignment comprises novel, paired short-read sequence data from a single highly inbred line each from D. simulans, D. mauritiana, and D. sechellia. We are unable to reject a species phylogeny with a basal polytomy; the estimated age of the polytomy is 242,000 yr before the present. However, we also find that up to 4.6% of autosomal and 2.2% of X-linked regions have evolutionary histories consistent with recent gene flow between the mainland species (D. simulans) and the two island endemic species (D. mauritiana and D. sechellia). Our findings thus show that gene flow has occurred throughout the genomes of the D. simulans clade species despite considerable geographic, ecological, and intrinsic reproductive isolation. Last, our analysis of lineage-specific changes confirms that the D. sechellia genome has experienced a significant excess of slightly deleterious changes and a dearth of presumed favorable changes. The relatively reduced efficacy of natural selection in D. sechellia is consistent with its derived, persistently reduced historical effective population size.


Subject(s)
Drosophila/classification , Genetic Speciation , Genome, Insect , Animals , Base Sequence , Chromosomes, Insect/genetics , Drosophila/genetics , Evolution, Molecular , Gene Flow , Haplotypes , Phylogeny , Population Density , Reproductive Isolation , Selection, Genetic , Sequence Alignment , Sequence Analysis, DNA
3.
Mol Biol Evol ; 30(9): 2177-86, 2013 Sep.
Article in English | MEDLINE | ID: mdl-23827876

ABSTRACT

Adaptive mutations that accumulate during species divergence are likely to contribute to reproductive incompatibilities and hinder gene flow; however, there may also be a class of mutations that are generally advantageous and can spread across species boundaries. In this study, we characterize a 15 kb region on chromosome 3R that has introgressed from the cosmopolitan generalist species Drosophila simulans into the island endemic D. sechellia, which is an ecological specialist. The introgressed haplotype is fixed in D. sechellia over almost the entirety of the resequenced region, whereas a core region of the introgressed haplotype occurs at high frequency in D. simulans. The observed patterns of nucleotide variation and linkage disequilibrium are consistent with a recently completed selective sweep in D. sechellia and an incomplete sweep in D. simulans. Independent estimates of both the time to the introgression and sweep events are all close to 10,000 years before the present. Interestingly, the most likely target of selection is a highly occupied transcription factor binding region. This work confirms that it is possible for mutations to be globally advantageous, despite their occurrence in divergent genomic and ecological backgrounds.


Subject(s)
Drosophila/genetics , Evolution, Molecular , Gene Flow , Genetic Speciation , Animals , Chromosome Mapping , Chromosomes, Insect , Drosophila/classification , Female , Genetic Variation , Haplotypes , Linkage Disequilibrium , Male , Mutation , Phylogeny , Selection, Genetic , Species Specificity
4.
PLoS Biol ; 9(8): e1001126, 2011 Aug.
Article in English | MEDLINE | ID: mdl-21857805

ABSTRACT

The evolution of heteromorphic sex chromosomes (e.g., XY in males or ZW in females) has repeatedly elicited the evolution of two kinds of chromosome-specific regulation: dosage compensation--the equalization of X chromosome gene expression in males and females--and meiotic sex chromosome inactivation (MSCI)--the transcriptional silencing and heterochromatinization of the X during meiosis in the male (or Z in the female) germline. How the X chromosome is regulated in the Drosophila melanogaster male germline is unclear. Here we report three new findings concerning gene expression from the X in Drosophila testes. First, X chromosome-wide dosage compensation appears to be absent from most of the Drosophila male germline. Second, microarray analysis provides no evidence for X chromosome-specific inactivation during meiosis. Third, we confirm the previous discovery that the expression of transgene reporters driven by autosomal spermatogenesis-specific promoters is strongly reduced when inserted on the X chromosome versus the autosomes; but we show that this chromosomal difference in expression is established in premeiotic cells and persists in meiotic cells. The magnitude of the X-autosome difference in transgene expression cannot be explained by the absence of dosage compensation, suggesting that a previously unrecognized mechanism limits expression from the X during spermatogenesis in Drosophila. These findings help to resolve several previously conflicting reports and have implications for patterns of genome evolution and speciation in Drosophila.


Subject(s)
Dosage Compensation, Genetic/genetics , Drosophila/genetics , Meiosis/genetics , Sex Chromosomes/genetics , Animals , Female , Germ Cells/metabolism , Male , Spermatogenesis/genetics , Testis/metabolism , X Chromosome Inactivation/genetics
5.
Nat Genet ; 36(10): 1122-5, 2004 Oct.
Article in English | MEDLINE | ID: mdl-15378061

ABSTRACT

Global-scale patterns of human population structure may be influenced by the rate of migration among populations that is nearly eight times higher for females than for males. This difference is attributed mainly to the widespread practice of patrilocality, in which women move into their mates' residences after marriage. Here we directly test this hypothesis by comparing global patterns of DNA sequence variation on the Y chromosome and mitochondrial DNA (mtDNA) in the same panel of 389 individuals from ten populations (four from Africa and two each from Europe, Asia and Oceania). We introduce a new strategy to assay Y-chromosome variation that identifies a high density of single-nucleotide polymorphisms, allows complete sequencing of all individuals rather than relying on predetermined markers and provides direct sequence comparisons with mtDNA. We found the overall proportion of between-group variation (Phi(ST)) to be 0.334 for the Y chromosome and 0.382 for mtDNA. Genetic differentiation between populations was similar for the Y chromosome and mtDNA at all geographic scales that we tested. Although patrilocality may be important at the local scale, patterns of genetic structure on the continental and global scales are not shaped by the higher rate of migration among females than among males.


Subject(s)
Chromosomes, Human, Y/genetics , DNA, Mitochondrial/genetics , Alu Elements , Emigration and Immigration , Family Characteristics , Female , Genetic Variation , Genetics, Population , Humans , Male , Models, Genetic , Molecular Sequence Data , Population Dynamics , Sex Characteristics
6.
Nat Genet ; 55(2): 301-311, 2023 02.
Article in English | MEDLINE | ID: mdl-36658436

ABSTRACT

Ixodes spp. and related ticks transmit prevalent infections, although knowledge of their biology and development of anti-tick measures have been hindered by the lack of a high-quality genome. In the present study, we present the assembly of a 2.23-Gb Ixodes scapularis genome by sequencing two haplotypes within one individual, complemented by chromosome-level scaffolding and full-length RNA isoform sequencing, yielding a fully reannotated genome featuring thousands of new protein-coding genes and various RNA species. Analyses of the repetitive DNA identified transposable elements, whereas the examination of tick-associated bacterial sequences yielded an improved Rickettsia buchneri genome. We demonstrate how the Ixodes genome advances tick science by contributing to new annotations, gene models and epigenetic functions, expansion of gene families, development of in-depth proteome catalogs and deciphering of genetic variations in wild ticks. Overall, we report critical genetic resources and biological insights impacting our understanding of tick biology and future interventions against tick-transmitted infections.


Subject(s)
Ixodes , Animals , Ixodes/genetics , Ixodes/microbiology , Genome/genetics , Bacteria/genetics , Base Sequence , RNA
7.
Plant Genome ; 14(1): e20072, 2021 03.
Article in English | MEDLINE | ID: mdl-33605092

ABSTRACT

Hop (Humulus lupulus L. var Lupulus) is a diploid, dioecious plant with a history of cultivation spanning more than one thousand years. Hop cones are valued for their use in brewing and contain compounds of therapeutic interest including xanthohumol. Efforts to determine how biochemical pathways responsible for desirable traits are regulated have been challenged by the large (2.8 Gb), repetitive, and heterozygous genome of hop. We present a draft haplotype-phased assembly of the Cascade cultivar genome. Our draft assembly and annotation of the Cascade genome is the most extensive representation of the hop genome to date. PacBio long-read sequences from hop were assembled with FALCON and partially phased with FALCON-Unzip. Comparative analysis of haplotype sequences provides insight into selective pressures that have driven evolution in hop. We discovered genes with greater sequence divergence enriched for stress-response, growth, and flowering functions in the draft phased assembly. With improved resolution of long terminal retrotransposons (LTRs) due to long-read sequencing, we found that hop is over 70% repetitive. We identified a homolog of cannabidiolic acid synthase (CBDAS) that is expressed in multiple tissues. The approaches we developed to analyze the draft phased assembly serve to deepen our understanding of the genomic landscape of hop and may have broader applicability to the study of other large, complex genomes.


Subject(s)
Humulus , Diploidy , Genome, Plant , Genomics , Haplotypes , Humulus/genetics
8.
Front Plant Sci ; 12: 720670, 2021.
Article in English | MEDLINE | ID: mdl-34567033

ABSTRACT

A defining component of agroforestry parklands across Sahelo-Sudanian Africa (SSA), the shea tree (Vitellaria paradoxa) is central to sustaining local livelihoods and the farming environments of rural communities. Despite its economic and cultural value, however, not to mention the ecological roles it plays as a dominant parkland species, shea remains semi-domesticated with virtually no history of systematic genetic improvement. In truth, shea's extended juvenile period makes traditional breeding approaches untenable; but the opportunity for genome-assisted breeding is immense, provided the foundational resources are available. Here we report the development and public release of such resources. Using the FALCON-Phase workflow, 162.6 Gb of long-read PacBio sequence data were assembled into a 658.7 Mbp, chromosome-scale reference genome annotated with 38,505 coding genes. Whole genome duplication (WGD) analysis based on this gene space revealed clear signatures of two ancient WGD events in shea's evolutionary past, one prior to the Astrid-Rosid divergence (116-126 Mya) and the other at the root of the order Ericales (65-90 Mya). In a first genome-wide look at the suite of fatty acid (FA) biosynthesis genes that likely govern stearin content, the primary determinant of shea butter quality, relatively high copy numbers of six key enzymes were found (KASI, KASIII, FATB, FAD2, FAD3, and FAX2), some likely originating in shea's more recent WGD event. To help translate these findings into practical tools for characterization, selection, and genome-wide association studies (GWAS), resequencing data from a shea diversity panel was used to develop a database of more than 3.5 million functionally annotated, physically anchored SNPs. Two smaller, more curated sets of suggested SNPs, one for GWAS (104,211 SNPs) and the other targeting FA biosynthesis genes (90 SNPs), are also presented. With these resources, the hope is to support national programs across the shea belt in the strategic, genome-enabled conservation and long-term improvement of the shea tree for SSA.

9.
Nat Commun ; 12(1): 1935, 2021 04 28.
Article in English | MEDLINE | ID: mdl-33911078

ABSTRACT

Haplotype-resolved genome assemblies are important for understanding how combinations of variants impact phenotypes. To date, these assemblies have been best created with complex protocols, such as cultured cells that contain a single-haplotype (haploid) genome, single cells where haplotypes are separated, or co-sequencing of parental genomes in a trio-based approach. These approaches are impractical in most situations. To address this issue, we present FALCON-Phase, a phasing tool that uses ultra-long-range Hi-C chromatin interaction data to extend phase blocks of partially-phased diploid assembles to chromosome or scaffold scale. FALCON-Phase uses the inherent phasing information in Hi-C reads, skipping variant calling, and reduces the computational complexity of phasing. Our method is validated on three benchmark datasets generated as part of the Vertebrate Genomes Project (VGP), including human, cow, and zebra finch, for which high-quality, fully haplotype-resolved assemblies are available using the trio-based approach. FALCON-Phase is accurate without having parental data and performance is better in samples with higher heterozygosity. For cow and zebra finch the accuracy is 97% compared to 80-91% for human. FALCON-Phase is applicable to any draft assembly that contains long primary contigs and phased associate contigs.


Subject(s)
Contig Mapping/methods , Genome, Human/genetics , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , Algorithms , Animals , Cattle , Haplotypes/genetics , Humans , Polymorphism, Single Nucleotide/genetics , Zebrafish/genetics
10.
PLoS Biol ; 5(11): e293, 2007 Nov 06.
Article in English | MEDLINE | ID: mdl-17988173

ABSTRACT

The evolution of heteromorphic sex chromosomes creates a genetic condition favoring the invasion of sex-ratio meiotic drive elements, resulting in the biased transmission of one sex chromosome over the other, in violation of Mendel's first law. The molecular mechanisms of sex-ratio meiotic drive may therefore help us to understand the evolutionary forces shaping the meiotic behavior of the sex chromosomes. Here we characterize a sex-ratio distorter on the X chromosome (Dox) in Drosophila simulans by genetic and molecular means. Intriguingly, Dox has very limited coding capacity. It evolved from another X-linked gene, which also evolved de nova. Through retrotransposition, Dox also gave rise to an autosomal suppressor, not much yang (Nmy). An RNA interference mechanism seems to be involved in the suppression of the Dox distorter by the Nmy suppressor. Double mutant males of the genotype dox; nmy are normal for both sex-ratio and spermatogenesis. We postulate that recurrent bouts of sex-ratio meiotic drive and its subsequent suppression might underlie several common features observed in the heterogametic sex, including meiotic sex chromosome inactivation and achiasmy.


Subject(s)
Drosophila/genetics , Meiosis , Sex Ratio , X Chromosome/genetics , Amino Acid Sequence , Animals , Base Sequence , Chromosome Mapping , Drosophila Proteins/genetics , Evolution, Molecular , Female , Genes, Suppressor , Male , Models, Genetic , Molecular Sequence Data , Phylogeny , RNA Interference , Y Chromosome/genetics
11.
Nat Commun ; 11(1): 2071, 2020 04 29.
Article in English | MEDLINE | ID: mdl-32350247

ABSTRACT

Inbred animals were historically chosen for genome analysis to circumvent assembly issues caused by haplotype variation but this resulted in a composite of the two genomes. Here we report a haplotype-aware scaffolding and polishing pipeline which was used to create haplotype-resolved, chromosome-level genome assemblies of Angus (taurine) and Brahman (indicine) cattle subspecies from contigs generated by the trio binning method. These assemblies reveal structural and copy number variants that differentiate the subspecies and that variant detection is sensitive to the specific reference genome chosen. Six genes with immune related functions have additional copies in the indicine compared with taurine lineage and an indicus-specific extra copy of fatty acid desaturase is under positive selection. The haplotyped genomes also enable transcripts to be phased to detect allele-specific expression. This work exemplifies the value of haplotype-resolved genomes to better explore evolutionary and functional variations.


Subject(s)
Cattle/genetics , Genetic Variation , Genome , Haplotypes/genetics , Alleles , Allelic Imbalance , Animals , Base Sequence , Chromosomes, Mammalian/genetics , Female , Genetic Loci , INDEL Mutation/genetics , Male , Molecular Sequence Annotation , Polymorphism, Single Nucleotide/genetics , RNA, Messenger/genetics , RNA, Messenger/metabolism , Repetitive Sequences, Nucleic Acid/genetics
12.
Genetics ; 178(1): 427-37, 2008 Jan.
Article in English | MEDLINE | ID: mdl-18202385

ABSTRACT

A 2.4-kb stretch within the RRM2P4 region of the X chromosome, previously sequenced in a sample of 41 globally distributed humans, displayed both an ancient time to the most recent common ancestor (e.g., a TMRCA of approximately 2 million years) and a basal clade composed entirely of Asian sequences. This pattern was interpreted to reflect a history of introgressive hybridization from archaic hominins (most likely Asian Homo erectus) into the anatomically modern human genome. Here, we address this hypothesis by resequencing the 2.4-kb RRM2P4 region in 131 African and 122 non-African individuals and by extending the length of sequence in a window of 16.5 kb encompassing the RRM2P4 pseudogene in a subset of 90 individuals. We find that both the ancient TMRCA and the skew in non-African representation in one of the basal clades are essentially limited to the central 2.4-kb region. We define a new summary statistic called the minimum clade proportion (pmc), which quantifies the proportion of individuals from a specified geographic region in each of the two basal clades of a binary gene tree, and then employ coalescent simulations to assess the likelihood of the observed central RRM2P4 genealogy under two alternative views of human evolutionary history: recent African replacement (RAR) and archaic admixture (AA). A molecular-clock-based TMRCA estimate of 2.33 million years is a statistical outlier under the RAR model; however, the large variance associated with this estimate makes it difficult to distinguish the predictions of the human origins models tested here. The pmc summary statistic, which has improved power with larger samples of chromosomes, yields values that are significantly unlikely under the RAR model and fit expectations better under a range of archaic admixture scenarios.


Subject(s)
Chromosomes, Human, X/genetics , Genealogy and Heraldry , Models, Genetic , DNA, Intergenic/genetics , Demography , Genetic Variation , Humans , Likelihood Functions , Phylogeny , Sequence Analysis, DNA
13.
Gigascience ; 8(10)2019 10 01.
Article in English | MEDLINE | ID: mdl-31609423

ABSTRACT

BACKGROUND: A high-quality reference genome is an essential tool for applied and basic research on arthropods. Long-read sequencing technologies may be used to generate more complete and contiguous genome assemblies than alternate technologies; however, long-read methods have historically had greater input DNA requirements and higher costs than next-generation sequencing, which are barriers to their use on many samples. Here, we present a 2.3 Gb de novo genome assembly of a field-collected adult female spotted lanternfly (Lycorma delicatula) using a single Pacific Biosciences SMRT Cell. The spotted lanternfly is an invasive species recently discovered in the northeastern United States that threatens to damage economically important crop plants in the region. RESULTS: The DNA from 1 individual was used to make 1 standard, size-selected library with an average DNA fragment size of ∼20 kb. The library was run on 1 Sequel II SMRT Cell 8M, generating a total of 132 Gb of long-read sequences, of which 82 Gb were from unique library molecules, representing ∼36× coverage of the genome. The assembly had high contiguity (contig N50 length = 1.5 Mb), completeness, and sequence level accuracy as estimated by conserved gene set analysis (96.8% of conserved genes both complete and without frame shift errors). Furthermore, it was possible to segregate more than half of the diploid genome into the 2 separate haplotypes. The assembly also recovered 2 microbial symbiont genomes known to be associated with L. delicatula, each microbial genome being assembled into a single contig. CONCLUSIONS: We demonstrate that field-collected arthropods can be used for the rapid generation of high-quality genome assemblies, an attractive approach for projects on emerging invasive species, disease vectors, or conservation efforts of endangered species.


Subject(s)
Diptera/genetics , Genome, Insect , Genomics/methods , Animals , Female , Gene Library , Introduced Species , Sequence Analysis, DNA
14.
Genes (Basel) ; 10(1)2019 01 18.
Article in English | MEDLINE | ID: mdl-30669388

ABSTRACT

A high-quality reference genome is a fundamental resource for functional genetics, comparative genomics, and population genomics, and is increasingly important for conservation biology. PacBio Single Molecule, Real-Time (SMRT) sequencing generates long reads with uniform coverage and high consensus accuracy, making it a powerful technology for de novo genome assembly. Improvements in throughput and concomitant reductions in cost have made PacBio an attractive core technology for many large genome initiatives, however, relatively high DNA input requirements (~5 µg for standard library protocol) have placed PacBio out of reach for many projects on small organisms that have lower DNA content, or on projects with limited input DNA for other reasons. Here we present a high-quality de novo genome assembly from a single Anopheles coluzzii mosquito. A modified SMRTbell library construction protocol without DNA shearing and size selection was used to generate a SMRTbell library from just 100 ng of starting genomic DNA. The sample was run on the Sequel System with chemistry 3.0 and software v6.0, generating, on average, 25 Gb of sequence per SMRT Cell with 20 h movies, followed by diploid de novo genome assembly with FALCON-Unzip. The resulting curated assembly had high contiguity (contig N50 3.5 Mb) and completeness (more than 98% of conserved genes were present and full-length). In addition, this single-insect assembly now places 667 (>90%) of formerly unplaced genes into their appropriate chromosomal contexts in the AgamP4 PEST reference. We were also able to resolve maternal and paternal haplotypes for over 1/3 of the genome. By sequencing and assembling material from a single diploid individual, only two haplotypes were present, simplifying the assembly process compared to samples from multiple pooled individuals. The method presented here can be applied to samples with starting DNA amounts as low as 100 ng per 1 Gb genome size. This new low-input approach puts PacBio-based assemblies in reach for small highly heterozygous organisms that comprise much of the diversity of life.


Subject(s)
Anopheles/genetics , Genome, Insect , Sequence Analysis, DNA/methods , Animals , Contig Mapping/methods , Contig Mapping/standards , Ploidies , Polymorphism, Genetic , Sequence Analysis, DNA/standards
15.
Nat Commun ; 10(1): 260, 2019 01 16.
Article in English | MEDLINE | ID: mdl-30651564

ABSTRACT

Rapid innovation in sequencing technologies and improvement in assembly algorithms have enabled the creation of highly contiguous mammalian genomes. Here we report a chromosome-level assembly of the water buffalo (Bubalus bubalis) genome using single-molecule sequencing and chromatin conformation capture data. PacBio Sequel reads, with a mean length of 11.5 kb, helped to resolve repetitive elements and generate sequence contiguity. All five B. bubalis sub-metacentric chromosomes were correctly scaffolded with centromeres spanned. Although the index animal was partly inbred, 58% of the genome was haplotype-phased by FALCON-Unzip. This new reference genome improves the contig N50 of the previous short-read based buffalo assembly more than a thousand-fold and contains only 383 gaps. It surpasses the human and goat references in sequence contiguity and facilitates the annotation of hard to assemble gene clusters such as the major histocompatibility complex (MHC).


Subject(s)
Buffaloes/genetics , Chromosomes, Mammalian/genetics , Contig Mapping/methods , Genome/genetics , Goats/genetics , Animals , Chromatin/chemistry , Chromatin/genetics , Female , Genomics/methods , Haplotypes , High-Throughput Nucleotide Sequencing , Humans , Major Histocompatibility Complex/genetics , Molecular Sequence Annotation/methods , Multigene Family/genetics , Repetitive Sequences, Nucleic Acid/genetics , Whole Genome Sequencing
16.
Genetics ; 177(4): 2195-207, 2007 Dec.
Article in English | MEDLINE | ID: mdl-18073427

ABSTRACT

We estimate parameters of a general isolation-with-migration model using resequence data from mitochondrial DNA (mtDNA), the Y chromosome, and two loci on the X chromosome in samples of 25-50 individuals from each of 10 human populations. Application of a coalescent-based Markov chain Monte Carlo technique allows simultaneous inference of divergence times, rates of gene flow, as well as changes in effective population size. Results from comparisons between sub-Saharan African and Eurasian populations estimate that 1500 individuals founded the ancestral Eurasian population approximately 40 thousand years ago (KYA). Furthermore, these small Eurasian founding populations appear to have grown much more dramatically than either African or Oceanian populations. Analyses of sub-Saharan African populations provide little evidence for a history of population bottlenecks and suggest that they began diverging from one another upward of 50 KYA. We surmise that ancestral African populations had already been geographically structured prior to the founding of ancestral Eurasian populations. African populations are shown to experience low levels of mitochondrial DNA gene flow, but high levels of Y chromosome gene flow. In particular, Y chromosome gene flow appears to be asymmetric, i.e., from the Bantu-speaking population into other African populations. Conversely, mitochondrial gene flow is more extensive between non-African populations, but appears to be absent between European and Asian populations.


Subject(s)
Chromosomes, Human, X , Chromosomes, Human, Y , DNA, Mitochondrial , Gene Flow , Genetics, Population , Population Dynamics , Base Sequence , Humans , Markov Chains , Population Density , Racial Groups/genetics
17.
Curr Biol ; 28(8): 1289-1295.e4, 2018 04 23.
Article in English | MEDLINE | ID: mdl-29606420

ABSTRACT

Crossing over between homologous chromosomes during meiosis repairs programmed DNA double-strand breaks, ensures proper segregation at meiosis I [1], shapes the genomic distribution of nucleotide variability in populations, and enhances the efficacy of natural selection among genetically linked sites [2]. Between closely related Drosophila species, large differences exist in the rate and chromosomal distribution of crossing over. Little, however, is known about the molecular genetic changes or population genetic forces that mediate evolved differences in recombination between species [3, 4]. Here, we show that a meiosis gene with a history of rapid evolution acts as a trans-acting modifier of species differences in crossing over. In transgenic flies, the dicistronic gene, mei-217/mei-218, recapitulates a large part of the species differences in the rate and chromosomal distribution of crossing over. These phenotypic differences appear to result from changes in protein sequence not gene expression. Our population genetics analyses show that the protein-coding sequence of mei-218, but not mei-217, has a history of recurrent positive natural selection. By modulating the intensity of centromeric and telomeric suppression of crossing over, evolution at mei-217/-218 has incidentally shaped gross differences in the chromosomal distribution of nucleotide variability between species. We speculate that recurrent bouts of adaptive evolution at mei-217/-218 might reflect a history of coevolution with selfish genetic elements.


Subject(s)
Cell Cycle Proteins/genetics , Crossing Over, Genetic/genetics , Drosophila Proteins/genetics , Meiosis/genetics , Amino Acid Sequence , Animals , Animals, Genetically Modified/genetics , Cell Cycle Proteins/metabolism , Cell Cycle Proteins/physiology , Centromere/genetics , Centromere/physiology , DNA Breaks, Double-Stranded , Drosophila/genetics , Drosophila Proteins/metabolism , Drosophila Proteins/physiology , Drosophila melanogaster/genetics , Evolution, Molecular , Gene Expression/genetics , Recombination, Genetic/genetics , Selection, Genetic , Species Specificity
18.
Nat Biotechnol ; 2018 Oct 22.
Article in English | MEDLINE | ID: mdl-30346939

ABSTRACT

Complex allelic variation hampers the assembly of haplotype-resolved sequences from diploid genomes. We developed trio binning, an approach that simplifies haplotype assembly by resolving allelic variation before assembly. In contrast with prior approaches, the effectiveness of our method improved with increasing heterozygosity. Trio binning uses short reads from two parental genomes to first partition long reads from an offspring into haplotype-specific sets. Each haplotype is then assembled independently, resulting in a complete diploid reconstruction. We used trio binning to recover both haplotypes of a diploid human genome and identified complex structural variants missed by alternative approaches. We sequenced an F1 cross between the cattle subspecies Bos taurus taurus and Bos taurus indicus and completely assembled both parental haplotypes with NG50 haplotig sizes of >20 Mb and 99.998% accuracy, surpassing the quality of current cattle reference genomes. We suggest that trio binning improves diploid genome assembly and will facilitate new studies of haplotype variation and inheritance.

19.
Elife ; 72018 12 13.
Article in English | MEDLINE | ID: mdl-30543325

ABSTRACT

During speciation, sex chromosomes often accumulate interspecific genetic incompatibilities faster than the rest of the genome. The drive theory posits that sex chromosomes are susceptible to recurrent bouts of meiotic drive and suppression, causing the evolutionary build-up of divergent cryptic sex-linked drive systems and, incidentally, genetic incompatibilities. To assess the role of drive during speciation, we combine high-resolution genetic mapping of X-linked hybrid male sterility with population genomics analyses of divergence and recent gene flow between the fruitfly species, Drosophila mauritiana and D. simulans. Our findings reveal a high density of genetic incompatibilities and a corresponding dearth of gene flow on the X chromosome. Surprisingly, we find that a known drive element recently migrated between species and, rather than contributing to interspecific divergence, caused a strong reduction in local sequence divergence, undermining the evolution of hybrid sterility. Gene flow can therefore mediate the effects of selfish genetic elements during speciation.


Subject(s)
Biological Evolution , Genetic Speciation , X Chromosome/genetics , Y Chromosome/genetics , Animals , Drosophila/genetics , Drosophila simulans/genetics , Gene Flow , Infertility, Male/genetics , Male , Meiosis/genetics , Species Specificity
20.
Gigascience ; 6(10): 1-16, 2017 10 01.
Article in English | MEDLINE | ID: mdl-29020750

ABSTRACT

Reference-quality genomes are expected to provide a resource for studying gene structure, function, and evolution. However, often genes of interest are not completely or accurately assembled, leading to unknown errors in analyses or additional cloning efforts for the correct sequences. A promising solution is long-read sequencing. Here we tested PacBio-based long-read sequencing and diploid assembly for potential improvements to the Sanger-based intermediate-read zebra finch reference and Illumina-based short-read Anna's hummingbird reference, 2 vocal learning avian species widely studied in neuroscience and genomics. With DNA of the same individuals used to generate the reference genomes, we generated diploid assemblies with the FALCON-Unzip assembler, resulting in contigs with no gaps in the megabase range, representing 150-fold and 200-fold improvements over the current zebra finch and hummingbird references, respectively. These long-read and phased assemblies corrected and resolved what we discovered to be numerous misassemblies in the references, including missing sequences in gaps, erroneous sequences flanking gaps, base call errors in difficult-to-sequence regions, complex repeat structure errors, and allelic differences between the 2 haplotypes. These improvements were validated by single long-genome and transcriptome reads and resulted for the first time in completely resolved protein-coding genes widely studied in neuroscience and specialized in vocal learning species. These findings demonstrate the impact of long reads, sequencing of previously difficult-to-sequence regions, and phasing of haplotypes on generating the high-quality assemblies necessary for understanding gene structure, function, and evolution.


Subject(s)
Birds/genetics , Animals , Avian Proteins/genetics , Dual Specificity Phosphatase 1/genetics , Early Growth Response Protein 1/genetics , Female , Forkhead Transcription Factors/genetics , Genome , Male , Nerve Tissue Proteins/genetics , Sequence Analysis, DNA
SELECTION OF CITATIONS
SEARCH DETAIL