RESUMO
The expansion of agriculture is responsible for the mass conversion of biologically diverse natural environments into managed agroecosystems dominated by a handful of genetically homogeneous crop species. Agricultural ecosystems typically have very different abiotic and ecological conditions from those they replaced and create potential niches for those species that are able to exploit the abundant resources offered by crop plants. While there are well-studied examples of crop pests that have adapted into novel agricultural niches, the impact of agricultural intensification on the evolution of crop mutualists such as pollinators is poorly understood. We combined genealogical inference from genomic data with archaeological records to demonstrate that the Holocene demographic history of a wild specialist pollinator of Cucurbita (pumpkins, squashes, and gourds) has been profoundly impacted by the history of agricultural expansion in North America. Populations of the squash bee Eucera pruinosa experienced rapid growth in areas where agriculture intensified within the past 1,000 y, suggesting that the cultivation of Cucurbita in North America has increased the amount of floral resources available to these bees. In addition, we found that roughly 20% of this bee species' genome shows signatures of recent selective sweeps. These signatures are overwhelmingly concentrated in populations from eastern North America where squash bees were historically able to colonize novel environments due to human cultivation of Cucurbita pepo and now exclusively inhabit agricultural niches. These results suggest that the widespread cultivation of crops can prompt adaptation in wild pollinators through the distinct ecological conditions imposed by agricultural environments.
Assuntos
Cucurbita , Humanos , Animais , Abelhas , Cucurbita/genética , Ecossistema , Polinização , Agricultura , Produtos AgrícolasRESUMO
Life on Earth has evolved from initial simplicity to the astounding complexity we experience today. Bacteria and archaea have largely excelled in metabolic diversification, but eukaryotes additionally display abundant morphological innovation. How have these innovations come about and what constraints are there on the origins of novelty and the continuing maintenance of biodiversity on Earth? The history of life and the code for the working parts of cells and systems are written in the genome. The Earth BioGenome Project has proposed that the genomes of all extant, named eukaryotes-about 2 million species-should be sequenced to high quality to produce a digital library of life on Earth, beginning with strategic phylogenetic, ecological, and high-impact priorities. Here we discuss why we should sequence all eukaryotic species, not just a representative few scattered across the many branches of the tree of life. We suggest that many questions of evolutionary and ecological significance will only be addressable when whole-genome data representing divergences at all of the branchings in the tree of life or all species in natural ecosystems are available. We envisage that a genomic tree of life will foster understanding of the ongoing processes of speciation, adaptation, and organismal dependencies within entire ecosystems. These explorations will resolve long-standing problems in phylogenetics, evolution, ecology, conservation, agriculture, bioindustry, and medicine.
Assuntos
Sequência de Bases/genética , Eucariotos/genética , Genômica/ética , Animais , Biodiversidade , Evolução Biológica , Ecologia , Ecossistema , Genoma , Genômica/métodos , Humanos , FilogeniaRESUMO
A global international initiative, such as the Earth BioGenome Project (EBP), requires both agreement and coordination on standards to ensure that the collective effort generates rapid progress toward its goals. To this end, the EBP initiated five technical standards committees comprising volunteer members from the global genomics scientific community: Sample Collection and Processing, Sequencing and Assembly, Annotation, Analysis, and IT and Informatics. The current versions of the resulting standards documents are available on the EBP website, with the recognition that opportunities, technologies, and challenges may improve or change in the future, requiring flexibility for the EBP to meet its goals. Here, we describe some highlights from the proposed standards, and areas where additional challenges will need to be met.
Assuntos
Sequência de Bases/genética , Eucariotos/genética , Genômica/normas , Animais , Biodiversidade , Genômica/métodos , Humanos , Padrões de Referência , Valores de Referência , Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/normasRESUMO
In 1977, a sample of diseased adult honeybees (Apis mellifera) from Egypt was found to contain large amounts of a previously unknown virus, Egypt bee virus, which was subsequently shown to be serologically related to deformed wing virus (DWV). By sequencing the original isolate, we demonstrate that Egypt bee virus is in fact a fourth unique, major variant of DWV (DWV-D): more closely related to DWV-C than to either DWV-A or DWV-B. DWV-A and DWV-B are the most common DWV variants worldwide due to their close relationship and transmission by Varroa destructor. However, we could not find any trace of DWV-D in several hundred RNA sequencing libraries from a worldwide selection of honeybee, varroa and bumblebee samples. This means that DWV-D has either become extinct, been replaced by other DWV variants better adapted to varroa-mediated transmission, or persists only in a narrow geographic or host range, isolated from common bee and beekeeping trade routes.
Assuntos
Vírus de RNA , Varroidae , Animais , Abelhas , Vírus de DNA , Egito , Vírus de RNA/genéticaRESUMO
The impacts of invertebrate RNA virus population dynamics on virulence and infection outcomes are poorly understood. Deformed wing virus (DWV), the main viral pathogen of honey bees, negatively impacts bee health, which can lead to colony death. Despite previous reports on the reduction of DWV diversity following the arrival of the parasitic mite Varroa destructor, the key DWV vector, we found high genetic diversity of DWV in infested United States honey bee colonies. Phylogenetic analysis showed that divergent US DWV genotypes are of monophyletic origin and were likely generated as a result of diversification after a genetic bottleneck. To investigate the population dynamics of this divergent DWV, we designed a series of novel infectious cDNA clones corresponding to coexisting DWV genotypes, thereby devising a reverse-genetics system for an invertebrate RNA virus quasispecies. Equal replication rates were observed for all clone-derived DWV variants in single infections. Surprisingly, individual clones replicated to the same high levels as their mixtures and even the parental highly diverse natural DWV population, suggesting that complementation between genotypes was not required to replicate to high levels. Mixed clone-derived infections showed a lack of strong competitive exclusion, suggesting that the DWV genotypes were adapted to coexist. Mutational and recombination events were observed across clone progeny, providing new insights into the forces that drive and constrain virus diversification. Accordingly, our results suggest that Varroa influences DWV dynamics by causing an initial selective sweep, which is followed by virus diversification fueled by negative frequency-dependent selection for new genotypes. We suggest that this selection might reflect the ability of rare lineages to evade host defenses, specifically antiviral RNA interference (RNAi). In support of this hypothesis, we show that RNAi induced against one DWV strain is less effective against an alternate strain from the same population.
Assuntos
Vetores Aracnídeos/virologia , Abelhas/virologia , Evasão da Resposta Imune/genética , Vírus de RNA/genética , Varroidae/virologia , Animais , Abelhas/genética , Abelhas/imunologia , Abelhas/parasitologia , Células Clonais , Biblioteca Gênica , Variação Genética , Genótipo , Mutação , Filogenia , Interferência de RNA/imunologia , Vírus de RNA/classificação , Vírus de RNA/imunologia , Vírus de RNA/patogenicidade , Recombinação Genética , Genética Reversa/métodos , Seleção Genética , Virulência , Replicação ViralAssuntos
Sequência de Bases/genética , Eucariotos/genética , Animais , Biodiversidade , Genômica , HumanosRESUMO
BACKGROUND: The ability to generate long sequencing reads and access long-range linkage information is revolutionizing the quality and completeness of genome assemblies. Here we use a hybrid approach that combines data from four genome sequencing and mapping technologies to generate a new genome assembly of the honeybee Apis mellifera. We first generated contigs based on PacBio sequencing libraries, which were then merged with linked-read 10x Chromium data followed by scaffolding using a BioNano optical genome map and a Hi-C chromatin interaction map, complemented by a genetic linkage map. RESULTS: Each of the assembly steps reduced the number of gaps and incorporated a substantial amount of additional sequence into scaffolds. The new assembly (Amel_HAv3) is significantly more contiguous and complete than the previous one (Amel_4.5), based mainly on Sanger sequencing reads. N50 of contigs is 120-fold higher (5.381 Mbp compared to 0.053 Mbp) and we anchor > 98% of the sequence to chromosomes. All of the 16 chromosomes are represented as single scaffolds with an average of three sequence gaps per chromosome. The improvements are largely due to the inclusion of repetitive sequence that was unplaced in previous assemblies. In particular, our assembly is highly contiguous across centromeres and telomeres and includes hundreds of AvaI and AluI repeats associated with these features. CONCLUSIONS: The improved assembly will be of utility for refining gene models, studying genome function, mapping functional genetic variation, identification of structural variants, and comparative genomics.
Assuntos
Abelhas/genética , Cromossomos de Insetos/genética , Genômica , Animais , Genoma Mitocondrial/genética , Telômero/genéticaRESUMO
Our understanding of the mechanisms controlling insect diapause has increased dramatically with the introduction of global gene expression techniques, such as RNA sequencing (RNA-seq). However, little attention has been given to how ecologically relevant field conditions may affect gene expression during diapause development because previous studies have focused on laboratory-reared and -maintained insects. To determine whether gene expression differs between laboratory and field conditions, prepupae of the alfalfa leafcutting bee, Megachile rotundata, entering diapause early or late in the growing season were collected. These two groups were further subdivided in early autumn into laboratory- and field-maintained groups, resulting in four experimental treatments of diapausing prepupae: early and late field, and early and late laboratory. RNA-seq and differential expression analyses were performed on bees from the four treatment groups in November, January, March and May. The number of treatment-specific differentially expressed genes (97 to 1249) outnumbered the number of differentially regulated genes common to all four treatments (14 to 229), indicating that exposure to laboratory or field conditions had a major impact on gene expression during diapause development. Principle component analysis and hierarchical cluster analysis yielded similar grouping of treatments, confirming that the treatments form distinct clusters. Our results support the conclusion that gene expression during the course of diapause development is not a simple ordered sequence, but rather a highly plastic response determined primarily by the environmental history of the individual insect.
Assuntos
Abelhas/genética , Diapausa/genética , Meio Ambiente , Expressão Gênica , Animais , Abelhas/crescimento & desenvolvimento , Perfilação da Expressão Gênica , Estações do Ano , Análise de Sequência de RNARESUMO
Long-read sequencing has revolutionized genome assembly, yielding highly contiguous, chromosome-level contigs. However, assemblies from some third generation long read technologies, such as Pacific Biosciences (PacBio) continuous long reads (CLR), have a high error rate. Such errors can be corrected with short reads through a process called polishing. Although best practices for polishing non-model de novo genome assemblies were recently described by the Vertebrate Genome Project (VGP) Assembly community, there is a need for a publicly available, reproducible workflow that can be easily implemented and run on a conventional high performance computing environment. Here, we describe polishCLR (https://github.com/isugifNF/polishCLR), a reproducible Nextflow workflow that implements best practices for polishing assemblies made from CLR data. PolishCLR can be initiated from several input options that extend best practices to suboptimal cases. It also provides re-entry points throughout several key processes, including identifying duplicate haplotypes in purge_dups, allowing a break for scaffolding if data are available, and throughout multiple rounds of polishing and evaluation with Arrow and FreeBayes. PolishCLR is containerized and publicly available for the greater assembly community as a tool to complete assemblies from existing, error-prone long-read data.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA , Fluxo de Trabalho , HaplótiposRESUMO
The boll weevil, Anthonomus grandis grandis Boheman, is one of the most historically impactful insects due to its near destruction of the US cotton industry in the early 20th century. Contemporary efforts to manage this insect primarily use pheromone baited traps for detection and organophosphate insecticides for control, but this strategy is not sustainable due to financial and environmental costs. We present a high-quality boll weevil genome assembly, consisting of 306 scaffolds with approximately 24,000 annotated genes, as a first step in the identification of gene targets for novel pest control. Gene content and transposable element distribution are similar to those found in other Curculionidae genomes; however, this is the most contiguous and only assembly reported to date for a member in the species-rich genus Anthonomus. Transcriptome profiles across larval, pupal, and adult life stages led to identification of several genes and gene families that could present targets for novel control strategies.
Assuntos
Besouros , Inseticidas , Gorgulhos , Animais , Gorgulhos/genética , Besouros/genética , Larva , Biologia , GossypiumRESUMO
The pink bollworm, Pectinophora gossypiella (Saunders) (Lepidoptera: Gelechiidae), is a major global pest of cotton. Current management practices include chemical insecticides, cultural strategies, sterile insect releases, and transgenic cotton producing crystalline (Cry) protein toxins of the bacterium Bacillus thuringiensis (Bt). These strategies have contributed to the eradication of P. gossypiella from the cotton-growing areas of the United States and northern Mexico. However, this pest has evolved resistance to Bt cotton in Asia, where it remains a critical pest, and the benefits of using transgenic Bt crops have been lost. A complete annotated reference genome is needed to improve global Bt resistance management of the pink bollworm. We generated the first chromosome-level genome assembly for pink bollworm from a Bt-susceptible laboratory strain (APHIS-S) using PacBio continuous long reads for contig generation, Illumina Hi-C for scaffolding, and Illumina whole-genome re-sequencing for error correction. The pseudo-haploid assembly consists of 29 autosomes and the Z sex chromosome. The assembly exceeds the minimum Earth BioGenome Project quality standards, has a low error rate, is highly contiguous at both the contig and scaffold levels (L/N50 of 18/8.26 MB and 14/16.44 MB, respectively), and is complete, with 98.6% of lepidopteran single-copy orthologs represented without duplication. The genome was annotated with 50% repeat content and 14,107 protein-coding genes, further assigned to 41,666 functional annotations. This assembly represents the first publicly available complete annotated genome of pink bollworm and will serve as the foundation for advancing molecular genetics of this important pest species.
Assuntos
Bacillus thuringiensis , Mariposas , Animais , Resistência a Inseticidas/genética , Plantas Geneticamente Modificadas/genética , Proteínas de Bactérias/genética , Mariposas/genética , Mariposas/metabolismo , Bacillus thuringiensis/genética , Bacillus thuringiensis/metabolismo , Endotoxinas/genética , Endotoxinas/metabolismo , Cromossomos/metabolismo , Gossypium/genética , Gossypium/metabolismoRESUMO
Helicoverpa zea (Lepidoptera: Noctuidae) is an insect pest of major cultivated crops in North and South America. The species has adapted to different host plants and developed resistance to several insecticidal agents, including Bacillus thuringiensis (Bt) insecticidal proteins in transgenic cotton and maize. Helicoverpa zea populations persist year-round in tropical and subtropical regions, but seasonal migrations into temperate zones increase the geographic range of associated crop damage. To better understand the genetic basis of these physiological and ecological characteristics, we generated a high-quality chromosome-level assembly for a single H. zea male from Bt-resistant strain, HzStark_Cry1AcR. Hi-C data were used to scaffold an initial 375.2â Mb contig assembly into 30 autosomes and the Z sex chromosome (scaffold N50 = 12.8â Mb and L50 = 14). The scaffolded assembly was error-corrected with a novel pipeline, polishCLR. The mitochondrial genome was assembled through an improved pipeline and annotated. Assessment of this genome assembly indicated 98.8% of the Lepidopteran Benchmark Universal Single-Copy Ortholog set were complete (98.5% as complete single copy). Repetitive elements comprised approximately 29.5% of the assembly with the plurality (11.2%) classified as retroelements. This chromosome-scale reference assembly for H. zea, ilHelZeax1.1, will facilitate future research to evaluate and enhance sustainable crop production practices.
Assuntos
Bacillus thuringiensis , Inseticidas , Lepidópteros , Mariposas , Animais , Inseticidas/farmacologia , Bacillus thuringiensis/genética , Zea mays , Cromossomos Sexuais , Proteínas de Bactérias/genética , Plantas Geneticamente Modificadas , Proteínas Hemolisinas/genética , Mariposas/genética , Controle Biológico de Vetores , LarvaRESUMO
Genome sequencing of a diverse array of arthropod genomes is already underway, and these genomes will be used to study human health, agriculture, biodiversity, and ecology. These new genomes are intended to serve as community resources and provide the foundational information required to apply 'omics technologies to a more diverse set of species. However, biologists require genome annotation to use these genomes and derive a better understanding of complex biological systems. Genome annotation incorporates two related, but distinct, processes: Demarcating genes and other elements present in genome sequences (structural annotation); and associating a function with genetic elements (functional annotation). While there are well-established and freely available workflows for structural annotation of gene identification in newly assembled genomes, workflows for providing the functional annotation required to support functional genomics studies are less well understood. Genome-scale functional annotation is required for functional modeling (enrichment, networks, etc.). A first-pass genome-wide functional annotation effort can rapidly identify under-represented gene sets for focused community annotation efforts. We present an open-source, open access, and containerized pipeline for genome-scale functional annotation of insect proteomes and apply it to various arthropod species. We show that the performance of the predictions is consistent across a set of arthropod genomes with varying assembly and annotation quality.
RESUMO
The phylum Arthropoda includes species crucial for ecosystem stability, soil health, crop production, and others that present obstacles to crop and animal agriculture. The United States Department of Agriculture's Agricultural Research Service initiated the Ag100Pest Initiative to generate reference genome assemblies of arthropods that are (or may become) pests to agricultural production and global food security. We describe the project goals, process, status, and future. The first three years of the project were focused on species selection, specimen collection, and the construction of lab and bioinformatics pipelines for the efficient production of assemblies at scale. Contig-level assemblies of 47 species are presented, all of which were generated from single specimens. Lessons learned and optimizations leading to the current pipeline are discussed. The project name implies a target of 100 species, but the efficiencies gained during the project have supported an expansion of the original goal and a total of 158 species are currently in the pipeline. We anticipate that the processes described in the paper will help other arthropod research groups or other consortia considering genome assembly at scale.
RESUMO
Kodamaea ohmeri is a symbiont of the small hive beetle (SHB), which is a scavenger of honey bee colonies. The SHB causes absconding of the economically important honey bee (Apis mellifera) and deposits K. ohmeri in the honeycomb. We describe long-read sequencing and further analyses of the K. ohmeri genome.
RESUMO
BACKGROUND: Angelman Syndrome (AS) is a neurodevelopmental disorder with core features of intellectual disability, speech impairment, movement disorders, and a unique behavioral profile. Typically, AS results from absent maternal expression of UBE3A, but some individuals have imprinting defects in a portion of their cells. These individuals are mosaic for normal and defective UBE3A expression, resulting in mosaic AS (mAS) with a partial loss of gene expression. METHODS: This study aims to contrast the mAS phenotype to that of AS. Clinical characteristics of mAS were obtained from a parental survey of 22 mAS patients and from the Angelman Natural History study. These were contrasted with those of AS using historical data. RESULTS: Developmental delay was present in nearly all mAS patients, whereas the core features of AS were reported in less than 40%. While language and ability to manage activities of daily living were markedly improved over that expected in AS, mAS patients demonstrated a high incidence of behavioral challenges. CONCLUSION: Clinical work-up of an individual with developmental delay, hyperactivity, anxiety, and an uncharacteristically happy demeanor should prompt methylation studies to rule out mAS. We expand the phenotypic spectrum of AS to include features that overlap with Prader-Willi such as hyperphagia.
Assuntos
Síndrome de Angelman/diagnóstico , Síndrome de Angelman/genética , Idioma , Mosaicismo , Fenótipo , Adolescente , Adulto , Síndrome de Angelman/epidemiologia , Criança , Pré-Escolar , Comunicação , Feminino , Impressão Genômica , Humanos , Incidência , Masculino , Índice de Gravidade de Doença , Comportamento Social , Inquéritos e Questionários , Adulto JovemRESUMO
Multispecies host-parasite evolution is common, but how parasites evolve after speciating remains poorly understood. Shared evolutionary history and physiology may propel species along similar evolutionary trajectories whereas pursuing different strategies can reduce competition. We test these scenarios in the economically important association between honey bees and ectoparasitic mites by sequencing the genomes of the sister mite species Varroa destructor and Varroa jacobsoni. These genomes were closely related, with 99.7% sequence identity. Among the 9,628 orthologous genes, 4.8% showed signs of positive selection in at least one species. Divergent selective trajectories were discovered in conserved chemosensory gene families (IGR, SNMP), and Halloween genes (CYP) involved in moulting and reproduction. However, there was little overlap in these gene sets and associated GO terms, indicating different selective regimes operating on each of the parasites. Based on our findings, we suggest that species-specific strategies may be needed to combat evolving parasite communities.
Assuntos
Abelhas/parasitologia , Evolução Molecular , Varroidae/genética , Animais , Sistema Enzimático do Citocromo P-450/genética , DNA Mitocondrial , Feminino , Interações Hospedeiro-Parasita , Masculino , Especificidade da EspécieRESUMO
Honey bees, the primary managed insect pollinator, suffer considerable losses due to Deformed wing virus (DWV), an RNA virus vectored by the mite Varroa destructor. Mite vectoring has resulted in the emergence of virulent DWV variants. The basis for such changes in DWV is poorly understood. Most importantly, it remains unclear whether replication of DWV occurs in the mite. In this study, we exposed Varroa mites to DWV type A via feeding on artificially infected honey bees. A significant, 357-fold increase in DWV load was observed in these mites after 2 days. However, after 8 additional days of passage on honey bee pupae with low viral loads, the DWV load dropped by 29-fold. This decrease significantly reduced the mites' ability to transmit DWV to honey bees. Notably, negative-strand DWV RNA, which could indicate viral replication, was detected only in mites collected from pupae with high DWV levels but not in the passaged mites. We also found that Varroa mites contain honey bee mRNAs, consistent with the acquisition of honey bee cells which would additionally contain DWV replication complexes with negative-strand DWV RNA. We propose that transmission of DWV type A by Varroa mites occurs in a non-propagative manner.
Assuntos
Vetores Artrópodes/virologia , Abelhas , Vírus de RNA/metabolismo , Varroidae/virologia , Animais , Abelhas/parasitologia , Abelhas/virologiaRESUMO
BACKGROUND: A high-quality reference genome is an essential tool for applied and basic research on arthropods. Long-read sequencing technologies may be used to generate more complete and contiguous genome assemblies than alternate technologies; however, long-read methods have historically had greater input DNA requirements and higher costs than next-generation sequencing, which are barriers to their use on many samples. Here, we present a 2.3 Gb de novo genome assembly of a field-collected adult female spotted lanternfly (Lycorma delicatula) using a single Pacific Biosciences SMRT Cell. The spotted lanternfly is an invasive species recently discovered in the northeastern United States that threatens to damage economically important crop plants in the region. RESULTS: The DNA from 1 individual was used to make 1 standard, size-selected library with an average DNA fragment size of â¼20 kb. The library was run on 1 Sequel II SMRT Cell 8M, generating a total of 132 Gb of long-read sequences, of which 82 Gb were from unique library molecules, representing â¼36× coverage of the genome. The assembly had high contiguity (contig N50 length = 1.5 Mb), completeness, and sequence level accuracy as estimated by conserved gene set analysis (96.8% of conserved genes both complete and without frame shift errors). Furthermore, it was possible to segregate more than half of the diploid genome into the 2 separate haplotypes. The assembly also recovered 2 microbial symbiont genomes known to be associated with L. delicatula, each microbial genome being assembled into a single contig. CONCLUSIONS: We demonstrate that field-collected arthropods can be used for the rapid generation of high-quality genome assemblies, an attractive approach for projects on emerging invasive species, disease vectors, or conservation efforts of endangered species.
Assuntos
Dípteros/genética , Genoma de Inseto , Genômica/métodos , Animais , Feminino , Biblioteca Gênica , Espécies Introduzidas , Análise de Sequência de DNARESUMO
Hymenoptera is the second-most sequenced arthropod order, with 52 publically archived genomes (71 with ants, reviewed elsewhere), however these genomes do not capture the breadth of this very diverse order (Figure 1, Table 1). These sequenced genomes represent only 15 of the 97 extant families. Although at least 55 other genomes are in progress in an additional 11 families (see Table 2), stinging wasps represent 35 (67%) of the available and 42 (76%) of the in progress genomes. A more comprehensive catalog of hymenopteran genomes is needed for research into the evolutionary processes underlying the expansive diversity in terms of ecology, behavior, and physiological traits within this group. Additional sequencing is needed to generate an assembly for even 0.05% of the estimated 1 million hymenopteran species, and we recommend premier level assemblies for at least 0.1% of the >150,000 named species dispersed across the order. Given the haplodiploid sex determination in Hymenoptera, haploid male sequencing will help minimize genome assembly issues to enable higher quality genome assemblies.