RESUMEN
A global international initiative, such as the Earth BioGenome Project (EBP), requires both agreement and coordination on standards to ensure that the collective effort generates rapid progress toward its goals. To this end, the EBP initiated five technical standards committees comprising volunteer members from the global genomics scientific community: Sample Collection and Processing, Sequencing and Assembly, Annotation, Analysis, and IT and Informatics. The current versions of the resulting standards documents are available on the EBP website, with the recognition that opportunities, technologies, and challenges may improve or change in the future, requiring flexibility for the EBP to meet its goals. Here, we describe some highlights from the proposed standards, and areas where additional challenges will need to be met.
Asunto(s)
Secuencia de Bases/genética , Eucariontes/genética , Genómica/normas , Animales , Biodiversidad , Genómica/métodos , Humanos , Estándares de Referencia , Valores de Referencia , Análisis de Secuencia de ADN/métodos , Análisis de Secuencia de ADN/normasRESUMEN
BACKGROUND: The western flower thrips, Frankliniella occidentalis (Pergande), is a globally invasive pest and plant virus vector on a wide array of food, fiber, and ornamental crops. The underlying genetic mechanisms of the processes governing thrips pest and vector biology, feeding behaviors, ecology, and insecticide resistance are largely unknown. To address this gap, we present the F. occidentalis draft genome assembly and official gene set. RESULTS: We report on the first genome sequence for any member of the insect order Thysanoptera. Benchmarking Universal Single-Copy Ortholog (BUSCO) assessments of the genome assembly (size = 415.8 Mb, scaffold N50 = 948.9 kb) revealed a relatively complete and well-annotated assembly in comparison to other insect genomes. The genome is unusually GC-rich (50%) compared to other insect genomes to date. The official gene set (OGS v1.0) contains 16,859 genes, of which ~ 10% were manually verified and corrected by our consortium. We focused on manual annotation, phylogenetic, and expression evidence analyses for gene sets centered on primary themes in the life histories and activities of plant-colonizing insects. Highlights include the following: (1) divergent clades and large expansions in genes associated with environmental sensing (chemosensory receptors) and detoxification (CYP4, CYP6, and CCE enzymes) of substances encountered in agricultural environments; (2) a comprehensive set of salivary gland genes supported by enriched expression; (3) apparent absence of members of the IMD innate immune defense pathway; and (4) developmental- and sex-specific expression analyses of genes associated with progression from larvae to adulthood through neometaboly, a distinct form of maturation differing from either incomplete or complete metamorphosis in the Insecta. CONCLUSIONS: Analysis of the F. occidentalis genome offers insights into the polyphagous behavior of this insect pest that finds, colonizes, and survives on a widely diverse array of plants. The genomic resources presented here enable a more complete analysis of insect evolution and biology, providing a missing taxon for contemporary insect genomics-based analyses. Our study also offers a genomic benchmark for molecular and evolutionary investigations of other Thysanoptera species.
Asunto(s)
Genoma de los Insectos , Rasgos de la Historia de Vida , Thysanoptera/fisiología , Transcriptoma , Animales , Productos Agrícolas , Conducta Alimentaria , Cadena Alimentaria , Inmunidad Innata/genética , Percepción , Filogenia , Reproducción/genética , Thysanoptera/genética , Thysanoptera/inmunologíaRESUMEN
An amendment to this paper has been published and can be accessed via the original article.
RESUMEN
BACKGROUND: Halyomorpha halys (Stål), the brown marmorated stink bug, is a highly invasive insect species due in part to its exceptionally high levels of polyphagy. This species is also a nuisance due to overwintering in human-made structures. It has caused significant agricultural losses in recent years along the Atlantic seaboard of North America and in continental Europe. Genomic resources will assist with determining the molecular basis for this species' feeding and habitat traits, defining potential targets for pest management strategies. RESULTS: Analysis of the 1.15-Gb draft genome assembly has identified a wide variety of genetic elements underpinning the biological characteristics of this formidable pest species, encompassing the roles of sensory functions, digestion, immunity, detoxification and development, all of which likely support H. halys' capacity for invasiveness. Many of the genes identified herein have potential for biomolecular pesticide applications. CONCLUSIONS: Availability of the H. halys genome sequence will be useful for the development of environmentally friendly biomolecular pesticides to be applied in concert with more traditional, synthetic chemical-based controls.
Asunto(s)
Heterópteros/genética , Proteínas de Insectos/genética , Resistencia a los Insecticidas , Secuenciación Completa del Genoma/métodos , Animales , Ecosistema , Transferencia de Gen Horizontal , Tamaño del Genoma , Heterópteros/clasificación , Especies Introducidas , FilogeniaRESUMEN
BACKGROUND: Having conquered water surfaces worldwide, the semi-aquatic bugs occupy ponds, streams, lakes, mangroves, and even open oceans. The diversity of this group has inspired a range of scientific studies from ecology and evolution to developmental genetics and hydrodynamics of fluid locomotion. However, the lack of a representative water strider genome hinders our ability to more thoroughly investigate the molecular mechanisms underlying the processes of adaptation and diversification within this group. RESULTS: Here we report the sequencing and manual annotation of the Gerris buenoi (G. buenoi) genome; the first water strider genome to be sequenced thus far. The size of the G. buenoi genome is approximately 1,000 Mb, and this sequencing effort has recovered 20,949 predicted protein-coding genes. Manual annotation uncovered a number of local (tandem and proximal) gene duplications and expansions of gene families known for their importance in a variety of processes associated with morphological and physiological adaptations to a water surface lifestyle. These expansions may affect key processes associated with growth, vision, desiccation resistance, detoxification, olfaction and epigenetic regulation. Strikingly, the G. buenoi genome contains three insulin receptors, suggesting key changes in the rewiring and function of the insulin pathway. Other genomic changes affecting with opsin genes may be associated with wavelength sensitivity shifts in opsins, which is likely to be key in facilitating specific adaptations in vision for diverse water habitats. CONCLUSIONS: Our findings suggest that local gene duplications might have played an important role during the evolution of water striders. Along with these findings, the sequencing of the G. buenoi genome now provides us the opportunity to pursue exciting research opportunities to further understand the genomic underpinnings of traits associated with the extreme body plan and life history of water striders.
Asunto(s)
Genoma , Heterópteros/genética , Heterópteros/fisiología , Proteínas de Insectos/genética , Adaptación Fisiológica , Animales , Evolución Molecular , Genómica , Heterópteros/clasificación , Fenotipo , FilogeniaRESUMEN
Hyalella azteca is a cryptic species complex of epibenthic amphipods of interest to ecotoxicology and evolutionary biology. It is the primary crustacean used in North America for sediment toxicity testing and an emerging model for molecular ecotoxicology. To provide molecular resources for sediment quality assessments and evolutionary studies, we sequenced, assembled, and annotated the genome of the H. azteca U.S. Lab Strain. The genome quality and completeness is comparable with other ecotoxicological model species. Through targeted investigation and use of gene expression data sets of H. azteca exposed to pesticides, metals, and other emerging contaminants, we annotated and characterized the major gene families involved in sequestration, detoxification, oxidative stress, and toxicant response. Our results revealed gene loss related to light sensing, but a large expansion in chemoreceptors, likely underlying sensory shifts necessary in their low light habitats. Gene family expansions were also noted for cytochrome P450 genes, cuticle proteins, ion transporters, and include recent gene duplications in the metal sequestration protein, metallothionein. Mapping of differentially expressed transcripts to the genome significantly increased the ability to functionally annotate toxicant responsive genes. The H. azteca genome will greatly facilitate development of genomic tools for environmental assessments and promote an understanding of how evolution shapes toxicological pathways with implications for environmental and human health.
Asunto(s)
Anfípodos , Contaminantes Químicos del Agua , Animales , Ecotoxicología , Sedimentos Geológicos , América del Norte , Pruebas de ToxicidadRESUMEN
Seasonal environments present fundamental physiological challenges to a wide range of insects. Many temperate insects surmount the exigencies of winter by undergoing photoperiodic diapause, in which photoperiod provides a token cue that initiates an alternative developmental programme leading to dormancy. Pre-diapause is a crucial preparatory phase of this process, preceding developmental arrest. However, the regulatory and physiological mechanisms of diapause preparation are largely unknown. Using high-throughput gene expression profiling in the Asian tiger mosquito, Aedes albopictus, we reveal major shifts in endocrine signalling, cell proliferation, metabolism, energy production and cellular structure across pre-diapause development. While some hallmarks of diapause, such as insulin signalling and stress response, were not important at the transcriptional level, two genes, Pepck and PCNA, appear to show diapause-induced transcriptional changes across insect taxa. These processes demonstrate physiological commonalities between Ae. albopictus pre-diapause and diapause strategies across insects, and support the idea of a genetic 'toolkit' for diapause. Observations of gene expression trends from a comparative developmental perspective suggest that individual physiological processes are delayed against a background of a fixed morphological ontogeny. Our results demonstrate how deep sequencing can provide new insights into elusive molecular bases of complex ecological adaptations.
Asunto(s)
Aedes/fisiología , Transcriptoma , Adaptación Fisiológica , Aedes/genética , Aedes/crecimiento & desarrollo , Animales , Drosophila melanogaster/genética , Drosophila melanogaster/crecimiento & desarrollo , Drosophila melanogaster/fisiología , Regulación de la Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento , Datos de Secuencia Molecular , Fotoperiodo , Reacción en Cadena de la Polimerasa , Pupa/genética , Pupa/crecimiento & desarrollo , Pupa/fisiología , Estaciones del Año , Análisis de Secuencia de ADN , Factores de TiempoRESUMEN
Dormancy is a crucial adaptation allowing insects to withstand harsh environmental conditions. The pre-programmed developmental arrest of diapause is a form of dormancy that is distinct from quiescence, in which development arrests in immediate response to hardship. Much progress has been made in understanding the environmental and hormonal controls of diapause. However, studies identifying transcriptional changes unique to diapause, rather than quiescence, are lacking, making it difficult to disentangle the transcriptional profiles of diapause from dormancy in general. The Asian tiger mosquito, Aedes albopictus, presents an ideal model for such a study, as diapausing and quiescent eggs can be staged and collected for global gene expression profiling using a newly developed transcriptome. Here, we use RNA-Seq to contrast gene expression during diapause with quiescence to identify transcriptional changes specific to the diapause response. We identify global trends in gene expression that show gradual convergence of diapause gene expression upon gene expression during quiescence. Functionally, early diapause A. albopictus show strong expression differences of genes involved in metabolism, which diminish over time. Of these, only expression of lipid metabolism genes remained distinct in late diapause. We identify several genes putatively related to hormonal control of development that are persistently differentially expressed throughout diapause, suggesting these might be involved in the maintenance of diapause. Our results identify key biological differences between diapausing and quiescent pharate larvae, and suggest candidate pathways for studying metabolism and the hormonal control of development during diapause in other species.
Asunto(s)
Aedes/crecimiento & desarrollo , Aedes/genética , Regulación de la Expresión Génica , Transcriptoma , Aedes/metabolismo , Animales , Femenino , Larva/genética , Larva/crecimiento & desarrollo , Larva/metabolismo , Metabolismo de los Lípidos , Metamorfosis Biológica , Datos de Secuencia Molecular , Análisis de Secuencia de ARN , Factores de TiempoRESUMEN
Over the last couple of decades, there has been a rapid growth in the number and scope of agricultural genetics, genomics and breeding databases and resources. The AgBioData Consortium (https://www.agbiodata.org/) currently represents 44 databases and resources (https://www.agbiodata.org/databases) covering model or crop plant and animal GGB data, ontologies, pathways, genetic variation and breeding platforms (referred to as 'databases' throughout). One of the goals of the Consortium is to facilitate FAIR (Findable, Accessible, Interoperable, and Reusable) data management and the integration of datasets which requires data sharing, along with structured vocabularies and/or ontologies. Two AgBioData working groups, focused on Data Sharing and Ontologies, respectively, conducted a Consortium-wide survey to assess the current status and future needs of the members in those areas. A total of 33 researchers responded to the survey, representing 37 databases. Results suggest that data-sharing practices by AgBioData databases are in a fairly healthy state, but it is not clear whether this is true for all metadata and data types across all databases; and that, ontology use has not substantially changed since a similar survey was conducted in 2017. Based on our evaluation of the survey results, we recommend (i) providing training for database personnel in a specific data-sharing techniques, as well as in ontology use; (ii) further study on what metadata is shared, and how well it is shared among databases; (iii) promoting an understanding of data sharing and ontologies in the stakeholder community; (iv) improving data sharing and ontologies for specific phenotypic data types and formats; and (v) lowering specific barriers to data sharing and ontology use, by identifying sustainability solutions, and the identification, promotion, or development of data standards. Combined, these improvements are likely to help AgBioData databases increase development efforts towards improved ontology use, and data sharing via programmatic means. Database URL https://www.agbiodata.org/databases.
Asunto(s)
Manejo de Datos , Fitomejoramiento , Animales , Genómica/métodos , Bases de Datos Factuales , Difusión de la InformaciónRESUMEN
Landscape genetic studies use spatially explicit population genetic information to determine the physical and environmental causes of population genetic structure on regional scales. Comparative studies that identify common barriers to gene flow across multiple species within a community are important to both understand the evolutionary trajectories of populations and prioritize habitat conservation. Here, we use a comparative landscape genetic approach to ask whether gradients in temperature or precipitation seasonality structure genetic variation across three codistributed tree species in Central America, or whether a simpler (geographic distance) or more complex, species-specific environmental niche model is necessary to individually explain population genetic structure. Using descriptive statistics and causal modelling, we find that different factors best explain genetic distance in each of the three species: environmental niche distance in Bursera simaruba, geographic distance in Ficus insipida and historical barriers to gene flow or cryptic reproductive barriers for Brosimum alicastrum. This study confirms suggestions from previous studies of Central American tree species that imply that population genetic structure of trees in this region is determined by complex interactions of both historical and current barriers to gene flow.
Asunto(s)
Ambiente , Flujo Génico , Variación Genética , Árboles/genética , Bursera/genética , América Central , ADN de Plantas/genética , Ecosistema , Ficus/genética , Genética de Población , Modelos Genéticos , Moraceae/genéticaRESUMEN
BACKGROUND: Many temperate insects survive the harsh conditions of winter by undergoing photoperiodic diapause, a pre-programmed developmental arrest initiated by short day lengths. Despite the well-established ecological significance of photoperiodic diapause, the molecular basis of this crucial adaptation remains largely unresolved. The Asian tiger mosquito, Aedes albopictus (Skuse), represents an outstanding emerging model to investigate the molecular basis of photoperiodic diapause in a well-defined ecological and evolutionary context. Ae. albopictus is a medically significant vector and is currently considered the most invasive mosquito in the world. Traits related to diapause appear to be important factors contributing to the rapid spread of this mosquito. To generate novel sequence information for this species, as well as to discover transcripts involved in diapause preparation, we sequenced the transcriptome of Ae. albopictus oocytes destined to become diapausing or non-diapausing pharate larvae. RESULTS: 454 GS-FLX transcriptome sequencing yielded >1.1 million quality-filtered reads, which we assembled into 69,474 contigs (N50 = 1,009 bp). Our contig filtering approach, where we took advantage of strong sequence similarity to the fully sequenced genome of Aedes aegypti, as well as other reference organisms, resulted in 11,561 high-quality, conservative ESTs. Differential expression estimates based on normalized read counts revealed 57 genes with higher expression, and 257 with lower expression under diapause-inducing conditions. Analysis of expression by qPCR for 47 of these genes indicated a high correlation of expression levels between 454 sequence data and qPCR, but congruence of statistically significant differential expression was low. Seven genes identified as differentially expressed based on qPCR have putative functions that are consistent with the insect diapause syndrome; three genes have unknown function and represent novel candidates for the transcriptional basis of diapause. CONCLUSIONS: Our transcriptome database provides a rich resource for the comparative genomics and functional genetics of Ae. albopictus, an invasive and medically important mosquito. Additionally, the identification of differentially expressed transcripts related to diapause enriches the limited knowledge base for the molecular basis of insect diapause, in particular for the preparatory stage. Finally, our analysis illustrates a useful approach that draws from a closely related reference genome to generate high-confidence ESTs in a non-model organism.
Asunto(s)
Aedes/genética , ARN Mensajero/genética , Transcriptoma , Aedes/fisiología , Animales , Insectos Vectores , Fotoperiodo , Reacción en Cadena de la PolimerasaRESUMEN
Genome sequencing of a diverse array of arthropod genomes is already underway, and these genomes will be used to study human health, agriculture, biodiversity, and ecology. These new genomes are intended to serve as community resources and provide the foundational information required to apply 'omics technologies to a more diverse set of species. However, biologists require genome annotation to use these genomes and derive a better understanding of complex biological systems. Genome annotation incorporates two related, but distinct, processes: Demarcating genes and other elements present in genome sequences (structural annotation); and associating a function with genetic elements (functional annotation). While there are well-established and freely available workflows for structural annotation of gene identification in newly assembled genomes, workflows for providing the functional annotation required to support functional genomics studies are less well understood. Genome-scale functional annotation is required for functional modeling (enrichment, networks, etc.). A first-pass genome-wide functional annotation effort can rapidly identify under-represented gene sets for focused community annotation efforts. We present an open-source, open access, and containerized pipeline for genome-scale functional annotation of insect proteomes and apply it to various arthropod species. We show that the performance of the predictions is consistent across a set of arthropod genomes with varying assembly and annotation quality.
RESUMEN
The phylum Arthropoda includes species crucial for ecosystem stability, soil health, crop production, and others that present obstacles to crop and animal agriculture. The United States Department of Agriculture's Agricultural Research Service initiated the Ag100Pest Initiative to generate reference genome assemblies of arthropods that are (or may become) pests to agricultural production and global food security. We describe the project goals, process, status, and future. The first three years of the project were focused on species selection, specimen collection, and the construction of lab and bioinformatics pipelines for the efficient production of assemblies at scale. Contig-level assemblies of 47 species are presented, all of which were generated from single specimens. Lessons learned and optimizations leading to the current pipeline are discussed. The project name implies a target of 100 species, but the efficiencies gained during the project have supported an expansion of the original goal and a total of 158 species are currently in the pipeline. We anticipate that the processes described in the paper will help other arthropod research groups or other consortia considering genome assembly at scale.
RESUMEN
The GFF3toolkit ( https://github.com/NAL-i5K/GFF3toolkit ) supported by the i5k Workspace@NAL provides a suite of tools to handle gene annotations in GFF3 format from arthropod genome projects and their research communities. To improve GFF3 formatting of gene annotations, a quality control and merge procedure is proposed along with the GFF3toolkit. In particular, the toolkit provides functions to sort a GFF3 file, detect GFF3 format errors, merge two GFF3 files, and generate biological sequences from a GFF3 file. This chapter explains when and how to use the provided tools to obtain nonredundant arthropod gene sets in high quality.
Asunto(s)
Biología Computacional/métodos , Genoma de los Insectos , Insectos/genética , Anotación de Secuencia Molecular/métodos , Control de Calidad , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Animales , Secuenciación de Nucleótidos de Alto Rendimiento/métodosRESUMEN
BACKGROUND: The Hemiptera (aphids, cicadas, and true bugs) are a key insect order, with high diversity for feeding ecology and excellent experimental tractability for molecular genetics. Building upon recent sequencing of hemipteran pests such as phloem-feeding aphids and blood-feeding bed bugs, we present the genome sequence and comparative analyses centered on the milkweed bug Oncopeltus fasciatus, a seed feeder of the family Lygaeidae. RESULTS: The 926-Mb Oncopeltus genome is well represented by the current assembly and official gene set. We use our genomic and RNA-seq data not only to characterize the protein-coding gene repertoire and perform isoform-specific RNAi, but also to elucidate patterns of molecular evolution and physiology. We find ongoing, lineage-specific expansion and diversification of repressive C2H2 zinc finger proteins. The discovery of intron gain and turnover specific to the Hemiptera also prompted the evaluation of lineage and genome size as predictors of gene structure evolution. Furthermore, we identify enzymatic gains and losses that correlate with feeding biology, particularly for reductions associated with derived, fluid nutrition feeding. CONCLUSIONS: With the milkweed bug, we now have a critical mass of sequenced species for a hemimetabolous insect order and close outgroup to the Holometabola, substantially improving the diversity of insect genomics. We thereby define commonalities among the Hemiptera and delve into how hemipteran genomes reflect distinct feeding ecologies. Given Oncopeltus's strength as an experimental model, these new sequence resources bolster the foundation for molecular research and highlight technical considerations for the analysis of medium-sized invertebrate genomes.
Asunto(s)
Evolución Molecular , Genoma de los Insectos , Hemípteros/genética , Secuencia de Aminoácidos , Animales , Dedos de Zinc CYS2-HIS2 , Conducta Alimentaria , Dosificación de Gen , Perfilación de la Expresión Génica , Transferencia de Gen Horizontal , Genes Homeobox , Hemípteros/crecimiento & desarrollo , Hemípteros/metabolismo , Pigmentación/genética , Olfato , Factores de Transcripción/genéticaRESUMEN
The i5k Workspace@NAL is a genome database tailored toward newly sequenced arthropod genomes and their research communities. With 56 arthropod genomes and counting, the i5k Workspace strives to facilitate public data access, visualization, and community curation across arthropod species. Any researcher with an arthropod genome project who would like to take advantage of the i5k Workspace facilities is encouraged to submit their data. In this chapter, we explain how to use the i5k Workspace@NAL to submit, find, and improve arthropod genomics data.
Asunto(s)
Artrópodos/genética , Bases de Datos Genéticas , Genoma , Genómica , Animales , Biología Computacional/métodos , Genómica/métodos , Anotación de Secuencia Molecular , Programas Informáticos , Interfaz Usuario-Computador , Navegador WebRESUMEN
The Colorado potato beetle is one of the most challenging agricultural pests to manage. It has shown a spectacular ability to adapt to a variety of solanaceaeous plants and variable climates during its global invasion, and, notably, to rapidly evolve insecticide resistance. To examine evidence of rapid evolutionary change, and to understand the genetic basis of herbivory and insecticide resistance, we tested for structural and functional genomic changes relative to other arthropod species using genome sequencing, transcriptomics, and community annotation. Two factors that might facilitate rapid evolutionary change include transposable elements, which comprise at least 17% of the genome and are rapidly evolving compared to other Coleoptera, and high levels of nucleotide diversity in rapidly growing pest populations. Adaptations to plant feeding are evident in gene expansions and differential expression of digestive enzymes in gut tissues, as well as expansions of gustatory receptors for bitter tasting. Surprisingly, the suite of genes involved in insecticide resistance is similar to other beetles. Finally, duplications in the RNAi pathway might explain why Leptinotarsa decemlineata has high sensitivity to dsRNA. The L. decemlineata genome provides opportunities to investigate a broad range of phenotypes and to develop sustainable methods to control this widely successful pest.
Asunto(s)
Agricultura , Escarabajos/genética , Genoma de los Insectos , Genómica , Solanum tuberosum/parasitología , Animales , Elementos Transponibles de ADN/genética , Evolución Molecular , Femenino , Regulación de la Expresión Génica , Variación Genética , Genética de Población , Interacciones Huésped-Parásitos/genética , Proteínas de Insectos/genética , Proteínas de Insectos/metabolismo , Resistencia a los Insecticidas/genética , Masculino , Anotación de Secuencia Molecular , Familia de Multigenes , Control Biológico de Vectores , Filogenia , Interferencia de ARN , Factores de Transcripción/metabolismoRESUMEN
Database URL: https://citrusgreening.org/.
Asunto(s)
Bases de Datos Genéticas , Genoma de los Insectos/genética , Hemípteros , Insectos Vectores/genética , Control de Plagas , Enfermedades de las Plantas , Animales , Citrus/microbiología , Citrus/parasitología , Hemípteros/genética , Hemípteros/microbiología , RhizobiaceaeRESUMEN
Agricultural entomology is poised to benefit from the application of ecological genomics, particularly the fields of biofuels generation and pest control. Metagenomic methods can characterize microbial communities of termites, wood-boring beetles and livestock pests, and transcriptomic approaches reveal molecular bases behind wood-digesting capabilities of these insects, leading to potential mechanisms for biofuel generation. Genome sequences are being exploited to develop new pest control methods, identify candidate antigens to vaccinate livestock, and discover RNAi target sequences and potential non-target effects in other insects. Gene content analyses of pest genome sequences and their endosymbionts suggest metabolic interdependencies between organisms, exposing potential gene targets for insect control. Finally, genome-wide association studies and genotyping by high-throughput sequencing promise to improve management of pesticide resistance.
Asunto(s)
Agricultura/métodos , Genoma de los Insectos/genética , Genómica , Control de Insectos/métodos , Agricultura/tendencias , Animales , Estudio de Asociación del Genoma Completo , Genómica/tendencias , Insectos/genéticaRESUMEN
BACKGROUND: Relatively little is known about the genomic basis and evolution of wood-feeding in beetles. We undertook genome sequencing and annotation, gene expression assays, studies of plant cell wall degrading enzymes, and other functional and comparative studies of the Asian longhorned beetle, Anoplophora glabripennis, a globally significant invasive species capable of inflicting severe feeding damage on many important tree species. Complementary studies of genes encoding enzymes involved in digestion of woody plant tissues or detoxification of plant allelochemicals were undertaken with the genomes of 14 additional insects, including the newly sequenced emerald ash borer and bull-headed dung beetle. RESULTS: The Asian longhorned beetle genome encodes a uniquely diverse arsenal of enzymes that can degrade the main polysaccharide networks in plant cell walls, detoxify plant allelochemicals, and otherwise facilitate feeding on woody plants. It has the metabolic plasticity needed to feed on diverse plant species, contributing to its highly invasive nature. Large expansions of chemosensory genes involved in the reception of pheromones and plant kairomones are consistent with the complexity of chemical cues it uses to find host plants and mates. CONCLUSIONS: Amplification and functional divergence of genes associated with specialized feeding on plants, including genes originally obtained via horizontal gene transfer from fungi and bacteria, contributed to the addition, expansion, and enhancement of the metabolic repertoire of the Asian longhorned beetle, certain other phytophagous beetles, and to a lesser degree, other phytophagous insects. Our results thus begin to establish a genomic basis for the evolutionary success of beetles on plants.