Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 104
Filtrar
1.
Cell Death Dis ; 14(9): 630, 2023 09 25.
Artigo em Inglês | MEDLINE | ID: mdl-37749143

RESUMO

Glioblastoma (GBM) is a deadly and the most common primary brain tumor in adults. Due to their regulation of a high number of mRNA transcripts, microRNAs (miRNAs) are key molecules in the control of biological processes and are thereby promising therapeutic targets for GBM patients. In this regard, we recently reported miRNAs as strong modulators of GBM aggressiveness. Here, using an integrative and comprehensive analysis of the TCGA database and the transcriptome of GBM biopsies, we identified three critical and clinically relevant miRNAs for GBM, miR-17-3p, miR-222, and miR-340. In addition, we showed that the combinatorial modulation of three of these miRNAs efficiently inhibited several biological processes in patient-derived GBM cells of all these three GBM subtypes (Mesenchymal, Proneural, Classical), induced cell death, and delayed tumor growth in a mouse tumor model. Finally, in a doxycycline-inducible model, we observed a significant inhibition of GBM stem cell viability and a significant delay of orthotopic tumor growth. Collectively, our results reveal, for the first time, the potential of miR-17-3p, miR-222 and miR-340 multi-targeting as a promising therapeutic strategy for GBM patients.


Assuntos
Glioblastoma , MicroRNAs , Adulto , Humanos , Animais , Camundongos , MicroRNAs/genética , Glioblastoma/genética , Agressão , Biópsia , Morte Celular , Modelos Animais de Doenças
2.
ISME J ; 17(6): 903-915, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37031343

RESUMO

Meromictic Lake Cadagno is a permanently stratified system with a persistent microbial bloom within the oxic-anoxic boundary called the chemocline. The association between oxygenic and anoxygenic photosynthesis within the chemocline has been known for at least two decades. Although anoxygenic purple and green sulfur bacteria have been well studied, reports on oxygenic phytoplankton have remained sparse since their discovery in the 1920s. Nearly a century later, this study presents the first near-complete genome of a photosynthetic microbial eukaryote from the chemocline of Lake Cadagno, provisionally named Chlorella-like MAG. The 18.9 Mbp nuclear genome displays a high GC content (71.5%), and the phylogenetic placement suggests that it is a novel species of the genus Chlorella of Chlorophytes. Functional annotation of the Chlorella-like metagenome-assembled genome predicted 10,732 protein-coding genes, with an approximate 0.6% proportion potentially involved in carbon, sulfur, and nitrogen (C, N, and S) metabolism. In addition to C4 photosynthesis, this study detected genes for heat shock proteins (HSPs) in the Chlorella-like algae, consistent with the other Chlorella species. Altogether, the genomic insights in this study suggest the cooperation of photosynthetic algae with phototrophic sulfur bacteria via C, N, and S metabolism, which may aid their collective persistence in the Lake Cadagno chemocline. Furthermore, this work additionally presents the chloroplast genome of Cryptomonas-like species, which was likely to be presumed as cyanobacteria in previous studies because of the presence of phycobilisomes.


Assuntos
Chlorella , Lagos , Lagos/microbiologia , Chlorella/genética , Filogenia , Bactérias/metabolismo , Genômica , Enxofre/metabolismo
3.
Nucleic Acids Res ; 51(D1): D445-D451, 2023 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-36350662

RESUMO

OrthoDB provides evolutionary and functional annotations of genes in a diverse sampling of eukaryotes, prokaryotes, and viruses. Genomics continues to accelerate our exploration of gene diversity and orthology is the most precise way of bridging gene functional knowledge with the rapidly expanding universe of genomic sequences. OrthoDB samples the most diverse organisms with the best quality genomics data to provide the leading coverage of species diversity. This update of the underlying data to over 18 000 prokaryotes and almost 2000 eukaryotes with over 100 million genes propels the coverage to another level. This achievement also demonstrates the scalability of the underlying OrthoLoger software for delineation of orthologs, freely available from https://orthologer.ezlab.org. In addition to the ab-initio computations of gene orthology used for the OrthoDB release, the OrthoLoger software allows mapping of novel gene sets to precomputed orthologs and thereby links to their annotations. The LEMMI-style benchmarking of OrthoLoger ensures its state-of-the-art performance and is available from https://lemortho.ezlab.org. The OrthoDB web interface has been further developed to include a pairwise orthology view from any gene to any other sampled species. OrthoDB-computed evolutionary annotations as well as extensively collated functional annotations can be accessed via REST API or SPARQL/RDF, downloaded or browsed online from https://www.orthodb.org.


Assuntos
Bases de Dados Genéticas , Evolução Molecular , Eucariotos/genética , Genômica , Evolução Biológica , Software , Anotação de Sequência Molecular
4.
Nucleic Acids Res ; 50(18): 10264-10277, 2022 10 14.
Artigo em Inglês | MEDLINE | ID: mdl-36130228

RESUMO

The mutational spectrum of the mitochondrial DNA (mtDNA) does not resemble any of the known mutational signatures of the nuclear genome and variation in mtDNA mutational spectra between different organisms is still incomprehensible. Since mitochondria are responsible for aerobic respiration, it is expected that mtDNA mutational spectrum is affected by oxidative damage. Assuming that oxidative damage increases with age, we analyse mtDNA mutagenesis of different species in regards to their generation length. Analysing, (i) dozens of thousands of somatic mtDNA mutations in samples of different ages (ii) 70053 polymorphic synonymous mtDNA substitutions reconstructed in 424 mammalian species with different generation lengths and (iii) synonymous nucleotide content of 650 complete mitochondrial genomes of mammalian species we observed that the frequency of AH > GH substitutions (H: heavy strand notation) is twice bigger in species with high versus low generation length making their mtDNA more AH poor and GH rich. Considering that AH > GH substitutions are also sensitive to the time spent single-stranded (TSSS) during asynchronous mtDNA replication we demonstrated that AH > GH substitution rate is a function of both species-specific generation length and position-specific TSSS. We propose that AH > GH is a mitochondria-specific signature of oxidative damage associated with both aging and TSSS.


Assuntos
Envelhecimento , DNA Mitocondrial , Mamíferos , Envelhecimento/genética , Animais , DNA Mitocondrial/genética , Mamíferos/genética , Mitocôndrias/genética , Mutação , Nucleotídeos
5.
mBio ; 13(4): e0005222, 2022 08 30.
Artigo em Inglês | MEDLINE | ID: mdl-35726916

RESUMO

Lake Cadagno, a permanently stratified high-alpine lake with a persistent microbial bloom in its chemocline, has long been considered a model for the low-oxygen, high-sulfide Proterozoic ocean. Although the lake has been studied for over 25 years, the absence of concerted study of the bacteria, phytoplankton, and viruses, together with primary and secondary production, has hindered a comprehensive understanding of its microbial food web. Here, the identities, abundances, and productivity of microbes were evaluated in the context of Lake Cadagno biogeochemistry. Photosynthetic pigments together with 16S rRNA gene phylogenies suggest the prominence of eukaryotic phytoplankton chloroplasts, primarily chlorophytes. Chloroplasts closely related to those of high-alpine-adapted Ankyra judayi persisted with oxygen in the mixolimnion, where photosynthetic efficiency was high, while chloroplasts of Closteriopsis-related chlorophytes peaked in the chemocline and monimolimnion. The anoxygenic phototrophic sulfur bacterium Chromatium dominated the chemocline along with Lentimicrobium, a genus of known fermenters. Secondary production peaked in the chemocline, which suggested that anoxygenic primary producers depended on heterotrophic nutrient remineralization. The virus-to-microbe ratio peaked with phytoplankton abundances in the mixolimnion and were at a minimum where Chromatium abundance was highest, trends that suggest that viruses may play a role in the modulation of primary production. Through the combined analysis of bacterial, eukaryotic, viral, and biogeochemical spatial dynamics, we provide a comprehensive synthesis of the Lake Cadagno microbial loop. This study offers a new ecological perspective on how biological and geochemical connections may have occurred in the chemocline of the Proterozoic ocean, where eukaryotic microbial life is thought to have evolved. IMPORTANCE As a window into the past, this study offers insights into the potential role that microbial guilds may have played in the production and recycling of organic matter in ancient Proterozoic ocean chemoclines. The new observations described here suggest that chloroplasts of eukaryotic algae were persistent in the low-oxygen upper chemocline along with the purple and green sulfur bacteria known to dominate the lower half of the chemocline. This study provides the first insights into Lake Cadagno's viral ecology. High viral abundances suggested that viruses may be essential components of the chemocline, where their activity may result in the release and recycling of organic matter. The integration of diverse geochemical and biological data types provides a framework that lays the foundation to quantitatively resolve the processes performed by the discrete populations that comprise the microbial loop in this early anoxic ocean analogue.


Assuntos
Lagos , Fitoplâncton , Bactérias/genética , Lagos/microbiologia , Oceanos e Mares , Oxigênio , RNA Ribossômico 16S/genética
6.
PLoS Comput Biol ; 18(3): e1009947, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-35259160

RESUMO

Mouse is the most used model for studying the impact of microbiota on its host, but the repertoire of species from the mouse gut microbiome remains largely unknown. Accordingly, the similarity between human and mouse microbiomes at a low taxonomic level is not clear. We construct a comprehensive mouse microbiota genome (CMMG) catalog by assembling all currently available mouse gut metagenomes and combining them with published reference and metagenome-assembled genomes. The 41'798 genomes cluster into 1'573 species, of which 78.1% are uncultured, and we discovered 226 new genera, seven new families, and one new order. CMMG enables an unprecedented coverage of the mouse gut microbiome exceeding 86%, increases the mapping rate over four-fold, and allows functional microbiota analyses of human and mouse linking them to the driver species. Comparing CMMG to microbiota from the unified human gastrointestinal genomes shows an overlap of 62% at the genus but only 10% at the species level, demonstrating that human and mouse gut microbiota are largely distinct. CMMG contains the most comprehensive collection of consistently functionally annotated species of the mouse and human microbiome to date, setting the ground for analysis of new and reanalysis of existing datasets at an unprecedented depth.


Assuntos
Microbioma Gastrointestinal , Microbiota , Animais , Bactérias/genética , Microbioma Gastrointestinal/genética , Humanos , Metagenoma/genética , Metagenômica , Camundongos , Microbiota/genética , Filogenia
7.
Curr Protoc ; 1(12): e323, 2021 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-34936221

RESUMO

Evaluation of the quality of genomic "data products" such as genome assemblies or gene sets is of critical importance in order to recognize possible issues and correct them during the generation of new data. It is equally essential to guide subsequent or comparative analyses with existing data, as the correct interpretation of the results necessarily requires knowledge about the quality level and reliability of the inputs. Using datasets of near universal single-copy orthologs derived from OrthoDB, BUSCO can estimate the completeness and redundancy of genomic data by providing biologically meaningful metrics based on expected gene content. These can complement technical metrics such as contiguity measures (e.g., number of contigs/scaffolds, and N50 values). Here, we describe the use of the BUSCO tool suite to assess different data types that can range from genome assemblies of single isolates and assembled transcriptomes and annotated gene sets to metagenome-assembled genomes where the taxonomic origin of the species is unknown. BUSCO is the only tool capable of assessing all these types of sequences from both eukaryotic and prokaryotic species. The protocols detail the various BUSCO running modes and the novel workflows introduced in versions 4 and 5, including the batch analysis on multiple inputs, the auto-lineage workflow to run assessments without specifying a dataset, and a workflow for the evaluation of (large) eukaryotic genomes. The protocols further cover the BUSCO setup, guidelines to interpret the results, and BUSCO "plugin" workflows for performing common operations in genomics using BUSCO results, such as building phylogenomic trees and visualizing syntenies. © 2021 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Assessing an input sequence with a BUSCO dataset specified manually Basic Protocol 2: Assessing an input sequence with a dataset automatically selected by BUSCO Basic Protocol 3: Assessing multiple inputs Alternate Protocol: Decreasing analysis runtime when assessing a large number of small genomes with BUSCO auto-lineage workflow and Snakemake Support Protocol 1: BUSCO setup Support Protocol 2: Visualizing BUSCO results Support Protocol 3: Building phylogenomic trees.


Assuntos
Confiabilidade dos Dados , Genoma , Genômica , Filogenia , Reprodutibilidade dos Testes
8.
Microorganisms ; 9(8)2021 Aug 17.
Artigo em Inglês | MEDLINE | ID: mdl-34442837

RESUMO

Enteroviruses (EVs) from the D species are the causative agents of a diverse range of infectious diseases in spite of comprising only five known members. This small clade has a diverse host range and tissue tropism. It contains types infecting non-human primates and/or humans, and for the latter, they preferentially infect the eye, respiratory tract, gastrointestinal tract, and nervous system. Although several Enterovirus D members, in particular EV-D68, have been associated with neurological complications, including acute myelitis, there is currently no effective treatment or vaccine against any of them. This review highlights the peculiarities of this viral species, focusing on genome organization, functional elements, receptor usage, and pathogenesis.

9.
Mol Biol Evol ; 38(10): 4647-4654, 2021 09 27.
Artigo em Inglês | MEDLINE | ID: mdl-34320186

RESUMO

Methods for evaluating the quality of genomic and metagenomic data are essential to aid genome assembly procedures and to correctly interpret the results of subsequent analyses. BUSCO estimates the completeness and redundancy of processed genomic data based on universal single-copy orthologs. Here, we present new functionalities and major improvements of the BUSCO software, as well as the renewal and expansion of the underlying data sets in sync with the OrthoDB v10 release. Among the major novelties, BUSCO now enables phylogenetic placement of the input sequence to automatically select the most appropriate BUSCO data set for the assessment, allowing the analysis of metagenome-assembled genomes of unknown origin. A newly introduced genome workflow increases the efficiency and runtimes especially on large eukaryotic genomes. BUSCO is the only tool capable of assessing both eukaryotic and prokaryotic species, and can be applied to various data types, from genome assemblies and metagenomic bins, to transcriptomes and gene sets.


Assuntos
Eucariotos , Genômica , Eucariotos/genética , Genoma Viral , Genômica/métodos , Filogenia , Fluxo de Trabalho
12.
BMC Biol ; 19(1): 41, 2021 03 10.
Artigo em Inglês | MEDLINE | ID: mdl-33750380

RESUMO

BACKGROUND: The stable fly, Stomoxys calcitrans, is a major blood-feeding pest of livestock that has near worldwide distribution, causing an annual cost of over $2 billion for control and product loss in the USA alone. Control of these flies has been limited to increased sanitary management practices and insecticide application for suppressing larval stages. Few genetic and molecular resources are available to help in developing novel methods for controlling stable flies. RESULTS: This study examines stable fly biology by utilizing a combination of high-quality genome sequencing and RNA-Seq analyses targeting multiple developmental stages and tissues. In conjunction, 1600 genes were manually curated to characterize genetic features related to stable fly reproduction, vector host interactions, host-microbe dynamics, and putative targets for control. Most notable was characterization of genes associated with reproduction and identification of expanded gene families with functional associations to vision, chemosensation, immunity, and metabolic detoxification pathways. CONCLUSIONS: The combined sequencing, assembly, and curation of the male stable fly genome followed by RNA-Seq and downstream analyses provide insights necessary to understand the biology of this important pest. These resources and new data will provide the groundwork for expanding the tools available to control stable fly infestations. The close relationship of Stomoxys to other blood-feeding (horn flies and Glossina) and non-blood-feeding flies (house flies, medflies, Drosophila) will facilitate understanding of the evolutionary processes associated with development of blood feeding among the Cyclorrhapha.


Assuntos
Genoma de Inseto , Interações Hospedeiro-Parasita/genética , Controle de Insetos , Muscidae/genética , Animais , Reprodução/genética
13.
Sci Rep ; 11(1): 4586, 2021 02 25.
Artigo em Inglês | MEDLINE | ID: mdl-33633271

RESUMO

Epstein-Barr virus (EBV) is one of the most common viruses latently infecting humans. Little is known about the impact of human genetic variation on the large inter-individual differences observed in response to EBV infection. To search for a potential imprint of host genomic variation on the EBV sequence, we jointly analyzed paired viral and human genomic data from 268 HIV-coinfected individuals with CD4 + T cell count < 200/mm3 and elevated EBV viremia. We hypothesized that the reactivated virus circulating in these patients could carry sequence variants acquired during primary EBV infection, thereby providing a snapshot of early adaptation to the pressure exerted on EBV by the individual immune response. We searched for associations between host and pathogen genetic variants, taking into account human and EBV population structure. Our analyses revealed significant associations between human and EBV sequence variation. Three polymorphic regions in the human genome were found to be associated with EBV variation: one at the amino acid level (BRLF1:p.Lys316Glu); and two at the gene level (burden testing of rare variants in BALF5 and BBRF1). Our findings confirm that jointly analyzing host and pathogen genomes can identify sites of genomic interactions, which could help dissect pathogenic mechanisms and suggest new therapeutic avenues.


Assuntos
Variação Genética , Genoma Viral , Herpesvirus Humano 4/genética , Estudos de Coortes , Infecções por Vírus Epstein-Barr/virologia , Sequenciamento de Nucleotídeos em Larga Escala , Humanos
14.
Nucleic Acids Res ; 49(D1): D389-D393, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33196836

RESUMO

OrthoDB provides evolutionary and functional annotations of orthologs, inferred for a vast number of available organisms. OrthoDB is leading in the coverage and genomic diversity sampling of Eukaryotes, Prokaryotes and Viruses, and the sampling of Bacteria is further set to increase three-fold. The user interface has been enhanced in response to the massive growth in data. OrthoDB provides three views on the data: (i) a list of orthologous groups related to a user query, which are now arranged to visualize their hierarchical relations, (ii) a detailed view of an orthologous group, now featuring a Sankey diagram to facilitate navigation between the levels of orthology, from more finely-resolved to more general groups of orthologs, as well as an arrangement of orthologs into an interactive organism taxonomy structure, and (iii) we added a gene-centric view, showing the gene functional annotations and the pair-wise orthologs in example species. The OrthoDB standalone software for delineation of orthologs, Orthologer, is freely available. Online BUSCO assessments and mapping to OrthoDB of user-uploaded data enable interactive exploration of related annotations and generation of comparative charts. OrthoDB strives to predict orthologs from the broadest coverage of species, as well as to extensively collate available functional annotations, and to compute evolutionary annotations such as evolutionary rate and phyletic profile. OrthoDB data can be assessed via SPARQL RDF, REST API, downloaded or browsed online from https://orthodb.org.


Assuntos
Bases de Dados Genéticas , Evolução Molecular , Anotação de Sequência Molecular , Homologia de Sequência do Ácido Nucleico , Animais , Software , Interface Usuário-Computador
15.
Viruses ; 12(11)2020 11 06.
Artigo em Inglês | MEDLINE | ID: mdl-33172032

RESUMO

The Asian tiger mosquito Aedes albopictus is a competent vector for several human arboviruses including dengue, chikungunya and Zika viruses. Mosquitoes also harbor insect-specific viruses (ISVs) that may modulate host physiology and potentially affect the transmission of viruses that are pathogenic to vertebrates, thus representing a potential tool for vector control strategies. In Ae. albopictus we identified a novel anphevirus (family Xinmoviridae; order Mononegavirales) provisionally designated here as Aedes albopictus anphevirus (AealbAV). AealbAV contains a ~12.4 kb genome that is highly divergent from currently known viruses but displays gene content and genomic organization typical of known anpheviruses. We identified AealbAV in several publicly available RNA-Seq datasets from different geographical regions both in laboratory colonies and field collected mosquitoes. Coding-complete genomes of AealbAV strains are highly similar worldwide (>96% nucleotide identity) and cluster according to the geographical origin of their hosts. AealbAV appears to be present in various body compartments and mosquito life stages, including eggs. We further detected AealbAV-derived vsiRNAs and vpiRNAs in publicly available miRNA-Seq libraries of Ae. albopictus and in samples experimentally coinfected with chikungunya virus. This suggests that AealbAV is targeted by the host RNA interference (RNAi) response, consistent with persistent virus replication. The discovery and characterization of AealbAV in Ae. albopictus will now allow us to identify its infection in mosquito populations and laboratory strains, and to assess its potential impact on Ae. albopictus physiology and ability to transmit arboviruses.


Assuntos
Aedes/virologia , Interações entre Hospedeiro e Microrganismos , Mononegavirais/classificação , Interferência de RNA , Animais , Vírus Chikungunya/genética , Coinfecção/virologia , Feminino , Genoma Viral , Vírus de Insetos/genética , Masculino , Mononegavirais/isolamento & purificação , Mononegavirais/fisiologia , Filogenia , Replicação Viral
16.
Genome Res ; 30(8): 1208-1216, 2020 08.
Artigo em Inglês | MEDLINE | ID: mdl-32616517

RESUMO

Studies of microbiomes are booming, along with the diversity of computational approaches to make sense out of the sequencing data and the volumes of accumulated microbial genotypes. A swift evaluation of newly published methods and their improvements against established tools is necessary to reduce the time between the methods' release and their adoption in microbiome analyses. The LEMMI platform offers a novel approach for benchmarking software dedicated to metagenome composition assessments based on read classification. It enables the integration of newly published methods in an independent and centralized benchmark designed to be continuously open to new submissions. This allows developers to be proactive regarding comparative evaluations and guarantees that any promising methods can be assessed side by side with established tools quickly after their release. Moreover, LEMMI enforces an effective distribution through software containers to ensure long-term availability of all methods. Here, we detail the LEMMI workflow and discuss the performances of some previously unevaluated tools. We see this platform eventually as a community-driven effort in which method developers can showcase novel approaches and get unbiased benchmarks for publications, and users can make informed choices and obtain standardized and easy-to-use tools.


Assuntos
Bactérias/classificação , Bactérias/genética , Biologia Computacional/métodos , Genoma Bacteriano/genética , Metagenômica/métodos , Algoritmos , Benchmarking/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Metagenoma/genética , Microbiota/genética , Análise de Sequência de DNA/métodos , Software
17.
BMC Bioinformatics ; 21(1): 257, 2020 Jun 22.
Artigo em Inglês | MEDLINE | ID: mdl-32571209

RESUMO

BACKGROUND: Metagenomics studies provide valuable insight into the composition and function of microbial populations from diverse environments; however, the data processing pipelines that rely on mapping reads to gene catalogs or genome databases for cultured strains yield results that underrepresent the genes and functional potential of uncultured microbes. Recent improvements in sequence assembly methods have eased the reliance on genome databases, thereby allowing the recovery of genomes from uncultured microbes. However, configuring these tools, linking them with advanced binning and annotation tools, and maintaining provenance of the processing continues to be challenging for researchers. RESULTS: Here we present ATLAS, a software package for customizable data processing from raw sequence reads to functional and taxonomic annotations using state-of-the-art tools to assemble, annotate, quantify, and bin metagenome data. Abundance estimates at genome resolution are provided for each sample in a dataset. ATLAS is written in Python and the workflow implemented in Snakemake; it operates in a Linux environment, and is compatible with Python 3.5+ and Anaconda 3+ versions. The source code for ATLAS is freely available, distributed under a BSD-3 license. CONCLUSIONS: ATLAS provides a user-friendly, modular and customizable Snakemake workflow for metagenome data processing; it is easily installable with conda and maintained as open-source on GitHub at https://github.com/metagenome-atlas/atlas.


Assuntos
Metagenômica/métodos , Software , Metagenoma , Anotação de Sequência Molecular , Fluxo de Trabalho
18.
Bioinformatics ; 36(12): 3882-3884, 2020 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-32311023

RESUMO

SUMMARY: Phigaro is a standalone command-line application that is able to detect prophage regions taking raw genome and metagenome assemblies as an input. It also produces dynamic annotated 'prophage genome maps' and marks possible transposon insertion spots inside prophages. It is applicable for mining prophage regions from large metagenomic datasets. AVAILABILITY AND IMPLEMENTATION: Source code for Phigaro is freely available for download at https://github.com/bobeobibo/phigaro along with test data. The code is written in Python. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Prófagos , Metagenoma , Metagenômica , Prófagos/genética , Software
19.
BMC Genomics ; 21(1): 47, 2020 Jan 14.
Artigo em Inglês | MEDLINE | ID: mdl-31937263

RESUMO

BACKGROUND: The red flour beetle Tribolium castaneum has emerged as an important model organism for the study of gene function in development and physiology, for ecological and evolutionary genomics, for pest control and a plethora of other topics. RNA interference (RNAi), transgenesis and genome editing are well established and the resources for genome-wide RNAi screening have become available in this model. All these techniques depend on a high quality genome assembly and precise gene models. However, the first version of the genome assembly was generated by Sanger sequencing, and with a small set of RNA sequence data limiting annotation quality. RESULTS: Here, we present an improved genome assembly (Tcas5.2) and an enhanced genome annotation resulting in a new official gene set (OGS3) for Tribolium castaneum, which significantly increase the quality of the genomic resources. By adding large-distance jumping library DNA sequencing to join scaffolds and fill small gaps, the gaps in the genome assembly were reduced and the N50 increased to 4753kbp. The precision of the gene models was enhanced by the use of a large body of RNA-Seq reads of different life history stages and tissue types, leading to the discovery of 1452 novel gene sequences. We also added new features such as alternative splicing, well defined UTRs and microRNA target predictions. For quality control, 399 gene models were evaluated by manual inspection. The current gene set was submitted to Genbank and accepted as a RefSeq genome by NCBI. CONCLUSIONS: The new genome assembly (Tcas5.2) and the official gene set (OGS3) provide enhanced genomic resources for genetic work in Tribolium castaneum. The much improved information on transcription start sites supports transgenic and gene editing approaches. Further, novel types of information such as splice variants and microRNA target genes open additional possibilities for analysis.


Assuntos
Genes de Insetos , Genoma de Inseto , Genômica , Tribolium/genética , Animais , Sítios de Ligação , Biologia Computacional/métodos , Genômica/métodos , MicroRNAs/genética , Anotação de Sequência Molecular , Filogenia , Interferência de RNA , Reprodutibilidade dos Testes
20.
Genome Biol Evol ; 12(1): 3534-3549, 2020 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-31778187

RESUMO

The dipluran two-pronged bristletail Campodea augens is a blind ancestrally wingless hexapod with the remarkable capacity to regenerate lost body appendages such as its long antennae. As sister group to Insecta (sensu stricto), Diplura are key to understanding the early evolution of hexapods and the origin and evolution of insects. Here we report the 1.2-Gb draft genome of C. augens and results from comparative genomic analyses with other arthropods. In C. augens, we uncovered the largest chemosensory gene repertoire of ionotropic receptors in the animal kingdom, a massive expansion that might compensate for the loss of vision. We found a paucity of photoreceptor genes mirroring at the genomic level the secondary loss of an ancestral external photoreceptor organ. Expansions of detoxification and carbohydrate metabolism gene families might reflect adaptations for foraging behavior, and duplicated apoptotic genes might underlie its high regenerative potential. The C. augens genome represents one of the key references for studying the emergence of genomic innovations in insects, the most diverse animal group, and opens up novel opportunities to study the under-explored biology of diplurans.


Assuntos
Artrópodes/genética , Evolução Molecular , Genoma , Animais , Apoptose/genética , Artrópodes/classificação , Metabolismo dos Carboidratos/genética , Insetos/genética , Família Multigênica , Filogenia , Domínios Proteicos , Vírus de RNA/genética , Receptores de Superfície Celular/genética , Visão Ocular/genética , Xenobióticos/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA