RESUMEN
Understanding microRNA (miRNA) functions has been hampered by major difficulties in identifying their biological target(s). Currently, the main limitation is the lack of a suitable strategy to identify biologically relevant targets among a high number of putative targets. Here we provide a proof of concept of successful de novo (i.e. without prior knowledge of its identity) miRNA phenotypic target (i.e. target whose de-repression contributes to the phenotypic outcomes) identification from RNA-seq data. Using the medaka mir-202 knock-out (KO) model in which inactivation leads to a major organism-level reproductive phenotype, including reduced egg production, we introduced novel criteria including limited fold-change in KO and low interindividual variability in gene expression to reduce the list of 2853 putative targets to a short list of 5. We selected tead3b, a member of the evolutionarily-conserved Hippo pathway, known to regulate ovarian functions, due to its remarkably strong and evolutionarily conserved binding affinity for miR-202-5p. Deleting the miR-202-5p binding site in the 3' UTR of tead3b, but not of other Hippo pathway members sav1 and vgll4b, triggered a reduced egg production phenotype. This is one of the few successful examples of de novo functional assignment of a miRNA phenotypic target in vivo in vertebrates.
Asunto(s)
Vía de Señalización Hippo , MicroARNs , Oryzias , Animales , Sitios de Unión , MicroARNs/genética , MicroARNs/metabolismo , Fenotipo , RNA-Seq , Oryzias/metabolismoRESUMEN
Genome annotation plays a crucial role in providing comprehensive catalog of genes and transcripts for a particular species. As research projects generate new transcriptome data worldwide, integrating this information into existing annotations becomes essential. However, most bioinformatics pipelines are limited in their ability to effectively and consistently update annotations using new RNA-seq data. Here we introduce TAGADA, an RNA-seq pipeline for Transcripts And Genes Assembly, Deconvolution, and Analysis. Given a genomic sequence, a reference annotation and RNA-seq reads, TAGADA enhances existing gene models by generating an improved annotation. It also computes expression values for both the reference and novel annotation, identifies long non-coding transcripts (lncRNAs), and provides a comprehensive quality control report. Developed using Nextflow DSL2, TAGADA offers user-friendly functionalities and ensures reproducibility across different computing platforms through its containerized environment. In this study, we demonstrate the efficacy of TAGADA using RNA-seq data from the GENE-SWiTCH project alongside chicken and pig genome annotations as references. Results indicate that TAGADA can substantially increase the number of annotated transcripts by approximately [Formula: see text] in these species. Furthermore, we illustrate how TAGADA can integrate Illumina NovaSeq short reads with PacBio Iso-Seq long reads, showcasing its versatility. TAGADA is available at github.com/FAANG/analysis-TAGADA.
RESUMEN
Like aboveground herbivores, belowground herbivores are confronted with multiple plant defense mechanisms including complex chemical cocktails in plant tissue. Roots and shoots of Brassicaceae plants contain the two-component glucosinolate (GSL)-myrosinase defense system. Upon cell damage, for example by herbivore feeding, toxic and pungent isothiocyanates (ITCs) can be formed. Several aboveground-feeding herbivores have developed biochemical adaptation strategies to overcome the GSL-ITC defenses of their host plant. Whether belowground herbivores feeding on Brassica roots possess similar mechanisms has received little attention. Here, we analyze how two related belowground specialist herbivores detoxify the GSL-ITC defenses of their host plants. The larvae of the fly species Delia radicum and D. floralis are common pests and specialized herbivores on the roots of Brassicaceae. We used chemical analyses (HPLC-MS/MS and HPLC-UV) to examine how the GSL-ITC defense system is metabolized by these congeneric larvae. In addition, we screened for candidate genes involved in the detoxification process using RNAseq and qPCR. The chemical analyses yielded glutathione conjugates and amines. This indicates that both species detoxify ITCs using potentially the general mercapturic acid pathway, which is also found in aboveground herbivores, and an ITC-specific hydrolytic pathway previously characterized in microbes. Performance assays confirmed that ITCs negatively affect the survival of both species, in spite of their known specialization to ITC-producing plants and tissues, whereas ITC breakdown products are less toxic. Interestingly, the RNAseq analyses showed that the two congeneric species activate different sets of genes upon ITC exposure, which was supported by qPCR data. Based on our findings, we conclude that these specialist larvae use combinations of general and compound-specific detoxification mechanisms with differing efficacies and substrate preferences. This indicates that combining detoxification mechanisms can be an evolutionarily successful strategy to handle plant defenses in herbivores.
RESUMEN
Belowground herbivores are overseen and underestimated, even though they can cause significant economic losses in agriculture. The cabbage root fly Delia radicum (Anthomyiidae) is a common pest in Brassica species, including agriculturally important crops, such as oilseed rape. The damage is caused by the larvae, which feed specifically on the taproots of Brassica plants until they pupate. The adults are aboveground-living generalists feeding on pollen and nectar. Female flies are attracted by chemical cues in Brassica plants for oviposition. An assembled and annotated genome can elucidate which genetic mechanisms underlie the adaptation of D. radicum to its host plants and their specific chemical defences, in particular isothiocyanates. Therefore, we assembled, annotated and analysed the D. radicum genome using a combination of different next-generation sequencing and bioinformatic approaches. We assembled a chromosome-level D. radicum genome using PacBio and Hi-C Illumina sequence data. Combining Canu and 3D-DNA genome assembler, we constructed a 1.3 Gbp genome with an N50 of 242 Mbp and 6 pseudo-chromosomes. To annotate the assembled D. radicum genome, we combined homology-, transcriptome- and ab initio-prediction approaches. In total, we annotated 13,618 genes that were predicted by at least two approaches. We analysed egg, larval, pupal and adult transcriptomes in relation to life-stage specific molecular functions. This high-quality annotated genome of D. radicum is a first step to understanding the genetic mechanisms underlying host plant adaptation. As such, it will be an important resource to find novel and sustainable approaches to reduce crop losses to these pests.
Asunto(s)
Brassica , Dípteros , Animales , Productos Agrícolas , Dípteros/genética , Femenino , Herbivoria , Larva/genéticaRESUMEN
MicroRNAs (miRNAs) are important post-transcriptional regulators of gene expression involved in countless biological processes and are widely studied across metazoans. Although miRNA research continues to grow, the large community of fish miRNA researchers lacks exhaustive resources consistent among species. To fill this gap, we developed FishmiRNA, an evolutionarily supported miRNA annotation and expression database for ray-finned fishes: www.fishmirna.org. The self-explanatory database contains detailed, manually curated miRNA annotations with orthology relationships rigorously established by sequence similarity and conserved syntenies, and expression data provided for each detected mature miRNA. In just few clicks, users can download the annotation and expression database in several convenient formats either in its entirety or a subset. Simple filters and Blast search options also permit the simultaneous exploration and visual comparison of expression data for up to any ten mature miRNAs across species and organs. FishmiRNA was specifically designed for ease of use to reach a wide audience.
Asunto(s)
MicroARNs , Animales , Peces/genética , Peces/metabolismo , MicroARNs/genética , MicroARNs/metabolismoRESUMEN
BACKGROUND: Circulating miRNAs (c-miRNAs) are found in most, if not all, biological fluids and are becoming well-established non-invasive biomarkers of many human pathologies. However, their features in non-pathological contexts and whether their expression profiles reflect normal life history events have received little attention, especially in non-mammalian species. The aim of the present study was to investigate the potential of c-miRNAs to serve as biomarkers of reproductive and metabolic states in fish. RESULTS: The blood plasma was sampled throughout the reproductive cycle of female rainbow trout subjected to two different feeding regimes that triggered contrasting metabolic states. In addition, ovarian fluid was sampled at ovulation, and all samples were subjected to small RNA-seq analysis, leading to the establishment of a comprehensive miRNA repertoire (i.e., miRNAome) and enabling subsequent comparative analyses to a panel of RNA-seq libraries from a wide variety of tissues and organs. We showed that biological fluid miRNAomes are complex and encompass a high proportion of the overall rainbow trout miRNAome. While sharing a high proportion of common miRNAs, the blood plasma and ovarian fluid miRNAomes exhibited strong fluid-specific signatures. We further revealed that the blood plasma miRNAome significantly changed depending on metabolic and reproductive states. We subsequently identified three evolutionarily conserved muscle-specific miRNAs or myomiRs (miR-1-1/2-3p, miR-133a-1/2-3p, and miR-206-3p) that accumulated in the blood plasma in response to high feeding rates, making these myomiRs strong candidate biomarkers of active myogenesis. We also identified miR-202-5p as a candidate biomarker for reproductive success that could be used to predict ovulation and/or egg quality. CONCLUSIONS: Together, these promising results reveal the high potential of c-miRNAs, including evolutionarily conserved myomiRs, as physiologically relevant biomarker candidates and pave the way for the use of c-miRNAs for non-invasive phenotyping in various fish species.
Asunto(s)
MicroARNs , Oncorhynchus mykiss , Animales , Biomarcadores , Femenino , Humanos , MicroARNs/genética , Oncorhynchus mykiss/genética , Reproducción/genéticaRESUMEN
The Eurasian plant Stipa capillata is the most widespread species within feather grasses. Many taxa of the genus are dominants in steppe plant communities and can be used for their classification and in studies related to climate change. Moreover, some species are of economic importance mainly as fodder plants and can be used for soil remediation processes. Although large-scale molecular data has begun to appear, there is still no complete or draft genome for any Stipa species. Thus, here we present a single-molecule long-read sequencing dataset generated using the Pacific Biosciences Sequel System. A draft genome of about 1004 Mb was obtained with a contig N50 length of 351 kb. Importantly, here we report 81,224 annotated protein-coding genes, present 77,614 perfect and 58 unique imperfect SSRs, reveal the putative allopolyploid nature of S. capillata, investigate the evolutionary history of the genus, demonstrate structural heteroplasmy of the chloroplast genome and announce for the first time the mitochondrial genome in Stipa. The assembled nuclear, mitochondrial and chloroplast genomes provide a significant source of genetic data for further works on phylogeny, hybridisation and population studies within Stipa and the grass family Poaceae.
Asunto(s)
Genoma del Cloroplasto , Genoma Mitocondrial , Genoma de Planta , Proteínas de Plantas/genética , Poaceae/genética , Mapeo Contig , Europa (Continente) , Tamaño del Genoma , Heteroplasmia , Repeticiones de Microsatélite , Filogenia , Fitomejoramiento/métodos , Proteínas de Plantas/clasificación , Ploidias , Poaceae/clasificaciónRESUMEN
Most metazoans are associated with symbionts. Characterizing the effect of a particular symbiont often requires getting access to its genome, which is usually done by sequencing the whole community. We present MinYS, a targeted assembly approach to assemble a particular genome of interest from such metagenomic data. First, taking advantage of a reference genome, a subset of the reads is assembled into a set of backbone contigs. Then, this draft assembly is completed using the whole metagenomic readset in a de novo manner. The resulting assembly is output as a genome graph, enabling different strains with potential structural variants coexisting in the sample to be distinguished. MinYS was applied to 50 pea aphid resequencing samples, with variable diversity in symbiont communities, in order to recover the genome sequence of its obligatory bacterial symbiont, Buchnera aphidicola. It was able to return high-quality assemblies (one contig assembly in 90% of the samples), even when using increasingly distant reference genomes, and to retrieve large structural variations in the samples. Because of its targeted essence, it outperformed standard metagenomic assemblers in terms of both time and assembly quality.
RESUMEN
BACKGROUND: Most metazoans are involved in durable relationships with microbes which can take several forms, from mutualism to parasitism. The advances of NGS technologies and bioinformatics tools have opened opportunities to shed light on the diversity of microbial communities and to give some insights into the functions they perform in a broad array of hosts. The pea aphid is a model system for the study of insect-bacteria symbiosis. It is organized in a complex of biotypes, each adapted to specific host plants. It harbors both an obligatory symbiont supplying key nutrients and several facultative symbionts bringing additional functions to the host, such as protection against biotic and abiotic stresses. However, little is known on how the symbiont genomic diversity is structured at different scales: across host biotypes, among individuals of the same biotype, or within individual aphids, which limits our understanding on how these multi-partner symbioses evolve and interact. RESULTS: We present a framework well adapted to the study of genomic diversity and evolutionary dynamics of the pea aphid holobiont from metagenomic read sets, based on mapping to reference genomes and whole genome variant calling. Our results revealed that the pea aphid microbiota is dominated by a few heritable bacterial symbionts reported in earlier works, with no discovery of new microbial associates. However, we detected a large and heterogeneous genotypic diversity associated with the different symbionts of the pea aphid. Partitioning analysis showed that this fine resolution diversity is distributed across the three considered scales. Phylogenetic analyses highlighted frequent horizontal transfers of facultative symbionts between host lineages, indicative of flexible associations between the pea aphid and its microbiota. However, the evolutionary dynamics of symbiotic associations strongly varied depending on the symbiont, reflecting different histories and possible constraints. In addition, at the intra-host scale, we showed that different symbiont strains may coexist inside the same aphid host. CONCLUSIONS: We present a methodological framework for the detailed analysis of NGS data from microbial communities of moderate complexity and gave major insights into the extent of diversity in pea aphid-symbiont associations and the range of evolutionary trajectories they could take.
Asunto(s)
Áfidos/microbiología , Buchnera/aislamiento & purificación , Microbiota/genética , Rickettsia/aislamiento & purificación , Simbiosis/fisiología , Animales , Buchnera/clasificación , Buchnera/genética , Genoma Bacteriano/genética , Metagenoma/genética , Metagenómica , Filogenia , ARN Ribosómico 16S/genética , Rickettsia/clasificación , Rickettsia/genéticaRESUMEN
BACKGROUND AND AIMS: Bacterial Non-Specific Acid Phosphatase (NSAP) enzymes are capable of dephosphorylating diverse organic phosphoesters but are rarely studied: their distribution in natural and managed environments is poorly understood. The aim of this study was to generate new insight into the environmental distribution of NSAPs and establish their potential global relevance to cycling of organic phosphorus. METHODS: We employed bioinformatic tools to determine NSAP diversity and subcellular localization in microbial genomes; used the corresponding NSAP gene sequences to census metagenomes from diverse ecosystems; studied the effect of long-term land management upon NSAP diversity and abundance. RESULTS: Periplasmic class B NSAPs are poorly represented in marine and terrestrial environments, reflecting their association with enteric and pathogenic bacteria. Periplasmic class A and outer membrane-associated class C NSAPs are cosmopolitan. NSAPs are more abundant in marine than terrestrial ecosystems and class C more abundant than class A genes, except in an acidic peat where class A genes dominate. A clear effect of land management upon gene abundance was identified. CONCLUSIONS: NSAP genes are cosmopolitan. Class C genes are more widely distributed: their association with the outer-membrane of cells gives them a clear role in the cycling of organic phosphorus, particularly in soils.
RESUMEN
Phosphorus cycling exerts significant influence upon soil fertility and productivity - processes largely controlled by microbial activity. We adopted phenotypic and metagenomic approaches to investigate phosphatase genes within soils. Microbial communities in bare fallowed soil showed a marked capacity to utilise phytate for growth compared with arable or grassland soil communities. Bare fallowed soil contained lowest concentrations of orthophosphate. Analysis of metagenomes indicated phoA, phoD and phoX, and histidine acid and cysteine phytase genes were most abundant in grassland soil which contained the greatest amount of NaOH-EDTA extractable orthophosphate. Beta-propeller phytase genes were most abundant in bare fallowed soil. Phylogenetic analysis of metagenome sequences indicated the phenotypic shift observed in the capacity to mineralise phytate in bare fallow soil was accompanied by an increase in phoD, phoX and beta-propeller phytase genes coding for exoenzymes. However, there was a remarkable degree of genetic similarity across the soils despite the differences in land-use. Predicted extracellular ecotypes were distributed across a greater range of soil structure than predicted intracellular ecotypes, suggesting that microbial communities subject to the dual stresses of low nutrient availability and reduced access to organic material in bare fallowed soils rely upon the action of exoenzymes.