Pesquisa | BVS Doenças Infecciosas e Parasitárias

1.

KaMRaT: a C++ toolkit for k-mer count matrix dimension reduction.

Xue, Haoliang; Gallopin, Mélina; Marchet, Camille; Nguyen, Ha N; Wang, Yunfeng; Lainé, Antoine; Bessiere, Chloé; Gautheret, Daniel.

Bioinformatics ; 40(3)2024 Mar 04.

Artigo em Inglês | MEDLINE | ID: mdl-38444086

RESUMO

MOTIVATION: KaMRaT is designed for processing large k-mer count tables derived from multi-sample, RNA-seq data. Its primary objective is to identify condition-specific or differentially expressed sequences, regardless of gene or transcript annotation. RESULTS: KaMRaT is implemented in C++. Major functions include scoring k-mers based on count statistics, merging overlapping k-mers into contigs and selecting k-mers based on their occurrence across specific samples. AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available via https://github.com/Transipedia/KaMRaT.

Assuntos

Algoritmos , Software , Análise de Sequência de DNA/métodos , RNA-Seq , Documentação

2.

Transinteractome analysis reveals distinct niche requirements for isotype-based plasma cell subsets in the bone marrow.

Bonaud, Amélie; Larraufie, Pierre; Khamyath, Mélanie; Szachnowski, Ugo; Flint, Shaun M; Brunel-Meunier, Nadège; Delhommeau, François; Munier, Annie; Lönnberg, Tapio; Toffano-Nioche, Claire; Gautheret, Daniel; Balabanian, Karl; Espéli, Marion.

Eur J Immunol ; 53(9): e2250334, 2023 09.

Artigo em Inglês | MEDLINE | ID: mdl-37377335

RESUMO

Bone marrow (BM) long-lived plasma cells (PCs) are essential for long-term protection against infection, and their persistence within this organ relies on interactions with Cxcl12-expressing stromal cells that are still not clearly identified. Here, using single cell RNAseq and in silico transinteractome analyses, we identified Leptin receptor positive (LepR+ ) mesenchymal cells as the stromal cell subset most likely to interact with PCs within the BM. Moreover, we demonstrated that depending on the isotype they express, PCs may use different sets of integrins and adhesion molecules to interact with these stromal cells. Altogether, our results constitute an unprecedented characterization of PC subset stromal niches and open new avenues for the specific targeting of BM PCs based on their isotype.

Assuntos

Medula Óssea , Células-Tronco Mesenquimais , Medula Óssea/metabolismo , Plasmócitos , Células Estromais , Moléculas de Adesão Celular/metabolismo , Células da Medula Óssea

3.

Nonsense-Mediated Decay Restricts LncRNA Levels in Yeast Unless Blocked by Double-Stranded RNA Structure.

Wery, Maxime; Descrimes, Marc; Vogt, Nicolas; Dallongeville, Anne-Sophie; Gautheret, Daniel; Morillon, Antonin.

Mol Cell ; 61(3): 379-392, 2016 Feb 04.

Artigo em Inglês | MEDLINE | ID: mdl-26805575

RESUMO

Antisense long non-coding (aslnc)RNAs represent a substantial part of eukaryotic transcriptomes that are, in yeast, controlled by the Xrn1 exonuclease. Nonsense-Mediated Decay (NMD) destabilizes the Xrn1-sensitive aslncRNAs (XUT), but what determines their sensitivity remains unclear. We report that 3' single-stranded (3'-ss) extension mediates XUTs degradation by NMD, assisted by the Mtr4 and Dbp2 helicases. Single-gene investigation, genome-wide RNA analyses, and double-stranded (ds)RNA mapping revealed that 3'-ss extensions discriminate the NMD-targeted XUTs from stable lncRNAs. Ribosome profiling showed that XUT are translated, locking them for NMD activity. Interestingly, mutants of the Mtr4 and Dbp2 helicases accumulated XUTs, suggesting that dsRNA unwinding is a critical step for degradation. Indeed, expression of anticomplementary transcripts protects cryptic intergenic lncRNAs from NMD. Our results indicate that aslncRNAs form dsRNA that are only translated and targeted to NMD if dissociated by Mtr4 and Dbp2. We propose that NMD buffers genome expression by discarding pervasive regulatory transcripts.

Assuntos

Degradação do RNAm Mediada por Códon sem Sentido , RNA de Cadeia Dupla/metabolismo , RNA Fúngico/metabolismo , RNA Longo não Codificante/metabolismo , Saccharomyces cerevisiae/metabolismo , RNA Helicases DEAD-box/genética , RNA Helicases DEAD-box/metabolismo , Exorribonucleases/genética , Exorribonucleases/metabolismo , Regulação Fúngica da Expressão Gênica , Genoma Fúngico , Mutação , Conformação de Ácido Nucleico , Fases de Leitura Aberta , RNA Antissenso/genética , RNA Antissenso/metabolismo , RNA de Cadeia Dupla/química , RNA de Cadeia Dupla/genética , RNA Fúngico/química , RNA Fúngico/genética , RNA Longo não Codificante/química , RNA Longo não Codificante/genética , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , Transcrição Gênica

4.

Rfam 14: expanded coverage of metagenomic, viral and microRNA families.

Kalvari, Ioanna; Nawrocki, Eric P; Ontiveros-Palacios, Nancy; Argasinska, Joanna; Lamkiewicz, Kevin; Marz, Manja; Griffiths-Jones, Sam; Toffano-Nioche, Claire; Gautheret, Daniel; Weinberg, Zasha; Rivas, Elena; Eddy, Sean R; Finn, Robert D; Bateman, Alex; Petrov, Anton I.

Nucleic Acids Res ; 49(D1): D192-D200, 2021 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-33211869

RESUMO

Rfam is a database of RNA families where each of the 3444 families is represented by a multiple sequence alignment of known RNA sequences and a covariance model that can be used to search for additional members of the family. Recent developments have involved expert collaborations to improve the quality and coverage of Rfam data, focusing on microRNAs, viral and bacterial RNAs. We have completed the first phase of synchronising microRNA families in Rfam and miRBase, creating 356 new Rfam families and updating 40. We established a procedure for comprehensive annotation of viral RNA families starting with Flavivirus and Coronaviridae RNAs. We have also increased the coverage of bacterial and metagenome-based RNA families from the ZWD database. These developments have enabled a significant growth of the database, with the addition of 759 new families in Rfam 14. To facilitate further community contribution to Rfam, expert users are now able to build and submit new families using the newly developed Rfam Cloud family curation system. New Rfam website features include a new sequence similarity search powered by RNAcentral, as well as search and visualisation of families with pseudoknots. Rfam is freely available at https://rfam.org.

Assuntos

Bases de Dados de Ácidos Nucleicos , Metagenoma , MicroRNAs/genética , RNA Bacteriano/genética , RNA não Traduzido/genética , RNA Viral/genética , Bactérias/genética , Bactérias/metabolismo , Pareamento de Bases , Sequência de Bases , Humanos , Internet , MicroRNAs/classificação , MicroRNAs/metabolismo , Anotação de Sequência Molecular , Conformação de Ácido Nucleico , RNA Bacteriano/classificação , RNA Bacteriano/metabolismo , RNA não Traduzido/classificação , RNA não Traduzido/metabolismo , RNA Viral/classificação , RNA Viral/metabolismo , Alinhamento de Sequência , Análise de Sequência de RNA , Software , Vírus/genética , Vírus/metabolismo

5.

2-kupl: mapping-free variant detection from DNA-seq data of matched samples.

Wang, Yunfeng; Xue, Haoliang; Pourcel, Christine; Du, Yang; Gautheret, Daniel.

BMC Bioinformatics ; 22(1): 304, 2021 Jun 05.

Artigo em Inglês | MEDLINE | ID: mdl-34090332

RESUMO

BACKGROUND: The detection of genome variants, including point mutations, indels and structural variants, is a fundamental and challenging computational problem. We address here the problem of variant detection between two deep-sequencing (DNA-seq) samples, such as two human samples from an individual patient, or two samples from distinct bacterial strains. The preferred strategy in such a case is to align each sample to a common reference genome, collect all variants and compare these variants between samples. Such mapping-based protocols have several limitations. DNA sequences with large indels, aggregated mutations and structural variants are hard to map to the reference. Furthermore, DNA sequences cannot be mapped reliably to genomic low complexity regions and repeats. RESULTS: We introduce 2-kupl, a k-mer based, mapping-free protocol to detect variants between two DNA-seq samples. On simulated and actual data, 2-kupl achieves higher accuracy than other mapping-free protocols. Applying 2-kupl to prostate cancer whole exome sequencing data, we identify a number of candidate variants in hard-to-map regions and propose potential novel recurrent variants in this disease. CONCLUSIONS: We developed a mapping-free protocol for variant calling between matched DNA-seq samples. Our protocol is suitable for variant detection in unmappable genome regions or in the absence of a reference genome.

Assuntos

Genômica , Sequenciamento de Nucleotídeos em Larga Escala , DNA , Genoma Humano , Humanos , Análise de Sequência de DNA

6.

REINDEER: efficient indexing of k-mer presence and abundance in sequencing datasets.

Marchet, Camille; Iqbal, Zamin; Gautheret, Daniel; Salson, Mikaël; Chikhi, Rayan.

Bioinformatics ; 36(Suppl_1): i177-i185, 2020 07 01.

Artigo em Inglês | MEDLINE | ID: mdl-32657392

RESUMO

MOTIVATION: In this work we present REINDEER, a novel computational method that performs indexing of sequences and records their abundances across a collection of datasets. To the best of our knowledge, other indexing methods have so far been unable to record abundances efficiently across large datasets. RESULTS: We used REINDEER to index the abundances of sequences within 2585 human RNA-seq experiments in 45 h using only 56 GB of RAM. This makes REINDEER the first method able to record abundances at the scale of â¼4 billion distinct k-mers across 2585 datasets. REINDEER also supports exact presence/absence queries of k-mers. Briefly, REINDEER constructs the compacted de Bruijn graph of each dataset, then conceptually merges those de Bruijn graphs into a single global one. Then, REINDEER constructs and indexes monotigs, which in a nutshell are groups of k-mers of similar abundances. AVAILABILITY AND IMPLEMENTATION: https://github.com/kamimrcht/REINDEER. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Análise de Sequência de DNA , Software , Algoritmos , Humanos , Análise de Sequência de RNA

7.

Landscape of the Noncoding Transcriptome Response of Two Arabidopsis Ecotypes to Phosphate Starvation.

Blein, Thomas; Balzergue, Coline; Roulé, Thomas; Gabriel, Marc; Scalisi, Laetitia; François, Tracy; Sorin, Céline; Christ, Aurélie; Godon, Christian; Delannoy, Etienne; Martin-Magniette, Marie-Laure; Nussaume, Laurent; Hartmann, Caroline; Gautheret, Daniel; Desnos, Thierry; Crespi, Martin.

Plant Physiol ; 183(3): 1058-1072, 2020 07.

Artigo em Inglês | MEDLINE | ID: mdl-32404413

RESUMO

Root architecture varies widely between species; it even varies between ecotypes of the same species, despite strong conservation of the coding portion of their genomes. By contrast, noncoding RNAs evolve rapidly between ecotypes and may control their differential responses to the environment, since several long noncoding RNAs (lncRNAs) are known to quantitatively regulate gene expression. Roots from ecotypes Columbia and Landsberg erecta of Arabidopsis (Arabidopsis thaliana) respond differently to phosphate starvation. Here, we compared transcriptomes (mRNAs, lncRNAs, and small RNAs) of root tips from these two ecotypes during early phosphate starvation. We identified thousands of lncRNAs that were largely conserved at the DNA level in these ecotypes. In contrast to coding genes, many lncRNAs were specifically transcribed in one ecotype and/or differentially expressed between ecotypes independent of phosphate availability. We further characterized these ecotype-related lncRNAs and studied their link with small interfering RNAs. Our analysis identified 675 lncRNAs differentially expressed between the two ecotypes, including antisense RNAs targeting key regulators of root-growth responses. Misregulation of several lincRNAs showed that at least two ecotype-related lncRNAs regulate primary root growth in ecotype Columbia. RNA-sequencing analysis following deregulation of lncRNA NPC48 revealed a potential link with root growth and transport functions. This exploration of the noncoding transcriptome identified ecotype-specific lncRNA-mediated regulation in root apexes. The noncoding genome may harbor further mechanisms involved in ecotype adaptation of roots to different soil environments.

Assuntos

Arabidopsis/genética , Ecótipo , Fosfatos/deficiência , Raízes de Plantas/anatomia & histologia , Raízes de Plantas/genética , RNA Longo não Codificante/genética , Estresse Fisiológico/genética , Adaptação Fisiológica/genética , Adaptação Fisiológica/fisiologia , Arabidopsis/fisiologia , Regulação da Expressão Gênica de Plantas , Variação Genética , Raízes de Plantas/fisiologia , Estresse Fisiológico/fisiologia , Transcriptoma

8.

Reference-free transcriptome signatures for prostate cancer prognosis.

Nguyen, Ha T N; Xue, Haoliang; Firlej, Virginie; Ponty, Yann; Gallopin, Melina; Gautheret, Daniel.

BMC Cancer ; 21(1): 394, 2021 Apr 12.

Artigo em Inglês | MEDLINE | ID: mdl-33845808

RESUMO

BACKGROUND: RNA-seq data are increasingly used to derive prognostic signatures for cancer outcome prediction. A limitation of current predictors is their reliance on reference gene annotations, which amounts to ignoring large numbers of non-canonical RNAs produced in disease tissues. A recently introduced kind of transcriptome classifier operates entirely in a reference-free manner, relying on k-mers extracted from patient RNA-seq data. METHODS: In this paper, we set out to compare conventional and reference-free signatures in risk and relapse prediction of prostate cancer. To compare the two approaches as fairly as possible, we set up a common procedure that takes as input either a k-mer count matrix or a gene expression matrix, extracts a signature and evaluates this signature in an independent dataset. RESULTS: We find that both gene-based and k-mer based classifiers had similarly high performances for risk prediction and a markedly lower performance for relapse prediction. Interestingly, the reference-free signatures included a set of sequences mapping to novel lncRNAs or variable regions of cancer driver genes that were not part of gene-based signatures. CONCLUSIONS: Reference-free classifiers are thus a promising strategy for the identification of novel prognostic RNA biomarkers.

Assuntos

Biomarcadores Tumorais , Neoplasias da Próstata/genética , Neoplasias da Próstata/mortalidade , Transcriptoma , Algoritmos , Biologia Computacional/métodos , Perfilação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Prognóstico , Neoplasias da Próstata/patologia , Recidiva , Reprodutibilidade dos Testes , Aprendizado de Máquina Supervisionado

9.

Accurate characterization of Escherichia coli tRNA modifications with a simple method of deep-sequencing library preparation.

Wang, Ji; Toffano-Nioche, Claire; Lorieux, Florence; Gautheret, Daniel; Lehmann, Jean.

RNA Biol ; 18(1): 33-46, 2021 01.

Artigo em Inglês | MEDLINE | ID: mdl-32618488

RESUMO

In conventional RNA high-throughput sequencing, modified bases prevent a large fraction of tRNA transcripts to be converted into cDNA libraries. Recent proposals aiming at resolving this issue take advantage of the interference of base modifications with RT enzymes to detect and identify them by establishing signals from aborted cDNA transcripts. Because some modifications, such as methyl groups, do almost not allow RT bypassing, demethylation and highly processive RT enzymes have been used to overcome these obstacles. Working with Escherichia coli as a model system, we show that with a conventional (albeit still engineered) RT enzyme and key optimizations in library preparation, all RT-impairing modifications can be highlighted along the entire tRNA length without demethylation procedure. This is achieved by combining deep-sequencing samples, which allows to establish aborted transcription signal of higher accuracy and reproducibility, with the potential for differentiating tiny differences in the state of modification of all cellular tRNAs. In addition, our protocol provides estimates of the relative tRNA abundance.

Assuntos

Escherichia coli/genética , Regulação Bacteriana da Expressão Gênica , Processamento Pós-Transcricional do RNA , RNA Bacteriano , RNA de Transferência/genética , Biologia Computacional/métodos , Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Conformação de Ácido Nucleico , RNA de Transferência/química , Análise de Sequência de RNA

10.

Identification of RNAs bound by Hfq reveals widespread RNA partners and a sporulation regulator in the human pathogen Clostridioides difficile.

Boudry, Pierre; Piattelli, Emma; Drouineau, Emilie; Peltier, Johann; Boutserin, Anaïs; Lejars, Maxence; Hajnsdorf, Eliane; Monot, Marc; Dupuy, Bruno; Martin-Verstraete, Isabelle; Gautheret, Daniel; Toffano-Nioche, Claire; Soutourina, Olga.

RNA Biol ; 18(11): 1931-1952, 2021 11.

Artigo em Inglês | MEDLINE | ID: mdl-33629931

RESUMO

Noncoding RNAs (ncRNA) have emerged as important components of regulatory networks governing bacterial physiology and virulence. Previous deep-sequencing analysis identified a large diversity of ncRNAs in the human enteropathogen Clostridioides (Clostridium) difficile. Some of them are trans-encoded RNAs that could require the RNA chaperone protein Hfq for their action. Recent analysis suggested a pleiotropic role of Hfq in C. difficile with the most pronounced effect on sporulation, a key process during the infectious cycle of this pathogen. However, a global view of RNAs interacting with C. difficile Hfq is missing. In the present study, we performed RNA immunoprecipitation high-throughput sequencing (RIP-Seq) to identify Hfq-associated RNAs in C. difficile. Our work revealed a large set of Hfq-interacting mRNAs and ncRNAs, including mRNA leaders and coding regions, known and potential new ncRNAs. In addition to trans-encoded RNAs, new categories of Hfq ligands were found including cis-antisense RNAs, riboswitches and CRISPR RNAs. ncRNA-mRNA and ncRNA-ncRNA pairings were postulated through computational predictions. Investigation of one of the Hfq-associated ncRNAs, RCd1, suggests that this RNA contributes to the control of late stages of sporulation in C. difficile. Altogether, these data provide essential molecular basis for further studies of post-transcriptional regulatory network in this enteropathogen.

Assuntos

Clostridioides difficile/crescimento & desenvolvimento , Clostridioides/fisiologia , Regulação Bacteriana da Expressão Gênica , Fator Proteico 1 do Hospedeiro/metabolismo , RNA Bacteriano/metabolismo , Esporos Bacterianos/fisiologia , Virulência , Clostridioides difficile/genética , Clostridioides difficile/metabolismo , Genoma Bacteriano , Fator Proteico 1 do Hospedeiro/genética , Humanos , RNA Bacteriano/genética

11.

Native elongating transcript sequencing reveals global anti-correlation between sense and antisense nascent transcription in fission yeast.

Wery, Maxime; Gautier, Camille; Descrimes, Marc; Yoda, Mayuko; Vennin-Rendos, Hervé; Migeot, Valérie; Gautheret, Daniel; Hermand, Damien; Morillon, Antonin.

RNA ; 24(2): 196-208, 2018 02.

Artigo em Inglês | MEDLINE | ID: mdl-29114019

RESUMO

Antisense transcription can regulate sense gene expression. However, previous annotations of antisense transcription units have been based on detection of mature antisense long noncoding (aslnc)RNAs by RNA-seq and/or microarrays, only giving a partial view of the antisense transcription landscape and incomplete molecular bases for antisense-mediated regulation. Here, we used native elongating transcript sequencing to map genome-wide nascent antisense transcription in fission yeast. Strikingly, antisense transcription was detected for most protein-coding genes, correlating with low sense transcription, especially when overlapping the mRNA start site. RNA profiling revealed that the resulting aslncRNAs mainly correspond to cryptic Xrn1/Exo2-sensitive transcripts (XUTs). ChIP-seq analyses showed that antisense (as)XUT's expression is associated with specific histone modification patterns. Finally, we showed that asXUTs are controlled by the histone chaperone Spt6 and respond to meiosis induction, in both cases anti-correlating with levels of the paired-sense mRNAs, supporting physiological significance to antisense-mediated gene attenuation. Our work highlights that antisense transcription is much more extended than anticipated and might constitute an additional nonpromoter determinant of gene regulation complexity.

Assuntos

Regulação Fúngica da Expressão Gênica , RNA Antissenso/biossíntese , Schizosaccharomyces/genética , Transcrição Gênica , Chaperonas de Histonas/metabolismo , Código das Histonas , Meiose/genética , Elongação Traducional da Cadeia Peptídica , Interferência de RNA , Estabilidade de RNA , RNA Antissenso/metabolismo , Schizosaccharomyces/metabolismo , Proteínas de Schizosaccharomyces pombe/metabolismo , Análise de Sequência de RNA

12.

The conserved regulatory RNA RsaE down-regulates the arginine degradation pathway in Staphylococcus aureus.

Rochat, Tatiana; Bohn, Chantal; Morvan, Claire; Le Lam, Thao Nguyen; Razvi, Fareha; Pain, Adrien; Toffano-Nioche, Claire; Ponien, Prishila; Jacq, Annick; Jacquet, Eric; Fey, Paul D; Gautheret, Daniel; Bouloc, Philippe.

Nucleic Acids Res ; 46(17): 8803-8816, 2018 09 28.

Artigo em Inglês | MEDLINE | ID: mdl-29986060

RESUMO

RsaE is a regulatory RNA highly conserved amongst Firmicutes that lowers the amount of mRNAs associated with the TCA cycle and folate metabolism. A search for new RsaE targets in Staphylococcus aureus revealed that in addition to previously described substrates, RsaE down-regulates several genes associated with arginine catabolism. In particular, RsaE targets the arginase rocF mRNA via direct interactions involving G-rich motifs. Two duplicated C-rich motifs of RsaE can independently downregulate rocF expression. The faster growth rate of ΔrsaE compared to its parental strain in media containing amino acids as sole carbon source points to an underlying role for RsaE in amino acid catabolism. Collectively, the data support a model in which RsaE acts as a global regulator of functions associated with metabolic adaptation.

Assuntos

Arginina/metabolismo , RNA Bacteriano/fisiologia , Sequências Reguladoras de Ácido Ribonucleico , Staphylococcus aureus/genética , Staphylococcus aureus/metabolismo , Aminoácidos/metabolismo , Aminoácidos/farmacologia , Sequência Conservada , Meios de Cultura/química , Meios de Cultura/farmacologia , Regulação para Baixo/efeitos dos fármacos , Regulação para Baixo/genética , Regulação Bacteriana da Expressão Gênica/efeitos dos fármacos , Regulação Bacteriana da Expressão Gênica/genética , Redes e Vias Metabólicas/efeitos dos fármacos , Redes e Vias Metabólicas/genética , Organismos Geneticamente Modificados , Sequências Reguladoras de Ácido Ribonucleico/genética , Staphylococcus aureus/efeitos dos fármacos , Staphylococcus aureus/crescimento & desenvolvimento

13.

CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins.

Couvin, David; Bernheim, Aude; Toffano-Nioche, Claire; Touchon, Marie; Michalik, Juraj; Néron, Bertrand; Rocha, Eduardo P C; Vergnaud, Gilles; Gautheret, Daniel; Pourcel, Christine.

Nucleic Acids Res ; 46(W1): W246-W251, 2018 07 02.

Artigo em Inglês | MEDLINE | ID: mdl-29790974

RESUMO

CRISPR (clustered regularly interspaced short palindromic repeats) arrays and their associated (Cas) proteins confer bacteria and archaea adaptive immunity against exogenous mobile genetic elements, such as phages or plasmids. CRISPRCasFinder allows the identification of both CRISPR arrays and Cas proteins. The program includes: (i) an improved CRISPR array detection tool facilitating expert validation based on a rating system, (ii) prediction of CRISPR orientation and (iii) a Cas protein detection and typing tool updated to match the latest classification scheme of these systems. CRISPRCasFinder can either be used online or as a standalone tool compatible with Linux operating system. All third-party software packages employed by the program are freely available. CRISPRCasFinder is available at https://crisprcas.i2bc.paris-saclay.fr.

Assuntos

Proteínas Associadas a CRISPR/genética , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Software , Proteínas Associadas a CRISPR/química , Sistemas CRISPR-Cas , Internet

14.

A benchmark study of scoring methods for non-coding mutations.

Drubay, Damien; Gautheret, Daniel; Michiels, Stefan.

Bioinformatics ; 34(10): 1635-1641, 2018 05 15.

Artigo em Inglês | MEDLINE | ID: mdl-29340599

RESUMO

Motivation: Detailed knowledge of coding sequences has led to different candidate models for pathogenic variant prioritization. Several deleteriousness scores have been proposed for the non-coding part of the genome, but no large-scale comparison has been realized to date to assess their performance. Results: We compared the leading scoring tools (CADD, FATHMM-MKL, Funseq2 and GWAVA) and some recent competitors (DANN, SNP and SOM scores) for their ability to discriminate assumed pathogenic variants from assumed benign variants (using the ClinVar, COSMIC and 1000 genomes project databases). Using the ClinVar benchmark, CADD was the best tool for detecting the pathogenic variants that are mainly located in protein coding gene regions. Using the COSMIC benchmark, FATHMM-MKL, GWAVA and SOMliver outperformed the other tools for pathogenic variants that are typically located in lincRNAs, pseudogenes and other parts of the non-coding genome. However, all tools had low precision, which could potentially be improved by future non-coding genome feature discoveries. These results may have been influenced by the presence of potential benign variants in the COSMIC database. The development of a gold standard as consistent as ClinVar for these regions will be necessary to confirm our tool ranking. Availability and implementation: The Snakemake, C++ and R codes are freely available from https://github.com/Oncostat/BenchmarkNCVTools and supported on Linux. Contact: damien.drubay@gustaveroussy.fr or stefan.michiels@gustaveroussy.fr. Supplementary information: Supplementary data are available at Bioinformatics online.

Assuntos

Genoma Humano , Mutação , Análise de Sequência de DNA/métodos , Software , Benchmarking , Humanos

15.

Revisiting the structure/function relationships of H/ACA(-like) RNAs: a unified model for Euryarchaea and Crenarchaea.

Toffano-Nioche, Claire; Gautheret, Daniel; Leclerc, Fabrice.

Nucleic Acids Res ; 43(16): 7744-61, 2015 Sep 18.

Artigo em Inglês | MEDLINE | ID: mdl-26240384

RESUMO

A structural and functional classification of H/ACA and H/ACA-like motifs is obtained from the analysis of the H/ACA guide RNAs which have been identified previously in the genomes of Euryarchaea (Pyrococcus) and Crenarchaea (Pyrobaculum). A unified structure/function model is proposed based on the common structural determinants shared by H/ACA and H/ACA-like motifs in both Euryarchaea and Crenarchaea. Using a computational approach, structural and energetic rules for the guide:target RNA-RNA interactions are derived from structural and functional data on the H/ACA RNP particles. H/ACA(-like) motifs found in Pyrococcus are evaluated through the classification and their biological relevance is discussed. Extra-ribosomal targets found in both Pyrococcus and Pyrobaculum might support the hypothesis of a gene regulation mediated by H/ACA(-like) guide RNAs in archaea.

Assuntos

Pyrobaculum/genética , Pyrococcus abyssi/genética , RNA Arqueal/química , RNA Guia de Cinetoplastídeos/química , RNA Nucleolar Pequeno/química , Pareamento de Bases , Euryarchaeota/genética , Regulação da Expressão Gênica em Archaea , Modelos Moleculares , Motivos de Nucleotídeos , Dobramento de RNA , RNA Arqueal/metabolismo , RNA Guia de Cinetoplastídeos/metabolismo , RNA Nucleolar Pequeno/metabolismo , RNA de Transferência/metabolismo , Relação Estrutura-Atividade

16.

In plants, decapping prevents RDR6-dependent production of small interfering RNAs from endogenous mRNAs.

Martínez de Alba, Angel Emilio; Moreno, Ana Beatriz; Gabriel, Marc; Mallory, Allison C; Christ, Aurélie; Bounon, Rémi; Balzergue, Sandrine; Aubourg, Sebastien; Gautheret, Daniel; Crespi, Martin D; Vaucheret, Hervé; Maizel, Alexis.

Nucleic Acids Res ; 43(5): 2902-13, 2015 Mar 11.

Artigo em Inglês | MEDLINE | ID: mdl-25694514

RESUMO

Cytoplasmic degradation of endogenous RNAs is an integral part of RNA quality control (RQC) and often relies on the removal of the 5' cap structure and their subsequent 5' to 3' degradation in cytoplasmic processing (P-)bodies. In parallel, many eukaryotes degrade exogenous and selected endogenous RNAs through post-transcriptional gene silencing (PTGS). In plants, PTGS depends on small interfering (si)RNAs produced after the conversion of single-stranded RNAs to double-stranded RNAs by the cellular RNA-dependent RNA polymerase 6 (RDR6) in cytoplasmic siRNA-bodies. PTGS and RQC compete for transgene-derived RNAs, but it is unknown whether this competition also occurs for endogenous transcripts. We show that the lethality of decapping mutants is suppressed by impairing RDR6 activity. We establish that upon decapping impairment hundreds of endogenous mRNAs give rise to a new class of rqc-siRNAs, that over-accumulate when RQC processes are impaired, a subset of which depending on RDR6 for their production. We observe that P- and siRNA-bodies often are dynamically juxtaposed, potentially allowing for cross-talk of the two machineries. Our results suggest that the decapping of endogenous RNA limits their entry into the PTGS pathway. We anticipate that the rqc-siRNAs identified in decapping mutants represent a subset of a larger ensemble of endogenous siRNAs.

Assuntos

Proteínas de Arabidopsis/genética , Capuzes de RNA/genética , RNA Mensageiro/genética , RNA de Plantas/genética , RNA Interferente Pequeno/genética , RNA Polimerase Dependente de RNA/genética , Arabidopsis/genética , Arabidopsis/metabolismo , Proteínas de Arabidopsis/metabolismo , Endorribonucleases/genética , Endorribonucleases/metabolismo , Regulação da Expressão Gênica de Plantas , Mutação , Análise de Sequência com Séries de Oligonucleotídeos , Plantas Geneticamente Modificadas , Capuzes de RNA/metabolismo , Interferência de RNA , RNA Mensageiro/metabolismo , RNA de Plantas/metabolismo , RNA Interferente Pequeno/metabolismo , Pequeno RNA não Traduzido/genética , Pequeno RNA não Traduzido/metabolismo , RNA Polimerase Dependente de RNA/metabolismo , Transcriptoma

17.

A Dual Model for Prioritizing Cancer Mutations in the Non-coding Genome Based on Germline and Somatic Events.

Li, Jia; Poursat, Marie-Anne; Drubay, Damien; Motz, Arnaud; Saci, Zohra; Morillon, Antonin; Michiels, Stefan; Gautheret, Daniel.

PLoS Comput Biol ; 11(11): e1004583, 2015 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-26588488

RESUMO

We address here the issue of prioritizing non-coding mutations in the tumoral genome. To this aim, we created two independent computational models. The first (germline) model estimates purifying selection based on population SNP data. The second (somatic) model estimates tumor mutation density based on whole genome tumor sequencing. We show that each model reflects a different set of constraints acting either on the normal or tumor genome, and we identify the specific genome features that most contribute to these constraints. Importantly, we show that the somatic mutation model carries independent functional information that can be used to narrow down the non-coding regions that may be relevant to cancer progression. On this basis, we identify positions in non-coding RNAs and the non-coding parts of mRNAs that are both under purifying selection in the germline and protected from mutation in tumors, thus introducing a new strategy for future detection of cancer driver elements in the expressed non-coding genome.

Assuntos

Biologia Computacional/métodos , Genoma Humano/genética , Modelos Genéticos , Mutação/genética , Neoplasias/genética , RNA não Traduzido/genética , Humanos , Análise de Sequência de DNA

18.

Detection of generic differential RNA processing events from RNA-seq data.

Tran, Van Du T; Souiai, Oussema; Romero-Barrios, Natali; Crespi, Martin; Gautheret, Daniel.

RNA Biol ; 13(1): 59-67, 2016.

Artigo em Inglês | MEDLINE | ID: mdl-26849165

RESUMO

RNA-seq data analysis has revealed abundant alternative splicing in eukaryotic mRNAs. However, splicing is only one of many processing events that transcripts may undergo during their lifetime. We present here RNAprof (RNA profile analysis), a program for the detection of differential processing events from the comparison of RNA-seq experiments. RNAprof implements a specific gene-level normalization procedure and compares RNA-seq coverage profiles at nucleotide resolution to detect regions of significant coverage differences, independently of splice sites or other gene features. We used RNAprof to analyze the effect of alternative-splicing regulators NSRa and NSRb on the Arabidopsis thaliana transcriptome. A number of intron retention events and alternative transcript structures were specifically detected by RNAprof and confirmed by qRT-PCR. Further tests using a public Mus musculus RNA-seq dataset and comparisons with other RNA isoform predictors showed that RNAprof uniquely identified sets of highly significant processing events as well as other relevant library-specific differences in RNA-seq profiles. This highlights an important layer of variation that remains undetected by current protocols for RNA-seq analysis.

Assuntos

Biologia Computacional/métodos , Processamento Pós-Transcricional do RNA , RNA/genética , Análise de Sequência de RNA/métodos , Processamento Alternativo , Animais , Arabidopsis/genética , Arabidopsis/metabolismo , Proteínas de Arabidopsis/metabolismo , Biologia Computacional/normas , Bases de Dados Genéticas , Perfilação da Expressão Gênica/métodos , Perfilação da Expressão Gênica/normas , Camundongos , Análise de Sequência de RNA/normas

19.

An assessment of bacterial small RNA target prediction programs.

Pain, Adrien; Ott, Alban; Amine, Hamza; Rochat, Tatiana; Bouloc, Philippe; Gautheret, Daniel.

RNA Biol ; 12(5): 509-13, 2015.

Artigo em Inglês | MEDLINE | ID: mdl-25760244

RESUMO

Most bacterial regulatory RNAs exert their function through base-pairing with target RNAs. Computational prediction of targets is a busy research field that offers biologists a variety of web sites and software. However, it is difficult for a non-expert to evaluate how reliable those programs are. Here, we provide a simple benchmark for bacterial sRNA target prediction based on trusted E. coli sRNA/target pairs. We use this benchmark to assess the most recent RNA target predictors as well as earlier programs for RNA-RNA hybrid prediction. Moreover, we consider how the definition of mRNA boundaries can impact overall predictions. Recent algorithms that exploit both conservation of targets and accessibility information offer improved accuracy over previous software. However, even with the best predictors, the number of true biological targets with low scores and non-targets with high scores remains puzzling.

Assuntos

Biologia Computacional/métodos , Escherichia coli/genética , RNA Bacteriano/genética , Pareamento de Bases/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Regiões não Traduzidas/genética

20.

A universal RNA structural motif docking the elbow of tRNA in the ribosome, RNAse P and T-box leaders.

Lehmann, Jean; Jossinet, Fabrice; Gautheret, Daniel.

Nucleic Acids Res ; 41(10): 5494-502, 2013 May 01.

Artigo em Inglês | MEDLINE | ID: mdl-23580544

RESUMO

The structure and function of conserved motifs constituting the apex of Stem I in T-box mRNA leaders are investigated. We point out that this apex shares striking similarities with the L1 stalk (helices 76-78) of the ribosome. A sequence and structure analysis of both elements shows that, similarly to the head of the L1 stalk, the function of the apex of Stem I lies in the docking of tRNA through a stacking interaction with the conserved G19:C56 base pair platform. The inferred structure in the apex of Stem I consists of a module of two T-loops bound together head to tail, a module that is also present in the head of the L1 stalk, but went unnoticed. Supporting the analysis, we show that a highly conserved structure in RNAse P formerly described as the J11/12-J12/11 module, which is precisely known to bind the elbow of tRNA, constitutes a third instance of this T-loop module. A structural analysis explains why six nucleotides constituting the core of this module are highly invariant among all three types of RNA. Our finding that major RNA partners of tRNA bind the elbow with a same RNA structure suggests an explanation for the origin of the tRNA L-shape.

Assuntos

Regiões 5' não Traduzidas , RNA Ribossômico 23S/química , RNA de Transferência/química , Ribonuclease P/química , Sequência de Bases , Modelos Moleculares , Dados de Sequência Molecular , Conformação de Ácido Nucleico , Motivos de Nucleotídeos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA