Your browser doesn't support javascript.
loading
Deep learning predicts short non-coding RNA functions from only raw sequence data.
Noviello, Teresa Maria Rosaria; Ceccarelli, Francesco; Ceccarelli, Michele; Cerulo, Luigi.
Affiliation
  • Noviello TMR; Department of Electrical Engineering and Information Technology, University of Naples "Federico II", Napoli, Italy.
  • Ceccarelli F; Biogem Scarl, Istituto di Ricerche Genetiche "Gaetano Salvatore", Ariano Irpino, Italy.
  • Ceccarelli M; CaReBios srl, Ariano Irpino, Italy.
  • Cerulo L; Computer Laboratory, University of Cambridge, Cambridge, UK.
PLoS Comput Biol ; 16(11): e1008415, 2020 11.
Article in En | MEDLINE | ID: mdl-33175836
Small non-coding RNAs (ncRNAs) are short non-coding sequences involved in gene regulation in many biological processes and diseases. The lack of a complete comprehension of their biological functionality, especially in a genome-wide scenario, has demanded new computational approaches to annotate their roles. It is widely known that secondary structure is determinant to know RNA function and machine learning based approaches have been successfully proven to predict RNA function from secondary structure information. Here we show that RNA function can be predicted with good accuracy from a lightweight representation of sequence information without the necessity of computing secondary structure features which is computationally expensive. This finding appears to go against the dogma of secondary structure being a key determinant of function in RNA. Compared to recent secondary structure based methods, the proposed solution is more robust to sequence boundary noise and reduces drastically the computational cost allowing for large data volume annotations. Scripts and datasets to reproduce the results of experiments proposed in this study are available at: https://github.com/bioinformatics-sannio/ncrna-deep.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: RNA, Untranslated / Deep Learning Type of study: Health_economic_evaluation / Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: PLoS Comput Biol Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2020 Document type: Article Affiliation country: Italy Country of publication: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: RNA, Untranslated / Deep Learning Type of study: Health_economic_evaluation / Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: PLoS Comput Biol Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2020 Document type: Article Affiliation country: Italy Country of publication: United States