Machine Learning-Based Annotation of Long Noncoding RNAs Using PLncPRO.
Methods Mol Biol
; 2107: 253-260, 2020.
Article
in En
| MEDLINE
| ID: mdl-31893451
ABSTRACT
Long noncoding RNAs (lncRNAs) are noncoding RNAs with transcript length more than 200 nucleotides. Although poorly conserved, lncRNAs are expressed across diverse species, including plants and animals, and are known to be involved in regulation of various biological processes. To understand their biological significance, we first need to identify the lncRNAs accurately. However, distinguishing lncRNAs from coding transcripts is still a challenging task. Here, we describe a machine learning-based approach to accurately identify the plant lncRNAs. We describe the usage of plant long noncoding RNA prediction by random forests (PLncPRO), which employs machine learning-based random forest algorithm to recognize the lncRNAs from the set of given transcript sequences. Stepwise instructions have been provided to use PLncPRO to annotate the lncRNA sequences.
Key words
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Main subject:
Plants
/
Computational Biology
/
RNA, Long Noncoding
Language:
En
Journal:
Methods Mol Biol
Journal subject:
BIOLOGIA MOLECULAR
Year:
2020
Document type:
Article
Affiliation country:
India