Your browser doesn't support javascript.
loading
Porpoise: a new approach for accurate prediction of RNA pseudouridine sites.
Li, Fuyi; Guo, Xudong; Jin, Peipei; Chen, Jinxiang; Xiang, Dongxu; Song, Jiangning; Coin, Lachlan J M.
Afiliación
  • Li F; Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, the University of Melbourne, Australia.
  • Guo X; Ningxia University, China.
  • Jin P; Department of Clinical Laboratory of Ruijin Hospital, affiliated with Shanghai Jiao Tong University School of Medicine, Shanghai, China.
  • Chen J; Northwest A&F University, China.
  • Xiang D; Faculty of Engineering and Information Technology, The University of Melbourne, Australia.
  • Song J; Monash Biomedicine Discovery Institute, Monash University, Australia.
  • Coin LJM; Department of Microbiology and Immunology at the University of Melbourne, Australia.
Brief Bioinform ; 22(6)2021 11 05.
Article en En | MEDLINE | ID: mdl-34226915
ABSTRACT
Pseudouridine is a ubiquitous RNA modification type present in eukaryotes and prokaryotes, which plays a vital role in various biological processes. Almost all kinds of RNAs are subject to this modification. However, it remains a great challenge to identify pseudouridine sites via experimental approaches, requiring expensive and time-consuming experimental research. Therefore, computational approaches that can be used to perform accurate in silico identification of pseudouridine sites from the large amount of RNA sequence data are highly desirable and can aid in the functional elucidation of this critical modification. Here, we propose a new computational approach, termed Porpoise, to accurately identify pseudouridine sites from RNA sequence data. Porpoise builds upon a comprehensive evaluation of 18 frequently used feature encoding schemes based on the selection of four types of features, including binary features, pseudo k-tuple composition, nucleotide chemical property and position-specific trinucleotide propensity based on single-strand (PSTNPss). The selected features are fed into the stacked ensemble learning framework to enable the construction of an effective stacked model. Both cross-validation tests on the benchmark dataset and independent tests show that Porpoise achieves superior predictive performance than several state-of-the-art approaches. The application of model interpretation tools demonstrates the importance of PSTNPs for the performance of the trained models. This new method is anticipated to facilitate community-wide efforts to identify putative pseudouridine sites and formulate novel testable biological hypothesis.
Asunto(s)
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Seudouridina / ARN / Biología Computacional Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Brief Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2021 Tipo del documento: Article País de afiliación: Australia

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Seudouridina / ARN / Biología Computacional Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Brief Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2021 Tipo del documento: Article País de afiliación: Australia