Your browser doesn't support javascript.
loading
BiPSTP: Sequence feature encoding method for identifying different RNA modifications with bidirectional position-specific trinucleotides propensities.
Wang, Mingzhao; Ali, Haider; Xu, Yandi; Xie, Juanying; Xu, Shengquan.
Afiliación
  • Wang M; School of Computer Science, Shaanxi Normal University, Xi'an, China.
  • Ali H; School of Computer Science, Shaanxi Normal University, Xi'an, China.
  • Xu Y; School of Computer Science, Shaanxi Normal University, Xi'an, China; College of Life Sciences, Shaanxi Normal University, Xi'an, China.
  • Xie J; School of Computer Science, Shaanxi Normal University, Xi'an, China. Electronic address: xiejuany@snnu.edu.cn.
  • Xu S; College of Life Sciences, Shaanxi Normal University, Xi'an, China. Electronic address: xushengquan@snnu.edu.cn.
J Biol Chem ; 300(4): 107140, 2024 Apr.
Article en En | MEDLINE | ID: mdl-38447795
ABSTRACT
RNA modification, a posttranscriptional regulatory mechanism, significantly influences RNA biogenesis and function. The accurate identification of modification sites is paramount for investigating their biological implications. Methods for encoding RNA sequence into numerical data play a crucial role in developing robust models for predicting modification sites. However, existing techniques suffer from limitations, including inadequate information representation, challenges in effectively integrating positional and sequential information, and the generation of irrelevant or redundant features when combining multiple approaches. These deficiencies hinder the effectiveness of machine learning models in addressing the performance challenges associated with predicting RNA modification sites. Here, we introduce a novel RNA sequence feature representation method, named BiPSTP, which utilizes bidirectional trinucleotide position-specific propensities. We employ the parameter ξ to denote the interval between the current nucleotide and its adjacent forward or backward dinucleotide, enabling the extraction of positional and sequential information from RNA sequences. Leveraging the BiPSTP method, we have developed the prediction model mRNAPred using support vector machine classifier to identify multiple types of RNA modification sites. We evaluate the performance of our BiPSTP method and mRNAPred model across 12 distinct RNA modification types. Our experimental results demonstrate the superiority of the mRNAPred model compared to state-of-art models in the domain of RNA modification sites identification. Importantly, our BiPSTP method enhances the robustness and generalization performance of prediction models. Notably, it can be applied to feature extraction from DNA sequences to predict other biological modification sites.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: ARN / Procesamiento Postranscripcional del ARN / Máquina de Vectores de Soporte Idioma: En Revista: J Biol Chem Año: 2024 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: ARN / Procesamiento Postranscripcional del ARN / Máquina de Vectores de Soporte Idioma: En Revista: J Biol Chem Año: 2024 Tipo del documento: Article País de afiliación: China