Your browser doesn't support javascript.
loading
m5CPred-SVM: a novel method for predicting m5C sites of RNA.
Chen, Xiao; Xiong, Yi; Liu, Yinbo; Chen, Yuqing; Bi, Shoudong; Zhu, Xiaolei.
  • Chen X; School of Sciences, Anhui Agricultural University, Hefei, 230036, Anhui, China.
  • Xiong Y; School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
  • Liu Y; School of Sciences, Anhui Agricultural University, Hefei, 230036, Anhui, China.
  • Chen Y; School of Sciences, Anhui Agricultural University, Hefei, 230036, Anhui, China.
  • Bi S; School of Sciences, Anhui Agricultural University, Hefei, 230036, Anhui, China. bishoudong@163.com.
  • Zhu X; School of Sciences, Anhui Agricultural University, Hefei, 230036, Anhui, China. xlzhu_mdl@hotmail.com.
BMC Bioinformatics ; 21(1): 489, 2020 Oct 30.
Article en En | MEDLINE | ID: mdl-33126851
BACKGROUND: As one of the most common post-transcriptional modifications (PTCM) in RNA, 5-cytosine-methylation plays important roles in many biological functions such as RNA metabolism and cell fate decision. Through accurate identification of 5-methylcytosine (m5C) sites on RNA, researchers can better understand the exact role of 5-cytosine-methylation in these biological functions. In recent years, computational methods of predicting m5C sites have attracted lots of interests because of its efficiency and low-cost. However, both the accuracy and efficiency of these methods are not satisfactory yet and need further improvement. RESULTS: In this work, we have developed a new computational method, m5CPred-SVM, to identify m5C sites in three species, H. sapiens, M. musculus and A. thaliana. To build this model, we first collected benchmark datasets following three recently published methods. Then, six types of sequence-based features were generated based on RNA segments and the sequential forward feature selection strategy was used to obtain the optimal feature subset. After that, the performance of models based on different learning algorithms were compared, and the model based on the support vector machine provided the highest prediction accuracy. Finally, our proposed method, m5CPred-SVM was compared with several existing methods, and the result showed that m5CPred-SVM offered substantially higher prediction accuracy than previously published methods. It is expected that our method, m5CPred-SVM, can become a useful tool for accurate identification of m5C sites. CONCLUSION: In this study, by introducing position-specific propensity related features, we built a new model, m5CPred-SVM, to predict RNA m5C sites of three different species. The result shows that our model outperformed the existing state-of-art models. Our model is available for users through a web server at https://zhulab.ahu.edu.cn/m5CPred-SVM .
Asunto(s)
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: ARN / 5-Metilcitosina / Máquina de Vectores de Soporte Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Animals / Humans Idioma: En Año: 2020 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: ARN / 5-Metilcitosina / Máquina de Vectores de Soporte Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Animals / Humans Idioma: En Año: 2020 Tipo del documento: Article