Your browser doesn't support javascript.
loading
ILGBMSH: an interpretable classification model for the shRNA target prediction with ensemble learning algorithm.
Zhao, Chengkui; Xu, Nan; Tan, Jingwen; Cheng, Qi; Xie, Weixin; Xu, Jiayu; Wei, Zhenyu; Ye, Jing; Yu, Lei; Feng, Weixing.
Affiliation
  • Zhao C; Institute of Intelligent System and Bioinformatics, College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China.
  • Xu N; Institute of Biomedical Engineering and Technology, Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, No, 3663 North Zhongshan Road, Shanghai 200065, China.
  • Tan J; Shanghai Unicar-Therapy Bio-medicine Technology Co., Ltd, No 1525 Minqiang Road, Shanghai, 201612, China.
  • Cheng Q; Institute of Biomedical Engineering and Technology, Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, No, 3663 North Zhongshan Road, Shanghai 200065, China.
  • Xie W; Shanghai Unicar-Therapy Bio-medicine Technology Co., Ltd, No 1525 Minqiang Road, Shanghai, 201612, China.
  • Xu J; Institute of Intelligent System and Bioinformatics, College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China.
  • Wei Z; Institute of Intelligent System and Bioinformatics, College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China.
  • Ye J; Institute of Intelligent System and Bioinformatics, College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China.
  • Yu L; Institute of Intelligent System and Bioinformatics, College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China.
  • Feng W; Institute of Biomedical Engineering and Technology, Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, No, 3663 North Zhongshan Road, Shanghai 200065, China.
Brief Bioinform ; 23(6)2022 11 19.
Article in En | MEDLINE | ID: mdl-36184189
ABSTRACT
Short hairpin RNA (shRNA)-mediated gene silencing is an important technology to achieve RNA interference, in which the design of potent and reliable shRNA molecules plays a crucial role. However, efficient shRNA target selection through biological technology is expensive and time consuming. Hence, it is crucial to develop a more precise and efficient computational method to design potent and reliable shRNA molecules. In this work, we present an interpretable classification model for the shRNA target prediction using the Light Gradient Boosting Machine algorithm called ILGBMSH. Rather than utilizing only the shRNA sequence feature, we extracted 554 biological and deep learning features, which were not considered in previous shRNA prediction research. We evaluated the performance of our model compared with the state-of-the-art shRNA target prediction models. Besides, we investigated the feature explanation from the model's parameters and interpretable method called Shapley Additive Explanations, which provided us with biological insights from the model. We used independent shRNA experiment data from other resources to prove the predictive ability and robustness of our model. Finally, we used our model to design the miR30-shRNA sequences and conducted a gene knockdown experiment. The experimental result was perfectly in correspondence with our expectation with a Pearson's coefficient correlation of 0.985. In summary, the ILGBMSH model can achieve state-of-the-art shRNA prediction performance and give biological insights from the machine learning model parameters.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Machine Learning Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2022 Document type: Article Affiliation country: China

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Machine Learning Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2022 Document type: Article Affiliation country: China