Your browser doesn't support javascript.
loading
CGD: Comprehensive guide designer for CRISPR-Cas systems.
Menon, A Vipin; Sohn, Jang-Il; Nam, Jin-Wu.
Afiliação
  • Menon AV; Department of Life Science, College of Natural Sciences, Hanyang University, Seoul 04763, Republic of Korea.
  • Sohn JI; Department of Life Science, College of Natural Sciences, Hanyang University, Seoul 04763, Republic of Korea.
  • Nam JW; Research Institute for Convergence of Basic Sciences, Hanyang University, Seoul 04763, Republic of Korea.
Comput Struct Biotechnol J ; 18: 814-820, 2020.
Article em En | MEDLINE | ID: mdl-32308928
ABSTRACT
The Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-Cas systems, including dead Cas9 (dCas9), Cas9, and Cas12a, have revolutionized genome engineering in mammalian somatic cells. Although computational tools that assess the target sites of CRISPR-Cas systems are inevitably important for designing efficient guide RNAs (gRNAs), they exhibit generalization issues in selecting features and do not provide optimal results in a comprehensive manner. Here, we introduce a Comprehensive Guide Designer (CGD) for four different CRISPR systems, which utilizes the machine learning algorithm, Elastic Net Logistic Regression (ENLOR), to autonomously generalize the models. CGD contains specific models trained with public datasets generated by CRISPRi, CRISPRa, CRISPR-Cas9, and CRISPR-Cas12a (designated as CGDi, CGDa, CGD9, and CGD12a, respectively) in an unbiased manner. The trained CGD models were benchmarked to other regression-based machine learning models, such as ElasticNet Linear Regression (ENLR), Random Forest and Boruta (RFB), and Extreme Gradient Boosting (Xgboost) with inbuilt feature selection. Evaluation with independent test datasets showed that CGD models outperformed the pre-existing methods in predicting the efficacy of gRNAs. All CGD source codes and datasets are available at GitHub (https//github.com/vipinmenon1989/CGD), and the CGD webserver can be accessed at http//big.hanyang.ac.kr2195/CGD.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Revista: Comput Struct Biotechnol J Ano de publicação: 2020 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Revista: Comput Struct Biotechnol J Ano de publicação: 2020 Tipo de documento: Article