Your browser doesn't support javascript.
loading
CTISL: a dynamic stacking multi-class classification approach for identifying cell types from single-cell RNA-seq data.
Wang, Xiao; Chai, Ziyi; Li, Shaohua; Liu, Yan; Li, Chen; Jiang, Yu; Liu, Quanzhong.
Afiliação
  • Wang X; Department of Software Engineering, College of Information Engineering, Northwest A&F University, Yangling 712100, China.
  • Chai Z; Department of Software Engineering, College of Information Engineering, Northwest A&F University, Yangling 712100, China.
  • Li S; Department of Software Engineering, College of Information Engineering, Northwest A&F University, Yangling 712100, China.
  • Liu Y; School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.
  • Li C; Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia.
  • Jiang Y; Department of Animal Genetics, Breeding and Reproduction, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China.
  • Liu Q; Department of Software Engineering, College of Information Engineering, Northwest A&F University, Yangling 712100, China.
Bioinformatics ; 40(2)2024 02 01.
Article em En | MEDLINE | ID: mdl-38317054
ABSTRACT
MOTIVATION Effective identification of cell types is of critical importance in single-cell RNA-sequencing (scRNA-seq) data analysis. To date, many supervised machine learning-based predictors have been implemented to identify cell types from scRNA-seq datasets. Despite the technical advances of these state-of-the-art tools, most existing predictors were single classifiers, of which the performances can still be significantly improved. It is therefore highly desirable to employ the ensemble learning strategy to develop more accurate computational models for robust and comprehensive identification of cell types on scRNA-seq datasets.

RESULTS:

We propose a two-layer stacking model, termed CTISL (Cell Type Identification by Stacking ensemble Learning), which integrates multiple classifiers to identify cell types. In the first layer, given a reference scRNA-seq dataset with known cell types, CTISL dynamically combines multiple cell-type-specific classifiers (i.e. support-vector machine and logistic regression) as the base learners to deliver the outcomes for the input of a meta-classifier in the second layer. We conducted a total of 24 benchmarking experiments on 17 human and mouse scRNA-seq datasets to evaluate and compare the prediction performance of CTISL and other state-of-the-art predictors. The experiment results demonstrate that CTISL achieves superior or competitive performance compared to these state-of-the-art approaches. We anticipate that CTISL can serve as a useful and reliable tool for cost-effective identification of cell types from scRNA-seq datasets. AVAILABILITY AND IMPLEMENTATION The webserver and source code are freely available at http//bigdata.biocie.cn/CTISLweb/home and https//zenodo.org/records/10568906, respectively.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Análise de Célula Única / Análise da Expressão Gênica de Célula Única Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Análise de Célula Única / Análise da Expressão Gênica de Célula Única Idioma: En Ano de publicação: 2024 Tipo de documento: Article