Your browser doesn't support javascript.
loading
Evaluation of single-cell RNAseq labelling algorithms using cancer datasets.
Christensen, Erik; Luo, Ping; Turinsky, Andrei; Husic, Mia; Mahalanabis, Alaina; Naidas, Alaine; Diaz-Mejia, Juan Javier; Brudno, Michael; Pugh, Trevor; Ramani, Arun; Shooshtari, Parisa.
Afiliación
  • Christensen E; Department of Computer Science, University of Western Ontario, London, ON, Canada.
  • Luo P; Children's Health Research Institute, Lawson Research Institute, London, ON, Canada.
  • Turinsky A; Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada.
  • Husic M; Centre for Computational Medicine, The Hospital for Sick Children, Toronto, ON, Canada.
  • Mahalanabis A; Centre for Computational Medicine, The Hospital for Sick Children, Toronto, ON, Canada.
  • Naidas A; Centre for Computational Medicine, The Hospital for Sick Children, Toronto, ON, Canada.
  • Diaz-Mejia JJ; Children's Health Research Institute, Lawson Research Institute, London, ON, Canada.
  • Brudno M; Department of Pathology and Lab Medicine, University of Western Ontario, London, ON, Canada.
  • Pugh T; Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada.
  • Ramani A; Department of Computer Science, University of Toronto, Toronto, ON, Canada.
  • Shooshtari P; Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada.
Brief Bioinform ; 24(1)2023 01 19.
Article en En | MEDLINE | ID: mdl-36585784
ABSTRACT
Single-cell RNA sequencing (scRNA-seq) clustering and labelling methods are used to determine precise cellular composition of tissue samples. Automated labelling methods rely on either unsupervised, cluster-based approaches or supervised, cell-based approaches to identify cell types. The high complexity of cancer poses a unique challenge, as tumor microenvironments are often composed of diverse cell subpopulations with unique functional effects that may lead to disease progression, metastasis and treatment resistance. Here, we assess 17 cell-based and 9 cluster-based scRNA-seq labelling algorithms using 8 cancer datasets, providing a comprehensive large-scale assessment of such methods in a cancer-specific context. Using several performance metrics, we show that cell-based methods generally achieved higher performance and were faster compared to cluster-based methods. Cluster-based methods more successfully labelled non-malignant cell types, likely because of a lack of gene signatures for relevant malignant cell subpopulations. Larger cell numbers present in some cell types in training data positively impacted prediction scores for cell-based methods. Finally, we examined which methods performed favorably when trained and tested on separate patient cohorts in scenarios similar to clinical applications, and which were able to accurately label particularly small or under-represented cell populations in the given datasets. We conclude that scPred and SVM show the best overall performances with cancer-specific data and provide further suggestions for algorithm selection. Our analysis pipeline for assessing the performance of cell type labelling algorithms is available in https//github.com/shooshtarilab/scRNAseq-Automated-Cell-Type-Labelling.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Análisis de Expresión Génica de una Sola Célula / Neoplasias Tipo de estudio: Prognostic_studies Límite: Humans Idioma: En Revista: Brief Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: Canadá

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Análisis de Expresión Génica de una Sola Célula / Neoplasias Tipo de estudio: Prognostic_studies Límite: Humans Idioma: En Revista: Brief Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: Canadá