Your browser doesn't support javascript.
loading
Supervised learning of high-confidence phenotypic subpopulations from single-cell data.
Ren, Tao; Chen, Canping; Danilov, Alexey V; Liu, Susan; Guan, Xiangnan; Du, Shunyi; Wu, Xiwei; Sherman, Mara H; Spellman, Paul T; Coussens, Lisa M; Adey, Andrew C; Mills, Gordon B; Wu, Ling-Yun; Xia, Zheng.
Afiliación
  • Ren T; Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.
  • Chen C; School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, China.
  • Danilov AV; Computational Biology Program, Oregon Health & Science University, Portland, OR, USA.
  • Liu S; Department of Biomedical Engineering, Oregon Health & Science University, Portland, OR, USA.
  • Guan X; City of Hope National Medical Center, Duarte, CA, USA.
  • Du S; Computational Biology Program, Oregon Health & Science University, Portland, OR, USA.
  • Wu X; Department of Biomedical Engineering, Oregon Health & Science University, Portland, OR, USA.
  • Sherman MH; Department of Oncology Biomarker Development, Genentech Inc, South San Francisco, CA, USA.
  • Spellman PT; Computational Biology Program, Oregon Health & Science University, Portland, OR, USA.
  • Coussens LM; Department of Biomedical Engineering, Oregon Health & Science University, Portland, OR, USA.
  • Adey AC; City of Hope National Medical Center, Duarte, CA, USA.
  • Mills GB; Department of Cell, Developmental & Cancer Biology, Oregon Health & Science University, Portland, OR, USA.
  • Wu LY; Knight Cancer Institute, Oregon Health & Science University, Portland, OR, USA.
  • Xia Z; Knight Cancer Institute, Oregon Health & Science University, Portland, OR, USA.
bioRxiv ; 2023 Mar 25.
Article en En | MEDLINE | ID: mdl-36993424
ABSTRACT
Accurately identifying phenotype-relevant cell subsets from heterogeneous cell populations is crucial for delineating the underlying mechanisms driving biological or clinical phenotypes. Here, by deploying a learning with rejection strategy, we developed a novel supervised learning framework called PENCIL to identify subpopulations associated with categorical or continuous phenotypes from single-cell data. By embedding a feature selection function into this flexible framework, for the first time, we were able to select informative features and identify cell subpopulations simultaneously, which enables the accurate identification of phenotypic subpopulations otherwise missed by methods incapable of concurrent gene selection. Furthermore, the regression mode of PENCIL presents a novel ability for supervised phenotypic trajectory learning of subpopulations from single-cell data. We conducted comprehensive simulations to evaluate PENCILs versatility in simultaneous gene selection, subpopulation identification and phenotypic trajectory prediction. PENCIL is fast and scalable to analyze 1 million cells within 1 hour. Using the classification mode, PENCIL detected T-cell subpopulations associated with melanoma immunotherapy outcomes. Moreover, when applied to scRNA-seq of a mantle cell lymphoma patient with drug treatment across multiple time points, the regression mode of PENCIL revealed a transcriptional treatment response trajectory. Collectively, our work introduces a scalable and flexible infrastructure to accurately identify phenotype-associated subpopulations from single-cell data.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: BioRxiv Año: 2023 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: BioRxiv Año: 2023 Tipo del documento: Article País de afiliación: China
...