Your browser doesn't support javascript.
loading
PIPET: predicting relevant subpopulations in single-cell data using phenotypic information from bulk data.
Ruan, Xinjia; Cheng, Yu; Ye, Yuqing; Wang, Yuhang; Chen, Xinyi; Yang, Yuqing; Liu, Tiantian; Yan, Fangrong.
Afiliação
  • Ruan X; Research Center of Biostatistics and Computational Pharmacy, China Pharmaceutical University, Nanjing 211198, P.R. China.
  • Cheng Y; Research Center of Biostatistics and Computational Pharmacy, China Pharmaceutical University, Nanjing 211198, P.R. China.
  • Ye Y; Research Center of Biostatistics and Computational Pharmacy, China Pharmaceutical University, Nanjing 211198, P.R. China.
  • Wang Y; Research Center of Biostatistics and Computational Pharmacy, China Pharmaceutical University, Nanjing 211198, P.R. China.
  • Chen X; Research Center of Biostatistics and Computational Pharmacy, China Pharmaceutical University, Nanjing 211198, P.R. China.
  • Yang Y; Research Center of Biostatistics and Computational Pharmacy, China Pharmaceutical University, Nanjing 211198, P.R. China.
  • Liu T; Research Center of Biostatistics and Computational Pharmacy, China Pharmaceutical University, Nanjing 211198, P.R. China.
  • Yan F; Research Center of Biostatistics and Computational Pharmacy, China Pharmaceutical University, Nanjing 211198, P.R. China.
Brief Bioinform ; 25(4)2024 May 23.
Article em En | MEDLINE | ID: mdl-38819254
ABSTRACT
Single-cell RNA sequencing has revealed cellular heterogeneity in complex tissues, notably benefiting research on diseases such as cancer. However, the integration of single-cell data from small samples with extensive clinical features in bulk data remains underexplored. In this study, we introduce PIPET, an algorithmic method for predicting relevant subpopulations in single-cell data based on multivariate phenotypic information from bulk data. PIPET generates feature vectors for each phenotype from differentially expressed genes in bulk data and then identifies relevant cellular subpopulations by assessing the similarity between single-cell data and these vectors. Subsequently, phenotype-related cell states can be analyzed based on these subpopulations. In simulated datasets, PIPET showed robust performance in predicting multiclassification cellular subpopulations. Application of PIPET to lung adenocarcinoma single-cell RNA sequencing data revealed cellular subpopulations with poor survival and associations with TP53 mutations. Similarly, in breast cancer single-cell data, PIPET identified cellular subpopulations associated with the PAM50 clinical subtypes and triple-negative breast cancer subtypes. Overall, PIPET effectively identified relevant cellular subpopulations in single-cell data, guided by phenotypic information from bulk data. This approach comprehensively delineates the molecular characteristics of each cellular subpopulation, offering insights into disease-related subpopulations and guiding personalized treatment strategies.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Fenótipo / Algoritmos / Análise de Célula Única Limite: Female / Humans Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Fenótipo / Algoritmos / Análise de Célula Única Limite: Female / Humans Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article