Your browser doesn't support javascript.
loading
scTPC: a novel semisupervised deep clustering model for scRNA-seq data.
Qiu, Yushan; Yang, Lingfei; Jiang, Hao; Zou, Quan.
Afiliación
  • Qiu Y; School of Mathematical Sciences, Shenzhen University, Shenzhen, Guangdong 518000, China.
  • Yang L; School of Mathematical Sciences, Shenzhen University, Shenzhen, Guangdong 518000, China.
  • Jiang H; School of Mathematics, Renmin University of China, Haidian District, Beijing 100872, China.
  • Zou Q; Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 610056, China.
Bioinformatics ; 40(5)2024 May 02.
Article en En | MEDLINE | ID: mdl-38684178
ABSTRACT
MOTIVATION Continuous advancements in single-cell RNA sequencing (scRNA-seq) technology have enabled researchers to further explore the study of cell heterogeneity, trajectory inference, identification of rare cell types, and neurology. Accurate scRNA-seq data clustering is crucial in single-cell sequencing data analysis. However, the high dimensionality, sparsity, and presence of "false" zero values in the data can pose challenges to clustering. Furthermore, current unsupervised clustering algorithms have not effectively leveraged prior biological knowledge, making cell clustering even more challenging.

RESULTS:

This study investigates a semisupervised clustering model called scTPC, which integrates the triplet constraint, pairwise constraint, and cross-entropy constraint based on deep learning. Specifically, the model begins by pretraining a denoising autoencoder based on a zero-inflated negative binomial distribution. Deep clustering is then performed in the learned latent feature space using triplet constraints and pairwise constraints generated from partial labeled cells. Finally, to address imbalanced cell-type datasets, a weighted cross-entropy loss is introduced to optimize the model. A series of experimental results on 10 real scRNA-seq datasets and five simulated datasets demonstrate that scTPC achieves accurate clustering with a well-designed framework. AVAILABILITY AND IMPLEMENTATION scTPC is a Python-based algorithm, and the code is available from https//github.com/LF-Yang/Code or https//zenodo.org/records/10951780.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Algoritmos / Análisis de la Célula Individual Límite: Humans Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Algoritmos / Análisis de la Célula Individual Límite: Humans Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article País de afiliación: China