Your browser doesn't support javascript.
loading
scGCL: an imputation method for scRNA-seq data based on graph contrastive learning.
Xiong, Zehao; Luo, Jiawei; Shi, Wanwan; Liu, Ying; Xu, Zhongyuan; Wang, Bo.
Afiliación
  • Xiong Z; College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China.
  • Luo J; College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China.
  • Shi W; College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China.
  • Liu Y; College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China.
  • Xu Z; College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China.
  • Wang B; College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China.
Bioinformatics ; 39(3)2023 03 01.
Article en En | MEDLINE | ID: mdl-36825817
ABSTRACT
MOTIVATION Single-cell RNA-sequencing (scRNA-seq) is widely used to reveal cellular heterogeneity, complex disease mechanisms and cell differentiation processes. Due to high sparsity and complex gene expression patterns, scRNA-seq data present a large number of dropout events, affecting downstream tasks such as cell clustering and pseudo-time analysis. Restoring the expression levels of genes is essential for reducing technical noise and facilitating downstream analysis. However, existing scRNA-seq data imputation methods ignore the topological structure information of scRNA-seq data and cannot comprehensively utilize the relationships between cells.

RESULTS:

Here, we propose a single-cell Graph Contrastive Learning method for scRNA-seq data imputation, named scGCL, which integrates graph contrastive learning and Zero-inflated Negative Binomial (ZINB) distribution to estimate dropout values. scGCL summarizes global and local semantic information through contrastive learning and selects positive samples to enhance the representation of target nodes. To capture the global probability distribution, scGCL introduces an autoencoder based on the ZINB distribution, which reconstructs the scRNA-seq data based on the prior distribution. Through extensive experiments, we verify that scGCL outperforms existing state-of-the-art imputation methods in clustering performance and gene imputation on 14 scRNA-seq datasets. Further, we find that scGCL can enhance the expression patterns of specific genes in Alzheimer's disease datasets. AVAILABILITY AND IMPLEMENTATION The code and data of scGCL are available on Github https//github.com/zehaoxiong123/scGCL. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Programas Informáticos / Perfilación de la Expresión Génica Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Programas Informáticos / Perfilación de la Expresión Génica Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: China