Batch correction of single-cell sequencing data via an autoencoder architecture.

Danino, Reut; Nachman, Iftach; Sharan, Roded

Danino, Reut; Nachman, Iftach; Sharan, Roded.

Afiliación

Danino R; Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv 6997801, Israel.
Nachman I; School of Neurobiology, Biochemistry and Biophysics, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel.
Sharan R; Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv 6997801, Israel.

Bioinform Adv ; 4(1): vbad186, 2024.

Article en En | MEDLINE | ID: mdl-38213820

ABSTRACT

ABSTRACT

Motivation Technical differences between gene expression sequencing experiments can cause variations in the data in the form of batch effect biases. These do not represent true biological variations between samples and can lead to false conclusions or hinder the ability to integrate multiple datasets. Since there is a growing need for the joint analysis of single-cell sequencing datasets from different sources, there is also a need to correct the resulting batch effects while maintaining the true biological variations in the data.

Results:

We developed a semi-supervised deep learning architecture called Autoencoder-based Batch Correction (ABC) for integrating single-cell sequencing datasets. Our method removes batch effects through a guided process of data compression using supervised cell type classifier branches for biological signal retention. It aligns the different batches using an adversarial training approach. We comprehensively evaluate the performance of our method using four single-cell sequencing datasets and multiple measures for batch effect removal and biological variation conservation. ABC outperforms 10 state-of-the-art methods for this task including Seurat, scGen, ComBat, scanorama, scVI, scANVI, AutoClass, Harmony, scDREAMER, and CLEAR, correcting various types of batch effects while preserving intricate biological variations.

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Base de datos: MEDLINE Idioma: En Revista: Bioinform Adv Año: 2024 Tipo del documento: Article País de afiliación: Israel

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Base de datos: MEDLINE Idioma: En Revista: Bioinform Adv Año: 2024 Tipo del documento: Article País de afiliación: Israel