Your browser doesn't support javascript.
loading
InClust+: the deep generative framework with mask modules for multimodal data integration, imputation, and cross-modal generation.
Wang, Lifei; Nie, Rui; Miao, Xuexia; Cai, Yankai; Wang, Anqi; Zhang, Hanwen; Zhang, Jiang; Cai, Jun.
Afiliação
  • Wang L; Shulan (Hangzhou) Hospital, Affiliated to Zhejiang Shuren University Shulan International Medical College, Hangzhou, China. wanglf1020@gmail.com.
  • Nie R; China National Center for Bioinformation, Beijing, China.
  • Miao X; Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101, China.
  • Cai Y; University of Chinese Academy of Sciences, Beijing, 100049, China.
  • Wang A; China National Center for Bioinformation, Beijing, China.
  • Zhang H; Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101, China.
  • Zhang J; School of Economic and Management, China University of Geoscience, Wuhan, China.
  • Cai J; Shulan (Hangzhou) Hospital, Affiliated to Zhejiang Shuren University Shulan International Medical College, Hangzhou, China.
BMC Bioinformatics ; 25(1): 41, 2024 Jan 24.
Article em En | MEDLINE | ID: mdl-38267858
ABSTRACT

BACKGROUND:

With the development of single-cell technology, many cell traits can be measured. Furthermore, the multi-omics profiling technology could jointly measure two or more traits in a single cell simultaneously. In order to process the various data accumulated rapidly, computational methods for multimodal data integration are needed.

RESULTS:

Here, we present inClust+, a deep generative framework for the multi-omics. It's built on previous inClust that is specific for transcriptome data, and augmented with two mask modules designed for multimodal data processing an input-mask module in front of the encoder and an output-mask module behind the decoder. InClust+ was first used to integrate scRNA-seq and MERFISH data from similar cell populations, and to impute MERFISH data based on scRNA-seq data. Then, inClust+ was shown to have the capability to integrate the multimodal data (e.g. tri-modal data with gene expression, chromatin accessibility and protein abundance) with batch effect. Finally, inClust+ was used to integrate an unlabeled monomodal scRNA-seq dataset and two labeled multimodal CITE-seq datasets, transfer labels from CITE-seq datasets to scRNA-seq dataset, and generate the missing modality of protein abundance in monomodal scRNA-seq data. In the above examples, the performance of inClust+ is better than or comparable to the most recent tools in the corresponding task.

CONCLUSIONS:

The inClust+ is a suitable framework for handling multimodal data. Meanwhile, the successful implementation of mask in inClust+ means that it can be applied to other deep learning methods with similar encoder-decoder architecture to broaden the application scope of these models.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Cromatina / Transcriptoma Tipo de estudo: Clinical_trials Idioma: En Revista: BMC Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Cromatina / Transcriptoma Tipo de estudo: Clinical_trials Idioma: En Revista: BMC Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China
...