Your browser doesn't support javascript.
loading
BEENE: deep learning-based nonlinear embedding improves batch effect estimation.
Rahman, Md Ashiqur; Tutul, Abdullah Aman; Sharmin, Mahfuza; Bayzid, Md Shamsuzzoha.
Afiliação
  • Rahman MA; Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh.
  • Tutul AA; Department of Computer Science, Purdue University, West Lafayette, IN 47907, United States.
  • Sharmin M; Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh.
  • Bayzid MS; Department of Computer Science and Engineering, Texas A&M University, College Station, TX 77843, United States.
Bioinformatics ; 39(8)2023 08 01.
Article em En | MEDLINE | ID: mdl-37561107
MOTIVATION: Analyzing large-scale single-cell transcriptomic datasets generated using different technologies is challenging due to the presence of batch-specific systematic variations known as batch effects. Since biological and technological differences are often interspersed, detecting and accounting for batch effects in RNA-seq datasets are critical for effective data integration and interpretation. Low-dimensional embeddings, such as principal component analysis (PCA) are widely used in visual inspection and estimation of batch effects. Linear dimensionality reduction methods like PCA are effective in assessing the presence of batch effects, especially when batch effects exhibit linear patterns. However, batch effects are inherently complex and existing linear dimensionality reduction methods could be inadequate and imprecise in the presence of sophisticated nonlinear batch effects. RESULTS: We present Batch Effect Estimation using Nonlinear Embedding (BEENE), a deep nonlinear auto-encoder network which is specially tailored to generate an alternative lower dimensional embedding suitable for both linear and nonlinear batch effects. BEENE simultaneously learns the batch and biological variables from RNA-seq data, resulting in an embedding that is more robust and sensitive than PCA embedding in terms of detecting and quantifying batch effects. BEENE was assessed on a collection of carefully controlled simulated datasets as well as biological datasets, including two technical replicates of mouse embryogenesis cells, peripheral blood mononuclear cells from three largely different experiments and five studies of pancreatic islet cells. AVAILABILITY AND IMPLEMENTATION: BEENE is freely available as an open source project at https://github.com/ashiq24/BEENE.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Aprendizado Profundo Limite: Animals Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Bangladesh

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Aprendizado Profundo Limite: Animals Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Bangladesh