Your browser doesn't support javascript.
loading
Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets.
Argelaguet, Ricard; Velten, Britta; Arnol, Damien; Dietrich, Sascha; Zenz, Thorsten; Marioni, John C; Buettner, Florian; Huber, Wolfgang; Stegle, Oliver.
Afiliação
  • Argelaguet R; European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK.
  • Velten B; European Molecular Biology Laboratory (EMBL), Heidelberg, Germany.
  • Arnol D; European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK.
  • Dietrich S; Heidelberg University Hospital, Heidelberg, Germany.
  • Zenz T; Heidelberg University Hospital, Heidelberg, Germany.
  • Marioni JC; German Cancer Research Center (dkfz) and National Center for Tumor Diseases (NCT), Heidelberg, Germany.
  • Buettner F; Germany & Hematology, University Hospital Zurich and University of Zurich, Zurich, Switzerland.
  • Huber W; European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK.
  • Stegle O; Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK.
Mol Syst Biol ; 14(6): e8124, 2018 06 20.
Article em En | MEDLINE | ID: mdl-29925568
Multi-omics studies promise the improved characterization of biological processes across molecular layers. However, methods for the unsupervised integration of the resulting heterogeneous data sets are lacking. We present Multi-Omics Factor Analysis (MOFA), a computational method for discovering the principal sources of variation in multi-omics data sets. MOFA infers a set of (hidden) factors that capture biological and technical sources of variability. It disentangles axes of heterogeneity that are shared across multiple modalities and those specific to individual data modalities. The learnt factors enable a variety of downstream analyses, including identification of sample subgroups, data imputation and the detection of outlier samples. We applied MOFA to a cohort of 200 patient samples of chronic lymphocytic leukaemia, profiled for somatic mutations, RNA expression, DNA methylation and ex vivo drug responses. MOFA identified major dimensions of disease heterogeneity, including immunoglobulin heavy-chain variable region status, trisomy of chromosome 12 and previously underappreciated drivers, such as response to oxidative stress. In a second application, we used MOFA to analyse single-cell multi-omics data, identifying coordinated transcriptional and epigenetic changes along cell differentiation.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Biologia Computacional / Conjuntos de Dados como Assunto Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Biologia Computacional / Conjuntos de Dados como Assunto Idioma: En Ano de publicação: 2018 Tipo de documento: Article