Your browser doesn't support javascript.
loading
A fully Bayesian latent variable model for integrative clustering analysis of multi-type omics data.
Mo, Qianxing; Shen, Ronglai; Guo, Cui; Vannucci, Marina; Chan, Keith S; Hilsenbeck, Susan G.
Affiliation
  • Mo Q; Division of Biostatistics, Dan L. Duncan Cancer Center, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA qmo@bcm.edu.
  • Shen R; Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, 485 Lexington Ave, New York, NY 10017, USA.
  • Guo C; Department of Biostatistics, University of Michigan, 1415 Washington Heights, Ann Arbor, MI 48109, USA.
  • Vannucci M; Department of Statistics, Rice University, 6100 Main Street, Houston, TX 77030, USA.
  • Chan KS; Molecular & Cellular Biology/Scott Department of Urology, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA.
  • Hilsenbeck SG; Division of Biostatistics, Dan L. Duncan Cancer Center, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA.
Biostatistics ; 19(1): 71-86, 2018 01 01.
Article in En | MEDLINE | ID: mdl-28541380
Identification of clinically relevant tumor subtypes and omics signatures is an important task in cancer translational research for precision medicine. Large-scale genomic profiling studies such as The Cancer Genome Atlas (TCGA) Research Network have generated vast amounts of genomic, transcriptomic, epigenomic, and proteomic data. While these studies have provided great resources for researchers to discover clinically relevant tumor subtypes and driver molecular alterations, there are few computationally efficient methods and tools for integrative clustering analysis of these multi-type omics data. Therefore, the aim of this article is to develop a fully Bayesian latent variable method (called iClusterBayes) that can jointly model omics data of continuous and discrete data types for identification of tumor subtypes and relevant omics features. Specifically, the proposed method uses a few latent variables to capture the inherent structure of multiple omics data sets to achieve joint dimension reduction. As a result, the tumor samples can be clustered in the latent variable space and relevant omics features that drive the sample clustering are identified through Bayesian variable selection. This method significantly improve on the existing integrative clustering method iClusterPlus in terms of statistical inference and computational speed. By analyzing TCGA and simulated data sets, we demonstrate the excellent performance of the proposed method in revealing clinically meaningful tumor subtypes and driver omics features.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Models, Statistical / Bayes Theorem / Genomics / Neoplasms Type of study: Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: Biostatistics Year: 2018 Document type: Article Affiliation country: United States Country of publication: United kingdom

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Models, Statistical / Bayes Theorem / Genomics / Neoplasms Type of study: Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: Biostatistics Year: 2018 Document type: Article Affiliation country: United States Country of publication: United kingdom