Your browser doesn't support javascript.
loading
Integrative phenotyping framework (iPF): integrative clustering of multiple omics data identifies novel lung disease subphenotypes.
Kim, SungHwan; Herazo-Maya, Jose D; Kang, Dongwan D; Juan-Guardela, Brenda M; Tedrow, John; Martinez, Fernando J; Sciurba, Frank C; Tseng, George C; Kaminski, Naftali.
Afiliación
  • Kim S; Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, 15261, USA. swiss747@gmail.com.
  • Herazo-Maya JD; Department of Statistics, Korea University, Seoul, 5062, South Korea. swiss747@gmail.com.
  • Kang DD; Department of Internal Medicine (Pulmonary, Critical Care and Sleep Medicine), Yale School of Medicine, New Haven, CT, 06520, USA. jose.herazo-maya@yale.edu.
  • Juan-Guardela BM; Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA. donkang75@gmail.com.
  • Tedrow J; Department of Internal Medicine (Pulmonary, Critical Care and Sleep Medicine), Yale School of Medicine, New Haven, CT, 06520, USA. brendyjuan@hotmail.com.
  • Martinez FJ; Department of Medicine, University of Pittsburgh, Pittsburgh, PA, 15261, USA. tedrowjr@upmc.edu.
  • Sciurba FC; Department of Medicine, Weill Cornell Medical College, New York, NY, 15261, USA. fjm2003@med.cornell.edu.
  • Tseng GC; Department of Medicine, University of Pittsburgh, Pittsburgh, PA, 15261, USA. sciurbafc@upmc.edu.
  • Kaminski N; Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, 15261, USA. ctseng@pitt.edu.
BMC Genomics ; 16: 924, 2015 Nov 11.
Article en En | MEDLINE | ID: mdl-26560100
ABSTRACT

BACKGROUND:

The increased multi-omics information on carefully phenotyped patients in studies of complex diseases requires novel methods for data integration. Unlike continuous intensity measurements from most omics data sets, phenome data contain clinical variables that are binary, ordinal and categorical.

RESULTS:

In this paper we introduce an integrative phenotyping framework (iPF) for disease subtype discovery. A feature topology plot was developed for effective dimension reduction and visualization of multi-omics data. The approach is free of model assumption and robust to data noises or missingness. We developed a workflow to integrate homogeneous patient clustering from different omics data in an agglomerative manner and then visualized heterogeneous clustering of pairwise omics sources. We applied the framework to two batches of lung samples obtained from patients diagnosed with chronic obstructive lung disease (COPD) or interstitial lung disease (ILD) with well-characterized clinical (phenomic) data, mRNA and microRNA expression profiles. Application of iPF to the first training batch identified clusters of patients consisting of homogenous disease phenotypes as well as clusters with intermediate disease characteristics. Analysis of the second batch revealed a similar data structure, confirming the presence of intermediate clusters. Genes in the intermediate clusters were enriched with inflammatory and immune functional annotations, suggesting that they represent mechanistically distinct disease subphenotypes that may response to immunomodulatory therapies. The iPF software package and all source codes are publicly available.

CONCLUSIONS:

Identification of subclusters with distinct clinical and biomolecular characteristics suggests that integration of phenomic and other omics information could lead to identification of novel mechanism-based disease sub-phenotypes.
Asunto(s)

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Fenotipo / Biología Computacional Tipo de estudio: Etiology_studies / Prognostic_studies Idioma: En Revista: BMC Genomics Asunto de la revista: GENETICA Año: 2015 Tipo del documento: Article

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Fenotipo / Biología Computacional Tipo de estudio: Etiology_studies / Prognostic_studies Idioma: En Revista: BMC Genomics Asunto de la revista: GENETICA Año: 2015 Tipo del documento: Article