Your browser doesn't support javascript.
loading
Joint clinical and molecular subtyping of COPD with variational autoencoders.
Maiorino, Enrico; De Marzio, Margherita; Xu, Zhonghui; Yun, Jeong H; Chase, Robert P; Hersh, Craig P; Weiss, Scott T; Silverman, Edwin K; Castaldi, Peter J; Glass, Kimberly.
Affiliation
  • Maiorino E; Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School.
  • De Marzio M; Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School.
  • Xu Z; Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School.
  • Yun JH; Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School.
  • Chase RP; Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School.
  • Hersh CP; Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School.
  • Weiss ST; Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School.
  • Silverman EK; Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School.
  • Castaldi PJ; Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School.
  • Glass K; Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School.
medRxiv ; 2024 Jan 10.
Article in En | MEDLINE | ID: mdl-38260473
ABSTRACT
Chronic Obstructive Pulmonary Disease (COPD) is a complex, heterogeneous disease. Traditional subtyping methods generally focus on either the clinical manifestations or the molecular endotypes of the disease, resulting in classifications that do not fully capture the disease's complexity. Here, we bridge this gap by introducing a subtyping pipeline that integrates clinical and gene expression data with variational autoencoders. We apply this methodology to the COPDGene study, a large study of current and former smoking individuals with and without COPD. Our approach generates a set of vector embeddings, called Personalized Integrated Profiles (PIPs), that recapitulate the joint clinical and molecular state of the subjects in the study. Prediction experiments show that the PIPs have a predictive accuracy comparable to or better than other embedding approaches. Using trajectory learning approaches, we analyze the main trajectories of variation in the PIP space and identify five well-separated subtypes with distinct clinical phenotypes, expression signatures, and disease outcomes. Notably, these subtypes are more robust to data resampling compared to those identified using traditional clustering approaches. Overall, our findings provide new avenues to establish fine-grained associations between the clinical characteristics, molecular processes, and disease outcomes of COPD.

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Prognostic_studies Language: En Journal: MedRxiv Year: 2024 Document type: Article Country of publication: Estados Unidos

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Prognostic_studies Language: En Journal: MedRxiv Year: 2024 Document type: Article Country of publication: Estados Unidos