Your browser doesn't support javascript.
loading
HIP: a method for high-dimensional multi-view data integration and prediction accounting for subgroup heterogeneity.
Butts, Jessica; Verace, Leif; Wendt, Christine; Bowler, Russel P; Hersh, Craig P; Long, Qi; Eberly, Lynn; Safo, Sandra E.
Affiliation
  • Butts J; Division of Biostatistics and Health Data Science, University of Minnesota, Minneapolis, MN 55414, USA.
  • Verace L; Division of Biostatistics and Health Data Science, University of Minnesota, Minneapolis, MN 55414, USA.
  • Wendt C; Division of Pulmonary, Allergy and Critical Care, University of Minnesota, Minneapolis, MN 55455, USA.
  • Bowler RP; Division of Pulmonary, Critical Care and Sleep Medicine, Department of Medicine, National Jewish Health, Denver, CO 80206, USA.
  • Hersh CP; Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA.
  • Long Q; Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.
  • Eberly L; Division of Biostatistics and Health Data Science, University of Minnesota, Minneapolis, MN 55414, USA.
  • Safo SE; Division of Biostatistics and Health Data Science, University of Minnesota, Minneapolis, MN 55414, USA.
Brief Bioinform ; 25(6)2024 Sep 23.
Article in En | MEDLINE | ID: mdl-39344710
ABSTRACT
Epidemiologic and genetic studies in many complex diseases suggest subgroup disparities (e.g. by sex, race) in disease course and patient outcomes. We consider this from the standpoint of integrative analysis where we combine information from different views (e.g. genomics, proteomics, clinical data). Existing integrative analysis methods ignore the heterogeneity in subgroups, and stacking the views and accounting for subgroup heterogeneity does not model the association among the views. We propose Heterogeneity in Integration and Prediction (HIP), a statistical approach for joint association and prediction that leverages the strengths in each view to identify molecular signatures that are shared by and specific to a subgroup. We apply HIP to proteomics and gene expression data pertaining to chronic obstructive pulmonary disease (COPD) to identify proteins and genes shared by, and unique to, males and females, contributing to the variation in COPD, measured by airway wall thickness. Our COPD findings have identified proteins, genes, and pathways that are common across and specific to males and females, some implicated in COPD, while others could lead to new insights into sex differences in COPD mechanisms. HIP accounts for subgroup heterogeneity in multi-view data, ranks variables based on importance, is applicable to univariate or multivariate continuous outcomes, and incorporates covariate adjustment. With the efficient algorithms implemented using PyTorch, this method has many potential scientific applications and could enhance multiomics research in health disparities. HIP is available at https//github.com/lasandrall/HIP, a video tutorial at https//youtu.be/O6E2OLmeMDo and a Shiny Application at https//multi-viewlearn.shinyapps.io/HIP_ShinyApp/ for users with limited programming experience.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Pulmonary Disease, Chronic Obstructive Limits: Female / Humans / Male Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2024 Document type: Article Affiliation country: United States Country of publication: United kingdom

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Pulmonary Disease, Chronic Obstructive Limits: Female / Humans / Male Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2024 Document type: Article Affiliation country: United States Country of publication: United kingdom