Search | VHL Regional Portal

1.

Connecting intermediate phenotypes to disease using multi-omics in heart failure.

Moore, Anni; Venkatesh, Rasika; Levin, Michael G; Damrauer, Scott M; Reza, Nosheen; Cappola, Thomas P; Ritchie, Marylyn D.

medRxiv ; 2024 Aug 07.

Article in English | MEDLINE | ID: mdl-39148828

ABSTRACT

Heart failure (HF) is one of the most common, complex, heterogeneous diseases in the world, with over 1-3% of the global population living with the condition. Progression of HF can be tracked via MRI measures of structural and functional changes to the heart, namely left ventricle (LV), including ejection fraction, mass, end-diastolic volume, and LV end-systolic volume. Moreover, while genome-wide association studies (GWAS) have been a useful tool to identify candidate variants involved in HF risk, they lack crucial tissue-specific and mechanistic information which can be gained from incorporating additional data modalities. This study addresses this gap by incorporating transcriptome-wide and proteome-wide association studies (TWAS and PWAS) to gain insights into genetically-regulated changes in gene expression and protein abundance in precursors to HF measured using MRI-derived cardiac measures as well as full-stage all-cause HF. We identified several gene and protein overlaps between LV ejection fraction and end-systolic volume measures. Many of the overlaps identified in MRI-derived measurements through TWAS and PWAS appear to be shared with all-cause HF. We implicate many putative pathways relevant in HF associated with these genes and proteins via gene-set enrichment and protein-protein interaction network approaches. The results of this study (1) highlight the benefit of using multi-omics to better understand genetics and (2) provide novel insights as to how changes in heart structure and function may relate to HF.

2.

Protein-truncating variant in APOL3 increases chronic kidney disease risk in epistasis with APOL1 risk alleles.

Zhang, David Y; Levin, Michael G; Duda, Jeffrey T; Landry, Latrice G; Witschey, Walter R; Damrauer, Scott M; Ritchie, Marylyn D; Rader, Daniel J.

JCI Insight ; 2024 Aug 20.

Article in English | MEDLINE | ID: mdl-39163132

ABSTRACT

BACKGROUND: Two coding alleles within the APOL1 gene, G1 and G2, found almost exclusively in individuals genetically similar to West African populations, contribute substantially to the pathogenesis of chronic kidney disease (CKD). The APOL gene cluster on chromosome 22 contains a total of six APOL genes that have arisen as a result of gene duplication. METHODS: Using a genome-first approach in the Penn Medicine Biobank, we identified 62 protein-altering variants in the six APOL genes with a minor allele frequency > 0.1% in a population of participants genetically similar to African reference populations and performed population-specific phenome-wide association studies. RESULTS: We identified rs1108978, a stop-gain variant in APOL3 (p.Q58*), to be significantly associated with increased CKD risk, even after conditioning on APOL1 G1/G2 carrier status. These findings were replicated in the Veterans Affairs Million Veteran Program and the All of Us Research Program. APOL3 p.Q58* was also significantly associated with a number of quantitative traits linked to CKD including decreased kidney volume. This truncating variant contributed the most risk for CKD in patients monoallelic for APOL1 G1/G2, suggesting an epistatic interaction and a potential protective effect of wild-type APOL3 against APOL1-induced kidney disease. CONCLUSION: This study demonstrates the utility of targeting population-specific variants in a genome-first approach, even in the context of well-studied gene-disease relationships. FUNDING: National Heart, Lung, and Blood Institute (F30HL172382, R01HL169378, R01HL169458), Doris Duke Foundation (grant 2023-0224), National Institute of Biomedical Imaging and Bioengineering (P41EB029460), National Center for Advancing Translational Sciences (UL1-TR-001878).

3.

Protocol for a type 3 hybrid implementation cluster randomized clinical trial to evaluate the effect of patient and clinician nudges to advance the use of genomic medicine across a diverse health system.

Raper, Anna C; Weathers, Benita L; Drivas, Theodore G; Ellis, Colin A; Kripke, Colleen Morse; Oyer, Randall A; Owens, Anjali T; Verma, Anurag; Wileyto, Paul E; Wollack, Colin C; Zhou, Wenting; Ritchie, Marylyn D; Schnoll, Robert A; Nathanson, Katherine L.

Implement Sci ; 19(1): 61, 2024 Aug 19.

Article in English | MEDLINE | ID: mdl-39160614

ABSTRACT

BACKGROUND: Germline genetic testing is recommended for an increasing number of conditions with underlying genetic etiologies, the results of which impact medical management. However, genetic testing is underutilized in clinics due to system, clinician, and patient level barriers. Behavioral economics provides a framework to create implementation strategies, such as nudges, to address these multi-level barriers and increase the uptake of genetic testing for conditions where the results impact medical management. METHODS: Patients meeting eligibility for germline genetic testing for a group of conditions will be identified using electronic phenotyping algorithms. A pragmatic, type 3 hybrid cluster randomization study will test nudges to patients and/or clinicians, or neither. Clinicians who receive nudges will be prompted to either refer their patient to genetics or order genetic testing themselves. We will use rapid cycle approaches informed by clinician and patient experiences, health equity, and behavioral economics to optimize these nudges before trial initiation. The primary implementation outcome is uptake of germline genetic testing for the pre-selected health conditions. Patient data collected through the electronic health record (e.g. demographics, geocoded address) will be examined as moderators of the effect of nudges. DISCUSSION: This study will be one of the first randomized trials to examine the effects of patient- and clinician-directed nudges informed by behavioral economics on uptake of genetic testing. The pragmatic design will facilitate a large and diverse patient sample, allow for the assessment of genetic testing uptake, and provide comparison of the effect of different nudge combinations. This trial also involves optimization of patient identification, test selection, ordering, and result reporting in an electronic health record-based infrastructure to further address clinician-level barriers to utilizing genomic medicine. The findings may help determine the impact of low-cost, sustainable implementation strategies that can be integrated into health care systems to improve the use of genomic medicine. TRIAL REGISTRATION: ClinicalTrials.gov. NCT06377033. Registered on March 31, 2024. https://clinicaltrials.gov/study/NCT06377033?term=NCT06377033&rank=1.

Subject(s)

Genetic Testing , Genomics , Humans , Genetic Testing/methods , Genomics/methods , Electronic Health Records , Economics, Behavioral , Implementation Science

4.

Genetic risk factors for COVID-19 and influenza are largely distinct.

Kosmicki, Jack A; Marcketta, Anthony; Sharma, Deepika; Di Gioia, Silvio Alessandro; Batista, Samantha; Yang, Xiao-Man; Tzoneva, Gannie; Martinez, Hector; Sidore, Carlo; Kessler, Michael D; Horowitz, Julie E; Roberts, Genevieve H L; Justice, Anne E; Banerjee, Nilanjana; Coignet, Marie V; Leader, Joseph B; Park, Danny S; Lanche, Rouel; Maxwell, Evan; Knight, Spencer C; Bai, Xiaodong; Guturu, Harendra; Baltzell, Asher; Girshick, Ahna R; McCurdy, Shannon R; Partha, Raghavendran; Mansfield, Adam J; Turissini, David A; Zhang, Miao; Mbatchou, Joelle; Watanabe, Kyoko; Verma, Anurag; Sirugo, Giorgio; Ritchie, Marylyn D; Salerno, William J; Shuldiner, Alan R; Rader, Daniel J; Mirshahi, Tooraj; Marchini, Jonathan; Overton, John D; Carey, David J; Habegger, Lukas; Reid, Jeffrey G; Economides, Aris; Kyratsous, Christos; Karalis, Katia; Baum, Alina; Cantor, Michael N; Rand, Kristin A; Hong, Eurie L.

Nat Genet ; 56(8): 1592-1596, 2024 Aug.

Article in English | MEDLINE | ID: mdl-39103650

ABSTRACT

Coronavirus disease 2019 (COVID-19) and influenza are respiratory illnesses caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and influenza viruses, respectively. Both diseases share symptoms and clinical risk factors1, but the extent to which these conditions have a common genetic etiology is unknown. This is partly because host genetic risk factors are well characterized for COVID-19 but not for influenza, with the largest published genome-wide association studies for these conditions including >2 million individuals2 and about 1,000 individuals3-6, respectively. Shared genetic risk factors could point to targets to prevent or treat both infections. Through a genetic study of 18,334 cases with a positive test for influenza and 276,295 controls, we show that published COVID-19 risk variants are not associated with influenza. Furthermore, we discovered and replicated an association between influenza infection and noncoding variants in B3GALT5 and ST6GAL1, neither of which was associated with COVID-19. In vitro small interfering RNA knockdown of ST6GAL1-an enzyme that adds sialic acid to the cell surface, which is used for viral entry-reduced influenza infectivity by 57%. These results mirror the observation that variants that downregulate ACE2, the SARS-CoV-2 receptor, protect against COVID-19 (ref. 7). Collectively, these findings highlight downregulation of key cell surface receptors used for viral entry as treatment opportunities to prevent COVID-19 and influenza.

Subject(s)

COVID-19 , Genetic Predisposition to Disease , Genome-Wide Association Study , Influenza, Human , SARS-CoV-2 , Humans , Influenza, Human/genetics , Influenza, Human/epidemiology , Influenza, Human/virology , COVID-19/genetics , COVID-19/virology , Risk Factors , SARS-CoV-2/genetics , Male , Female , Polymorphism, Single Nucleotide , Case-Control Studies , Middle Aged

5.

Multi-organ imaging-derived polygenic indexes for brain and body health.

Yang, Xiaochen; Sullivan, Patrick F; Li, Bingxuan; Fan, Zirui; Ding, Dezheng; Shu, Juan; Guo, Yuxin; Paschou, Peristera; Bao, Jingxuan; Shen, Li; Ritchie, Marylyn D; Nave, Gideon; Platt, Michael L; Li, Tengfei; Zhu, Hongtu; Zhao, Bingxin.

medRxiv ; 2024 Jun 03.

Article in English | MEDLINE | ID: mdl-38883759

ABSTRACT

The UK Biobank (UKB) imaging project is a crucial resource for biomedical research, but is limited to 100,000 participants due to cost and accessibility barriers. Here we used genetic data to predict heritable imaging-derived phenotypes (IDPs) for a larger cohort. We developed and evaluated 4,375 IDP genetic scores (IGS) derived from UKB brain and body images. When applied to UKB participants who were not imaged, IGS revealed links to numerous phenotypes and stratified participants at increased risk for both brain and somatic diseases. For example, IGS identified individuals at higher risk for Alzheimer's disease and multiple sclerosis, offering additional insights beyond traditional polygenic risk scores of these diseases. When applied to independent external cohorts, IGS also stratified those at high disease risk in the All of Us Research Program and the Alzheimer's Disease Neuroimaging Initiative study. Our results demonstrate that, while the UKB imaging cohort is largely healthy and may not be the most enriched for disease risk management, it holds immense potential for stratifying the risk of various brain and body diseases in broader external genetic cohorts.

6.

Interpretable deep clustering survival machines for Alzheimer's disease subtype discovery.

Hou, Bojian; Wen, Zixuan; Bao, Jingxuan; Zhang, Richard; Tong, Boning; Yang, Shu; Wen, Junhao; Cui, Yuhan; Moore, Jason H; Saykin, Andrew J; Huang, Heng; Thompson, Paul M; Ritchie, Marylyn D; Davatzikos, Christos; Shen, Li.

Med Image Anal ; 97: 103231, 2024 Jun 14.

Article in English | MEDLINE | ID: mdl-38941858

ABSTRACT

Alzheimer's disease (AD) is a complex neurodegenerative disorder that has impacted millions of people worldwide. The neuroanatomical heterogeneity of AD has made it challenging to fully understand the disease mechanism. Identifying AD subtypes during the prodromal stage and determining their genetic basis would be immensely valuable for drug discovery and subsequent clinical treatment. Previous studies that clustered subgroups typically used unsupervised learning techniques, neglecting the survival information and potentially limiting the insights gained. To address this problem, we propose an interpretable survival analysis method called Deep Clustering Survival Machines (DCSM), which combines both discriminative and generative mechanisms. Similar to mixture models, we assume that the timing information of survival data can be generatively described by a mixture of parametric distributions, referred to as expert distributions. We learn the weights of these expert distributions for individual instances in a discriminative manner by leveraging their features. This allows us to characterize the survival information of each instance through a weighted combination of the learned expert distributions. We demonstrate the superiority of the DCSM method by applying this approach to cluster patients with mild cognitive impairment (MCI) into subgroups with different risks of converting to AD. Conventional clustering measurements for survival analysis along with genetic association studies successfully validate the effectiveness of the proposed method and characterize our clustering findings.

7.

SABER: Statistical Identification of Loci of Interest in GWAS Summary Statistics using a Bayesian Gaussian Mixture Model.

Kumar, Rachit; Venkatesh, Rasika; Ritchie, Marylyn D.

AMIA Jt Summits Transl Sci Proc ; 2024: 575-583, 2024.

Article in English | MEDLINE | ID: mdl-38827044

ABSTRACT

Genome-wide association studies (GWAS) remain a popular method for identifying novel genetic associations with human phenotypes and have provided many insights into the etiology of many diseases. However, GWAS provide limited support for how a genetic association might contribute to disease due to inherent limitations, such as linkage disequilibrium. As such, many methods that operate on GWAS summary statistics have been developed to generate evidence for functional pathways or for variants of interest, but they require defining the genomic region bounds for loci of interest. At present, there are limited methods for determining these bounds in a rigorous, reproducible way. We present a novel statistical method, Statistical Analysis for Bayesian Estimation of Regions (SABER), that uses Bayesian Gaussian mixture models to reproducibly generate ratios that quantify whether particular genomic positions represent the bounds of loci of interest and can be used to delineate genomic regions for downstream analyses.

8.

PFERM: A Fair Empirical Risk Minimization Approach with Prior Knowledge.

Hou, Bojian; Mondragón, Andrés; Tarzanagh, Davoud Ataee; Zhou, Zhuoping; Saykin, Andrew J; Moore, Jason H; Ritchie, Marylyn D; Long, Qi; Shen, Li.

AMIA Jt Summits Transl Sci Proc ; 2024: 211-220, 2024.

Article in English | MEDLINE | ID: mdl-38827072

ABSTRACT

Fairness is crucial in machine learning to prevent bias based on sensitive attributes in classifier predictions. However, the pursuit of strict fairness often sacrifices accuracy, particularly when significant prevalence disparities exist among groups, making classifiers less practical. For example, Alzheimer's disease (AD) is more prevalent in women than men, making equal treatment inequitable for females. Accounting for prevalence ratios among groups is essential for fair decision-making. In this paper, we introduce prior knowledge for fairness, which incorporates prevalence ratio information into the fairness constraint within the Empirical Risk Minimization (ERM) framework. We develop the Prior-knowledge-guided Fair ERM (PFERM) framework, aiming to minimize expected risk within a specified function class while adhering to a prior-knowledge-guided fairness constraint. This approach strikes a flexible balance between accuracy and fairness. Empirical results confirm its effectiveness in preserving fairness without compromising accuracy.

9.

Employing Informatics Strategies in Alzheimer's Disease Research: A Review from Genetics, Multiomics, and Biomarkers to Clinical Outcomes.

Bao, Jingxuan; Lee, Brian N; Wen, Junhao; Kim, Mansu; Mu, Shizhuo; Yang, Shu; Davatzikos, Christos; Long, Qi; Ritchie, Marylyn D; Shen, Li.

Annu Rev Biomed Data Sci ; 2024 Jun 07.

Article in English | MEDLINE | ID: mdl-38848574

ABSTRACT

Alzheimer's disease (AD) is a critical national concern, affecting 5.8 million people and costing more than 250 billion annually. However, there is no available cure. Thus, effective strategies are in urgent need to discover AD biomarkers for disease early detection and drug development. In this review, we study AD from a biomedical data scientist perspective to discuss the four fundamental components in AD research: genetics (G), molecular multiomics (M), multimodal imaging biomarkers (B), and clinical outcomes (O) (collectively referred to as the GMBO framework). We provide a comprehensive review of common statistical and informatics methodologies for each component within the GMBO framework, accompanied by the major findings from landmark AD studies. Our review highlights the potential of multimodal biobank data in addressing key challenges in AD, such as early diagnosis, disease heterogeneity, and therapeutic development. We identify major hurdles in AD research, including data scarcity and complexity, and advocate for enhanced collaboration, data harmonization, and advanced modeling techniques. This review aims to be an essential guide for understanding current biomedical data science strategies in AD research, emphasizing the need for integrated, multidisciplinary approaches to advance our understanding and management of AD.

10.

Cross-phenotype associations between Alzheimer's Disease and its comorbidities may provide clues to progression.

Moore, Anni; Ritchie, Marylyn D.

AMIA Jt Summits Transl Sci Proc ; 2024: 623-631, 2024.

Article in English | MEDLINE | ID: mdl-38827078

ABSTRACT

Alzheimer's disease (AD) is the most prevalent neurodegenerative disease worldwide, with one in nine people over the age of 65 living with the disease in 2023. In this study, we used a phenome wide association study (PheWAS) approach to identify cross-phenotype between previously identified genetic associations for AD and electronic health record (EHR) diagnoses from the UK Biobank (UKBB) (n=361,194 of European ancestry) and the eMERGE Network (n=105,108 of diverse ancestry). Based on 497 previously identified AD-associated variants from the Alzheimer's Disease Variant Portal (ADVP), we found significant associations primarily in immune and cardiac related diseases in our PheWAS. Replicating variants have widespread impacts on immune genes in diverse tissue types. This study demonstrates the potential of using the PheWAS strategy to improve our understanding of AD progression as well as identify potential drug repurposing opportunities for new treatment and disease prevention strategies.

11.

Risk factors affecting polygenic score performance across diverse cohorts.

Hui, Daniel; Dudek, Scott; Kiryluk, Krzysztof; Walunas, Theresa L; Kullo, Iftikhar J; Wei, Wei-Qi; Tiwari, Hemant K; Peterson, Josh F; Chung, Wendy K; Davis, Brittney; Khan, Atlas; Kottyan, Leah; Limdi, Nita A; Feng, Qiping; Puckelwartz, Megan J; Weng, Chunhua; Smith, Johanna L; Karlson, Elizabeth W; Jarvik, Gail P; Ritchie, Marylyn D.

medRxiv ; 2024 Apr 10.

Article in English | MEDLINE | ID: mdl-38645167

ABSTRACT

Apart from ancestry, personal or environmental covariates may contribute to differences in polygenic score (PGS) performance. We analyzed effects of covariate stratification and interaction on body mass index (BMI) PGS (PGSBMI) across four cohorts of European (N=491,111) and African (N=21,612) ancestry. Stratifying on binary covariates and quintiles for continuous covariates, 18/62 covariates had significant and replicable R2 differences among strata. Covariates with the largest differences included age, sex, blood lipids, physical activity, and alcohol consumption, with R2 being nearly double between best and worst performing quintiles for certain covariates. 28 covariates had significant PGSBMI-covariate interaction effects, modifying PGSBMI effects by nearly 20% per standard deviation change. We observed overlap between covariates that had significant R2 differences among strata and interaction effects - across all covariates, their main effects on BMI were correlated with their maximum R2 differences and interaction effects (0.56 and 0.58, respectively), suggesting high-PGSBMI individuals have highest R2 and increase in PGS effect. Using quantile regression, we show the effect of PGSBMI increases as BMI itself increases, and that these differences in effects are directly related to differences in R2 when stratifying by different covariates. Given significant and replicable evidence for context-specific PGSBMI performance and effects, we investigated ways to increase model performance taking into account non-linear effects. Machine learning models (neural networks) increased relative model R2 (mean 23%) across datasets. Finally, creating PGSBMI directly from GxAge GWAS effects increased relative R2 by 7.8%. These results demonstrate that certain covariates, especially those most associated with BMI, significantly affect both PGSBMI performance and effects across diverse cohorts and ancestries, and we provide avenues to improve model performance that consider these effects.

12.

The Alzheimer's Knowledge Base: A Knowledge Graph for Alzheimer Disease Research.

Romano, Joseph D; Truong, Van; Kumar, Rachit; Venkatesan, Mythreye; Graham, Britney E; Hao, Yun; Matsumoto, Nick; Li, Xi; Wang, Zhiping; Ritchie, Marylyn D; Shen, Li; Moore, Jason H.

J Med Internet Res ; 26: e46777, 2024 Apr 18.

Article in English | MEDLINE | ID: mdl-38635981

ABSTRACT

BACKGROUND: As global populations age and become susceptible to neurodegenerative illnesses, new therapies for Alzheimer disease (AD) are urgently needed. Existing data resources for drug discovery and repurposing fail to capture relationships central to the disease's etiology and response to drugs. OBJECTIVE: We designed the Alzheimer's Knowledge Base (AlzKB) to alleviate this need by providing a comprehensive knowledge representation of AD etiology and candidate therapeutics. METHODS: We designed the AlzKB as a large, heterogeneous graph knowledge base assembled using 22 diverse external data sources describing biological and pharmaceutical entities at different levels of organization (eg, chemicals, genes, anatomy, and diseases). AlzKB uses a Web Ontology Language 2 ontology to enforce semantic consistency and allow for ontological inference. We provide a public version of AlzKB and allow users to run and modify local versions of the knowledge base. RESULTS: AlzKB is freely available on the web and currently contains 118,902 entities with 1,309,527 relationships between those entities. To demonstrate its value, we used graph data science and machine learning to (1) propose new therapeutic targets based on similarities of AD to Parkinson disease and (2) repurpose existing drugs that may treat AD. For each use case, AlzKB recovers known therapeutic associations while proposing biologically plausible new ones. CONCLUSIONS: AlzKB is a new, publicly available knowledge resource that enables researchers to discover complex translational associations for AD drug discovery. Through 2 use cases, we show that it is a valuable tool for proposing novel therapeutic hypotheses based on public biomedical knowledge.

Subject(s)

Alzheimer Disease , Humans , Alzheimer Disease/drug therapy , Alzheimer Disease/genetics , Pattern Recognition, Automated , Knowledge Bases , Machine Learning , Knowledge

13.

The genetic architecture of multimodal human brain age.

Wen, Junhao; Zhao, Bingxin; Yang, Zhijian; Erus, Guray; Skampardoni, Ioanna; Mamourian, Elizabeth; Cui, Yuhan; Hwang, Gyujoon; Bao, Jingxuan; Boquet-Pujadas, Aleix; Zhou, Zhen; Veturi, Yogasudha; Ritchie, Marylyn D; Shou, Haochang; Thompson, Paul M; Shen, Li; Toga, Arthur W; Davatzikos, Christos.

Nat Commun ; 15(1): 2604, 2024 Mar 23.

Article in English | MEDLINE | ID: mdl-38521789

ABSTRACT

The complex biological mechanisms underlying human brain aging remain incompletely understood. This study investigated the genetic architecture of three brain age gaps (BAG) derived from gray matter volume (GM-BAG), white matter microstructure (WM-BAG), and functional connectivity (FC-BAG). We identified sixteen genomic loci that reached genome-wide significance (P-value < 5×10-8). A gene-drug-disease network highlighted genes linked to GM-BAG for treating neurodegenerative and neuropsychiatric disorders and WM-BAG genes for cancer therapy. GM-BAG displayed the most pronounced heritability enrichment in genetic variants within conserved regions. Oligodendrocytes and astrocytes, but not neurons, exhibited notable heritability enrichment in WM and FC-BAG, respectively. Mendelian randomization identified potential causal effects of several chronic diseases on brain aging, such as type 2 diabetes on GM-BAG and AD on WM-BAG. Our results provide insights into the genetics of human brain aging, with clinical implications for potential lifestyle and therapeutic interventions. All results are publicly available at https://labs.loni.usc.edu/medicine .

Subject(s)

Diabetes Mellitus, Type 2 , White Matter , Humans , Brain , Gray Matter , Magnetic Resonance Imaging/methods , White Matter/physiology , Mendelian Randomization Analysis

14.

Strategies for Implementing Machine Learning Algorithms in the Clinical Practice of Radiology.

Chae, Allison; Yao, Michael S; Sagreiya, Hersh; Goldberg, Ari D; Chatterjee, Neil; MacLean, Matthew T; Duda, Jeffrey; Elahi, Ameena; Borthakur, Arijitt; Ritchie, Marylyn D; Rader, Daniel; Kahn, Charles E; Witschey, Walter R; Gee, James C.

Radiology ; 310(1): e223170, 2024 Jan.

Article in English | MEDLINE | ID: mdl-38259208

ABSTRACT

Despite recent advancements in machine learning (ML) applications in health care, there have been few benefits and improvements to clinical medicine in the hospital setting. To facilitate clinical adaptation of methods in ML, this review proposes a standardized framework for the step-by-step implementation of artificial intelligence into the clinical practice of radiology that focuses on three key components: problem identification, stakeholder alignment, and pipeline integration. A review of the recent literature and empirical evidence in radiologic imaging applications justifies this approach and offers a discussion on structuring implementation efforts to help other hospital practices leverage ML to improve patient care. Clinical trial registration no. 04242667 © RSNA, 2024 Supplemental material is available for this article.

Subject(s)

Artificial Intelligence , Radiology , Humans , Radiography , Algorithms , Machine Learning

15.

mixWAS: An efficient distributed algorithm for mixed-outcomes genome-wide association studies.

Li, Ruowang; Benz, Luke; Duan, Rui; Denny, Joshua C; Hakonarson, Hakon; Mosley, Jonathan D; Smoller, Jordan W; Wei, Wei-Qi; Ritchie, Marylyn D; Moore, Jason H; Chen, Yong.

medRxiv ; 2024 Jan 10.

Article in English | MEDLINE | ID: mdl-38260403

ABSTRACT

Genome-wide association studies (GWAS) have been instrumental in identifying genetic associations for various diseases and traits. However, uncovering genetic underpinnings among traits beyond univariate phenotype associations remains a challenge. Multi-phenotype associations (MPA), or genetic pleiotropy, offer important insights into shared genes and pathways among traits, enhancing our understanding of genetic architectures of complex diseases. GWAS of biobank-linked electronic health record (EHR) data are increasingly being utilized to identify MPA among various traits and diseases. However, methodologies that can efficiently take advantage of distributed EHR to detect MPA are still lacking. Here, we introduce mixWAS, a novel algorithm that efficiently and losslessly integrates multiple EHRs via summary statistics, allowing the detection of MPA among mixed phenotypes while accounting for heterogeneities across EHRs. Simulations demonstrate that mixWAS outperforms the widely used MPA detection method, Phenome-wide association study (PheWAS), across diverse scenarios. Applying mixWAS to data from seven EHRs in the US, we identified 4,534 MPA among blood lipids, BMI, and circulatory diseases. Validation in an independent EHR data from UK confirmed 97.7% of the associations. mixWAS fundamentally improves the detection of MPA and is available as a free, open-source software.

16.

Penn Healthy Diet survey: pilot validation and scoring.

Compher, Charlene W; Quinn, Ryan; Haslam, Richard; Bader, Elizabeth; Weaver, Joellen; Dudek, Scott; Ritchie, Marylyn D; Lewis, James D; Wu, Gary D.

Br J Nutr ; 131(1): 156-162, 2024 01 14.

Article in English | MEDLINE | ID: mdl-37519237

ABSTRACT

Though diet quality is widely recognised as linked to risk of chronic disease, health systems have been challenged to find a user-friendly, efficient way to obtain information about diet. The Penn Healthy Diet (PHD) survey was designed to fill this void. The purposes of this pilot project were to assess the patient experience with the PHD, to validate the accuracy of the PHD against related items in a diet recall and to explore scoring algorithms with relationship to the Healthy Eating Index (HEI)-2015 computed from the recall data. A convenience sample of participants in the Penn Health BioBank was surveyed with the PHD, the Automated Self-Administered 24-hour recall (ASA24) and experience questions. Kappa scores and Spearman correlations were used to compare related questions in the PHD to the ASA24. Numerical scoring, regression tree and weighted regressions were computed for scoring. Participants assessed the PHD as easy to use and were willing to repeat the survey at least annually. The three scoring algorithms were strongly associated with HEI-2015 scores using National Health and Nutrition Examination Survey 2017-2018 data from which the PHD was developed and moderately associated with the pilot replication data. The PHD is acceptable to participants and at least moderately correlated with the HEI-2015. Further validation in a larger sample will enable the selection of the strongest scoring approach.

Subject(s)

Diet, Healthy , Diet , Humans , Nutrition Surveys , Pilot Projects , Diet Surveys

17.

Polygenic risk scores for cardiometabolic traits demonstrate importance of ancestry for predictive precision medicine.

Kember, Rachel L; Verma, Shefali S; Verma, Anurag; Xiao, Brenda; Lucas, Anastasia; Kripke, Colleen M; Judy, Renae; Chen, Jinbo; Damrauer, Scott M; Rader, Daniel J; Ritchie, Marylyn D.

Pac Symp Biocomput ; 29: 611-626, 2024.

Article in English | MEDLINE | ID: mdl-38160310

ABSTRACT

Polygenic risk scores (PRS) have predominantly been derived from genome-wide association studies (GWAS) conducted in European ancestry (EUR) individuals. In this study, we present an in-depth evaluation of PRS based on multi-ancestry GWAS for five cardiometabolic phenotypes in the Penn Medicine BioBank (PMBB) followed by a phenome-wide association study (PheWAS). We examine the PRS performance across all individuals and separately in African ancestry (AFR) and EUR ancestry groups. For AFR individuals, PRS derived using the multi-ancestry LD panel showed a higher effect size for four out of five PRSs (DBP, SBP, T2D, and BMI) than those derived from the AFR LD panel. In contrast, for EUR individuals, the multi-ancestry LD panel PRS demonstrated a higher effect size for two out of five PRSs (SBP and T2D) compared to the EUR LD panel. These findings underscore the potential benefits of utilizing a multi-ancestry LD panel for PRS derivation in diverse genetic backgrounds and demonstrate overall robustness in all individuals. Our results also revealed significant associations between PRS and various phenotypic categories. For instance, CAD PRS was linked with 18 phenotypes in AFR and 82 in EUR, while T2D PRS correlated with 84 phenotypes in AFR and 78 in EUR. Notably, associations like hyperlipidemia, renal failure, atrial fibrillation, coronary atherosclerosis, obesity, and hypertension were observed across different PRSs in both AFR and EUR groups, with varying effect sizes and significance levels. However, in AFR individuals, the strength and number of PRS associations with other phenotypes were generally reduced compared to EUR individuals. Our study underscores the need for future research to prioritize 1) conducting GWAS in diverse ancestry groups and 2) creating a cosmopolitan PRS methodology that is universally applicable across all genetic backgrounds. Such advances will foster a more equitable and personalized approach to precision medicine.

Subject(s)

Diabetes Mellitus, Type 2 , Hypertension , Humans , Genetic Risk Score , Genome-Wide Association Study/methods , Genetic Predisposition to Disease , Precision Medicine , Multifactorial Inheritance , Computational Biology , Phenotype , Hypertension/genetics , Diabetes Mellitus, Type 2/genetics , Risk Factors

18.

Lymphocyte Count Derived Polygenic Score and Interindividual Variability in CD4 T-cell Recovery in Response to Antiretroviral Therapy.

Cardone, Kathleen M; Dudek, Scott; Keat, Karl; Bradford, Yuki; Cindi, Zinhle; Daar, Eric S; Gulick, Roy; Riddler, Sharon A; Lennox, Jeffrey L; Sinxadi, Phumla; Haas, David W; Ritchie, Marylyn D.

Pac Symp Biocomput ; 29: 594-610, 2024.

Article in English | MEDLINE | ID: mdl-38160309

ABSTRACT

Access to safe and effective antiretroviral therapy (ART) is a cornerstone in the global response to the HIV pandemic. Among people living with HIV, there is considerable interindividual variability in absolute CD4 T-cell recovery following initiation of virally suppressive ART. The contribution of host genetics to this variability is not well understood. We explored the contribution of a polygenic score which was derived from large, publicly available summary statistics for absolute lymphocyte count from individuals in the general population (PGSlymph) due to a lack of publicly available summary statistics for CD4 T-cell count. We explored associations with baseline CD4 T-cell count prior to ART initiation (n=4959) and change from baseline to week 48 on ART (n=3274) among treatment-naïve participants in prospective, randomized ART studies of the AIDS Clinical Trials Group. We separately examined an African-ancestry-derived and a European-ancestry-derived PGSlymph, and evaluated their performance across all participants, and also in the African and European ancestral groups separately. Multivariate models that included PGSlymph, baseline plasma HIV-1 RNA, age, sex, and 15 principal components (PCs) of genetic similarity explained â¼26-27% of variability in baseline CD4 T-cell count, but PGSlymph accounted for <1% of this variability. Models that also included baseline CD4 T-cell count explained â¼7-9% of variability in CD4 T-cell count increase on ART, but PGSlymph accounted for <1% of this variability. In univariate analyses, PGSlymph was not significantly associated with baseline or change in CD4 T-cell count. Among individuals of African ancestry, the African PGSlymph term in the multivariate model was significantly associated with change in CD4 T-cell count while not significant in the univariate model. When applied to lymphocyte count in a general medical biobank population (Penn Medicine BioBank), PGSlymph explained â¼6-10% of variability in multivariate models (including age, sex, and PCs) but only â¼1% in univariate models. In summary, a lymphocyte count PGS derived from the general population was not consistently associated with CD4 T-cell recovery on ART. Nonetheless, adjusting for clinical covariates is quite important when estimating such polygenic effects.

Subject(s)

Anti-HIV Agents , HIV Infections , Humans , CD4-Positive T-Lymphocytes , Prospective Studies , Anti-HIV Agents/therapeutic use , Computational Biology , HIV Infections/drug therapy , HIV Infections/genetics , CD4 Lymphocyte Count , Viral Load

19.

Genomic loci influence patterns of structural covariance in the human brain.

Wen, Junhao; Nasrallah, Ilya M; Abdulkadir, Ahmed; Satterthwaite, Theodore D; Yang, Zhijian; Erus, Guray; Robert-Fitzgerald, Timothy; Singh, Ashish; Sotiras, Aristeidis; Boquet-Pujadas, Aleix; Mamourian, Elizabeth; Doshi, Jimit; Cui, Yuhan; Srinivasan, Dhivya; Skampardoni, Ioanna; Chen, Jiong; Hwang, Gyujoon; Bergman, Mark; Bao, Jingxuan; Veturi, Yogasudha; Zhou, Zhen; Yang, Shu; Dazzan, Paola; Kahn, Rene S; Schnack, Hugo G; Zanetti, Marcus V; Meisenzahl, Eva; Busatto, Geraldo F; Crespo-Facorro, Benedicto; Pantelis, Christos; Wood, Stephen J; Zhuo, Chuanjun; Shinohara, Russell T; Gur, Ruben C; Gur, Raquel E; Koutsouleris, Nikolaos; Wolf, Daniel H; Saykin, Andrew J; Ritchie, Marylyn D; Shen, Li; Thompson, Paul M; Colliot, Olivier; Wittfeld, Katharina; Grabe, Hans J; Tosun, Duygu; Bilgel, Murat; An, Yang; Marcus, Daniel S; LaMontagne, Pamela; Heckbert, Susan R.

Proc Natl Acad Sci U S A ; 120(52): e2300842120, 2023 Dec 26.

Article in English | MEDLINE | ID: mdl-38127979

ABSTRACT

Normal and pathologic neurobiological processes influence brain morphology in coordinated ways that give rise to patterns of structural covariance (PSC) across brain regions and individuals during brain aging and diseases. The genetic underpinnings of these patterns remain largely unknown. We apply a stochastic multivariate factorization method to a diverse population of 50,699 individuals (12 studies and 130 sites) and derive data-driven, multi-scale PSCs of regional brain size. PSCs were significantly correlated with 915 genomic loci in the discovery set, 617 of which are newly identified, and 72% were independently replicated. Key pathways influencing PSCs involve reelin signaling, apoptosis, neurogenesis, and appendage development, while pathways of breast cancer indicate potential interplays between brain metastasis and PSCs associated with neurodegeneration and dementia. Using support vector machines, multi-scale PSCs effectively derive imaging signatures of several brain diseases. Our results elucidate genetic and biological underpinnings that influence structural covariance patterns in the human brain.

Subject(s)

Brain Neoplasms , Magnetic Resonance Imaging , Humans , Magnetic Resonance Imaging/methods , Brain/pathology , Brain Mapping/methods , Genomics , Brain Neoplasms/pathology

20.

Cross-phenotype associations between Alzheimer's Disease and its comorbidities may provide clues to progression.

Moore, Anni; Ritchie, Marylyn D.

medRxiv ; 2023 Nov 07.

Article in English | MEDLINE | ID: mdl-37986758

ABSTRACT

Alzheimer's disease (AD) is the most prevalent neurodegenerative disease worldwide, with one in nine people over the age of 65 living with the disease in 2023. In this study, we used a phenome wide association study (PheWAS) approach to identify cross-phenotype associations between previously identified genetic AD and for electronic health record (EHR) diagnoses from the UK Biobank (UKBB) (n=361,194 of European ancestry) and the eMERGE Network (n=105,108 of diverse ancestry). Based on 497 previously identified AD-associated variants from the Alzheimer's Disease Variant Portal (ADVP), we found significant associations primarily in immune and cardiac related diseases in our PheWAS. Replicating variants have widespread impacts on immune genes in diverse tissue types. This study demonstrates the potential of using the PheWAS strategy to improve our understanding of AD progression as well as identify potential drug repurposing opportunities for new treatment and disease prevention strategies.

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL