Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 120
Filter
1.
J Am Med Inform Assoc ; 31(7): 1479-1492, 2024 Jun 20.
Article in English | MEDLINE | ID: mdl-38742457

ABSTRACT

OBJECTIVES: To develop recommendations regarding the use of weights to reduce selection bias for commonly performed analyses using electronic health record (EHR)-linked biobank data. MATERIALS AND METHODS: We mapped diagnosis (ICD code) data to standardized phecodes from 3 EHR-linked biobanks with varying recruitment strategies: All of Us (AOU; n = 244 071), Michigan Genomics Initiative (MGI; n = 81 243), and UK Biobank (UKB; n = 401 167). Using 2019 National Health Interview Survey data, we constructed selection weights for AOU and MGI to represent the US adult population more. We used weights previously developed for UKB to represent the UKB-eligible population. We conducted 4 common analyses comparing unweighted and weighted results. RESULTS: For AOU and MGI, estimated phecode prevalences decreased after weighting (weighted-unweighted median phecode prevalence ratio [MPR]: 0.82 and 0.61), while UKB estimates increased (MPR: 1.06). Weighting minimally impacted latent phenome dimensionality estimation. Comparing weighted versus unweighted phenome-wide association study for colorectal cancer, the strongest associations remained unaltered, with considerable overlap in significant hits. Weighting affected the estimated log-odds ratio for sex and colorectal cancer to align more closely with national registry-based estimates. DISCUSSION: Weighting had a limited impact on dimensionality estimation and large-scale hypothesis testing but impacted prevalence and association estimation. When interested in estimating effect size, specific signals from untargeted association analyses should be followed up by weighted analysis. CONCLUSION: EHR-linked biobanks should report recruitment and selection mechanisms and provide selection weights with defined target populations. Researchers should consider their intended estimands, specify source and target populations, and weight EHR-linked biobank analyses accordingly.


Subject(s)
Biological Specimen Banks , Electronic Health Records , Humans , Selection Bias , Female , Male , Adult , Middle Aged , Medical Record Linkage , United States , Aged , United Kingdom , Michigan
2.
Eur Urol Oncol ; 2024 May 10.
Article in English | MEDLINE | ID: mdl-38734542

ABSTRACT

BACKGROUND AND OBJECTIVE: Polygenic risk scores (PRSs) have been developed to identify men with the highest risk of prostate cancer. Our aim was to compare the performance of 16 PRSs in identifying men at risk of developing prostate cancer and then to evaluate the performance of the top-performing PRSs in differentiating individuals at risk of aggressive prostate cancer. METHODS: For this case-control study we downloaded 16 published PRSs from the Polygenic Score Catalog on May 28, 2021 and applied them to Michigan Genomics Initiative (MGI) patients. Cases were matched to the Michigan Urological Surgery Improvement Collaborative (MUSIC) registry to obtain granular clinical and pathological data. MGI prospectively enrolls patients undergoing surgery at the University of Michigan, and MUSIC is a multi-institutional registry that prospectively tracks demographic, treatment, and clinical variables. The predictive performance of each PRS was evaluated using the area under the covariate-adjusted receiver operating characteristic curve (aAUC), and the association between PRS and disease aggressiveness according to prostate biopsy data was measured using logistic regression. KEY FINDINGS AND LIMITATIONS: We included 18 050 patients in the analysis, of whom 15 310 were control subjects and 2740 were prostate cancer cases. The median age was 66.1 yr (interquartile range 59.9-71.6) for cases and 56.6 yr (interquartile range 42.6-66.7) for control subjects. The PRS performance in predicting the risk of developing prostate cancer according to aAUC ranged from 0.51 (95% confidence interval 0.51-0.53) to 0.67 (95% confidence interval 0.66-0.68). By contrast, there was no association between PRS and disease aggressiveness. CONCLUSIONS AND CLINICAL IMPLICATIONS: Prostate cancer PRSs have modest real-world performance in identifying patients at higher risk of developing prostate cancer; however, they are limited in distinguishing patients with indolent versus aggressive disease. PATIENT SUMMARY: Risk scores using data for multiple genes (called polygenic risk scores) can identify men at higher risk of developing prostate cancer. However, these scores need to be refined to be able to identify men with the highest risk for clinically significant prostate cancer.

3.
Commun Biol ; 7(1): 432, 2024 Apr 09.
Article in English | MEDLINE | ID: mdl-38594418

ABSTRACT

Trace elements are important for human health but may exert toxic or adverse effects. Mechanisms of uptake, distribution, metabolism, and excretion are partly under genetic control but have not yet been extensively mapped. Here we report a comprehensive multi-element genome-wide association study of 57 essential and non-essential trace elements. We perform genome-wide association meta-analyses of 14 trace elements in up to 6564 Scandinavian whole blood samples, and genome-wide association studies of 43 trace elements in up to 2819 samples measured only in the Trøndelag Health Study (HUNT). We identify 11 novel genetic loci associated with blood concentrations of arsenic, cadmium, manganese, selenium, and zinc in genome-wide association meta-analyses. In HUNT, several genome-wide significant loci are also indicated for other trace elements. Using two-sample Mendelian randomization, we find several indications of weak to moderate effects on health outcomes, the most precise being a weak harmful effect of increased zinc on prostate cancer. However, independent validation is needed. Our current understanding of trace element-associated genetic variants may help establish consequences of trace elements on human health.


Subject(s)
Selenium , Trace Elements , Male , Humans , Trace Elements/metabolism , Genome-Wide Association Study , Zinc , Selenium/analysis , Manganese
4.
medRxiv ; 2024 Mar 30.
Article in English | MEDLINE | ID: mdl-38585910

ABSTRACT

Background and Objectives: Amyotrophic lateral sclerosis (ALS) causes profound impairments in neurological function and a cure for this devastating disease remains elusive. Early detection and risk stratification are crucial for timely intervention and improving patient outcomes. This study aimed to identify predisposing genetic, phenotypic, and exposure-related factors for Amyotrophic lateral sclerosis using multi-modal data and assess their joint predictive potential. Methods: Utilizing data from the UK Biobank, we analyzed an unrelated set of 292 ALS cases and 408,831 controls of European descent. Two polygenic risk scores (PRS) are constructed: "GWAS Hits PRS" and "PRS-CS," reflecting oligogenic and polygenic ALS risk profiles, respectively. Time-restricted phenome-wide association studies (PheWAS) were performed to identify pre-existing conditions increasing ALS risk, integrated into phenotypic risk scores (PheRS). A poly-exposure score ("PXS") captures the influence of environmental exposures measured through survey questionnaires. We evaluate the performance of these scores for predicting ALS incidence and stratifying risk, adjusting for baseline demographic covariates. Results: Both PRSs modestly predicted ALS diagnosis, but with increased predictive power when combined (covariate-adjusted receiver operating characteristic [AAUC] = 0.584 [0.525, 0.639]). PheRS incorporated diagnoses 1 year before ALS onset (PheRS1) modestly discriminated cases from controls (AAUC = 0.515 [0.472, 0.564]). The "PXS" did not significantly predict ALS. However, a model incorporating PRSs and PheRS1 improved prediction of ALS (AAUC = 0.604 [0.547, 0.667]), outperforming a model combining all risk scores. This combined risk score identified the top 10% of risk score distribution with a 4-fold higher ALS risk (95% CI: [2.04, 7.73]) versus those in the 40%-60% range. Discussions: By leveraging UK Biobank data, our study uncovers predisposing ALS factors, highlighting the improved effectiveness of multi-factorial prediction models to identify individuals at highest risk for ALS.

5.
BMC Bioinformatics ; 25(1): 65, 2024 Feb 09.
Article in English | MEDLINE | ID: mdl-38336614

ABSTRACT

BACKGROUND: Genetic variants can contribute differently to trait heritability by their functional categories, and recent studies have shown that incorporating functional annotation can improve the predictive performance of polygenic risk scores (PRSs). In addition, when only a small proportion of variants are causal variants, PRS methods that employ a Bayesian framework with shrinkage can account for such sparsity. It is possible that the annotation group level effect is also sparse. However, the number of PRS methods that incorporate both annotation information and shrinkage on effect sizes is limited. We propose a PRS method, PRSbils, which utilizes the functional annotation information with a bilevel continuous shrinkage prior to accommodate the varying genetic architectures both on the variant-specific level and on the functional annotation level. RESULTS: We conducted simulation studies and investigated the predictive performance in settings with different genetic architectures. Results indicated that when there was a relatively large variability of group-wise heritability contribution, the gain in prediction performance from the proposed method was on average 8.0% higher AUC compared to the benchmark method PRS-CS. The proposed method also yielded higher predictive performance compared to PRS-CS in settings with different overlapping patterns of annotation groups and obtained on average 6.4% higher AUC. We applied PRSbils to binary and quantitative traits in three real world data sources (the UK Biobank, the Michigan Genomics Initiative (MGI), and the Korean Genome and Epidemiology Study (KoGES)), and two sources of annotations: ANNOVAR, and pathway information from the Kyoto Encyclopedia of Genes and Genomes (KEGG), and demonstrated that the proposed method holds the potential for improving predictive performance by incorporating functional annotations. CONCLUSIONS: By utilizing a bilevel shrinkage framework, PRSbils enables the incorporation of both overlapping and non-overlapping annotations into PRS construction to improve the performance of genetic risk prediction. The software is available at https://github.com/styvon/PRSbils .


Subject(s)
Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Bayes Theorem , Genome-Wide Association Study/methods , Multifactorial Inheritance , Software , Risk Factors
6.
medRxiv ; 2024 Feb 13.
Article in English | MEDLINE | ID: mdl-38405832

ABSTRACT

Objective: To explore the role of selection bias adjustment by weighting electronic health record (EHR)-linked biobank data for commonly performed analyses. Materials and methods: We mapped diagnosis (ICD code) data to standardized phecodes from three EHR-linked biobanks with varying recruitment strategies: All of Us (AOU; n=244,071), Michigan Genomics Initiative (MGI; n=81,243), and UK Biobank (UKB; n=401,167). Using 2019 National Health Interview Survey data, we constructed selection weights for AOU and MGI to be more representative of the US adult population. We used weights previously developed for UKB to represent the UKB-eligible population. We conducted four common descriptive and analytic tasks comparing unweighted and weighted results. Results: For AOU and MGI, estimated phecode prevalences decreased after weighting (weighted-unweighted median phecode prevalence ratio [MPR]: 0.82 and 0.61), while UKB's estimates increased (MPR: 1.06). Weighting minimally impacted latent phenome dimensionality estimation. Comparing weighted versus unweighted PheWAS for colorectal cancer, the strongest associations remained unaltered and there was large overlap in significant hits. Weighting affected the estimated log-odds ratio for sex and colorectal cancer to align more closely with national registry-based estimates. Discussion: Weighting had limited impact on dimensionality estimation and large-scale hypothesis testing but impacted prevalence and association estimation more. Results from untargeted association analyses should be followed by weighted analysis when effect size estimation is of interest for specific signals. Conclusion: EHR-linked biobanks should report recruitment and selection mechanisms and provide selection weights with defined target populations. Researchers should consider their intended estimands, specify source and target populations, and weight EHR-linked biobank analyses accordingly.

7.
Hum Mol Genet ; 33(4): 374-385, 2024 Feb 01.
Article in English | MEDLINE | ID: mdl-37934784

ABSTRACT

Genome-wide association studies have contributed extensively to the discovery of disease-associated common variants. However, the genetic contribution to complex traits is still largely difficult to interpret. We report a genome-wide association study of 2394 cases and 2393 controls for age-related macular degeneration (AMD) via whole-genome sequencing, with 46.9 million genetic variants. Our study reveals significant single-variant association signals at four loci and independent gene-based signals in CFH, C2, C3, and NRTN. Using data from the Exome Aggregation Consortium (ExAC) for a gene-based test, we demonstrate an enrichment of predicted rare loss-of-function variants in CFH, CFI, and an as-yet unreported gene in AMD, ORMDL2. Our method of using a large variant list without individual-level genotypes as an external reference provides a flexible and convenient approach to leverage the publicly available variant datasets to augment the search for rare variant associations, which can explain additional disease risk in AMD.


Subject(s)
Genome-Wide Association Study , Macular Degeneration , Humans , Genome-Wide Association Study/methods , Macular Degeneration/genetics , Genotype , Genetic Testing , Whole Genome Sequencing , Polymorphism, Single Nucleotide/genetics , Genetic Predisposition to Disease , Complement Factor H/genetics
8.
JAMA Psychiatry ; 81(1): 34-44, 2024 Jan 01.
Article in English | MEDLINE | ID: mdl-37910111

ABSTRACT

Importance: Posttraumatic stress disorder (PTSD) has been reported to be a risk factor for several physical and somatic symptoms. However, the genetics of PTSD and its potential association with medical outcomes remain unclear. Objective: To examine disease categories and laboratory tests from electronic health records (EHRs) that are associated with PTSD polygenic scores. Design, Setting, and Participants: This genetic association study was conducted from July 15, 2021, to January 24, 2023, using EHR data from participants across 4 biobanks. The polygenic scores of PTSD symptom severity (PGS-PTSD) were tested with all available phecodes in Vanderbilt University Medical Center's biobank (BioVU), Mass General Brigham (MGB), Michigan Genomics Initiative (MGI), and UK Biobank (UKBB). The significant medical outcomes were tested for overrepresented disease categories and subsequently tested for genetic correlation and 2-sample mendelian randomization (MR) to determine genetically informed associations. Multivariable MR was conducted to assess whether PTSD associations with health outcomes were independent of the genetic effect of body mass index and tobacco smoking. Exposures: Polygenic score of PTSD symptom severity. Main Outcomes and Measures: A total of 1680 phecodes (ie, International Classification of Diseases, Ninth Revision- and Tenth Revision-based phenotypic definitions of health outcomes) across 4 biobanks and 490 laboratory tests across 2 biobanks (BioVU and MGB). Results: In this study including a total of 496 317 individuals (mean [SD] age, 56.8 [8.0] years; 263 048 female [53%]) across the 4 EHR sites, meta-analyzing associations of PGS-PTSD with 1680 phecodes from 496 317 individuals showed significant associations to be overrepresented from mental health disorders (fold enrichment = 3.15; P = 5.81 × 10-6), circulatory system (fold enrichment = 3.32; P = 6.39 × 10-12), digestive (fold enrichment = 2.42; P = 2.16 × 10-7), and respiratory outcomes (fold enrichment = 2.51; P = 8.28 × 10-5). The laboratory measures scan with PGS-PTSD in BioVU and MGB biobanks revealed top associations in metabolic and immune domains. MR identified genetic liability to PTSD symptom severity as an associated risk factor for 12 health outcomes, including alcoholism (ß = 0.023; P = 1.49 × 10-4), tachycardia (ß = 0.045; P = 8.30 × 10-5), cardiac dysrhythmias (ß = 0.016, P = 3.09 × 10-5), and acute pancreatitis (ß = 0.049, P = 4.48 × 10-4). Several of these associations were robust to genetic effects of body mass index and smoking. We observed a bidirectional association between PTSD symptoms and nonspecific chest pain and C-reactive protein. Conclusions and Relevance: Results of this study suggest the broad health repercussions associated with the genetic liability to PTSD across 4 biobanks. The circulatory and respiratory systems association was observed to be overrepresented in all 4 biobanks.


Subject(s)
Cardiovascular Diseases , Pancreatitis , Stress Disorders, Post-Traumatic , Humans , Female , Middle Aged , Stress Disorders, Post-Traumatic/genetics , Stress Disorders, Post-Traumatic/psychology , Acute Disease , Risk Factors , Genome-Wide Association Study
9.
J Clin Med ; 12(23)2023 Nov 25.
Article in English | MEDLINE | ID: mdl-38068365

ABSTRACT

BACKGROUND: Post-Acute Sequelae of COVID-19 (PASC) have emerged as a global public health and healthcare challenge. This study aimed to uncover predictive factors for PASC from multi-modal data to develop a predictive model for PASC diagnoses. METHODS: We analyzed electronic health records from 92,301 COVID-19 patients, covering medical phenotypes, medications, and lab results. We used a Super Learner-based prediction approach to identify predictive factors. We integrated the model outputs into individual and composite risk scores and evaluated their predictive performance. RESULTS: Our analysis identified several factors predictive of diagnoses of PASC, including being overweight/obese and the use of HMG CoA reductase inhibitors prior to COVID-19 infection, and respiratory system symptoms during COVID-19 infection. We developed a composite risk score with a moderate discriminatory ability for PASC (covariate-adjusted AUC (95% confidence interval): 0.66 (0.63, 0.69)) by combining the risk scores based on phenotype and medication records. The combined risk score could identify 10% of individuals with a 2.2-fold increased risk for PASC. CONCLUSIONS: We identified several factors predictive of diagnoses of PASC and integrated the information into a composite risk score for PASC prediction, which could contribute to the identification of individuals at higher risk for PASC and inform preventive efforts.

10.
Sci Adv ; 9(51): eadj3747, 2023 Dec 22.
Article in English | MEDLINE | ID: mdl-38117882

ABSTRACT

We investigated the design and analysis of observational booster vaccine effectiveness (VE) studies by performing a scoping review of booster VE literature with a focus on study design and analytic choices. We then applied 20 different approaches, including those found in the literature, to a single dataset from Michigan Medicine. We identified 80 studies in our review, including over 150 million observations in total. We found that while protection against infection is variable and dependent on several factors including the study population and time period, both monovalent boosters and particularly the bivalent booster offer strong protection against severe COVID-19. In addition, VE analyses with a severe disease outcome (hospitalization, intensive care unit admission, or death) appear to be more robust to design and analytic choices than an infection endpoint. In terms of design choices, we found that test-negative designs and their variants may offer advantages in statistical efficiency compared to cohort designs.


Subject(s)
COVID-19 , Humans , COVID-19/epidemiology , COVID-19/prevention & control , Hospitalization , Intensive Care Units , Michigan/epidemiology , Observational Studies as Topic
11.
PLoS Genet ; 19(12): e1010907, 2023 Dec.
Article in English | MEDLINE | ID: mdl-38113267

ABSTRACT

OBJECTIVE: To overcome the limitations associated with the collection and curation of COVID-19 outcome data in biobanks, this study proposes the use of polygenic risk scores (PRS) as reliable proxies of COVID-19 severity across three large biobanks: the Michigan Genomics Initiative (MGI), UK Biobank (UKB), and NIH All of Us. The goal is to identify associations between pre-existing conditions and COVID-19 severity. METHODS: Drawing on a sample of more than 500,000 individuals from the three biobanks, we conducted a phenome-wide association study (PheWAS) to identify associations between a PRS for COVID-19 severity, derived from a genome-wide association study on COVID-19 hospitalization, and clinical pre-existing, pre-pandemic phenotypes. We performed cohort-specific PRS PheWAS and a subsequent fixed-effects meta-analysis. RESULTS: The current study uncovered 23 pre-existing conditions significantly associated with the COVID-19 severity PRS in cohort-specific analyses, of which 21 were observed in the UKB cohort and two in the MGI cohort. The meta-analysis yielded 27 significant phenotypes predominantly related to obesity, metabolic disorders, and cardiovascular conditions. After adjusting for body mass index, several clinical phenotypes, such as hypercholesterolemia and gastrointestinal disorders, remained associated with an increased risk of hospitalization following COVID-19 infection. CONCLUSION: By employing PRS as a proxy for COVID-19 severity, we corroborated known risk factors and identified novel associations between pre-existing clinical phenotypes and COVID-19 severity. Our study highlights the potential value of using PRS when actual outcome data may be limited or inadequate for robust analyses.


Subject(s)
COVID-19 , Population Health , Humans , Genome-Wide Association Study , Genetic Risk Score , COVID-19/genetics , Biological Specimen Banks , Preexisting Condition Coverage , Risk Factors , Genetic Predisposition to Disease
12.
Epidemiol Health ; 45: e2023074, 2023.
Article in English | MEDLINE | ID: mdl-37591787

ABSTRACT

The Epidemiologic Questionnaire (EPI-Q) was established to collect broad, uniform, self-reported health data to supplement electronic health record (EHR) and genotype information from participants in the University of Michigan (UM) Precision Health cohorts. Recruitment of EPI-Q participants, who were already enrolled in 1 of 3 ongoing UM Precision Health cohorts-the Michigan Genomics Initiative, Mental Health Biobank, and Metabolism, Endocrinology, and Diabetes cohorts-began in March 2020. Of 54,043 retrospective invitations, 5,577 individuals enrolled, representing a 10.3% response rate. Of these, 3,502 (63.7%) were female, and the average age was 56.1 years (standard deviation, 15.4). The baseline survey comprises 11 modules on topics including personal and family health history, lifestyle, and cancer screening and history. Additionally, 11 optional modules cover topics including financial toxicity, occupational exposure, and life meaning. The questions are based on standardized and validated instruments used in other cohorts, and we share resources to expedite development of similar surveys. Data are collected via the MyDataHelps platform, which enables current and future participants to share non-Michigan Medicine EHR data. Recruitment is ongoing. Cohort data are available to those with institutional review board approval; for details, contact the Data Office for Clinical and Translational Research (DataOffice@umich.edu).


Subject(s)
Electronic Health Records , Mobile Applications , Humans , Female , Middle Aged , Male , Retrospective Studies , Genotype , Surveys and Questionnaires , Health Surveys
13.
Transl Vis Sci Technol ; 12(8): 21, 2023 08 01.
Article in English | MEDLINE | ID: mdl-37624605

ABSTRACT

Purpose: To investigate and compare novel volumetric microperimetry (MP)-derived metrics in intermediate age-related macular degeneration (iAMD), as current MP metrics show high variability and low sensitivity. Methods: This is a cross-sectional analysis of microperimetry baseline data from the multicenter, prospective PINNACLE study (ClinicalTrials.gov NCT04269304). The Visual Field Modeling and Analysis (VFMA) software and an open-source implementation (OSI) were applied to calculate MP-derived hill-of-vison (HOV) surface plots and the total volume (VTOT) beneath the plots. Bland-Altman plots were used for methodologic comparison, and the association of retinal sensitivity metrics with explanatory variables was tested with mixed-effects models. Results: In total, 247 eyes of 189 participants (75 ± 7.3 years) were included in the analysis. The VTOT output of VFMA and OSI exhibited a significant difference (P < 0.0001). VFMA yielded slightly higher coefficients of determination than OSI and mean sensitivity (MS) in univariable and multivariable modeling, for example, in association with low-luminance visual acuity (LLVA) (marginal R2/conditional R2: VFMA 0.171/0.771, OSI 0.162/0.765, MS 0.133/0.755). In the multivariable analysis, LLVA was the only demonstrable predictor of VFMA VTOT (t-value, P-value: -7.5, <0.001) and MS (-6.5, <0.001). Conclusions: The HOV-derived metric of VTOT exhibits favorable characteristics compared to MS in evaluating retinal sensitivity. The output of VFMA and OSI is not exactly interchangeable in this cross-sectional analysis. Longitudinal analysis is necessary to assess their performance in ability-to-detect change. Translational Relevance: This study explores new volumetric MP endpoints for future application in therapeutic trials in iAMD and reports specific characteristics of the available HOV software applications.


Subject(s)
Benchmarking , Macular Degeneration , Humans , Cross-Sectional Studies , Prospective Studies , Visual Field Tests , Macular Degeneration/diagnosis , Retina/diagnostic imaging
14.
medRxiv ; 2023 Jun 28.
Article in English | MEDLINE | ID: mdl-37425863

ABSTRACT

Background: Observational vaccine effectiveness (VE) studies based on real-world data are a crucial supplement to initial randomized clinical trials of Coronavirus Disease 2019 (COVID-19) vaccines. However, there exists substantial heterogeneity in study designs and statistical methods for estimating VE. The impact of such heterogeneity on VE estimates is not clear. Methods: We conducted a two-step literature review of booster VE: a literature search for first or second monovalent boosters on January 1, 2023, and a rapid search for bivalent boosters on March 28, 2023. For each study identified, study design, methods, and VE estimates for infection, hospitalization, and/or death were extracted and summarized via forest plots. We then applied methods identified in the literature to a single dataset from Michigan Medicine (MM), providing a comparison of the impact of different statistical methodologies on the same dataset. Results: We identified 53 studies estimating VE of the first booster, 16 for the second booster. Of these studies, 2 were case-control, 17 were test-negative, and 50 were cohort studies. Together, they included nearly 130 million people worldwide. VE for all outcomes was very high (around 90%) in earlier studies (i.e., in 2021), but became attenuated and more heterogeneous over time (around 40%-50% for infection, 60%-90% for hospitalization, and 50%-90% for death). VE compared to the previous dose was lower for the second booster (10-30% for infection, 30-60% against hospitalization, and 50-90% against death). We also identified 11 bivalent booster studies including over 20 million people. Early studies of the bivalent booster showed increased effectiveness compared to the monovalent booster (VE around 50-80% for hospitalization and death).Our primary analysis with MM data using a cohort design included 186,495 individuals overall (including 153,811 boosted and 32,684 with only a primary series vaccination), and a secondary test-negative design included 65,992 individuals tested for SARS-CoV-2. When different statistical designs and methods were applied to MM data, VE estimates for hospitalization and death were robust to analytic choices, with test-negative designs leading to narrower confidence intervals. Adjusting either for the propensity of getting boosted or directly adjusting for covariates reduced the heterogeneity across VE estimates for the infection outcome. Conclusion: While the advantage of the second monovalent booster is not obvious from the literature review, the first monovalent booster and the bivalent booster appear to offer strong protection against severe COVID-19. Based on both the literature view and data analysis, VE analyses with a severe disease outcome (hospitalization, ICU admission, or death) appear to be more robust to design and analytic choices than an infection endpoint. Test-negative designs can extend to severe disease outcomes and may offer advantages in statistical efficiency when used properly.

15.
Res Sq ; 2023 Apr 12.
Article in English | MEDLINE | ID: mdl-37090583

ABSTRACT

Background: Genetic variants can contribute differently to trait heritability by their functional categories, and recent studies have shown that incorporating functional annotation can improve the predictive performance of polygenic risk scores (PRSs). In addition, when only a small proportion of variants are causal variants, PRS methods that employ a Bayesian framework with shrinkage can account for such sparsity. It is possible that the annotation group level effect is also sparse. However, the number of PRS methods that incorporate both annotation information and shrinkage on effect sizes is limited. We propose a PRS method, PRSbils, which utilizes the functional annotation information with a bilevel continuous shrinkage prior to accommodate the varying genetic architectures both on the variant-specific level and on the functional annotation level. Results: We conducted simulation studies and investigated the predictive performance in settings with different genetic architectures. Results indicated that when there was a relatively large variability of group-wise heritability contribution, the gain in prediction performance from the proposed method was on average 8.0% higher AUC compared to the benchmark method PRS-CS. The proposed method also yielded higher predictive performance compared to PRS-CS in settings with different overlapping patterns of annotation groups and obtained on average 6.4% higher AUC. We applied PRSbils to binary and quantitative traits in three real world data sources (the UK Biobank, the Michigan Genomics Initiative (MGI), and the Korean Genome and Epidemiology Study (KoGES)), and two sources of annotations: ANNOVAR, and pathway information from the Kyoto Encyclopedia of Genes and Genomes (KEGG), and demonstrated that the proposed method holds the potential for improving predictive performance by incorporating functional annotations. Conclusions: By utilizing a bilevel shrinkage framework, PRSbils enables the incorporation of both overlapping and non-overlapping annotations into PRS construction to improve the performance of genetic risk prediction. The software is available at https://github.com/styvon/PRSbils.

16.
Ophthalmol Sci ; 3(3): 100294, 2023 Sep.
Article in English | MEDLINE | ID: mdl-37113474

ABSTRACT

Purpose: To study the individual course of retinal changes caused by healthy aging using deep learning. Design: Retrospective analysis of a large data set of retinal OCT images. Participants: A total of 85 709 adults between the age of 40 and 75 years of whom OCT images were acquired in the scope of the UK Biobank population study. Methods: We created a counterfactual generative adversarial network (GAN), a type of neural network that learns from cross-sectional, retrospective data. It then synthesizes high-resolution counterfactual OCT images and longitudinal time series. These counterfactuals allow visualization and analysis of hypothetical scenarios in which certain characteristics of the imaged subject, such as age or sex, are altered, whereas other attributes, crucially the subject's identity and image acquisition settings, remain fixed. Main Outcome Measures: Using our counterfactual GAN, we investigated subject-specific changes in the retinal layer structure as a function of age and sex. In particular, we measured changes in the retinal nerve fiber layer (RNFL), combined ganglion cell layer plus inner plexiform layer (GCIPL), inner nuclear layer to the inner boundary of the retinal pigment epithelium (INL-RPE), and retinal pigment epithelium (RPE). Results: Our counterfactual GAN is able to smoothly visualize the individual course of retinal aging. Across all counterfactual images, the RNFL, GCIPL, INL-RPE, and RPE changed by -0.1 µm ± 0.1 µm, -0.5 µm ± 0.2 µm, -0.2 µm ± 0.1 µm, and 0.1 µm ± 0.1 µm, respectively, per decade of age. These results agree well with previous studies based on the same cohort from the UK Biobank population study. Beyond population-wide average measures, our counterfactual GAN allows us to explore whether the retinal layers of a given eye will increase in thickness, decrease in thickness, or stagnate as a subject ages. Conclusion: This study demonstrates how counterfactual GANs can aid research into retinal aging by generating high-resolution, high-fidelity OCT images, and longitudinal time series. Ultimately, we envision that they will enable clinical experts to derive and explore hypotheses for potential imaging biomarkers for healthy and pathologic aging that can be refined and tested in prospective clinical trials. Financial Disclosures: Proprietary or commercial disclosure may be found after the references.

17.
Genet Epidemiol ; 47(3): 231-248, 2023 04.
Article in English | MEDLINE | ID: mdl-36739617

ABSTRACT

Linkage analysis, a class of methods for detecting co-segregation of genomic segments and traits in families, was used to map disease-causing genes for decades before genotyping arrays and dense SNP genotyping enabled genome-wide association studies in population samples. Population samples often contain related individuals, but the segregation of alleles within families is rarely used because traditional linkage methods are computationally inefficient for larger datasets. Here, we describe Population Linkage, a novel application of Haseman-Elston regression as a method of moments estimator of variance components and their standard errors. We achieve additional computational efficiency by using modern methods for detection of IBD segments and variance component estimation, efficient preprocessing of input data, and minimizing redundant numerical calculations. We also refined variance component models to account for the biases in population-scale methods for IBD segment detection. We ran Population Linkage on four blood lipid traits in over 70,000 individuals from the HUNT and SardiNIA studies, successfully detecting 25 known genetic signals. One notable linkage signal that appeared in both was for low-density lipoprotein (LDL) cholesterol levels in the region near the gene APOE (LOD = 29.3, variance explained = 4.1%). This is the region where the missense variants rs7412 and rs429358, which together make up the ε2, ε3, and ε4 alleles each account for 2.4% and 0.8% of variation in circulating LDL cholesterol. Our results show the potential for linkage analysis and other large-scale applications of method of moments variance components estimation.


Subject(s)
Genome-Wide Association Study , Models, Genetic , Humans , Phenotype , Cholesterol, LDL/genetics , Genetic Linkage , Apolipoproteins E/genetics
18.
J Clin Med ; 12(4)2023 Feb 07.
Article in English | MEDLINE | ID: mdl-36835863

ABSTRACT

BACKGROUND: A growing number of Coronavirus Disease-2019 (COVID-19) survivors are affected by post-acute sequelae of SARS CoV-2 infection (PACS). Using electronic health record data, we aimed to characterize PASC-associated diagnoses and develop risk prediction models. METHODS: In our cohort of 63,675 patients with a history of COVID-19, 1724 (2.7%) had a recorded PASC diagnosis. We used a case-control study design and phenome-wide scans to characterize PASC-associated phenotypes of the pre-, acute-, and post-COVID-19 periods. We also integrated PASC-associated phenotypes into phenotype risk scores (PheRSs) and evaluated their predictive performance. RESULTS: In the post-COVID-19 period, known PASC symptoms (e.g., shortness of breath, malaise/fatigue) and musculoskeletal, infectious, and digestive disorders were enriched among PASC cases. We found seven phenotypes in the pre-COVID-19 period (e.g., irritable bowel syndrome, concussion, nausea/vomiting) and sixty-nine phenotypes in the acute-COVID-19 period (predominantly respiratory, circulatory, neurological) associated with PASC. The derived pre- and acute-COVID-19 PheRSs stratified risk well, e.g., the combined PheRSs identified a quarter of the cohort with a history of COVID-19 with a 3.5-fold increased risk (95% CI: 2.19, 5.55) for PASC compared to the bottom 50%. CONCLUSIONS: The uncovered PASC-associated diagnoses across categories highlighted a complex arrangement of presenting and likely predisposing features, some with potential for risk stratification approaches.

19.
Cell Genom ; 3(2): 100257, 2023 Feb 08.
Article in English | MEDLINE | ID: mdl-36819667

ABSTRACT

Biobanks of linked clinical patient histories and biological samples are an efficient strategy to generate large cohorts for modern genetics research. Biobank recruitment varies by factors such as geographic catchment and sampling strategy, which affect biobank demographics and research utility. Here, we describe the Michigan Genomics Initiative (MGI), a single-health-system biobank currently consisting of >91,000 participants recruited primarily during surgical encounters at Michigan Medicine. The surgical enrollment results in a biobank enriched for many diseases and ideally suited for a disease genetics cohort. Compared with the much larger population-based UK Biobank, MGI has higher prevalence for nearly all diagnosis-code-based phenotypes and larger absolute case counts for many phenotypes. Genome-wide association study (GWAS) results replicate known findings, thereby validating the genetic and clinical data. Our results illustrate that opportunistic biobank sampling within single health systems provides a unique and complementary resource for exploring the genetics of complex diseases.

20.
Cancer Epidemiol Biomarkers Prev ; 32(6): 748-759, 2023 06 01.
Article in English | MEDLINE | ID: mdl-36626383

ABSTRACT

BACKGROUND: Studies have shown an increased risk of severe SARS-CoV-2-related (COVID-19) disease outcome and mortality for patients with cancer, but it is not well understood whether associations vary by cancer site, cancer treatment, and vaccination status. METHODS: Using electronic health record data from an academic medical center, we identified a retrospective cohort of 260,757 individuals tested for or diagnosed with COVID-19 from March 10, 2020, to August 1, 2022. Of these, 52,019 tested positive for COVID-19 of whom 13,752 had a cancer diagnosis. We conducted Firth-corrected logistic regression to assess the association between cancer status, site, treatment, vaccination, and four COVID-19 outcomes: hospitalization, intensive care unit admission, mortality, and a composite "severe COVID" outcome. RESULTS: Cancer diagnosis was significantly associated with higher rates of severe COVID, hospitalization, and mortality. These associations were driven by patients whose most recent initial cancer diagnosis was within the past 3 years. Chemotherapy receipt, colorectal cancer, hematologic malignancies, kidney cancer, and lung cancer were significantly associated with higher rates of worse COVID-19 outcomes. Vaccinations were significantly associated with lower rates of worse COVID-19 outcomes regardless of cancer status. CONCLUSIONS: Patients with colorectal cancer, hematologic malignancies, kidney cancer, or lung cancer or who receive chemotherapy for treatment should be cautious because of their increased risk of worse COVID-19 outcomes, even after vaccination. IMPACT: Additional COVID-19 precautions are warranted for people with certain cancer types and treatments. Significant benefit from vaccination is noted for both cancer and cancer-free patients.


Subject(s)
COVID-19 , Colorectal Neoplasms , Hematologic Neoplasms , Kidney Neoplasms , Lung Neoplasms , Humans , COVID-19/epidemiology , SARS-CoV-2 , Retrospective Studies , Hospitalization , Vaccination
SELECTION OF CITATIONS
SEARCH DETAIL
...