Your browser doesn't support javascript.
loading
Dissecting heritability, environmental risk, and air pollution causal effects using > 50 million individuals in MarketScan.
McGuire, Daniel; Markus, Havell; Yang, Lina; Xu, Jingyu; Montgomery, Austin; Berg, Arthur; Li, Qunhua; Carrel, Laura; Liu, Dajiang J; Jiang, Bibo.
Affiliation
  • McGuire D; Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, 17033, USA.
  • Markus H; MD/PhD Program, Penn State College of Medicine of Medicine, Hershey, PA, 17033, USA.
  • Yang L; Bioinformatics and Genomics PhD Program, Penn State College of Medicine, Hershey, PA, 17033, USA.
  • Xu J; Institute for Personalized Medicine, Penn State College of Medicine, Hershey, PA, 17033, USA.
  • Montgomery A; Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, 17033, USA.
  • Berg A; Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, 17033, USA.
  • Li Q; MD/PhD Program, Penn State College of Medicine of Medicine, Hershey, PA, 17033, USA.
  • Carrel L; Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, 17033, USA.
  • Liu DJ; Department of Statistics, Penn State University, University Park, PA, USA.
  • Jiang B; Department of Biochemistry and Molecular Biology, Penn State College of Medicine, Hershey, PA, 17033, USA.
Nat Commun ; 15(1): 5357, 2024 Jun 25.
Article in En | MEDLINE | ID: mdl-38918381
ABSTRACT
Large national-level electronic health record (EHR) datasets offer new opportunities for disentangling the role of genes and environment through deep phenotype information and approximate pedigree structures. Here we use the approximate geographical locations of patients as a proxy for spatially correlated community-level environmental risk factors. We develop a spatial mixed linear effect (SMILE) model that incorporates both genetics and environmental contribution. We extract EHR and geographical locations from 257,620 nuclear families and compile 1083 disease outcome measurements from the MarketScan dataset. We augment the EHR with publicly available environmental data, including levels of particulate matter 2.5 (PM2.5), nitrogen dioxide (NO2), climate, and sociodemographic data. We refine the estimates of genetic heritability and quantify community-level environmental contributions. We also use wind speed and direction as instrumental variables to assess the causal effects of air pollution. In total, we find PM2.5 or NO2 have statistically significant causal effects on 135 diseases, including respiratory, musculoskeletal, digestive, metabolic, and sleep disorders, where PM2.5 and NO2 tend to affect biologically distinct disease categories. These analyses showcase several robust strategies for jointly modeling genetic and environmental effects on disease risk using large EHR datasets and will benefit upcoming biobank studies in the era of precision medicine.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Air Pollution / Particulate Matter / Nitrogen Dioxide Limits: Adult / Female / Humans / Male / Middle aged Language: En Journal: Nat Commun Journal subject: BIOLOGIA / CIENCIA Year: 2024 Document type: Article Affiliation country: Estados Unidos Country of publication: Reino Unido

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Air Pollution / Particulate Matter / Nitrogen Dioxide Limits: Adult / Female / Humans / Male / Middle aged Language: En Journal: Nat Commun Journal subject: BIOLOGIA / CIENCIA Year: 2024 Document type: Article Affiliation country: Estados Unidos Country of publication: Reino Unido