Your browser doesn't support javascript.
loading
Addressing Information Biases Within Electronic Health Record Data to Improve the Examination of Epidemiologic Associations With Diabetes Prevalence Among Young Adults: Cross-Sectional Study.
Conderino, Sarah; Anthopolos, Rebecca; Albrecht, Sandra S; Farley, Shannon M; Divers, Jasmin; Titus, Andrea R; Thorpe, Lorna E.
Affiliation
  • Conderino S; Department of Population Health, New York University Grossman School of Medicine, New York, NY, United States.
  • Anthopolos R; Department of Population Health, New York University Grossman School of Medicine, New York, NY, United States.
  • Albrecht SS; Department of Epidemiology, Mailman School of Public Health at Columbia University, New York, NY, United States.
  • Farley SM; ICAP at Columbia University, New York, NY, United States.
  • Divers J; Department of Population Health, New York University Grossman School of Medicine, New York, NY, United States.
  • Titus AR; Department of Foundations of Medicine, New York University Long Island School of Medicine, Mineola, NY, United States.
  • Thorpe LE; Department of Population Health, New York University Grossman School of Medicine, New York, NY, United States.
JMIR Med Inform ; 12: e58085, 2024 Oct 01.
Article in En | MEDLINE | ID: mdl-39353204
ABSTRACT

Background:

Electronic health records (EHRs) are increasingly used for epidemiologic research to advance public health practice. However, key variables are susceptible to missing data or misclassification within EHRs, including demographic information or disease status, which could affect the estimation of disease prevalence or risk factor associations.

Objective:

In this paper, we applied methods from the literature on missing data and causal inference to assess whether we could mitigate information biases when estimating measures of association between potential risk factors and diabetes among a patient population of New York City young adults.

Methods:

We estimated the odds ratio (OR) for diabetes by race or ethnicity and asthma status using EHR data from NYU Langone Health. Methods from the missing data and causal inference literature were then applied to assess the ability to control for misclassification of health outcomes in the EHR data. We compared EHR-based associations with associations observed from 2 national health surveys, the Behavioral Risk Factor Surveillance System (BRFSS) and the National Health and Nutrition Examination Survey, representing traditional public health surveillance systems.

Results:

Observed EHR-based associations between race or ethnicity and diabetes were comparable to health survey-based estimates, but the association between asthma and diabetes was significantly overestimated (OREHR 3.01, 95% CI 2.86-3.18 vs ORBRFSS 1.23, 95% CI 1.09-1.40). Missing data and causal inference methods reduced information biases in these estimates, yielding relative differences from traditional estimates below 50% (ORMissingData 1.79, 95% CI 1.67-1.92 and ORCausal 1.42, 95% CI 1.34-1.51).

Conclusions:

Findings suggest that without bias adjustment, EHR analyses may yield biased measures of association, driven in part by subgroup differences in health care use. However, applying missing data or causal inference frameworks can help control for and, importantly, characterize residual information biases in these estimates.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Diabetes Mellitus / Electronic Health Records Limits: Adolescent / Adult / Female / Humans / Male Country/Region as subject: America do norte Language: En Journal: JMIR Med Inform Year: 2024 Document type: Article Affiliation country: Country of publication:

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Diabetes Mellitus / Electronic Health Records Limits: Adolescent / Adult / Female / Humans / Male Country/Region as subject: America do norte Language: En Journal: JMIR Med Inform Year: 2024 Document type: Article Affiliation country: Country of publication: