Your browser doesn't support javascript.
loading
Identifying erroneous height and weight values from adult electronic health records in the All of Us research program.
Guide, Andrew; Sulieman, Lina; Garbett, Shawn; Cronin, Robert M; Spotnitz, Matthew; Natarajan, Karthik; Carroll, Robert J; Harris, Paul; Chen, Qingxia.
Afiliación
  • Guide A; Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, United States.
  • Sulieman L; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States.
  • Garbett S; Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, United States.
  • Cronin RM; Department of Internal Medicine, The Ohio State University, Columbus, OH, United States.
  • Spotnitz M; Department of Biomedical Informatics, Columbia University, New York, NY, United States.
  • Natarajan K; Department of Biomedical Informatics, Columbia University, New York, NY, United States.
  • Carroll RJ; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States.
  • Harris P; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States.
  • Chen Q; Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, United States; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States. Electronic address: cindy.chen@vumc.org.
J Biomed Inform ; 155: 104660, 2024 Jul.
Article en En | MEDLINE | ID: mdl-38788889
ABSTRACT

INTRODUCTION:

Electronic Health Records (EHR) are a useful data source for research, but their usability is hindered by measurement errors. This study investigated an automatic error detection algorithm for adult height and weight measurements in EHR for the All of Us Research Program (All of Us).

METHODS:

We developed reference charts for adult heights and weights that were stratified on participant sex. Our analysis included 4,076,534 height and 5,207,328 wt measurements from âˆ¼ 150,000 participants. Errors were identified using modified standard deviation scores, differences from their expected values, and significant changes between consecutive measurements. We evaluated our method with chart-reviewed heights (8,092) and weights (9,039) from 250 randomly selected participants and compared it with the current cleaning algorithm in All of Us.

RESULTS:

The proposed algorithm classified 1.4 % of height and 1.5 % of weight errors in the full cohort. Sensitivity was 90.4 % (95 % CI 79.0-96.8 %) for heights and 65.9 % (95 % CI 56.9-74.1 %) for weights. Precision was 73.4 % (95 % CI 60.9-83.7 %) for heights and 62.9 (95 % CI 54.0-71.1 %) for weights. In comparison, the current cleaning algorithm has inferior performance in sensitivity (55.8 %) and precision (16.5 %) for height errors while having higher precision (94.0 %) and lower sensitivity (61.9 %) for weight errors.

DISCUSSION:

Our proposed algorithm outperformed in detecting height errors compared to weights. It can serve as a valuable addition to the current All of Us cleaning algorithm for identifying erroneous height values.
Asunto(s)
Palabras clave

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Estatura / Peso Corporal / Algoritmos / Registros Electrónicos de Salud Límite: Adult / Aged / Female / Humans / Male / Middle aged País/Región como asunto: America do norte Idioma: En Revista: J Biomed Inform Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Estatura / Peso Corporal / Algoritmos / Registros Electrónicos de Salud Límite: Adult / Aged / Female / Humans / Male / Middle aged País/Región como asunto: America do norte Idioma: En Revista: J Biomed Inform Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos