Characterizing the limitations of using diagnosis codes in the context of machine learning for healthcare.

Guo, Lin Lawrence; Morse, Keith E; Aftandilian, Catherine; Steinberg, Ethan; Fries, Jason; Posada, Jose; Fleming, Scott Lanyon; Lemmon, Joshua; Jessa, Karim; Shah, Nigam; Sung, Lillian

Guo, Lin Lawrence; Morse, Keith E; Aftandilian, Catherine; Steinberg, Ethan; Fries, Jason; Posada, Jose; Fleming, Scott Lanyon; Lemmon, Joshua; Jessa, Karim; Shah, Nigam; Sung, Lillian.

Afiliación

Guo LL; Program in Child Health Evaluative Sciences, The Hospital for Sick Children, Toronto, ON, Canada.
Morse KE; Division of Pediatric Hospital Medicine, Department of Pediatrics, Stanford University, Palo Alto, CA, USA.
Aftandilian C; Division of Hematology/Oncology, Department of Pediatrics, Stanford University, Palo Alto, CA, USA.
Steinberg E; Stanford Center for Biomedical Informatics Research, Stanford University, Palo Alto, CA, USA.
Fries J; Stanford Center for Biomedical Informatics Research, Stanford University, Palo Alto, CA, USA.
Posada J; Universidad del Norte, Barranquilla, Colombia.
Fleming SL; Stanford Center for Biomedical Informatics Research, Stanford University, Palo Alto, CA, USA.
Lemmon J; Program in Child Health Evaluative Sciences, The Hospital for Sick Children, Toronto, ON, Canada.
Jessa K; Information Services, The Hospital for Sick Children, Toronto, ON, Canada.
Shah N; Stanford Center for Biomedical Informatics Research, Stanford University, Palo Alto, CA, USA.
Sung L; Program in Child Health Evaluative Sciences, The Hospital for Sick Children, Toronto, ON, Canada. Lillian.sung@sickkids.ca.

BMC Med Inform Decis Mak ; 24(1): 51, 2024 Feb 14.

Article en En | MEDLINE | ID: mdl-38355486

ABSTRACT

ABSTRACT

BACKGROUND:

Diagnostic codes are commonly used as inputs for clinical prediction models, to create labels for prediction tasks, and to identify cohorts for multicenter network studies. However, the coverage rates of diagnostic codes and their variability across institutions are underexplored. The primary objective was to describe lab- and diagnosis-based labels for 7 selected outcomes at three institutions. Secondary objectives were to describe agreement, sensitivity, and specificity of diagnosis-based labels against lab-based labels.

METHODS:

This study included three cohorts SickKids from The Hospital for Sick Children, and StanfordPeds and StanfordAdults from Stanford Medicine. We included seven clinical outcomes with lab-based definitions acute kidney injury, hyperkalemia, hypoglycemia, hyponatremia, anemia, neutropenia and thrombocytopenia. For each outcome, we created four lab-based labels (abnormal, mild, moderate and severe) based on test result and one diagnosis-based label. Proportion of admissions with a positive label were presented for each outcome stratified by cohort. Using lab-based labels as the gold standard, agreement using Cohen's Kappa, sensitivity and specificity were calculated for each lab-based severity level.

RESULTS:

The number of admissions included were SickKids (n = 59,298), StanfordPeds (n = 24,639) and StanfordAdults (n = 159,985). The proportion of admissions with a positive diagnosis-based label was significantly higher for StanfordPeds compared to SickKids across all outcomes, with odds ratio (99.9% confidence interval) for abnormal diagnosis-based label ranging from 2.2 (1.7-2.7) for neutropenia to 18.4 (10.1-33.4) for hyperkalemia. Lab-based labels were more similar by institution. When using lab-based labels as the gold standard, Cohen's Kappa and sensitivity were lower at SickKids for all severity levels compared to StanfordPeds.

CONCLUSIONS:

Across multiple outcomes, diagnosis codes were consistently different between the two pediatric institutions. This difference was not explained by differences in test results. These results may have implications for machine learning model development and deployment.

Asunto(s)

Hiperpotasemia; Neutropenia; Humanos; Atención a la Salud; Aprendizaje Automático; Sensibilidad y Especificidad

Palabras clave

Cohort identification; Diagnostic coding practice; Electronic health records; Machine learning for health; Outcome identification

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Hiperpotasemia / Neutropenia Tipo de estudio: Clinical_trials / Diagnostic_studies / Prognostic_studies Límite: Humans Idioma: En Revista: BMC Med Inform Decis Mak Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article País de afiliación: Canadá

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google