Your browser doesn't support javascript.
loading
Subcategorizing EHR diagnosis codes to improve clinical application of machine learning models.
Reimer, Andrew P; Dai, Wei; Smith, Benjamin; Schiltz, Nicholas K; Sun, Jiayang; Koroukian, Siran M.
Afiliación
  • Reimer AP; Frances Payne Bolton School of Nursing, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH, United States; Critical Care Transport, Cleveland Clinic, 9800 Euclid Ave, Cleveland, OH, United States. Electronic address: axr62@cwru.edu.
  • Dai W; Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, United States.
  • Smith B; Department of Mathematics, Applied Mathematics and Statistics, College of Arts and Sciences, Case Western Reserve University, Cleveland, OH, United States.
  • Schiltz NK; Frances Payne Bolton School of Nursing, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH, United States.
  • Sun J; Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, United States.
  • Koroukian SM; Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, United States.
Int J Med Inform ; 156: 104588, 2021 12.
Article en En | MEDLINE | ID: mdl-34607290
ABSTRACT

BACKGROUND:

Electronic health record (EHR) data is commonly used for secondary purposes such as research and clinical decision support. However, reuse of EHR data presents several challenges including but not limited to identifying all diagnoses associated with a patient's clinical encounter. The purpose of this study was to assess the feasibility of developing a schema to identify and subclassify all structured diagnosis codes for a patient encounter.

METHODS:

To develop a subclassification schema we used EHR data from an interhospital transport data repository that contained complete hospital encounter level data. Eight discrete data sources containing structured diagnosis codes were identified. Diagnosis codes were normalized using the Unified Medical Language System and additional EHR data were combined with standardized terminologies to create and validate the subcategories. We then employed random forest to assess the usefulness of the new subcategorized diagnoses to predict post-interhospital transfer mortality by building 2 models, one using standard diagnosis codes, and one using the new subcategorized diagnosis codes.

RESULTS:

Six subcategories of diagnoses were identified and validated. The subcategories included primary or admitting diagnoses (10%), past medical, surgical or social history (9%), problem list (20%), comorbidity (24%), discharge diagnoses (6%), and unmapped diagnoses (31%). The subcategorized model outperformed the standard model, achieving a training AUROC of 0.97 versus 0.95 and testing model AUROC of 0.81 versus 0.46.

DISCUSSION:

Our work demonstrates that merging structured diagnosis codes with additional EHR data and secondary data sources provides additional information to understand the role of diagnosis throughout a clinical encounter and improves predictive model performance. Further work is necessary to assess if subcategorizing produces benefits in interpreting the results of prognostic models and/or operationalizing the results in clinical decision support applications.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Registros Electrónicos de Salud / Aprendizaje Automático Tipo de estudio: Diagnostic_studies / Prognostic_studies / Qualitative_research Límite: Humans Idioma: En Revista: Int J Med Inform Asunto de la revista: INFORMATICA MEDICA Año: 2021 Tipo del documento: Article

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Registros Electrónicos de Salud / Aprendizaje Automático Tipo de estudio: Diagnostic_studies / Prognostic_studies / Qualitative_research Límite: Humans Idioma: En Revista: Int J Med Inform Asunto de la revista: INFORMATICA MEDICA Año: 2021 Tipo del documento: Article