Your browser doesn't support javascript.
loading
Machine Learning Analysis to Identify Data Entry Errors in Prehospital Patient Care Reports: A Case Study of a National Out-of-Hospital Cardiac Arrest Registry.
Choi, Dong Hyun; Park, Jeong Ho; Choi, Young Ho; Song, Kyoung Jun; Kim, Sungwan; Shin, Sang Do.
Afiliación
  • Choi DH; Department of Biomedical Engineering, Seoul National University College of Medicine, Seoul, Republic of Korea.
  • Park JH; Laboratory of Emergency Medical Services, Seoul National University Hospital Biomedical Research Institute, Seoul, Republic of Korea.
  • Choi YH; Laboratory of Emergency Medical Services, Seoul National University Hospital Biomedical Research Institute, Seoul, Republic of Korea.
  • Song KJ; Department of Emergency Medicine, Seoul National University College of Medicine and Hospital, Seoul, Republic of Korea.
  • Kim S; Laboratory of Emergency Medical Services, Seoul National University Hospital Biomedical Research Institute, Seoul, Republic of Korea.
  • Shin SD; Department of Emergency Medicine, Seoul National University Bundang Hospital, Bundang, Republic of Korea.
Prehosp Emerg Care ; : 1-9, 2022 Nov 16.
Article en En | MEDLINE | ID: mdl-36256618
ABSTRACT

Background:

The objective of this study was to develop and validate machine learning models for data entry error detection in a national out-of-hospital cardiac arrest (OHCA) prehospital patient care report database.

Methods:

Adult OHCAs of presumed cardiac etiology were included. Data entry errors were defined as discrepancies between the coded data and the free-text note documenting the intervention or event; for example, information that was recorded as "absent" in the coded data but "present" in the free-text note. Machine learning models using the extreme gradient boosting, logistic regression, extreme gradient boosting outlier detection, and K-nearest neighbor outlier detection algorithms for error detection within nine core variables were developed and then validated for each variable.

Results:

Among 12,100 OHCAs, the proportion of cases with at least one error type was 16.2%. The area under the receiver operating characteristic curve (AUC) of the best-performing model (model with the highest AUC for each outcome variable) was 0.71-0.95. Machine learning models detected errors most efficiently for outcome place and initial rhythm errors; 82.6% of place errors and 93.8% of initial rhythm errors could be detected while checking 11 and 35% of data, respectively, compared to the strategy of checking all data.

Conclusion:

Machine learning models can detect data entry errors in care reports of emergency medical services (EMS) clinicians with acceptable performance and likely can improve the efficiency of the process of data quality control. EMS organizations that provide more prehospital interventions for OHCA patients could have higher error rates and may benefit from the adoption of error-detection models.

Texto completo: 1 Bases de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: Prehosp Emerg Care Asunto de la revista: MEDICINA DE EMERGENCIA Año: 2022 Tipo del documento: Article

Texto completo: 1 Bases de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: Prehosp Emerg Care Asunto de la revista: MEDICINA DE EMERGENCIA Año: 2022 Tipo del documento: Article