Your browser doesn't support javascript.
loading
Handwork vs machine: a comparison of rheumatoid arthritis patient populations as identified from EHR free-text by diagnosis extraction through machine-learning or traditional criteria-based chart review.
Maarseveen, T D; Maurits, M P; Niemantsverdriet, E; van der Helm-van Mil, A H M; Huizinga, T W J; Knevel, R.
Afiliación
  • Maarseveen TD; Department of Rheumatology, Leiden University Medical Center, Leiden, The Netherlands.
  • Maurits MP; Department of Rheumatology, Leiden University Medical Center, Leiden, The Netherlands.
  • Niemantsverdriet E; Department of Rheumatology, Leiden University Medical Center, Leiden, The Netherlands.
  • van der Helm-van Mil AHM; Department of Rheumatology, Leiden University Medical Center, Leiden, The Netherlands.
  • Huizinga TWJ; Department of Rheumatology, Leiden University Medical Center, Leiden, The Netherlands.
  • Knevel R; Department of Rheumatology, Leiden University Medical Center, Leiden, The Netherlands. R.Knevel@lumc.nl.
Arthritis Res Ther ; 23(1): 174, 2021 06 22.
Article en En | MEDLINE | ID: mdl-34158089
ABSTRACT

BACKGROUND:

Electronic health records (EHRs) offer a wealth of observational data. Machine-learning (ML) methods are efficient at data extraction, capable of processing the information-rich free-text physician notes in EHRs. The clinical diagnosis contained therein represents physician expert opinion and is more consistently recorded than classification criteria components.

OBJECTIVES:

To investigate the overlap and differences between rheumatoid arthritis patients as identified either from EHR free-text through the extraction of the rheumatologist diagnosis using machine-learning (ML) or through manual chart-review applying the 1987 and 2010 RA classification criteria.

METHODS:

Since EHR initiation, 17,662 patients have visited the Leiden rheumatology outpatient clinic. For ML, we used a support vector machine (SVM) model to identify those who were diagnosed with RA by their rheumatologist. We trained and validated the model on a random selection of 2000 patients, balancing PPV and sensitivity to define a cutoff, and assessed performance on a separate 1000 patients. We then deployed the model on our entire patient selection (including the 3000). Of those, 1127 patients had both a 1987 and 2010 EULAR/ACR criteria status at 1 year after inclusion into the local prospective arthritis cohort. In these 1127 patients, we compared the patient characteristics of RA cases identified with ML and those fulfilling the classification criteria.

RESULTS:

The ML model performed very well in the independent test set (sensitivity=0.85, specificity=0.99, PPV=0.86, NPV=0.99). In our selection of patients with both EHR and classification information, 373 were recognized as RA by ML and 357 and 426 fulfilled the 1987 or 2010 criteria, respectively. Eighty percent of the ML-identified cases fulfilled at least one of the criteria sets. Both demographic and clinical parameters did not differ between the ML extracted cases and those identified with EULAR/ACR classification criteria.

CONCLUSIONS:

With ML methods, we enable fast patient extraction from the huge EHR resource. Our ML algorithm accurately identifies patients diagnosed with RA by their rheumatologist. This resulting group of RA patients had a strong overlap with patients identified using the 1987 or 2010 classification criteria and the baseline (disease) characteristics were comparable. ML-assisted case labeling enables high-throughput creation of inclusive patient selections for research purposes.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Artritis Reumatoide Tipo de estudio: Diagnostic_studies / Etiology_studies / Guideline / Incidence_studies / Observational_studies / Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: Arthritis Res Ther Asunto de la revista: REUMATOLOGIA Año: 2021 Tipo del documento: Article País de afiliación: Países Bajos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Artritis Reumatoide Tipo de estudio: Diagnostic_studies / Etiology_studies / Guideline / Incidence_studies / Observational_studies / Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: Arthritis Res Ther Asunto de la revista: REUMATOLOGIA Año: 2021 Tipo del documento: Article País de afiliación: Países Bajos