The effects of misclassification in routine healthcare databases on the accuracy of prognostic prediction models: a case study of the CHA2DS2-VASc score in atrial fibrillation.

van Doorn, S; Brakenhoff, T B; Moons, K G M; Rutten, F H; Hoes, A W; Groenwold, R H H; Geersing, G J

van Doorn, S; Brakenhoff, T B; Moons, K G M; Rutten, F H; Hoes, A W; Groenwold, R H H; Geersing, G J.

Afiliação

van Doorn S; Julius Center for Health Sciences and Primary care, University Medical Center Utrecht, PO box 85500, 3508 AB Utrecht, The Netherlands.
Brakenhoff TB; Julius Center for Health Sciences and Primary care, University Medical Center Utrecht, PO box 85500, 3508 AB Utrecht, The Netherlands.
Moons KGM; Julius Center for Health Sciences and Primary care, University Medical Center Utrecht, PO box 85500, 3508 AB Utrecht, The Netherlands.
Rutten FH; Julius Center for Health Sciences and Primary care, University Medical Center Utrecht, PO box 85500, 3508 AB Utrecht, The Netherlands.
Hoes AW; Julius Center for Health Sciences and Primary care, University Medical Center Utrecht, PO box 85500, 3508 AB Utrecht, The Netherlands.
Groenwold RHH; Julius Center for Health Sciences and Primary care, University Medical Center Utrecht, PO box 85500, 3508 AB Utrecht, The Netherlands.
Geersing GJ; Julius Center for Health Sciences and Primary care, University Medical Center Utrecht, PO box 85500, 3508 AB Utrecht, The Netherlands.

Diagn Progn Res ; 1: 18, 2017.

Article em En | MEDLINE | ID: mdl-31093547

RESUMO

BACKGROUND: Research on prognostic prediction models frequently uses data from routine healthcare. However, potential misclassification of predictors when using such data may strongly affect the studied associations. There is no doubt that such misclassification could lead to the derivation of suboptimal prediction models. The extent to which misclassification affects the validation of existing prediction models is currently unclear.We aimed to quantify the amount of misclassification in routine care data and its effect on the validation of the existing risk prediction model. As an illustrative example, we validated the CHA2DS2-VASc prediction rule for predicting mortality in patients with atrial fibrillation (AF). METHODS: In a prospective cohort in general practice in the Netherlands, we used computerized retrieved data from the electronic medical records of patients known with AF as index predictors. Additionally, manually collected data after scrutinizing all complete medical files were used as reference predictors. Comparing the index with the reference predictors, we assessed misclassification in individual predictors by calculating Cohen's kappas and other diagnostic test accuracy measures. Predictive performance was quantified by the c-statistic and by determining calibration of multivariable models. RESULTS: In total, 2363 AF patients were included. After a median follow-up of 2.7 (IQR 2.3-3.0) years, 368 patients died (incidence rate 6.2 deaths per 100 person-years). Misclassification in individual predictors ranged from substantial (Cohen's kappa 0.56 for prior history of heart failure) to minor (kappa 0.90 for a history of type 2 diabetes). The overall model performance was not affected when using either index or reference predictors, with a c-statistic of 0.684 and 0.681, respectively, and similar calibration. CONCLUSION: In a case study validating the CHA2DS2-VASc prediction model, we found substantial predictor misclassification in routine healthcare data with only limited effect on overall model performance. Our study should be repeated for other often applied prediction models to further evaluate the usefulness of routinely available healthcare data for validating prognostic models in the presence of predictor misclassification.

Palavras-chave

Atrial fibrillation; CHA2DS2-VASc; Misclassification; Prediction model; Routine care data; Validation

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2017 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2017 Tipo de documento: Article