Accommodating population differences when validating risk prediction models.

Pfeiffer, Ruth M; Chen, Yiyao; Gail, Mitchell H; Ankerst, Donna P

Pfeiffer, Ruth M; Chen, Yiyao; Gail, Mitchell H; Ankerst, Donna P.

Afiliação

Pfeiffer RM; Biostatistics Branch, National Cancer Institute, Bethesda, Maryland, USA.
Chen Y; Departments of Mathematics and Life Science Systems, Technical University of Munich, Garching, Germany.
Gail MH; Biostatistics Branch, National Cancer Institute, Bethesda, Maryland, USA.
Ankerst DP; Departments of Mathematics and Life Science Systems, Technical University of Munich, Garching, Germany.

Stat Med ; 41(24): 4756-4780, 2022 10 30.

Article em En | MEDLINE | ID: mdl-36224712

ABSTRACT

ABSTRACT

Validation of risk prediction models in independent data provides a more rigorous assessment of model performance than internal assessment, for example, done by cross-validation in the data used for model development. However, several differences between the populations that gave rise to the training and the validation data can lead to seemingly poor performance of a risk model. In this paper we formalize the notions of "similarity" or "relatedness" of the training and validation data, and define reproducibility and transportability. We address the impact of different distributions of model predictors and differences in verifying the disease status or outcome on measures of calibration, accuracy and discrimination of a model. When individual level information from both the training and validation data sets is available, we propose and study weighted versions of the validation metrics that adjust for differences in the risk factor distributions and in outcome verification between the training and validation data to provide a more comprehensive assessment of model performance. We provide conditions on the risk model and the populations that gave rise to the training and validation data that ensure a model's reproducibility or transportability, and show how to check these conditions using weighted and unweighted performance measures. We illustrate the method by developing and validating a model that predicts the risk of developing prostate cancer using data from two large prostate cancer screening trials.

Assuntos

Detecção Precoce de Câncer; Neoplasias da Próstata; Humanos; Masculino; Prognóstico; Antígeno Prostático Específico; Neoplasias da Próstata/diagnóstico; Reprodutibilidade dos Testes; Medição de Risco

Palavras-chave

population differences; risk factor heterogeneity; risk model performance; selection; verification

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Neoplasias da Próstata / Detecção Precoce de Câncer Tipo de estudo: Diagnostic_studies / Etiology_studies / Prognostic_studies / Risk_factors_studies / Screening_studies Limite: Humans / Male Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google