Your browser doesn't support javascript.
loading
A constrained maximum likelihood approach to developing well-calibrated models for predicting binary outcomes.
Cao, Yaqi; Ma, Weidong; Zhao, Ge; McCarthy, Anne Marie; Chen, Jinbo.
Afiliación
  • Cao Y; Department of Statistics, School of Science, Minzu University of China, Beijing, China.
  • Ma W; Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.
  • Zhao G; Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.
  • McCarthy AM; Department of Mathematics and Statistics, Portland State University, Portland, PA, 97201, USA.
  • Chen J; Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.
Lifetime Data Anal ; 30(3): 624-648, 2024 Jul.
Article en En | MEDLINE | ID: mdl-38717617
ABSTRACT
The added value of candidate predictors for risk modeling is routinely evaluated by comparing the performance of models with or without including candidate predictors. Such comparison is most meaningful when the estimated risk by the two models are both unbiased in the target population. Very often data for candidate predictors are sourced from nonrepresentative convenience samples. Updating the base model using the study data without acknowledging the discrepancy between the underlying distribution of the study data and that in the target population can lead to biased risk estimates and therefore an unfair evaluation of candidate predictors. To address this issue assuming access to a well-calibrated base model, we propose a semiparametric method for model fitting that enforces good calibration. The central idea is to calibrate the fitted model against the base model by enforcing suitable constraints in maximizing the likelihood function. This approach enables unbiased assessment of model improvement offered by candidate predictors without requiring a representative sample from the target population, thus overcoming a significant practical challenge. We study theoretical properties for model parameter estimates, and demonstrate improvement in model calibration via extensive simulation studies. Finally, we apply the proposed method to data extracted from Penn Medicine Biobank to inform the added value of breast density for breast cancer risk assessment in the Caucasian woman population.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Neoplasias de la Mama / Modelos Estadísticos Límite: Female / Humans Idioma: En Revista: Lifetime Data Anal Año: 2024 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Neoplasias de la Mama / Modelos Estadísticos Límite: Female / Humans Idioma: En Revista: Lifetime Data Anal Año: 2024 Tipo del documento: Article País de afiliación: China
...