Your browser doesn't support javascript.
loading
Towards optimal model evaluation: enhancing active testing with actively improved estimators.
Lee, JooChul; Kolla, Likhitha; Chen, Jinbo.
Afiliación
  • Lee J; Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA. joochul.lee@pennmedicine.upenn.edu.
  • Kolla L; Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.
  • Chen J; Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.
Sci Rep ; 14(1): 10690, 2024 May 09.
Article en En | MEDLINE | ID: mdl-38724626
ABSTRACT
With rapid advancements in machine learning and statistical models, ensuring the reliability of these models through accurate evaluation has become imperative. Traditional evaluation methods often rely on fully labeled test data, a requirement that is becoming increasingly impractical due to the growing size of datasets. In this work, we address this issue by extending existing work on active testing (AT) methods which are designed to sequentially sample and label data for evaluating pre-trained models. We propose two novel estimators the Actively Improved Levelled Unbiased Risk (AILUR) and the Actively Improved Inverse Probability Weighting (AIIPW) estimators which are derived from nonparametric smoothing estimation. In addition, a model recalibration process is designed for the AIIPW estimator to optimize the sampling probability within the AT framework. We evaluate the proposed estimators on four real-world datasets and demonstrate that they consistently outperform existing AT methods. Our study also shows that the proposed methods are robust to changes in subsample sizes, and effective at reducing labeling costs.

Texto completo: 1 Bases de datos: MEDLINE Idioma: En Revista: Sci Rep Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Bases de datos: MEDLINE Idioma: En Revista: Sci Rep Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos