GUESS: projecting machine learning scores to well-calibrated probability estimates for clinical decision-making.
Bioinformatics
; 35(14): 2458-2465, 2019 07 15.
Article
em En
| MEDLINE
| ID: mdl-30496351
MOTIVATION: Clinical decision support systems have been applied in numerous fields, ranging from cancer survival toward drug resistance prediction. Nevertheless, clinical decision support systems typically have a caveat: many of them are perceived as black-boxes by non-experts and, unfortunately, the obtained scores cannot usually be interpreted as class probability estimates. In probability-focused medical applications, it is not sufficient to perform well with regards to discrimination and, consequently, various calibration methods have been developed to enable probabilistic interpretation. The aims of this study were (i) to develop a tool for fast and comparative analysis of different calibration methods, (ii) to demonstrate their limitations for the use on clinical data and (iii) to introduce our novel method GUESS. RESULTS: We compared the performances of two different state-of-the-art calibration methods, namely histogram binning and Bayesian Binning in Quantiles, as well as our novel method GUESS on both, simulated and real-world datasets. GUESS demonstrated calibration performance comparable to the state-of-the-art methods and always retained accurate class discrimination. GUESS showed superior calibration performance in small datasets and therefore may be an optimal calibration method for typical clinical datasets. Moreover, we provide a framework (CalibratR) for R, which can be used to identify the most suitable calibration method for novel datasets in a timely and efficient manner. Using calibrated probability estimates instead of original classifier scores will contribute to the acceptance and dissemination of machine learning based classification models in cost-sensitive applications, such as clinical research. AVAILABILITY AND IMPLEMENTATION: GUESS as part of CalibratR can be downloaded at CRAN.
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Tomada de Decisão Clínica
/
Aprendizado de Máquina
Tipo de estudo:
Prognostic_studies
Idioma:
En
Revista:
Bioinformatics
Ano de publicação:
2019
Tipo de documento:
Article