Your browser doesn't support javascript.
loading
A framework for understanding label leakage in machine learning for health care.
Davis, Sharon E; Matheny, Michael E; Balu, Suresh; Sendak, Mark P.
Afiliação
  • Davis SE; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232, United States.
  • Matheny ME; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232, United States.
  • Balu S; Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37232, United States.
  • Sendak MP; Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States.
J Am Med Inform Assoc ; 31(1): 274-280, 2023 12 22.
Article em En | MEDLINE | ID: mdl-37669138
ABSTRACT

INTRODUCTION:

The pitfalls of label leakage, contamination of model input features with outcome information, are well established. Unfortunately, avoiding label leakage in clinical prediction models requires more nuance than the common advice of applying "no time machine rule." FRAMEWORK We provide a framework for contemplating whether and when model features pose leakage concerns by considering the cadence, perspective, and applicability of predictions. To ground these concepts, we use real-world clinical models to highlight examples of appropriate and inappropriate label leakage in practice.

RECOMMENDATIONS:

Finally, we provide recommendations to support clinical and technical stakeholders as they evaluate the leakage tradeoffs associated with model design, development, and implementation decisions. By providing common language and dimensions to consider when designing models, we hope the clinical prediction community will be better prepared to develop statistically valid and clinically useful machine learning models.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Instalações de Saúde / Idioma Tipo de estudo: Guideline / Prognostic_studies Idioma: En Revista: J Am Med Inform Assoc Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Instalações de Saúde / Idioma Tipo de estudo: Guideline / Prognostic_studies Idioma: En Revista: J Am Med Inform Assoc Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Estados Unidos