Your browser doesn't support javascript.
loading
Shortcut learning in medical AI hinders generalization: method for estimating AI model generalization without external data.
Ong Ly, Cathy; Unnikrishnan, Balagopal; Tadic, Tony; Patel, Tirth; Duhamel, Joe; Kandel, Sonja; Moayedi, Yasbanoo; Brudno, Michael; Hope, Andrew; Ross, Heather; McIntosh, Chris.
Afiliación
  • Ong Ly C; Peter Munk Cardiac Centre and Ted Rogers Centre for Heart Research, University Health Network, Toronto, ON, Canada.
  • Unnikrishnan B; Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.
  • Tadic T; Toronto General Hospital Research Institute, University Health Network, Toronto, ON, Canada.
  • Patel T; Toronto General Hospital Research Institute, University Health Network, Toronto, ON, Canada.
  • Duhamel J; Department of Computer Science, University of Toronto, Toronto, ON, Canada.
  • Kandel S; Joint Department of Medical Imaging, University Health Network, Toronto, ON, Canada.
  • Moayedi Y; Vector Institute, Toronto, ON, Canada.
  • Brudno M; Radiation Medicine Program, Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada.
  • Hope A; Department of Radiation Oncology, University of Toronto, Toronto, ON, Canada.
  • Ross H; Radiation Medicine Program, Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada.
  • McIntosh C; Peter Munk Cardiac Centre and Ted Rogers Centre for Heart Research, University Health Network, Toronto, ON, Canada.
NPJ Digit Med ; 7(1): 124, 2024 May 14.
Article en En | MEDLINE | ID: mdl-38744921
ABSTRACT
Healthcare datasets are becoming larger and more complex, necessitating the development of accurate and generalizable AI models for medical applications. Unstructured datasets, including medical imaging, electrocardiograms, and natural language data, are gaining attention with advancements in deep convolutional neural networks and large language models. However, estimating the generalizability of these models to new healthcare settings without extensive validation on external data remains challenging. In experiments across 13 datasets including X-rays, CTs, ECGs, clinical discharge summaries, and lung auscultation data, our results demonstrate that model performance is frequently overestimated by up to 20% on average due to shortcut learning of hidden data acquisition biases (DAB). Shortcut learning refers to a phenomenon in which an AI model learns to solve a task based on spurious correlations present in the data as opposed to features directly related to the task itself. We propose an open source, bias-corrected external accuracy estimate, PEst, that better estimates external accuracy to within 4% on average by measuring and calibrating for DAB-induced shortcut learning.

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: NPJ Digit Med Año: 2024 Tipo del documento: Article País de afiliación: Canadá

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: NPJ Digit Med Año: 2024 Tipo del documento: Article País de afiliación: Canadá
...