CheXclusion: Fairness gaps in deep chest X-ray classifiers.

Seyyed-Kalantari, Laleh; Liu, Guanxiong; McDermott, Matthew; Chen, Irene Y; Ghassemi, Marzyeh

Seyyed-Kalantari, Laleh; Liu, Guanxiong; McDermott, Matthew; Chen, Irene Y; Ghassemi, Marzyeh.

Afiliação

Seyyed-Kalantari L; Computer Science, University of Toronto, Toronto, Ontario, Canada2Vector Institute, Toronto, Ontario, Canada* Corresponding author, laleh@cs.toronto.edu.

Pac Symp Biocomput ; 26: 232-243, 2021.

Article em En | MEDLINE | ID: mdl-33691020

RESUMO

Machine learning systems have received much attention recently for their ability to achieve expert-level performance on clinical tasks, particularly in medical imaging. Here, we examine the extent to which state-of-the-art deep learning classifiers trained to yield diagnostic labels from X-ray images are biased with respect to protected attributes. We train convolution neural networks to predict 14 diagnostic labels in 3 prominent public chest X-ray datasets: MIMIC-CXR, Chest-Xray8, CheXpert, as well as a multi-site aggregation of all those datasets. We evaluate the TPR disparity - the difference in true positive rates (TPR) - among different protected attributes such as patient sex, age, race, and insurance type as a proxy for socioeconomic status. We demonstrate that TPR disparities exist in the state-of-the-art classifiers in all datasets, for all clinical tasks, and all subgroups. A multi-source dataset corresponds to the smallest disparities, suggesting one way to reduce bias. We find that TPR disparities are not significantly correlated with a subgroup's proportional disease burden. As clinical models move from papers to products, we encourage clinical decision makers to carefully audit for algorithmic disparities prior to deployment. Our supplementary materials can be found at, http://www.marzyehghassemi.com/chexclusion-supp-3/.

Assuntos

Biologia Computacional; Redes Neurais de Computação; Humanos; Aprendizado de Máquina; Raios X

Buscar no Google

Imprimir

XML

PubMed Links

Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / Biologia Computacional Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Pac Symp Biocomput Assunto da revista: BIOTECNOLOGIA / INFORMATICA MEDICA Ano de publicação: 2021 Tipo de documento: Article

Buscar no Google

Imprimir

XML

PubMed Links