Your browser doesn't support javascript.
loading
Evaluating diagnostic content of AI-generated chest radiography: A multi-center visual Turing test.
Myong, Youho; Yoon, Dan; Kim, Byeong Soo; Kim, Young Gyun; Sim, Yongsik; Lee, Suji; Yoon, Jiyoung; Cho, Minwoo; Kim, Sungwan.
Affiliation
  • Myong Y; Department of Biomedical Engineering, Seoul National University College of Medicine, Seoul, Republic of Korea.
  • Yoon D; Department of Rehabilitation Medicine, Seoul National University Hospital, Seoul, Republic of Korea.
  • Kim BS; Interdisciplinary Program in Bioengineering, Seoul National University Graduate School, Seoul, Republic of Korea.
  • Kim YG; Interdisciplinary Program in Bioengineering, Seoul National University Graduate School, Seoul, Republic of Korea.
  • Sim Y; Interdisciplinary Program in Bioengineering, Seoul National University Graduate School, Seoul, Republic of Korea.
  • Lee S; Department of Radiology, Severance Hospital, Yonsei University College of Medicine, Seoul, Republic of Korea.
  • Yoon J; Department of Radiology, Severance Hospital, Yonsei University College of Medicine, Seoul, Republic of Korea.
  • Cho M; Department of Radiology and Research Institute of Radiological Science, Yonsei University College of Medicine, Seoul, Republic of Korea.
  • Kim S; Transdisciplinary Department of Medicine and Advanced Technology, Seoul National University Hospital, Seoul, Republic of Korea.
PLoS One ; 18(4): e0279349, 2023.
Article in En | MEDLINE | ID: mdl-37043456
ABSTRACT

BACKGROUND:

Accurate interpretation of chest radiographs requires years of medical training, and many countries face a shortage of medical professionals to meet such requirements. Recent advancements in artificial intelligence (AI) have aided diagnoses; however, their performance is often limited due to data imbalance. The aim of this study was to augment imbalanced medical data using generative adversarial networks (GANs) and evaluate the clinical quality of the generated images via a multi-center visual Turing test.

METHODS:

Using six chest radiograph datasets, (MIMIC, CheXPert, CXR8, JSRT, VBD, and OpenI), starGAN v2 generated chest radiographs with specific pathologies. Five board-certified radiologists from three university hospitals, each with at least five years of clinical experience, evaluated the image quality through a visual Turing test. Further evaluations were performed to investigate whether GAN augmentation enhanced the convolutional neural network (CNN) classifier performances.

RESULTS:

In terms of identifying GAN images as artificial, there was no significant difference in the sensitivity between radiologists and random guessing (result of radiologists 147/275 (53.5%) vs result of random guessing 137.5/275, (50%); p = .284). GAN augmentation enhanced CNN classifier performance by 11.7%.

CONCLUSION:

Radiologists effectively classified chest pathologies with synthesized radiographs, suggesting that the images contained adequate clinical information. Furthermore, GAN augmentation enhanced CNN performance, providing a bypass to overcome data imbalance in medical AI training. CNN based methods rely on the amount and quality of training data; the present study showed that GAN augmentation could effectively augment training data for medical AI.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Artificial Intelligence / Neural Networks, Computer Type of study: Clinical_trials / Diagnostic_studies Limits: Humans Language: En Journal: PLoS One Journal subject: CIENCIA / MEDICINA Year: 2023 Type: Article

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Artificial Intelligence / Neural Networks, Computer Type of study: Clinical_trials / Diagnostic_studies Limits: Humans Language: En Journal: PLoS One Journal subject: CIENCIA / MEDICINA Year: 2023 Type: Article