Búsqueda | Portal Regional de la BVS

Synthetically enhanced: unveiling synthetic data's potential in medical imaging research.

Khosravi, Bardia; Li, Frank; Dapamede, Theo; Rouzrokh, Pouria; Gamble, Cooper U; Trivedi, Hari M; Wyles, Cody C; Sellergren, Andrew B; Purkayastha, Saptarshi; Erickson, Bradley J; Gichoya, Judy W.

EBioMedicine ; 104: 105174, 2024 Jun.

Artículo en Inglés | MEDLINE | ID: mdl-38821021

RESUMEN

BACKGROUND: Chest X-rays (CXR) are essential for diagnosing a variety of conditions, but when used on new populations, model generalizability issues limit their efficacy. Generative AI, particularly denoising diffusion probabilistic models (DDPMs), offers a promising approach to generating synthetic images, enhancing dataset diversity. This study investigates the impact of synthetic data supplementation on the performance and generalizability of medical imaging research. METHODS: The study employed DDPMs to create synthetic CXRs conditioned on demographic and pathological characteristics from the CheXpert dataset. These synthetic images were used to supplement training datasets for pathology classifiers, with the aim of improving their performance. The evaluation involved three datasets (CheXpert, MIMIC-CXR, and Emory Chest X-ray) and various experiments, including supplementing real data with synthetic data, training with purely synthetic data, and mixing synthetic data with external datasets. Performance was assessed using the area under the receiver operating curve (AUROC). FINDINGS: Adding synthetic data to real datasets resulted in a notable increase in AUROC values (up to 0.02 in internal and external test sets with 1000% supplementation, p-value <0.01 in all instances). When classifiers were trained exclusively on synthetic data, they achieved performance levels comparable to those trained on real data with 200%-300% data supplementation. The combination of real and synthetic data from different sources demonstrated enhanced model generalizability, increasing model AUROC from 0.76 to 0.80 on the internal test set (p-value <0.01). INTERPRETATION: Synthetic data supplementation significantly improves the performance and generalizability of pathology classifiers in medical imaging. FUNDING: Dr. Gichoya is a 2022 Robert Wood Johnson Foundation Harold Amos Medical Faculty Development Program and declares support from RSNA Health Disparities grant (#EIHD2204), Lacuna Fund (#67), Gordon and Betty Moore Foundation, NIH (NIBIB) MIDRC grant under contracts 75N92020C00008 and 75N92020C00021, and NHLBI Award Number R01HL167811.

Asunto(s)

Diagnóstico por Imagen , Curva ROC , Humanos , Diagnóstico por Imagen/métodos , Algoritmos , Radiografía Torácica/métodos , Procesamiento de Imagen Asistido por Computador/métodos , Bases de Datos Factuales , Área Bajo la Curva , Modelos Estadísticos

Understanding Radiological Journal Views and Policies on Large Language Models in Academic Writing.

Lee, Tai-Lin; Ding, Julia; Trivedi, Hari M; Gichoya, Judy W; Moon, John T; Li, Hanzhou.

J Am Coll Radiol ; 21(4): 678-682, 2024 Apr.

Artículo en Inglés | MEDLINE | ID: mdl-37558108

Asunto(s)

Lenguaje , Radiología , Escritura , Políticas

Reply.

Li, Hanzhou; Moon, John T; Trivedi, Hari M; Gichoya, Judy W.

J Am Coll Radiol ; 21(7): 988, 2024 Jul.

Artículo en Inglés | MEDLINE | ID: mdl-38122882

Decoding radiology reports: Potential application of OpenAI ChatGPT to enhance patient understanding of diagnostic reports.

Li, Hanzhou; Moon, John T; Iyer, Deepak; Balthazar, Patricia; Krupinski, Elizabeth A; Bercu, Zachary L; Newsome, Janice M; Banerjee, Imon; Gichoya, Judy W; Trivedi, Hari M.

Clin Imaging ; 101: 137-141, 2023 Sep.

Artículo en Inglés | MEDLINE | ID: mdl-37336169

RESUMEN

PURPOSE: To evaluate the complexity of diagnostic radiology reports across major imaging modalities and the ability of ChatGPT (Early March 2023 Version, OpenAI, California, USA) to simplify these reports to the 8th grade reading level of the average U.S. adult. METHODS: We randomly sampled 100 radiographs (XR), 100 ultrasound (US), 100 CT, and 100 MRI radiology reports from our institution's database dated between 2022 and 2023 (N = 400). These were processed by ChatGPT using the prompt "Explain this radiology report to a patient in layman's terms in second person: ". Mean report length, Flesch reading ease score (FRES), and Flesch-Kincaid reading level (FKRL) were calculated for each report and ChatGPT output. T-tests were used to determine significance. RESULTS: Mean report length was 164 ± 117 words, FRES was 38.0 ± 11.8, and FKRL was 10.4 ± 1.9. FKRL was significantly higher for CT and MRI than for US and XR. Only 60/400 (15%) had a FKRL <8.5. The mean simplified ChatGPT output length was 103 ± 36 words, FRES was 83.5 ± 5.6, and FKRL was 5.8 ± 1.1. This reflects a mean decrease of 61 words (p < 0.01), increase in FRES of 45.5 (p < 0.01), and decrease in FKRL of 4.6 (p < 0.01). All simplified outputs had FKRL <8.5. DISCUSSION: Our study demonstrates the effective use of ChatGPT when tasked with simplifying radiology reports to below the 8th grade reading level. We report significant improvements in FRES, FKRL, and word count, the last of which requires modality-specific context.

Asunto(s)

Comprensión , Radiología , Adulto , Humanos , Radiografía , Imagen por Resonancia Magnética , Bases de Datos Factuales

Large Scale Semi-Automated Labeling of Routine Free-Text Clinical Records for Deep Learning.

Trivedi, Hari M; Panahiazar, Maryam; Liang, April; Lituiev, Dmytro; Chang, Peter; Sohn, Jae Ho; Chen, Yunn-Yi; Franc, Benjamin L; Joe, Bonnie; Hadley, Dexter.

J Digit Imaging ; 32(1): 30-37, 2019 02.

Artículo en Inglés | MEDLINE | ID: mdl-30128778

RESUMEN

Breast cancer is a leading cause of cancer death among women in the USA. Screening mammography is effective in reducing mortality, but has a high rate of unnecessary recalls and biopsies. While deep learning can be applied to mammography, large-scale labeled datasets, which are difficult to obtain, are required. We aim to remove many barriers of dataset development by automatically harvesting data from existing clinical records using a hybrid framework combining traditional NLP and IBM Watson. An expert reviewer manually annotated 3521 breast pathology reports with one of four outcomes: left positive, right positive, bilateral positive, negative. Traditional NLP techniques using seven different machine learning classifiers were compared to IBM Watson's automated natural language classifier. Techniques were evaluated using precision, recall, and F-measure. Logistic regression outperformed all other traditional machine learning classifiers and was used for subsequent comparisons. Both traditional NLP and Watson's NLC performed well for cases under 1024 characters with weighted average F-measures above 0.96 across all classes. Performance of traditional NLP was lower for cases over 1024 characters with an F-measure of 0.83. We demonstrate a hybrid framework using traditional NLP techniques combined with IBM Watson to annotate over 10,000 breast pathology reports for development of a large-scale database to be used for deep learning in mammography. Our work shows that traditional NLP and IBM Watson perform extremely well for cases under 1024 characters and can accelerate the rate of data annotation.

Asunto(s)

Neoplasias de la Mama/diagnóstico por imagen , Aprendizaje Profundo/estadística & datos numéricos , Registros Electrónicos de Salud/estadística & datos numéricos , Interpretación de Imagen Asistida por Computador/métodos , Mamografía/métodos , Mama/diagnóstico por imagen , Bases de Datos Factuales , Femenino , Humanos , Persona de Mediana Edad

RESUMEN

Asunto(s)

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA