Generative Pre-trained Transformer 4 makes cardiovascular magnetic resonance reports easy to understand.

Salam, Babak; Kravchenko, Dmitrij; Nowak, Sebastian; Sprinkart, Alois M; Weinhold, Leonie; Odenthal, Anna; Mesropyan, Narine; Bischoff, Leon M; Attenberger, Ulrike; Kuetting, Daniel L; Luetkens, Julian A; Isaak, Alexander

Salam, Babak; Kravchenko, Dmitrij; Nowak, Sebastian; Sprinkart, Alois M; Weinhold, Leonie; Odenthal, Anna; Mesropyan, Narine; Bischoff, Leon M; Attenberger, Ulrike; Kuetting, Daniel L; Luetkens, Julian A; Isaak, Alexander.

Afiliação

Salam B; Department of Diagnostic and Interventional Radiology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Quantitative Imaging Lab Bonn (QILaB), University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany.
Kravchenko D; Department of Diagnostic and Interventional Radiology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Quantitative Imaging Lab Bonn (QILaB), University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany.
Nowak S; Department of Diagnostic and Interventional Radiology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Quantitative Imaging Lab Bonn (QILaB), University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany.
Sprinkart AM; Department of Diagnostic and Interventional Radiology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Quantitative Imaging Lab Bonn (QILaB), University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany.
Weinhold L; University Hospital Bonn, Department of Medical Biometry, Informatics, and Epidemiology, Venusberg-Campus 1, 53127 Bonn, Germany.
Odenthal A; Department of Diagnostic and Interventional Radiology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany.
Mesropyan N; Department of Diagnostic and Interventional Radiology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Quantitative Imaging Lab Bonn (QILaB), University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany.
Bischoff LM; Department of Diagnostic and Interventional Radiology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Quantitative Imaging Lab Bonn (QILaB), University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany.
Attenberger U; Department of Diagnostic and Interventional Radiology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany.
Kuetting DL; Department of Diagnostic and Interventional Radiology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Quantitative Imaging Lab Bonn (QILaB), University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany.
Luetkens JA; Department of Diagnostic and Interventional Radiology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Quantitative Imaging Lab Bonn (QILaB), University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany.
Isaak A; Department of Diagnostic and Interventional Radiology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Quantitative Imaging Lab Bonn (QILaB), University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany. Electronic address: alexander.isaak@ukbonn.de.

J Cardiovasc Magn Reson ; 26(1): 101035, 2024.

Article em En | MEDLINE | ID: mdl-38460841

ABSTRACT

ABSTRACT

BACKGROUND:

Patients are increasingly using Generative Pre-trained Transformer 4 (GPT-4) to better understand their own radiology findings.

PURPOSE:

To evaluate the performance of GPT-4 in transforming cardiovascular magnetic resonance (CMR) reports into text that is comprehensible to medical laypersons.

METHODS:

ChatGPT with GPT-4 architecture was used to generate three different explained versions of 20 various CMR reports (n = 60) using the same prompt "Explain the radiology report in a language understandable to a medical layperson". Two cardiovascular radiologists evaluated understandability, factual correctness, completeness of relevant findings, and lack of potential harm, while 13 medical laypersons evaluated the understandability of the original and the GPT-4 reports on a Likert scale (1 "strongly disagree", 5 "strongly agree"). Readability was measured using the Automated Readability Index (ARI). Linear mixed-effects models (values given as median [interquartile range]) and intraclass correlation coefficient (ICC) were used for statistical analysis.

RESULTS:

GPT-4 reports were generated on average in 52 s ± 13. GPT-4 reports achieved a lower ARI score (10 [9-12] vs 5 [4-6]; p < 0.001) and were subjectively easier to understand for laypersons than original reports (1 [1] vs 4 [4,5]; p < 0.001). Eighteen out of 20 (90%) standard CMR reports and 2/60 (3%) GPT-generated reports had an ARI score corresponding to the 8th grade level or higher. Radiologists' ratings of the GPT-4 reports reached high levels for correctness (5 [4, 5]), completeness (5 [5]), and lack of potential harm (5 [5]); with "strong agreement" for factual correctness in 94% (113/120) and completeness of relevant findings in 81% (97/120) of reports. Test-retest agreement for layperson understandability ratings between the three simplified reports generated from the same original report was substantial (ICC 0.62; p < 0.001). Interrater agreement between radiologists was almost perfect for lack of potential harm (ICC 0.93, p < 0.001) and moderate to substantial for completeness (ICC 0.76, p < 0.001) and factual correctness (ICC 0.55, p < 0.001).

CONCLUSION:

GPT-4 can reliably transform complex CMR reports into more understandable, layperson-friendly language while largely maintaining factual correctness and completeness, and can thus help convey patient-relevant radiology information in an easy-to-understand manner.

Assuntos

Compreensão; Imageamento por Ressonância Magnética; Valor Preditivo dos Testes; Humanos; Reprodutibilidade dos Testes; Variações Dependentes do Observador; Letramento em Saúde; Educação de Pacientes como Assunto; Doenças Cardiovasculares/diagnóstico por imagem; Feminino; Masculino

Palavras-chave

Artificial intelligence; Cardiovascular magnetic resonance; Generative Pre-trained Transformers; Large language models; Text simplification

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Imageamento por Ressonância Magnética / Valor Preditivo dos Testes / Compreensão Limite: Female / Humans / Male Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google