Qualifying Certainty in Radiology Reports through Deep Learning-Based Natural Language Processing.

Liu, F; Zhou, P; Baccei, S J; Masciocchi, M J; Amornsiripanitch, N; Kiefe, C I; Rosen, M P

Liu, F; Zhou, P; Baccei, S J; Masciocchi, M J; Amornsiripanitch, N; Kiefe, C I; Rosen, M P.

Afiliação

Liu F; From the Department of Population and Quantitative Health Sciences (F.L., C.I.K.), University of Massachusetts Medical School, Worcester, Massachusetts.
Zhou P; Department of Radiology (F.L., P.Z., S.J.B., M.J.M., N.A., M.P.R.), University of Massachusetts Medical School, Worcester, Massachusetts.
Baccei SJ; Department of Radiology (F.L., P.Z., S.J.B., M.J.M., N.A., M.P.R.), University of Massachusetts Medical School, Worcester, Massachusetts.
Masciocchi MJ; Department of Radiology (F.L., P.Z., S.J.B., M.J.M., N.A., M.P.R.), University of Massachusetts Medical School, Worcester, Massachusetts.
Amornsiripanitch N; Department of Radiology (S.J.B., M.J.M., N.A., M.P.R.), UMass Memorial Medical Center, Worcester, Massachusetts.
Kiefe CI; Department of Radiology (F.L., P.Z., S.J.B., M.J.M., N.A., M.P.R.), University of Massachusetts Medical School, Worcester, Massachusetts.
Rosen MP; Department of Radiology (S.J.B., M.J.M., N.A., M.P.R.), UMass Memorial Medical Center, Worcester, Massachusetts.

AJNR Am J Neuroradiol ; 42(10): 1755-1761, 2021 10.

Article em En | MEDLINE | ID: mdl-34413062

ABSTRACT

ABSTRACT

BACKGROUND AND

PURPOSE:

Communication gaps exist between radiologists and referring physicians in conveying diagnostic certainty. We aimed to explore deep learning-based bidirectional contextual language models for automatically assessing diagnostic certainty expressed in the radiology reports to facilitate the precision of communication. MATERIALS AND

METHODS:

We randomly sampled 594 head MR imaging reports from an academic medical center. We asked 3 board-certified radiologists to read sentences from the Impression section and assign each sentence 1 of the 4 certainty categories "Non-Definitive," "Definitive-Mild," "Definitive-Strong," "Other." Using the annotated 2352 sentences, we developed and validated a natural language-processing system based on the start-of-the-art bidirectional encoder representations from transformers (BERT), which can capture contextual uncertainty semantics beyond the lexicon level. Finally, we evaluated 3 BERT variant models and reported standard metrics including sensitivity, specificity, and area under the curve.

RESULTS:

A κ score of 0.74 was achieved for interannotator agreement on uncertainty interpretations among 3 radiologists. For the 3 BERT variant models, the biomedical variant (BioBERT) achieved the best macro-average area under the curve of 0.931 (compared with 0.928 for the BERT-base and 0.925 for the clinical variant [ClinicalBERT]) on the validation data. All 3 models yielded high macro-average specificity (93.13%-93.65%), while the BERT-base obtained the highest macro-average sensitivity of 79.46% (compared with 79.08% for BioBERT and 78.52% for ClinicalBERT). The BioBERT model showed great generalizability on the heldout test data with a macro-average sensitivity of 77.29%, specificity of 92.89%, and area under the curve of 0.93.

CONCLUSIONS:

A deep transfer learning model can be developed to reliably assess the level of uncertainty communicated in a radiology report.

Assuntos

Aprendizado Profundo; Radiologia; Humanos; Idioma; Processamento de Linguagem Natural; Radiografia

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Radiologia / Aprendizado Profundo Tipo de estudo: Diagnostic_studies / Prognostic_studies Limite: Humans Idioma: En Revista: AJNR Am J Neuroradiol Ano de publicação: 2021 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google