Natural Language Processing of Large-Scale Structured Radiology Reports to Identify Oncologic Patients With or Without Splenomegaly Over a 10-Year Period.

Sun, Simon; Lupton, Kaelan; Batch, Karen; Nguyen, Huy; Gazit, Lior; Gangai, Natalie; Cho, Jessica; Nicholas, Kevin; Zulkernine, Farhana; Sevilimedu, Varadan; Simpson, Amber; Do, Richard K G

Sun, Simon; Lupton, Kaelan; Batch, Karen; Nguyen, Huy; Gazit, Lior; Gangai, Natalie; Cho, Jessica; Nicholas, Kevin; Zulkernine, Farhana; Sevilimedu, Varadan; Simpson, Amber; Do, Richard K G.

Afiliação

Sun S; Department of Radiology, Memorial Sloan Kettering Cancer Center, New York, NY.
Lupton K; School of Computing, Queen's University, Kingston, Ontario, Canada.
Batch K; School of Computing, Queen's University, Kingston, Ontario, Canada.
Nguyen H; Strategy and Innovation, Memorial Sloan Kettering Cancer Center, New York, NY.
Gazit L; Strategy and Innovation, Memorial Sloan Kettering Cancer Center, New York, NY.
Gangai N; Department of Radiology, Memorial Sloan Kettering Cancer Center, New York, NY.
Cho J; Strategy and Innovation, Memorial Sloan Kettering Cancer Center, New York, NY.
Nicholas K; Strategy and Innovation, Memorial Sloan Kettering Cancer Center, New York, NY.
Zulkernine F; School of Computing, Queen's University, Kingston, Ontario, Canada.
Sevilimedu V; Biostatistics Service, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY.
Simpson A; School of Computing, Queen's University, Kingston, Ontario, Canada.
Do RKG; Department of Radiology, Memorial Sloan Kettering Cancer Center, New York, NY.

JCO Clin Cancer Inform ; 6: e2100104, 2022 01.

Article em En | MEDLINE | ID: mdl-34990210

ABSTRACT

ABSTRACT

PURPOSE:

To assess the accuracy of a natural language processing (NLP) model in extracting splenomegaly described in patients with cancer in structured computed tomography radiology reports.

METHODS:

In this retrospective study between July 2009 and April 2019, 3,87,359 consecutive structured radiology reports for computed tomography scans of the chest, abdomen, and pelvis from 91,665 patients spanning 30 types of cancer were included. A randomized sample of 2,022 reports from patients with colorectal cancer, hepatobiliary cancer (HB), leukemia, Hodgkin lymphoma (HL), and non-HL patients was manually annotated as positive or negative for splenomegaly. NLP model training/testing was performed on 1,617/405 reports, and a new validation set of 400 reports from all cancer subtypes was used to test NLP model accuracy, precision, and recall. Overall survival was compared between the patient groups (with and without splenomegaly) using Kaplan-Meier curves.

RESULTS:

The final cohort included 3,87,359 reports from 91,665 patients (mean age 60.8 years; 51.2% women). In the testing set, the model achieved accuracy of 92.1%, precision of 92.2%, and recall of 92.1% for splenomegaly. In the validation set, accuracy, precision, and recall were 93.8%, 92.9%, and 86.7%, respectively. In the entire cohort, splenomegaly was most frequent in patients with leukemia (32.5%), HB (17.4%), non-HL (9.1%), colorectal cancer (8.5%), and HL (5.6%). A splenomegaly label was associated with an increased risk of mortality in the entire cohort (hazard ratio 2.10; 95% CI, 1.98 to 2.22; P < .001).

CONCLUSION:

Automated splenomegaly labeling by NLP of radiology report demonstrates good accuracy, precision, and recall. Splenomegaly is most frequently reported in patients with leukemia, followed by patients with HB.

Assuntos

Neoplasias Colorretais; Leucemia; Radiologia; Feminino; Humanos; Masculino; Pessoa de Meia-Idade; Processamento de Linguagem Natural; Estudos Retrospectivos; Esplenomegalia/diagnóstico por imagem; Esplenomegalia/etiologia

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Radiologia / Neoplasias Colorretais / Leucemia Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Radiologia / Neoplasias Colorretais / Leucemia Idioma: En Ano de publicação: 2022 Tipo de documento: Article