Your browser doesn't support javascript.
loading
Rule-based natural language processing for automation of stroke data extraction: a validation study.
Gunter, Dane; Puac-Polanco, Paulo; Miguel, Olivier; Thornhill, Rebecca E; Yu, Amy Y X; Liu, Zhongyu A; Mamdani, Muhammad; Pou-Prom, Chloe; Aviv, Richard I.
Afiliação
  • Gunter D; The Ottawa Hospital Research Institute, Ottawa, ON, Canada.
  • Puac-Polanco P; Department of Radiology, Radiation Oncology and Medical Physics, University of Ottawa, The Ottawa Hospital Civic Campus Room C110, 1053 Carling Ave, Ottawa, ON, ON K1Y 4E9, Canada.
  • Miguel O; Department of Radiology, Radiation Oncology and Medical Physics, University of Ottawa, The Ottawa Hospital Civic Campus Room C110, 1053 Carling Ave, Ottawa, ON, ON K1Y 4E9, Canada.
  • Thornhill RE; Division of Medical Physics, Department of Radiology, Radiation Oncology and Medical Physics, University of Ottawa, Ottawa, ON, Canada.
  • Yu AYX; Department of Medicine (Neurology), University of Toronto, Sunnybrook Health Sciences Centre, Toronto, ON, Canada.
  • Liu ZA; Department of Medicine (Neurology), University of Toronto, Sunnybrook Health Sciences Centre, Toronto, ON, Canada.
  • Mamdani M; Department of Medicine, Unity Health Toronto, University of Toronto, Toronto, ON, Canada.
  • Pou-Prom C; Unity Health Toronto, Toronto, ON, Canada.
  • Aviv RI; The Ottawa Hospital Research Institute, Ottawa, ON, Canada. raviv@toh.ca.
Neuroradiology ; 64(12): 2357-2362, 2022 Dec.
Article em En | MEDLINE | ID: mdl-35913525
ABSTRACT

PURPOSE:

Data extraction from radiology free-text reports is time consuming when performed manually. Recently, more automated extraction methods using natural language processing (NLP) are proposed. A previously developed rule-based NLP algorithm showed promise in its ability to extract stroke-related data from radiology reports. We aimed to externally validate the accuracy of CHARTextract, a rule-based NLP algorithm, to extract stroke-related data from free-text radiology reports.

METHODS:

Free-text reports of CT angiography (CTA) and perfusion (CTP) studies of consecutive patients with acute ischemic stroke admitted to a regional stroke center for endovascular thrombectomy were analyzed from January 2015 to 2021. Stroke-related variables were manually extracted as reference standard from clinical reports, including proximal and distal anterior circulation occlusion, posterior circulation occlusion, presence of ischemia or hemorrhage, Alberta stroke program early CT score (ASPECTS), and collateral status. These variables were simultaneously extracted using a rule-based NLP algorithm. The NLP algorithm's accuracy, specificity, sensitivity, positive predictive value (PPV), and negative predictive value (NPV) were assessed.

RESULTS:

The NLP algorithm's accuracy was > 90% for identifying distal anterior occlusion, posterior circulation occlusion, hemorrhage, and ASPECTS. Accuracy was 85%, 74%, and 79% for proximal anterior circulation occlusion, presence of ischemia, and collateral status respectively. The algorithm confirmed the absence of variables from radiology reports with an 87-100% accuracy.

CONCLUSIONS:

Rule-based NLP has a moderate to good performance for stroke-related data extraction from free-text imaging reports. The algorithm's accuracy was affected by inconsistent report styles and lexicon among reporting radiologists.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Acidente Vascular Cerebral / AVC Isquêmico Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Neuroradiology Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Canadá

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Acidente Vascular Cerebral / AVC Isquêmico Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Neuroradiology Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Canadá