Your browser doesn't support javascript.
loading
Successful Development of a Natural Language Processing Algorithm for Pancreatic Neoplasms and Associated Histologic Features.
Harrison, Jon Michael; Yala, Adam; Mikhael, Peter; Roldan, Jorge; Ciprani, Debora; Michelakos, Theodoros; Bolm, Louisa; Qadan, Motaz; Ferrone, Cristina; Fernandez-Del Castillo, Carlos; Lillemoe, Keith Douglas; Santus, Enrico; Hughes, Kevin.
Affiliation
  • Harrison JM; From the Department of GI and General Surgery, Massachusetts General Hospital, Boston.
  • Yala A; Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Mass.
  • Mikhael P; Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Mass.
  • Roldan J; From the Department of GI and General Surgery, Massachusetts General Hospital, Boston.
  • Ciprani D; From the Department of GI and General Surgery, Massachusetts General Hospital, Boston.
  • Michelakos T; From the Department of GI and General Surgery, Massachusetts General Hospital, Boston.
  • Bolm L; From the Department of GI and General Surgery, Massachusetts General Hospital, Boston.
  • Qadan M; From the Department of GI and General Surgery, Massachusetts General Hospital, Boston.
  • Ferrone C; From the Department of GI and General Surgery, Massachusetts General Hospital, Boston.
  • Fernandez-Del Castillo C; From the Department of GI and General Surgery, Massachusetts General Hospital, Boston.
  • Lillemoe KD; From the Department of GI and General Surgery, Massachusetts General Hospital, Boston.
  • Santus E; Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Mass.
  • Hughes K; From the Department of GI and General Surgery, Massachusetts General Hospital, Boston.
Pancreas ; 52(4): e219-e223, 2023 Apr 01.
Article in En | MEDLINE | ID: mdl-37716007
ABSTRACT

OBJECTIVES:

Natural language processing (NLP) algorithms can interpret unstructured text for commonly used terms and phrases. Pancreatic pathologies are diverse and include benign and malignant entities with associated histologic features. Creating a pancreas NLP algorithm can aid in electronic health record coding as well as large database creation and curation.

METHODS:

Text-based pancreatic anatomic and cytopathologic reports for pancreatic cancer, pancreatic ductal adenocarcinoma, neuroendocrine tumor, intraductal papillary neoplasm, tumor dysplasia, and suspicious findings were collected. This dataset was split 80/20 for model training and development. A separate set was held out for testing purposes. We trained using convolutional neural network to predict each heading.

RESULTS:

Over 14,000 reports were obtained from the Mass General Brigham Healthcare System electronic record. Of these, 1252 reports were used for algorithm development. Final accuracy and F1 scores relative to the test set ranged from 95% and 98% for each queried pathology. To understand the dependence of our results to training set size, we also generated learning curves. Scoring metrics improved as more reports were submitted for training; however, some queries had high index performance.

CONCLUSIONS:

Natural language processing algorithms can be used for pancreatic pathologies. Increased training volume, nonoverlapping terminology, and conserved text structure improve NLP algorithm performance.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Pancreatic Neoplasms / Natural Language Processing Type of study: Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: Pancreas Journal subject: GASTROENTEROLOGIA Year: 2023 Document type: Article

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Pancreatic Neoplasms / Natural Language Processing Type of study: Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: Pancreas Journal subject: GASTROENTEROLOGIA Year: 2023 Document type: Article