Your browser doesn't support javascript.
loading
Development of message passing-based graph convolutional networks for classifying cancer pathology reports.
Yoon, Hong-Jun; Klasky, Hilda B; Blanchard, Andrew E; Christian, J Blair; Durbin, Eric B; Wu, Xiao-Cheng; Stroup, Antoinette; Doherty, Jennifer; Coyle, Linda; Penberthy, Lynne; Tourassi, Georgia D.
Afiliação
  • Yoon HJ; Computational Sciences and Engineering Division, Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, Tennessee, 37830, USA. yoonh@ornl.gov.
  • Klasky HB; Computational Sciences and Engineering Division, Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, Tennessee, 37830, USA.
  • Blanchard AE; Computational Sciences and Engineering Division, Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, Tennessee, 37830, USA.
  • Christian JB; Computational Sciences and Engineering Division, Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, Tennessee, 37830, USA.
  • Durbin EB; College of Medicine, University of Kentucky, Lexington, Kentucky, 24105, USA.
  • Wu XC; Louisiana Tumor Registry, Louisiana State University Health Sciences Center, School of Public Health, New Orleans, Louisiana, 70112, USA.
  • Stroup A; New Jersey State Cancer Registry, Rutgers Cancer Institute of New Jersey, New Brunswick, New Jersey, 08901, USA.
  • Doherty J; Utah Cancer Registry, Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah, 84132, USA.
  • Coyle L; Information Management Services, Inc., Calverton, Maryland, 20705, USA.
  • Penberthy L; Surveillance Research Program, Division of Cancer Control and Population Sciences National Cancer Institute, Bethesda, MD, 20814, USA.
  • Tourassi GD; National Center for Computational Sciences, Oak Ridge National Laboratory, Oak Ridge, Tennessee, 37830, USA.
BMC Med Inform Decis Mak ; 24(Suppl 5): 262, 2024 Sep 17.
Article em En | MEDLINE | ID: mdl-39289714
ABSTRACT

BACKGROUND:

Applying graph convolutional networks (GCN) to the classification of free-form natural language texts leveraged by graph-of-words features (TextGCN) was studied and confirmed to be an effective means of describing complex natural language texts. However, the text classification models based on the TextGCN possess weaknesses in terms of memory consumption and model dissemination and distribution. In this paper, we present a fast message passing network (FastMPN), implementing a GCN with message passing architecture that provides versatility and flexibility by allowing trainable node embedding and edge weights, helping the GCN model find the better solution. We applied the FastMPN model to the task of clinical information extraction from cancer pathology reports, extracting the following six properties main site, subsite, laterality, histology, behavior, and grade.

RESULTS:

We evaluated the clinical task performance of the FastMPN models in terms of micro- and macro-averaged F1 scores. A comparison was performed with the multi-task convolutional neural network (MT-CNN) model. Results show that the FastMPN model is equivalent to or better than the MT-CNN.

CONCLUSIONS:

Our implementation revealed that our FastMPN model, which is based on the PyTorch platform, can train a large corpus (667,290 training samples) with 202,373 unique words in less than 3 minutes per epoch using one NVIDIA V100 hardware accelerator. Our experiments demonstrated that using this implementation, the clinical task performance scores of information extraction related to tumors from cancer pathology reports were highly competitive.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Processamento de Linguagem Natural / Redes Neurais de Computação / Neoplasias Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Processamento de Linguagem Natural / Redes Neurais de Computação / Neoplasias Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article