Your browser doesn't support javascript.
loading
Extracting Systemic Anticancer Therapy and Response Information From Clinical Notes Following the RECIST Definition.
Zuo, Xu; Kumar, Ashok; Shen, Shuhan; Li, Jianfu; Cong, Grace; Jin, Edward; Chen, Qingxia; Warner, Jeremy L; Yang, Ping; Xu, Hua.
Afiliación
  • Zuo X; University of Texas Health Science Center, Houston, TX.
  • Kumar A; Mayo Clinic, Scottsdale, AZ.
  • Shen S; Mayo Clinic, Scottsdale, AZ.
  • Li J; University of Texas Health Science Center, Houston, TX.
  • Cong G; University of Maryland, College Park, College Park, MD.
  • Jin E; University of Southern California, Los Angeles, CA.
  • Chen Q; Vanderbilt University Medical Center, Nashville, TN.
  • Warner JL; Vanderbilt University Medical Center, Nashville, TN.
  • Yang P; Legorreta Cancer Center at Brown University, Providence, RI.
  • Xu H; Lifespan Cancer Institute, Providence, RI.
JCO Clin Cancer Inform ; 8: e2300166, 2024 Jun.
Article en En | MEDLINE | ID: mdl-38885475
ABSTRACT

PURPOSE:

The RECIST guidelines provide a standardized approach for evaluating the response of cancer to treatment, allowing for consistent comparison of treatment efficacy across different therapies and patients. However, collecting such information from electronic health records manually can be extremely labor-intensive and time-consuming because of the complexity and volume of clinical notes. The aim of this study is to apply natural language processing (NLP) techniques to automate this process, minimizing manual data collection efforts, and improving the consistency and reliability of the results.

METHODS:

We proposed a complex, hybrid NLP system that automates the process of extracting, linking, and summarizing anticancer therapy and associated RECIST-like responses from narrative clinical text. The system consists of multiple machine learning-/deep learning-based and rule-based modules for diverse NLP tasks such as named entity recognition, assertion classification, relation extraction, and text normalization, to address different challenges associated with anticancer therapy and response information extraction. We then evaluated the system performances on two independent test sets from different institutions to demonstrate its effectiveness and generalizability.

RESULTS:

The system used domain-specific language models, BioBERT and BioClinicalBERT, for high-performance therapy mentions identification and RECIST responses extraction and categorization. The best-performing model achieved a 0.66 score in linking therapy and RECIST response mentions, with end-to-end performance peaking at 0.74 after relation normalization, indicating substantial efficacy with room for improvement.

CONCLUSION:

We developed, implemented, and tested an information extraction system from clinical notes for cancer treatment and efficacy assessment information. We expect this system will support future cancer research, particularly oncologic studies that focus on efficiently assessing the effectiveness and reliability of cancer therapeutics.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Procesamiento de Lenguaje Natural / Registros Electrónicos de Salud / Criterios de Evaluación de Respuesta en Tumores Sólidos / Neoplasias Límite: Humans Idioma: En Revista: JCO Clin Cancer Inform Año: 2024 Tipo del documento: Article

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Procesamiento de Lenguaje Natural / Registros Electrónicos de Salud / Criterios de Evaluación de Respuesta en Tumores Sólidos / Neoplasias Límite: Humans Idioma: En Revista: JCO Clin Cancer Inform Año: 2024 Tipo del documento: Article