Your browser doesn't support javascript.
loading
A span-based model for extracting overlapping PICO entities from randomized controlled trial publications.
Zhang, Gongbo; Zhou, Yiliang; Hu, Yan; Xu, Hua; Weng, Chunhua; Peng, Yifan.
Afiliação
  • Zhang G; Department of Biomedical Informatics, Columbia University, New York, NY 10032, United States.
  • Zhou Y; Department of Population Health Sciences, Weill Cornell Medicine, New York, NY 10065, United States.
  • Hu Y; McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030, United States.
  • Xu H; Section of Biomedical Informatics and Data Science, Yale School of Medicine, New Haven, CT 06510, United States.
  • Weng C; Department of Biomedical Informatics, Columbia University, New York, NY 10032, United States.
  • Peng Y; Department of Population Health Sciences, Weill Cornell Medicine, New York, NY 10065, United States.
J Am Med Inform Assoc ; 31(5): 1163-1171, 2024 Apr 19.
Article em En | MEDLINE | ID: mdl-38471120
ABSTRACT

OBJECTIVES:

Extracting PICO (Populations, Interventions, Comparison, and Outcomes) entities is fundamental to evidence retrieval. We present a novel method, PICOX, to extract overlapping PICO entities. MATERIALS AND

METHODS:

PICOX first identifies entities by assessing whether a word marks the beginning or conclusion of an entity. Then, it uses a multi-label classifier to assign one or more PICO labels to a span candidate. PICOX was evaluated using 1 of the best-performing baselines, EBM-NLP, and 3 more datasets, ie, PICO-Corpus and randomized controlled trial publications on Alzheimer's Disease (AD) or COVID-19, using entity-level precision, recall, and F1 scores.

RESULTS:

PICOX achieved superior precision, recall, and F1 scores across the board, with the micro F1 score improving from 45.05 to 50.87 (P ≪.01). On the PICO-Corpus, PICOX obtained higher recall and F1 scores than the baseline and improved the micro recall score from 56.66 to 67.33. On the COVID-19 dataset, PICOX also outperformed the baseline and improved the micro F1 score from 77.10 to 80.32. On the AD dataset, PICOX demonstrated comparable F1 scores with higher precision when compared to the baseline.

CONCLUSION:

PICOX excels in identifying overlapping entities and consistently surpasses a leading baseline across multiple datasets. Ablation studies reveal that its data augmentation strategy effectively minimizes false positives and improves precision.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Contexto em Saúde: 2_ODS3 / 4_TD Base de dados: MEDLINE Assunto principal: Doença de Alzheimer / COVID-19 Limite: Humans Idioma: En Revista: J Am Med Inform Assoc Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Contexto em Saúde: 2_ODS3 / 4_TD Base de dados: MEDLINE Assunto principal: Doença de Alzheimer / COVID-19 Limite: Humans Idioma: En Revista: J Am Med Inform Assoc Ano de publicação: 2024 Tipo de documento: Article