Your browser doesn't support javascript.
loading
A Multi-Label Text Classifier at Publication Level Based on "PubMedBERT + TextRNN" for Cancer Literature.
Ying, Zhang; Guanghui, Xia; Xiaoying, Li; Shishi, Tang.
Afiliação
  • Ying Z; Institute of Medical Information, Chinese Academy of Medical Sciences, China.
  • Guanghui X; Institute of Medical Information, Chinese Academy of Medical Sciences, China.
  • Xiaoying L; Institute of Medical Information, Chinese Academy of Medical Sciences, China.
  • Shishi T; Institute of Medical Information, Chinese Academy of Medical Sciences, China.
Stud Health Technol Inform ; 316: 374-375, 2024 Aug 22.
Article em En | MEDLINE | ID: mdl-39176755
ABSTRACT
There is a rapid growth in the volume of data in the cancer field and fine-grained classification is in high demand especially for interdisciplinary and collaborative research. There is thus a need to establish a multi-label classifier with higher resolution to reduce the burden of screening articles for clinical relevance. This research trains a multi-label classifier with scalability for classifying literature on cancer research directly at the publication level. Firstly, a corpus was divided into a training set and a testing set at a ratio of 73. Secondly, we compared the performance of classifiers developed by "PubMedBERT + TextRNN" and "BioBERT + TextRNN" with ICRP CT. Finally, the classifier was obtained based on the optimal combination "PubMedBERT + TextRNN", with P= 0.952014, R=0.936696, F1=0.931664. The quantitative comparisons demonstrate that the resulting classifier is fit for high-resolution classification of cancer literature at the publication level to support accurate retrieving and academic statistics.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Neoplasias Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Neoplasias Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article