FSN: Joint Entity and Relation Extraction Based on Filter Separator Network.

Dai, Qicai; Yang, Wenzhong; Wei, Fuyuan; He, Liang; Liao, Yuanyuan

Dai, Qicai; Yang, Wenzhong; Wei, Fuyuan; He, Liang; Liao, Yuanyuan.

Afiliação

Dai Q; School of Computer Science and Technology, Xinjiang University, Urumqi 830017, China.
Yang W; Xinjiang Key Laboratory of Multilingual Information Technology, Xinjiang University, Urumqi 830017, China.
Wei F; School of Computer Science and Technology, Xinjiang University, Urumqi 830017, China.
He L; Xinjiang Key Laboratory of Multilingual Information Technology, Xinjiang University, Urumqi 830017, China.
Liao Y; School of Computer Science and Technology, Xinjiang University, Urumqi 830017, China.

Entropy (Basel) ; 26(2)2024 Feb 12.

Article em En | MEDLINE | ID: mdl-38392417

ABSTRACT

ABSTRACT

Joint entity and relation extraction methods have attracted an increasing amount of attention recently due to their capacity to extract relational triples from intricate texts. However, most of the existing methods ignore the association and difference between the Named Entity Recognition (NER) subtask features and the Relation Extraction (RE) subtask features, which leads to an imbalance in the interaction between these two subtasks. To solve the above problems, we propose a new joint entity and relation extraction method, FSN. It contains a Filter Separator Network (FSN) module that employs a two-direction LSTM to filter and separate the information contained in a sentence and merges similar features through a splicing operation, thus solving the problem of the interaction imbalance between subtasks. In order to better extract the local feature information for each subtask, we designed a Named Entity Recognition Generation (NERG) module and a Relation Extraction Generation (REG) module by adopting the design idea of the decoder in Transformer and average pooling operations to better capture the entity boundary information in the sentence and the entity pair boundary information for each relation in the relational triple, respectively. Additionally, we propose a dynamic loss function that dynamically adjusts the learning weights of each subtask in each epoch according to the proportionality between each subtask, thus narrowing down the difference between the ideal and realistic results. We thoroughly evaluated our model on the SciERC dataset and the ACE2005 dataset. The experimental results demonstrate that our model achieves satisfactory results compared to the baseline model.

Palavras-chave

BERT; dynamic loss function; local feature extraction; subtask interaction balance

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Idioma: En Revista: Entropy (Basel) Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Idioma: En Revista: Entropy (Basel) Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China