Your browser doesn't support javascript.
loading
Named entity recognition of rice genes and phenotypes based on BiGRU neural networks.
Wu, Kangjie; Xu, Liqian; Li, Xinxiang; Zhang, Youhua; Yue, Zhenyu; Gao, Yujia; Chen, Yiqiong.
Afiliação
  • Wu K; School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China.
  • Xu L; School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China.
  • Li X; School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China.
  • Zhang Y; School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China.
  • Yue Z; School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China.
  • Gao Y; School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China.
  • Chen Y; School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China. Electronic address: ch_cyq@163.com.
Comput Biol Chem ; 108: 107977, 2024 Feb.
Article em En | MEDLINE | ID: mdl-37995493
ABSTRACT
Named Entity Recognition (NER) is a fundamental but crucial task in natural language processing (NLP) and big data analysis, with wide application range. NER for rice genes and phenotypes is a technique to identify genes and phenotypes from a large amount of text. NER for rice genes and phenotypes can facilitate the acquisition of information in the field of crops and provide references for our research on higher quality crops. At the same time, named entity recognition still faces many challenges. In this paper, we propose an improved bidirectional gated recurrent unit neural network (BI-GRU) method, which is used to automatically identify the required entities (i.e. gene names, rice phenotypes) from relevant rice literature and patents. The neural network model is combined with the Softmax function to directly output the probabilities of labels, forming the BI-GRU-SF model. With the ability of deep learning methods, the semantic information in the context can be learned without the need for feature engineering. Finally, we conducted experiments, and the results showed that our proposed model provided better performance compared to other models. All datasets and resource codes of BI-GRU-SF are available at https//github.com/qqeeqq/NER for academic use.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Oryza Idioma: En Revista: Comput Biol Chem Assunto da revista: BIOLOGIA / INFORMATICA MEDICA / QUIMICA Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Oryza Idioma: En Revista: Comput Biol Chem Assunto da revista: BIOLOGIA / INFORMATICA MEDICA / QUIMICA Ano de publicação: 2024 Tipo de documento: Article