Cross-type biomedical named entity recognition with deep multi-task learning.

Wang, Xuan; Zhang, Yu; Ren, Xiang; Zhang, Yuhao; Zitnik, Marinka; Shang, Jingbo; Langlotz, Curtis; Han, Jiawei

Wang, Xuan; Zhang, Yu; Ren, Xiang; Zhang, Yuhao; Zitnik, Marinka; Shang, Jingbo; Langlotz, Curtis; Han, Jiawei.

Afiliação

Wang X; Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
Zhang Y; Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
Ren X; Department of Computer Science, University of Southern California, Los Angeles, CA, USA.
Zhang Y; Department of Radiology, School of Medicine, Stanford University, Stanford, CA, USA.
Zitnik M; Department of Computer Science, Stanford University, Stanford, CA, USA.
Shang J; Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
Langlotz C; Department of Radiology, School of Medicine, Stanford University, Stanford, CA, USA.
Han J; Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA.

Bioinformatics ; 35(10): 1745-1752, 2019 05 15.

Article em En | MEDLINE | ID: mdl-30307536

ABSTRACT

ABSTRACT

MOTIVATION State-of-the-art biomedical named entity recognition (BioNER) systems often require handcrafted features specific to each entity type, such as genes, chemicals and diseases. Although recent studies explored using neural network models for BioNER to free experts from manual feature engineering, the performance remains limited by the available training data for each entity type.

RESULTS:

We propose a multi-task learning framework for BioNER to collectively use the training data of different types of entities and improve the performance on each of them. In experiments on 15 benchmark BioNER datasets, our multi-task model achieves substantially better performance compared with state-of-the-art BioNER systems and baseline neural sequence labeling models. Further analysis shows that the large performance gains come from sharing character- and word-level information among relevant biomedical entities across differently labeled corpora. AVAILABILITY AND IMPLEMENTATION Our source code is available at https//github.com/yuzhimanhua/lm-lstm-crf. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

Assuntos

Aprendizado Profundo; Redes Neurais de Computação; Benchmarking; Software

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / Aprendizado Profundo Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google