Your browser doesn't support javascript.
loading
Automatic extraction of drug indications from FDA drug labels.
Khare, Ritu; Wei, Chih-Hsuan; Lu, Zhiyong.
Afiliação
  • Khare R; National Center for Biotechnology Information (NCBI), NIH, Bethesda, MD 20894.
  • Wei CH; National Center for Biotechnology Information (NCBI), NIH, Bethesda, MD 20894.
  • Lu Z; National Center for Biotechnology Information (NCBI), NIH, Bethesda, MD 20894.
AMIA Annu Symp Proc ; 2014: 787-94, 2014.
Article em En | MEDLINE | ID: mdl-25954385
ABSTRACT
Extracting computable indications, i.e. drug-disease treatment relationships, from narrative drug resources is the key for building a gold standard drug indication repository. The two steps to the extraction problem are disease named-entity recognition (NER) to identify disease mentions from a free-text description and disease classification to distinguish indications from other disease mentions in the description. While there exist many tools for disease NER, disease classification is mostly achieved through human annotations. For example, we recently resorted to human annotations to prepare a corpus, LabeledIn, capturing structured indications from the drug labels submitted to FDA by pharmaceutical companies. In this study, we present an automatic end-to-end framework to extract structured and normalized indications from FDA drug labels. In addition to automatic disease NER, a key component of our framework is a machine learning method that is trained on the LabeledIn corpus to classify the NER-computed disease mentions as "indication vs. non-indication." Through experiments with 500 drug labels, our end-to-end system delivered 86.3% F1-measure in drug indication extraction, with 17% improvement over baseline. Further analysis shows that the indication classifier delivers a performance comparable to human experts and that the remaining errors are mostly due to disease NER (more than 50%). Given its performance, we conclude that our end-to-end approach has the potential to significantly reduce human annotation costs.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Inteligência Artificial / Doença / Armazenamento e Recuperação da Informação / Rotulagem de Medicamentos Limite: Humans País/Região como assunto: America do norte Idioma: En Ano de publicação: 2014 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Inteligência Artificial / Doença / Armazenamento e Recuperação da Informação / Rotulagem de Medicamentos Limite: Humans País/Região como assunto: America do norte Idioma: En Ano de publicação: 2014 Tipo de documento: Article