Your browser doesn't support javascript.
loading
Exploring species-based strategies for gene normalization.
Verspoor, Karin; Roeder, Christophe; Johnson, Helen L; Cohen, K Bretonnel; Baumgartner, William A; Hunter, Lawrence E.
Afiliación
  • Verspoor K; Center for Computational Pharmacology, University of Colorado Denver, Aurora, CO 80045, USA. karin.verspoor@ucdenver.edu
Article en En | MEDLINE | ID: mdl-20671318
ABSTRACT
We introduce a system developed for the BioCreative II.5 community evaluation of information extraction of proteins and protein interactions. The paper focuses primarily on the gene normalization task of recognizing protein mentions in text and mapping them to the appropriate database identifiers based on contextual clues. We outline a ""fuzzy" dictionary lookup approach to protein mention detection that matches regularized text to similarly regularized dictionary entries. We describe several different strategies for gene normalization that focus on species or organism mentions in the text, both globally throughout the document and locally in the immediate vicinity of a protein mention, and present the results of experimentation with a series of system variations that explore the effectiveness of the various normalization strategies, as well as the role of external knowledge sources. While our system was neither the best nor the worst performing system in the evaluation, the gene normalization strategies show promise and the system affords the opportunity to explore some of the variables affecting performance on the BCII.5 tasks.
Asunto(s)

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Reconocimiento de Normas Patrones Automatizadas / Biología Computacional / Mapeo de Interacción de Proteínas / Minería de Datos / Genes Tipo de estudio: Prognostic_studies Idioma: En Revista: ACM Trans Comput Biol Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2010 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Reconocimiento de Normas Patrones Automatizadas / Biología Computacional / Mapeo de Interacción de Proteínas / Minería de Datos / Genes Tipo de estudio: Prognostic_studies Idioma: En Revista: ACM Trans Comput Biol Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2010 Tipo del documento: Article País de afiliación: Estados Unidos