Improving the inter-corpora compatibility for protein annotations.
J Bioinform Comput Biol
; 8(5): 901-16, 2010 Oct.
Article
en En
| MEDLINE
| ID: mdl-20981894
ABSTRACT
Although there are several corpora with protein annotation, incompatibility between the annotations in different corpora remains a problem that hinders the progress of automatic recognition of protein names in biomedical literature. Here, we report on our efforts to find a solution to the incompatibility issue, and to improve the compatibility between two representative protein-annotated corpora the GENIA corpus and the GENETAG corpus. In a comparative study, we improve our insight into the two corpora, and a series of experimental results show that most of the incompatibility can be removed.
Buscar en Google
Colección:
01-internacional
Base de datos:
MEDLINE
Asunto principal:
Proteínas
/
Minería de Datos
Idioma:
En
Revista:
J Bioinform Comput Biol
Asunto de la revista:
BIOLOGIA
/
INFORMATICA MEDICA
Año:
2010
Tipo del documento:
Article
País de afiliación:
Japón