Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 2 de 2
Filtrar
Más filtros

Banco de datos
Tipo del documento
Asunto de la revista
País de afiliación
Intervalo de año de publicación
1.
BMC Bioinformatics ; 13: 161, 2012 Jul 09.
Artículo en Inglés | MEDLINE | ID: mdl-22776079

RESUMEN

BACKGROUND: Manually annotated corpora are critical for the training and evaluation of automated methods to identify concepts in biomedical text. RESULTS: This paper presents the concept annotations of the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of 97 full-length, open-access biomedical journal articles that have been annotated both semantically and syntactically to serve as a research resource for the biomedical natural-language-processing (NLP) community. CRAFT identifies all mentions of nearly all concepts from nine prominent biomedical ontologies and terminologies: the Cell Type Ontology, the Chemical Entities of Biological Interest ontology, the NCBI Taxonomy, the Protein Ontology, the Sequence Ontology, the entries of the Entrez Gene database, and the three subontologies of the Gene Ontology. The first public release includes the annotations for 67 of the 97 articles, reserving two sets of 15 articles for future text-mining competitions (after which these too will be released). Concept annotations were created based on a single set of guidelines, which has enabled us to achieve consistently high interannotator agreement. CONCLUSIONS: As the initial 67-article release contains more than 560,000 tokens (and the full set more than 790,000 tokens), our corpus is among the largest gold-standard annotated biomedical corpora. Unlike most others, the journal articles that comprise the corpus are drawn from diverse biomedical disciplines and are marked up in their entirety. Additionally, with a concept-annotation count of nearly 100,000 in the 67-article subset (and more than 140,000 in the full collection), the scale of conceptual markup is also among the largest of comparable corpora. The concept annotations of the CRAFT Corpus have the potential to significantly advance biomedical text mining by providing a high-quality gold standard for NLP systems. The corpus, annotation guidelines, and other associated resources are freely available at http://bionlp-corpora.sourceforge.net/CRAFT/index.shtml.


Asunto(s)
Minería de Datos , Procesamiento de Lenguaje Natural , Vocabulario Controlado , Biología Computacional/métodos , Bases de Datos Factuales , Almacenamiento y Recuperación de la Información/métodos , Semántica
2.
EMBO J ; 23(7): 1422-32, 2004 Apr 07.
Artículo en Inglés | MEDLINE | ID: mdl-15029249

RESUMEN

With their ability to depolymerize microtubules (MTs), KinI kinesins are the rogue members of the kinesin family. Here we present the 1.6 A crystal structure of a KinI motor core from Plasmodium falciparum, which is sufficient for depolymerization in vitro. Unlike all published kinesin structures to date, nucleotide is not present, and there are noticeable differences in loop regions L6 and L10 (the plus-end tip), L2 and L8 and in switch II (L11 and helix4); otherwise, the pKinI structure is very similar to previous kinesin structures. KinI-conserved amino acids were mutated to alanine, and studied for their effects on depolymerization and ATP hydrolysis. Notably, mutation of three residues in L2 appears to primarily affect depolymerization, rather than general MT binding or ATP hydrolysis. The results of this study confirm the suspected importance of loop 2 for KinI function, and provide evidence that KinI is specialized to hydrolyze ATP after initiating depolymerization.


Asunto(s)
Cinesinas/química , Microtúbulos/metabolismo , Proteínas Motoras Moleculares/química , Proteínas Motoras Moleculares/metabolismo , Plasmodium falciparum/química , Animales , Sitios de Unión , Cristalografía por Rayos X , Humanos , Cinesinas/genética , Cinesinas/metabolismo , Cinesinas/ultraestructura , Microtúbulos/ultraestructura , Modelos Moleculares , Proteínas Motoras Moleculares/ultraestructura , Datos de Secuencia Molecular , Plasmodium falciparum/citología , Plasmodium falciparum/metabolismo , Estructura Terciaria de Proteína
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA