SETH detects and normalizes genetic variants in text.
Bioinformatics
; 32(18): 2883-5, 2016 09 15.
Article
en En
| MEDLINE
| ID: mdl-27256315
ABSTRACT
UNLABELLED Descriptions of genetic variations and their effect are widely spread across the biomedical literature. However, finding all mentions of a specific variation, or all mentions of variations in a specific gene, is difficult to achieve due to the many ways such variations are described. Here, we describe SETH, a tool for the recognition of variations from text and their subsequent normalization to dbSNP or UniProt. SETH achieves high precision and recall on several evaluation corpora of PubMed abstracts. It is freely available and encompasses stand-alone scripts for isolated application and evaluation as well as a thorough documentation for integration into other applications. AVAILABILITY AND IMPLEMENTATION SETH is released under the Apache 2.0 license and can be downloaded from http//rockt.github.io/SETH/ CONTACT thomas@informatik.hu-berlin.de or leser@informatik.hu-berlin.de.
Texto completo:
1
Colección:
01-internacional
Banco de datos:
MEDLINE
Asunto principal:
Variación Genética
/
Minería de Datos
/
Curaduría de Datos
Límite:
Humans
Idioma:
En
Revista:
Bioinformatics
Asunto de la revista:
INFORMATICA MEDICA
Año:
2016
Tipo del documento:
Article
País de afiliación:
Alemania