Your browser doesn't support javascript.
loading
Unification of functional annotation descriptions using text mining.
Queirós, Pedro; Novikova, Polina; Wilmes, Paul; May, Patrick.
Afiliação
  • Queirós P; Systems Ecology, Esch-sur-Alzette, Luxembourg.
  • Novikova P; Systems Ecology, Esch-sur-Alzette, Luxembourg.
  • Wilmes P; Systems Ecology, Esch-sur-Alzette, Luxembourg.
  • May P; Bioinformatics Core, Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 4362, Esch-sur-Alzette, Luxembourg.
Biol Chem ; 402(8): 983-990, 2021 07 27.
Article em En | MEDLINE | ID: mdl-33984880
ABSTRACT
A common approach to genome annotation involves the use of homology-based tools for the prediction of the functional role of proteins. The quality of functional annotations is dependent on the reference data used, as such, choosing the appropriate sources is crucial. Unfortunately, no single reference data source can be universally considered the gold standard, thus using multiple references could potentially increase annotation quality and coverage. However, this comes with challenges, particularly due to the introduction of redundant and exclusive annotations. Through text mining it is possible to identify highly similar functional descriptions, thus strengthening the confidence of the final protein functional annotation and providing a redundancy-free output. Here we present UniFunc, a text mining approach that is able to detect similar functional descriptions with high precision. UniFunc was built as a small module and can be independently used or integrated into protein function annotation pipelines. By removing the need to individually analyse and compare annotation results, UniFunc streamlines the complementary use of multiple reference datasets.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Mineração de Dados Idioma: En Revista: Biol Chem Assunto da revista: BIOQUIMICA Ano de publicação: 2021 Tipo de documento: Article País de afiliação: Luxemburgo

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Mineração de Dados Idioma: En Revista: Biol Chem Assunto da revista: BIOQUIMICA Ano de publicação: 2021 Tipo de documento: Article País de afiliação: Luxemburgo