Your browser doesn't support javascript.
loading
Improving GO semantic similarity measures by exploring the ontology beneath the terms and modelling uncertainty.
Yang, Haixuan; Nepusz, Tamás; Paccanaro, Alberto.
Afiliação
  • Yang H; Department of Computer Science and Centre for Systems and Synthetic Biology, Royal Holloway, University of London, Egham, TW20 0EX, UK.
Bioinformatics ; 28(10): 1383-9, 2012 May 15.
Article em En | MEDLINE | ID: mdl-22522134
ABSTRACT
MOTIVATION Several measures have been recently proposed for quantifying the functional similarity between gene products according to well-structured controlled vocabularies where biological terms are organized in a tree or in a directed acyclic graph (DAG) structure. However, existing semantic similarity measures ignore two important facts. First, when calculating the similarity between two terms, they disregard the descendants of these terms. While this makes no difference when the ontology is a tree, we shall show that it has important consequences when the ontology is a DAG-this is the case, for example, with the Gene Ontology (GO). Second, existing similarity measures do not model the inherent uncertainty which comes from the fact that our current knowledge of the gene annotation and of the ontology structure is incomplete. Here, we propose a novel approach based on downward random walks that can be used to improve any of the existing similarity measures to exhibit these two properties. The approach is computationally efficient-random walks do not need to be simulated as we provide formulas to calculate their stationary distributions.

RESULTS:

To show that our approach can potentially improve any semantic similarity measure, we test it on six different semantic similarity

measures:

three commonly used measures by Resnik (1999), Lin (1998), and Jiang and Conrath (1997); and three recently proposed

measures:

simUI, simGIC by Pesquita et al. (2008); GraSM by Couto et al. (2007); and Couto and Silva (2011). We applied these improved measures to the GO annotations of the yeast Saccharomyces cerevisiae, and tested how they correlate with sequence similarity, mRNA co-expression and protein-protein interaction data. Our results consistently show that the use of downward random walks leads to more reliable similarity measures.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Semântica / Algoritmos / Vocabulário Controlado Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2012 Tipo de documento: Article País de afiliação: Reino Unido

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Semântica / Algoritmos / Vocabulário Controlado Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2012 Tipo de documento: Article País de afiliação: Reino Unido
...