Your browser doesn't support javascript.
loading
Extending bicluster analysis to annotate unclassified ORFs and predict novel functional modules using expression data.
Bryan, Kenneth; Cunningham, Pádraig.
Afiliação
  • Bryan K; Complex and Adaptive Systems Laboratory, University College Dublin, Belfield, Dublin 4, Ireland. kenneth.bryan@ucd.ie
BMC Genomics ; 9 Suppl 2: S20, 2008 Sep 16.
Article em En | MEDLINE | ID: mdl-18831786
ABSTRACT

BACKGROUND:

Microarrays have the capacity to measure the expressions of thousands of genes in parallel over many experimental samples. The unsupervised classification technique of bicluster analysis has been employed previously to uncover gene expression correlations over subsets of samples with the aim of providing a more accurate model of the natural gene functional classes. This approach also has the potential to aid functional annotation of unclassified open reading frames (ORFs). Until now this aspect of biclustering has been under-explored. In this work we illustrate how bicluster analysis may be extended into a 'semi-supervised' ORF annotation approach referred to as BALBOA.

RESULTS:

The efficacy of the BALBOA ORF classification technique is first assessed via cross validation and compared to a multi-class k-Nearest Neighbour (kNN) benchmark across three independent gene expression datasets. BALBOA is then used to assign putative functional annotations to unclassified yeast ORFs. These predictions are evaluated using existing experimental and protein sequence information. Lastly, we employ a related semi-supervised method to predict the presence of novel functional modules within yeast.

CONCLUSION:

In this paper we demonstrate how unsupervised classification methods, such as bicluster analysis, may be extended using of available annotations to form semi-supervised approaches within the gene expression analysis domain. We show that such methods have the potential to improve upon supervised approaches and shed new light on the functions of unclassified ORFs and their co-regulation.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Modelos Estatísticos / Fases de Leitura Aberta / Perfilação da Expressão Gênica Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2008 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Modelos Estatísticos / Fases de Leitura Aberta / Perfilação da Expressão Gênica Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2008 Tipo de documento: Article