Biclustering of gene expression data by an extension of mixtures of factor analyzers.
Int J Biostat
; 4(1): Article 3, 2008.
Article
em En
| MEDLINE
| ID: mdl-22462105
ABSTRACT
A challenge in microarray data analysis concerns discovering local structures composed by sets of genes that show homogeneous expression patterns across subsets of conditions. We present an extension of the mixture of factor analyzers model (MFA) allowing for simultaneous clustering of genes and conditions. The proposed model is rather flexible since it models the density of high-dimensional data assuming a mixture of Gaussian distributions with a particular omponent-specific covariance structure. Specifically, a binary and row stochastic matrix representing tissue membership is used to cluster tissues (experimental conditions), whereas the traditional mixture approach is used to define the gene clustering. An alternating expectation conditional maximization (AECM) algorithm is proposed for parameter estimation; experiments on simulated and real data show the efficiency of our method as a general approach to biclustering. The Matlab code of the algorithm is available upon request from authors.
Buscar no Google
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Análise Fatorial
/
Perfilação da Expressão Gênica
Tipo de estudo:
Prognostic_studies
Limite:
Adult
/
Aged
/
Female
/
Humans
/
Male
/
Middle aged
Idioma:
En
Revista:
Int J Biostat
Ano de publicação:
2008
Tipo de documento:
Article