Discovering significant and interpretable patterns from multifactorial DNA microarray data with poor replication.
J Biomed Inform
; 37(4): 260-8, 2004 Aug.
Article
in En
| MEDLINE
| ID: mdl-15465479
MOTIVATION: Multivariate analyses are advantageous for the simultaneous testing of the separate and combined effects of many variables and of their interactions. In factorial designs with many factors and/or levels, however, sufficient replication is often prohibitively costly. Furthermore, complicated statements are often required for the biological interpretation of the higher-order interactions determined by standard statistical techniques like analysis of variance. RESULTS: Because we are usually interested in finding factor-specific effects or their interactions, we assumed that the observed expression profile of a gene is a manifestation of an underlying factor-specific generative pattern (FSGP) combined with noise. Thus, a genetic algorithm was created to find the nearest FSGP for each expression profile. We then measured the distance between each profile and the corresponding nearest FSGP. Permutation testing for the distance measures successfully identified those genes with statistically significant profiles, thus yielding straightforward biological interpretations. Association networks of genes, drugs, and cell lines were created as tripartite graphs, representing significant and interpretable relations, by using a microarray experiment of gastric-cancer cell lines with a factorial design and no replication. The proposed method may benefit the combined analysis of heterogeneous expression data from the growing public repositories.
Search on Google
Collection:
01-internacional
Database:
MEDLINE
Main subject:
Stomach Neoplasms
/
Algorithms
/
Pattern Recognition, Automated
/
Artificial Intelligence
/
Oligonucleotide Array Sequence Analysis
/
Gene Expression Profiling
Type of study:
Diagnostic_studies
/
Evaluation_studies
/
Prognostic_studies
Limits:
Humans
Language:
En
Journal:
J Biomed Inform
Journal subject:
INFORMATICA MEDICA
Year:
2004
Document type:
Article
Country of publication:
United States