Density of points clustering, application to transcriptomic data analysis.

Wicker, Nicolas; Dembele, Doulaye; Raffelsberger, Wolfgang; Poch, Olivier

Wicker, Nicolas; Dembele, Doulaye; Raffelsberger, Wolfgang; Poch, Olivier.

Afiliação

Wicker N; LSIIT-ICPS (AXE E), UPRES-A CNRS 70005 Université Louis Pasteur, 67400 Illkirch, France.

Nucleic Acids Res ; 30(18): 3992-4000, 2002 Sep 15.

Article em En | MEDLINE | ID: mdl-12235383

RESUMO

With the increasing amount of data produced by high-throughput technologies in many fields of science, clustering has become an integral step in exploratory data analysis in order to group similar elements into classes. However, many clustering algorithms can only work properly if aided by human expertise. For example, one parameter which is crucial and often manually set is the number of clusters present in the analyzed set. We present a novel stopping rule to find the optimal number of clusters based on the comparison of the density of points inside the clusters and between them. The method is evaluated on synthetic as well as on real transcriptomic data and compared with two current methods. Finally, we illustrate its usefulness in the analysis of the expression profiles of promyelocytic cells before and after treatment with all-trans retinoic acid. Simultaneous clustering for gene regulation and absolute initial expression levels allowed the identification of numerous genes associated with signal transduction revealing the complexity of retinoic acid signaling.

Assuntos

Análise por Conglomerados; Transcrição Gênica/genética; Algoritmos; Perfilação da Expressão Gênica; Humanos; Leucemia/genética; Leucemia/patologia; Saccharomyces cerevisiae/genética

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Transcrição Gênica / Análise por Conglomerados Limite: Humans Idioma: En Ano de publicação: 2002 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google