RESUMO
Immuno-precipitation of protein-DNA complexes followed by microarray hybridization is a powerful and cost-effective technology for discovering protein-DNA binding events at the genome scale. It is still an unresolved challenge to comprehensively, accurately and sensitively extract binding event information from the produced data. We have developed a novel strategy composed of an information-preserving signal-smoothing procedure, higher order derivative analysis and application of the principle of maximum entropy to address this challenge. Importantly, our method does not require any input parameters to be specified by the user. Using genome-scale binding data of two Escherichia coli global transcription regulators for which a relatively large number of experimentally supported sites are known, we show that â¼90% of known sites were resolved to within four probes, or â¼88 bp. Over half of the sites were resolved to within two probes, or â¼38 bp. Furthermore, we demonstrate that our strategy delivers significant quantitative and qualitative performance gains over available methods. Such accurate and sensitive binding site resolution has important consequences for accurately reconstructing transcriptional regulatory networks, for motif discovery, for furthering our understanding of local and non-local factors in protein-DNA interactions and for extending the usefulness horizon of the ChIP-chip platform.
Assuntos
Algoritmos , Imunoprecipitação da Cromatina , Proteínas de Ligação a DNA/análise , Análise de Sequência com Séries de Oligonucleotídeos , Sítios de Ligação , Proteínas de Escherichia coli/análise , Fator Proteico para Inversão de Estimulação/análise , Proteína Reguladora de Resposta a Leucina/análise , Sensibilidade e EspecificidadeRESUMO
Chromosomes contain DNA covered with proteins performing functions such as architectural organization and transcriptional regulation. The ability to count the number of proteins bound to various regions of the genome is essential for understanding both architectural and regulatory functions. We present a straightforward method of counting gfp-conjugated proteins bound to an individual duplex DNA molecule by calibrating to a commercially available fluorescence standard using wide-field fluorescence microscopy. We demonstrate our method using the E. coli nucleoid-associated protein Fis.