Your browser doesn't support javascript.
loading
ChIP-Enrich: gene set enrichment testing for ChIP-seq data.
Welch, Ryan P; Lee, Chee; Imbriano, Paul M; Patil, Snehal; Weymouth, Terry E; Smith, R Alex; Scott, Laura J; Sartor, Maureen A.
Afiliação
  • Welch RP; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA Biostatistics Department, University of Michigan, Ann Arbor, MI 48109, USA.
  • Lee C; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.
  • Imbriano PM; Biostatistics Department, University of Michigan, Ann Arbor, MI 48109, USA.
  • Patil S; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.
  • Weymouth TE; Center for Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.
  • Smith RA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.
  • Scott LJ; Biostatistics Department, University of Michigan, Ann Arbor, MI 48109, USA Center for Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA ljst@umich.edu.
  • Sartor MA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA Biostatistics Department, University of Michigan, Ann Arbor, MI 48109, USA Center for Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA sartorma@umich.e
Nucleic Acids Res ; 42(13): e105, 2014 Jul.
Article em En | MEDLINE | ID: mdl-24878920
ABSTRACT
Gene set enrichment testing can enhance the biological interpretation of ChIP-seq data. Here, we develop a method, ChIP-Enrich, for this analysis which empirically adjusts for gene locus length (the length of the gene body and its surrounding non-coding sequence). Adjustment for gene locus length is necessary because it is often positively associated with the presence of one or more peaks and because many biologically defined gene sets have an excess of genes with longer or shorter gene locus lengths. Unlike alternative methods, ChIP-Enrich can account for the wide range of gene locus length-to-peak presence relationships (observed in ENCODE ChIP-seq data sets). We show that ChIP-Enrich has a well-calibrated type I error rate using permuted ENCODE ChIP-seq data sets; in contrast, two commonly used gene set enrichment methods, Fisher's exact test and the binomial test implemented in Genomic Regions Enrichment of Annotations Tool (GREAT), can have highly inflated type I error rates and biases in ranking. We identify DNA-binding proteins, including CTCF, JunD and glucocorticoid receptor α (GRα), that show different enrichment patterns for peaks closer to versus further from transcription start sites. We also identify known and potential new biological functions of GRα. ChIP-Enrich is available as a web interface (http//chip-enrich.med.umich.edu) and Bioconductor package.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Análise de Sequência de DNA / Imunoprecipitação da Cromatina / Loci Gênicos / Genes Tipo de estudo: Risk_factors_studies Idioma: En Ano de publicação: 2014 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Análise de Sequência de DNA / Imunoprecipitação da Cromatina / Loci Gênicos / Genes Tipo de estudo: Risk_factors_studies Idioma: En Ano de publicação: 2014 Tipo de documento: Article