Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 20 de 21
1.
Clin Cancer Res ; 30(8): 1655-1668, 2024 Apr 15.
Article En | MEDLINE | ID: mdl-38277235

PURPOSE: Identifying molecular and immune features to guide immune checkpoint inhibitor (ICI)-based regimens remains an unmet clinical need. EXPERIMENTAL DESIGN: Tissue and longitudinal blood specimens from phase III trial S1400I in patients with metastatic squamous non-small cell carcinoma (SqNSCLC) treated with nivolumab monotherapy (nivo) or nivolumab plus ipilimumab (nivo+ipi) were subjected to multi-omics analyses including multiplex immunofluorescence (mIF), nCounter PanCancer Immune Profiling Panel, whole-exome sequencing, and Olink. RESULTS: Higher immune scores from immune gene expression profiling or immune cell infiltration by mIF were associated with response to ICIs and improved survival, except regulatory T cells, which were associated with worse overall survival (OS) for patients receiving nivo+ipi. Immune cell density and closer proximity of CD8+GZB+ T cells to malignant cells were associated with superior progression-free survival and OS. The cold immune landscape of NSCLC was associated with a higher level of chromosomal copy-number variation (CNV) burden. Patients with LRP1B-mutant tumors had a shorter survival than patients with LRP1B-wild-type tumors. Olink assays revealed soluble proteins such as LAMP3 increased in responders while IL6 and CXCL13 increased in nonresponders. Upregulation of serum CXCL13, MMP12, CSF-1, and IL8 were associated with worse survival before radiologic progression. CONCLUSIONS: The frequency, distribution, and clustering of immune cells relative to malignant ones can impact ICI efficacy in patients with SqNSCLC. High CNV burden may contribute to the cold immune microenvironment. Soluble inflammation/immune-related proteins in the blood have the potential to monitor therapeutic benefit from ICI treatment in patients with SqNSCLC.


Carcinoma, Non-Small-Cell Lung , Carcinoma, Squamous Cell , Lung Neoplasms , Humans , Nivolumab , Lung Neoplasms/drug therapy , Lung Neoplasms/genetics , Multiomics , Antineoplastic Combined Chemotherapy Protocols/adverse effects , Carcinoma, Non-Small-Cell Lung/drug therapy , Carcinoma, Non-Small-Cell Lung/genetics , Carcinoma, Squamous Cell/drug therapy , Carcinoma, Squamous Cell/genetics , Immunotherapy , Lung/pathology , Epithelial Cells/pathology , Ipilimumab/therapeutic use , Tumor Microenvironment
3.
Nucleic Acids Res ; 52(D1): D61-D66, 2024 Jan 05.
Article En | MEDLINE | ID: mdl-37971305

The Cistrome Data Browser is a resource of ChIP-seq, ATAC-seq and DNase-seq data from humans and mice. It provides maps of the genome-wide locations of transcription factors, cofactors, chromatin remodelers, histone post-translational modifications and regions of chromatin accessible to endonuclease activity. Cistrome DB v3.0 contains approximately 45 000 human and 44 000 mouse samples with about 32 000 newly collected datasets compared to the previous release. The Cistrome DB v3.0 user interface is implemented as a single page application that unifies menu driven and data driven search functions and provides an embedded genome browser, which allows users to find and visualize data more effectively. Users can find informative chromatin profiles through keyword, menu, and data-driven search tools. Browser search functions can predict the regulators of query genes as well as the cell type and factor dependent functionality of potential cis-regulatory elements. Cistrome DB v3.0 expands the display of quality control statistics, incorporates sequence logos into motif enrichment displays and includes more expansive sample metadata. Cistrome DB v3.0 is available at http://db3.cistrome.org/browser.


Chromatin , Databases, Protein , Genomics , Software , Animals , Humans , Mice , Chromatin/genetics , Histones/genetics , Histones/metabolism , Sequence Analysis, DNA , Transcription Factors/genetics , Transcription Factors/metabolism , Data Visualization , Internet , Genomics/methods
4.
Nat Med ; 29(11): 2737-2741, 2023 Nov.
Article En | MEDLINE | ID: mdl-37865722

Although circulating tumor DNA (ctDNA) assays are increasingly used to inform clinical decisions in cancer care, they have limited ability to identify the transcriptional programs that govern cancer phenotypes and their dynamic changes during the course of disease. To address these limitations, we developed a method for comprehensive epigenomic profiling of cancer from 1 ml of patient plasma. Using an immunoprecipitation-based approach targeting histone modifications and DNA methylation, we measured 1,268 epigenomic profiles in plasma from 433 individuals with one of 15 cancers. Our assay provided a robust proxy for transcriptional activity, allowing us to infer the expression levels of diagnostic markers and drug targets, measure the activity of therapeutically targetable transcription factors and detect epigenetic mechanisms of resistance. This proof-of-concept study in advanced cancers shows how plasma epigenomic profiling has the potential to unlock clinically actionable information that is currently accessible only via direct tissue sampling.


Circulating Tumor DNA , Neoplasms , Humans , Epigenomics , Biomarkers, Tumor/genetics , Neoplasms/genetics , Circulating Tumor DNA/genetics , Liquid Biopsy/methods , Mutation
5.
Nat Protoc ; 18(8): 2404-2414, 2023 08.
Article En | MEDLINE | ID: mdl-37391666

RNA-sequencing (RNA-seq) has become an increasingly cost-effective technique for molecular profiling and immune characterization of tumors. In the past decade, many computational tools have been developed to characterize tumor immunity from gene expression data. However, the analysis of large-scale RNA-seq data requires bioinformatics proficiency, large computational resources and cancer genomics and immunology knowledge. In this tutorial, we provide an overview of computational analysis of bulk RNA-seq data for immune characterization of tumors and introduce commonly used computational tools with relevance to cancer immunology and immunotherapy. These tools have diverse functions such as evaluation of expression signatures, estimation of immune infiltration, inference of the immune repertoire, prediction of immunotherapy response, neoantigen detection and microbiome quantification. We describe the RNA-seq IMmune Analysis (RIMA) pipeline integrating many of these tools to streamline RNA-seq analysis. We also developed a comprehensive and user-friendly guide in the form of a GitBook with text and video demos to assist users in analyzing bulk RNA-seq data for immune characterization at both individual sample and cohort levels by using RIMA.


Neoplasms , RNA , Humans , Software , Computational Biology/methods , Neoplasms/genetics , Sequence Analysis, RNA/methods , Gene Expression Profiling/methods
6.
Blood ; 141(15): 1817-1830, 2023 04 13.
Article En | MEDLINE | ID: mdl-36706355

The challenge of eradicating leukemia in patients with acute myelogenous leukemia (AML) after initial cytoreduction has motivated modern efforts to combine synergistic active modalities including immunotherapy. Recently, the ETCTN/CTEP 10026 study tested the combination of the DNA methyltransferase inhibitor decitabine together with the immune checkpoint inhibitor ipilimumab for AML/myelodysplastic syndrome (MDS) either after allogeneic hematopoietic stem cell transplantation (HSCT) or in the HSCT-naïve setting. Integrative transcriptome-based analysis of 304 961 individual marrow-infiltrating cells for 18 of 48 subjects treated on study revealed the strong association of response with a high baseline ratio of T to AML cells. Clinical responses were predominantly driven by decitabine-induced cytoreduction. Evidence of immune activation was only apparent after ipilimumab exposure, which altered CD4+ T-cell gene expression, in line with ongoing T-cell differentiation and increased frequency of marrow-infiltrating regulatory T cells. For post-HSCT samples, relapse could be attributed to insufficient clearing of malignant clones in progenitor cell populations. In contrast to AML/MDS bone marrow, the transcriptomes of leukemia cutis samples from patients with durable remission after ipilimumab monotherapy showed evidence of increased infiltration with antigen-experienced resident memory T cells and higher expression of CTLA-4 and FOXP3. Altogether, activity of combined decitabine and ipilimumab is impacted by cellular expression states within the microenvironmental niche of leukemic cells. The inadequate elimination of leukemic progenitors mandates urgent development of novel approaches for targeting these cell populations to generate long-lasting responses. This trial was registered at www.clinicaltrials.gov as #NCT02890329.


Hematopoietic Stem Cell Transplantation , Leukemia, Myeloid, Acute , Myelodysplastic Syndromes , Humans , Ipilimumab/therapeutic use , Decitabine/therapeutic use , Myelodysplastic Syndromes/genetics , Leukemia, Myeloid, Acute/drug therapy , Leukemia, Myeloid, Acute/genetics , Leukemia, Myeloid, Acute/pathology , Recurrence
7.
Bioinformatics ; 39(2)2023 02 03.
Article En | MEDLINE | ID: mdl-36688700

SUMMARY: The regulation of genes by cis-regulatory elements (CREs) is complex and differs between cell types. Visual analysis of large collections of chromatin profiles across diverse cell types, integrated with computational methods, can reveal meaningful biological insights. We developed Cistrome Explorer, a web-based interactive visual analytics tool for exploring thousands of chromatin profiles in diverse cell types. Integrated with the Cistrome Data Browser database which contains thousands of ChIP-seq, DNase-seq and ATAC-seq samples, Cistrome Explorer enables the discovery of patterns of CREs across cell types and the identification of transcription factor binding underlying these patterns. AVAILABILITY AND IMPLEMENTATION: Cistrome Explorer and its source code are available at http://cisvis.gehlenborglab.org/ and released under the MIT License. Documentation can be accessed via http://cisvis.gehlenborglab.org/docs/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Chromatin , Epigenomics , Sequence Analysis, DNA , Chromatin Immunoprecipitation Sequencing , Software , Databases, Genetic
8.
Genomics Proteomics Bioinformatics ; 19(4): 652-661, 2021 08.
Article En | MEDLINE | ID: mdl-34284136

Chromatin immunoprecipitation sequencing (ChIP-seq) and the Assay for Transposase-Accessible Chromatin with high-throughput sequencing (ATAC-seq) have become essential technologies to effectively measure protein-DNA interactions and chromatin accessibility. However, there is a need for a scalable and reproducible pipeline that incorporates proper normalization between samples, correction of copy number variations, and integration of new downstream analysis tools. Here we present Containerized Bioinformatics workflow for Reproducible ChIP/ATAC-seq Analysis (CoBRA), a modularized computational workflow which quantifies ChIP-seq and ATAC-seq peak regions and performs unsupervised and supervised analyses. CoBRA provides a comprehensive state-of-the-art ChIP-seq and ATAC-seq analysis pipeline that can be used by scientists with limited computational experience. This enables researchers to gain rapid insight into protein-DNA interactions and chromatin accessibility through sample clustering, differential peak calling, motif enrichment, comparison of sites to a reference database, and pathway analysis. CoBRA is publicly available online at https://bitbucket.org/cfce/cobra.


Chromatin Immunoprecipitation Sequencing , Computational Biology , Chromatin/genetics , DNA Copy Number Variations , High-Throughput Nucleotide Sequencing , Sequence Analysis, DNA , Workflow
9.
Nat Cancer ; 2(1): 34-48, 2021 01.
Article En | MEDLINE | ID: mdl-33997789

Pharmacologic inhibitors of cyclin-dependent kinases 4 and 6 (CDK4/6) were designed to induce cancer cell cycle arrest. Recent studies have suggested that these agents also exert other effects, influencing cancer cell immunogenicity, apoptotic responses, and differentiation. Using cell-based and mouse models of breast cancer together with clinical specimens, we show that CDK4/6 inhibitors induce remodeling of cancer cell chromatin characterized by widespread enhancer activation, and that this explains many of these effects. The newly activated enhancers include classical super-enhancers that drive luminal differentiation and apoptotic evasion, as well as a set of enhancers overlying endogenous retroviral elements that is enriched for proximity to interferon-driven genes. Mechanistically, CDK4/6 inhibition increases the level of several Activator Protein-1 (AP-1) transcription factor proteins, which are in turn implicated in the activity of many of the new enhancers. Our findings offer insights into CDK4/6 pathway biology and should inform the future development of CDK4/6 inhibitors.


Breast Neoplasms , Transcription Factor AP-1 , Animals , Breast Neoplasms/drug therapy , Cell Cycle Checkpoints , Cyclin-Dependent Kinase 4/genetics , Female , Genes, cdc , Humans , Mice , Transcription Factor AP-1/genetics
10.
Clin Cancer Res ; 27(18): 5049-5061, 2021 09 15.
Article En | MEDLINE | ID: mdl-33323402

PURPOSE: Whole-exome (WES) and RNA sequencing (RNA-seq) are key components of cancer immunogenomic analyses. To evaluate the consistency of tumor WES and RNA-seq profiling platforms across different centers, the Cancer Immune Monitoring and Analysis Centers (CIMAC) and the Cancer Immunologic Data Commons (CIDC) conducted a systematic harmonization study. EXPERIMENTAL DESIGN: DNA and RNA were centrally extracted from fresh frozen and formalin-fixed paraffin-embedded non-small cell lung carcinoma tumors and distributed to three centers for WES and RNA-seq profiling. In addition, two 10-plex HapMap cell line pools with known mutations were used to evaluate the accuracy of the WES platforms. RESULTS: The WES platforms achieved high precision (> 0.98) and recall (> 0.87) on the HapMap pools when evaluated on loci using > 50× common coverage. Nonsynonymous mutations clustered by tumor sample, achieving an index of specific agreement above 0.67 among replicates, centers, and sample processing. A DV200 > 24% for RNA, as a putative presequencing RNA quality control (QC) metric, was found to be a reliable threshold for generating consistent expression readouts in RNA-seq and NanoString data. MedTIN > 30 was likewise assessed as a reliable RNA-seq QC metric, above which samples from the same tumor across replicates, centers, and sample processing runs could be robustly clustered and HLA typing, immune infiltration, and immune repertoire inference could be performed. CONCLUSIONS: The CIMAC collaborating laboratory platforms effectively generated consistent WES and RNA-seq data and enable robust cross-trial comparisons and meta-analyses of highly complex immuno-oncology biomarker data across the NCI CIMAC-CIDC Network.


Base Sequence , DNA, Neoplasm/analysis , Exome Sequencing , Neoplasms/genetics , RNA, Neoplasm/analysis , Humans , Monitoring, Immunologic , Neoplasms/immunology
11.
Mod Pathol ; 34(2): 264-279, 2021 02.
Article En | MEDLINE | ID: mdl-33051600

Subependymal giant-cell astrocytomas (SEGAs) are slow-growing brain tumors that are a hallmark feature seen in 5-10% of patients with Tuberous Sclerosis Complex (TSC). Though histologically benign, they can cause serious neurologic symptoms, leading to death if untreated. SEGAs consistently show biallelic loss of TSC1 or TSC2. Herein, we aimed to define other somatic events beyond TSC1/TSC2 loss and identify potential transcriptional drivers that contribute to SEGA formation. Paired tumor-normal whole-exome sequencing was performed on 21 resected SEGAs from 20 TSC patients. Pathogenic variants in TSC1/TSC2 were identified in 19/21 (90%) SEGAs. Copy neutral loss of heterozygosity (size range: 2.2-46 Mb) was seen in 76% (16/21) of SEGAs (44% chr9q and 56% chr16p). An average of 1.4 other somatic variants (range 0-7) per tumor were identified, unlikely of pathogenic significance. Whole transcriptome RNA-sequencing analyses revealed 190 common differentially expressed genes in SEGA (n = 16, 13 from a prior study) in pairwise comparison to each of: low grade diffuse gliomas (n = 530) and glioblastoma (n = 171) from The Cancer Genome Atlas (TCGA) consortium, ganglioglioma (n = 10), TSC cortical tubers (n = 15), and multiple normal tissues. Among these, homeobox transcription factors (TFs) HMX3, HMX2, VAX1, SIX3; and TFs IRF6 and EOMES were all expressed >12-fold higher in SEGAs (FDR/q-value < 0.05). Immunohistochemistry supported the specificity of IRF6, VAX1, SIX3 for SEGAs in comparison to other tumor entities and normal brain. We conclude that SEGAs have an extremely low somatic mutation rate, suggesting that TSC1/TSC2 loss is sufficient to drive tumor growth. The unique and highly expressed SEGA-specific TFs likely reflect the neuroepithelial cell of origin, and may also contribute to the transcriptional and epigenetic state that enables SEGA growth following two-hit loss of TSC1 or TSC2 and mTORC1 activation.


Astrocytoma/genetics , Brain Neoplasms/genetics , Mechanistic Target of Rapamycin Complex 1/metabolism , Tuberous Sclerosis Complex 1 Protein/genetics , Tuberous Sclerosis Complex 2 Protein/genetics , Adolescent , Astrocytoma/metabolism , Brain Neoplasms/metabolism , Child , Child, Preschool , Female , Humans , Infant , Male , Middle Aged , Mutation Rate , Transcriptome , Young Adult
12.
Nat Protoc ; 15(8): 2503-2518, 2020 08.
Article En | MEDLINE | ID: mdl-32591768

Fixed-tissue ChIP-seq for H3K27 acetylation (H3K27ac) profiling (FiTAc-seq) is an epigenetic method for profiling active enhancers and promoters in formalin-fixed, paraffin-embedded (FFPE) tissues. We previously developed a modified ChIP-seq protocol (FiT-seq) for chromatin profiling in FFPE. FiT-seq produces high-quality chromatin profiles particularly for methylated histone marks but is not optimized for H3K27ac profiling. FiTAc-seq is a modified protocol that replaces the proteinase K digestion applied in FiT-seq with extended heating at 65 °C in a higher concentration of detergent and a minimized sonication step, to produce robust genome-wide H3K27ac maps from clinical samples. FiTAc-seq generates high-quality enhancer landscapes and super-enhancer (SE) annotation in numerous archived FFPE samples from distinct tumor types. This approach will be of great interest for both basic and clinical researchers. The entire protocol from FFPE blocks to sequence-ready library can be accomplished within 4 d.


Chromatin Immunoprecipitation Sequencing/methods , Histones/chemistry , Histones/metabolism , Lysine/metabolism , Paraffin Embedding , Tissue Fixation , Acetylation , Animals , Liver/cytology , Mice
13.
BMC Bioinformatics ; 19(1): 135, 2018 04 12.
Article En | MEDLINE | ID: mdl-29649993

BACKGROUND: RNA sequencing has become a ubiquitous technology used throughout life sciences as an effective method of measuring RNA abundance quantitatively in tissues and cells. The increase in use of RNA-seq technology has led to the continuous development of new tools for every step of analysis from alignment to downstream pathway analysis. However, effectively using these analysis tools in a scalable and reproducible way can be challenging, especially for non-experts. RESULTS: Using the workflow management system Snakemake we have developed a user friendly, fast, efficient, and comprehensive pipeline for RNA-seq analysis. VIPER (Visualization Pipeline for RNA-seq analysis) is an analysis workflow that combines some of the most popular tools to take RNA-seq analysis from raw sequencing data, through alignment and quality control, into downstream differential expression and pathway analysis. VIPER has been created in a modular fashion to allow for the rapid incorporation of new tools to expand the capabilities. This capacity has already been exploited to include very recently developed tools that explore immune infiltrate and T-cell CDR (Complementarity-Determining Regions) reconstruction abilities. The pipeline has been conveniently packaged such that minimal computational skills are required to download and install the dozens of software packages that VIPER uses. CONCLUSIONS: VIPER is a comprehensive solution that performs most standard RNA-seq analyses quickly and effectively with a built-in capacity for customization and expansion.


High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, RNA/methods , Software , Workflow , Base Sequence , Cluster Analysis , Down-Regulation/genetics , Gene Expression Profiling , Gene Ontology , RNA, Messenger/genetics , RNA, Messenger/metabolism , Sequence Alignment , Signal Transduction/genetics , Up-Regulation/genetics
14.
Sci Signal ; 10(477)2017 May 02.
Article En | MEDLINE | ID: mdl-28465412

Notch transcription complexes (NTCs) drive target gene expression by binding to two distinct types of genomic response elements, NTC monomer-binding sites and sequence-paired sites (SPSs) that bind NTC dimers. SPSs are conserved and have been linked to the Notch responsiveness of a few genes. To assess the overall contribution of SPSs to Notch-dependent gene regulation, we determined the DNA sequence requirements for NTC dimerization using a fluorescence resonance energy transfer (FRET) assay and applied insights from these in vitro studies to Notch-"addicted" T cell acute lymphoblastic leukemia (T-ALL) cells. We found that SPSs contributed to the regulation of about a third of direct Notch target genes. Although originally described in promoters, SPSs are present mainly in long-range enhancers, including an enhancer containing a newly described SPS that regulates HES5 expression. Our work provides a general method for identifying SPSs in genome-wide data sets and highlights the widespread role of NTC dimerization in Notch-transformed leukemia cells.


Enhancer Elements, Genetic , Gene Expression Regulation, Leukemic , Genome, Human , Precursor T-Cell Lymphoblastic Leukemia-Lymphoma/genetics , Receptors, Notch/metabolism , Transcription Factors/metabolism , Animals , Gene Expression Profiling , Humans , Mice , Precursor T-Cell Lymphoblastic Leukemia-Lymphoma/metabolism , Promoter Regions, Genetic , Protein Binding , Receptors, Notch/genetics , Signal Transduction , Transcription Factors/genetics , Tumor Cells, Cultured
15.
Cancer Res ; 77(3): 766-779, 2017 02 01.
Article En | MEDLINE | ID: mdl-27899379

Cancer cells exhibit dramatic alterations of chromatin organization at cis-regulatory elements, but the molecular basis, extent, and impact of these alterations are still being unraveled. Here, we identify extensive genome-wide modification of sites bearing the active histone mark H3K4me2 in primary human colorectal cancers, as compared with corresponding benign precursor adenomas. Modification of certain colorectal cancer sites highlighted the activity of the transcription factor CNOT3, which is known to control self-renewal of embryonic stem cells (ESC). In primary colorectal cancer cells, we observed a scattered pattern of CNOT3 expression, as might be expected for a tumor-initiating cell marker. Colorectal cancer cells exhibited nuclear and cytoplasmic expression of CNOT3, suggesting possible roles in both transcription and mRNA stability. We found that CNOT3 was bound primarily to genes whose expression was affected by CNOT3 loss, and also at sites modulated in certain types of colorectal cancers. These target genes were implicated in ESC and cancer self-renewal and fell into two distinct groups: those dependent on CNOT3 and MYC for optimal transcription and those repressed by CNOT3 binding and promoter hypermethylation. Silencing CNOT3 in colorectal cancer cells resulted in replication arrest. In clinical specimens, early-stage tumors that included >5% CNOT3+ cells exhibited a correlation to worse clinical outcomes compared with tumors with little to no CNOT3 expression. Together, our findings implicate CNOT3 in the coordination of colonic epithelial cell self-renewal, suggesting this factor as a new biomarker for molecular and prognostic classification of early-stage colorectal cancer. Cancer Res; 77(3); 766-79. ©2016 AACR.


Colorectal Neoplasms/pathology , Gene Expression Regulation, Neoplastic/physiology , Neoplastic Stem Cells/pathology , Transcription Factors/metabolism , Animals , Biomarkers, Tumor/analysis , Chromatin Immunoprecipitation , Colorectal Neoplasms/metabolism , Colorectal Neoplasms/mortality , Embryonic Stem Cells/metabolism , Embryonic Stem Cells/pathology , Gene Expression Profiling , Heterografts , Humans , Immunohistochemistry , Kaplan-Meier Estimate , Mice , Mice, Inbred NOD , Neoplastic Stem Cells/metabolism , Oligonucleotide Array Sequence Analysis , Reverse Transcriptase Polymerase Chain Reaction , Tissue Array Analysis , Transcriptome
16.
Nucleic Acids Res ; 45(D1): D658-D662, 2017 01 04.
Article En | MEDLINE | ID: mdl-27789702

Chromatin immunoprecipitation, DNase I hypersensitivity and transposase-accessibility assays combined with high-throughput sequencing enable the genome-wide study of chromatin dynamics, transcription factor binding and gene regulation. Although rapidly accumulating publicly available ChIP-seq, DNase-seq and ATAC-seq data are a valuable resource for the systematic investigation of gene regulation processes, a lack of standardized curation, quality control and analysis procedures have hindered extensive reuse of these data. To overcome this challenge, we built the Cistrome database, a collection of ChIP-seq and chromatin accessibility data (DNase-seq and ATAC-seq) published before January 1, 2016, including 13 366 human and 9953 mouse samples. All the data have been carefully curated and processed with a streamlined analysis pipeline and evaluated with comprehensive quality control metrics. We have also created a user-friendly web server for data query, exploration and visualization. The resulting Cistrome DB (Cistrome Data Browser), available online at http://cistrome.org/db, is expected to become a valuable resource for transcriptional and epigenetic regulation studies.


Chromatin Assembly and Disassembly , Chromatin Immunoprecipitation , Databases, Genetic , High-Throughput Nucleotide Sequencing , Web Browser , Animals , Epigenesis, Genetic , Epigenomics/methods , Gene Expression Regulation , Genomics/methods , Humans , Mice
17.
BMC Bioinformatics ; 17(1): 404, 2016 Oct 03.
Article En | MEDLINE | ID: mdl-27716038

BACKGROUND: Transcription factor binding, histone modification, and chromatin accessibility studies are important approaches to understanding the biology of gene regulation. ChIP-seq and DNase-seq have become the standard techniques for studying protein-DNA interactions and chromatin accessibility respectively, and comprehensive quality control (QC) and analysis tools are critical to extracting the most value from these assay types. Although many analysis and QC tools have been reported, few combine ChIP-seq and DNase-seq data analysis and quality control in a unified framework with a comprehensive and unbiased reference of data quality metrics. RESULTS: ChiLin is a computational pipeline that automates the quality control and data analyses of ChIP-seq and DNase-seq data. It is developed using a flexible and modular software framework that can be easily extended and modified. ChiLin is ideal for batch processing of many datasets and is well suited for large collaborative projects involving ChIP-seq and DNase-seq from different designs. ChiLin generates comprehensive quality control reports that include comparisons with historical data derived from over 23,677 public ChIP-seq and DNase-seq samples (11,265 datasets) from eight literature-based classified categories. To the best of our knowledge, this atlas represents the most comprehensive ChIP-seq and DNase-seq related quality metric resource currently available. These historical metrics provide useful heuristic quality references for experiment across all commonly used assay types. Using representative datasets, we demonstrate the versatility of the pipeline by applying it to different assay types of ChIP-seq data. The pipeline software is available open source at https://github.com/cfce/chilin . CONCLUSION: ChiLin is a scalable and powerful tool to process large batches of ChIP-seq and DNase-seq datasets. The analysis output and quality metrics have been structured into user-friendly directories and reports. We have successfully compiled 23,677 profiles into a comprehensive quality atlas with fine classification for users.


Chromatin Immunoprecipitation/methods , Deoxyribonucleases/genetics , Gene Expression Regulation , High-Throughput Nucleotide Sequencing/methods , Quality Control , Sequence Analysis, DNA/methods , Software , Chromosome Mapping , Data Interpretation, Statistical , Databases, Genetic , Deoxyribonucleases/metabolism , Humans
18.
Proc Natl Acad Sci U S A ; 111(2): 705-10, 2014 Jan 14.
Article En | MEDLINE | ID: mdl-24374627

The main oncogenic driver in T-lymphoblastic leukemia is NOTCH1, which activates genes by forming chromatin-associated Notch transcription complexes. Gamma-secretase-inhibitor treatment prevents NOTCH1 nuclear localization, but most genes with NOTCH1-binding sites are insensitive to gamma-secretase inhibitors. Here, we demonstrate that fewer than 10% of NOTCH1-binding sites show dynamic changes in NOTCH1 occupancy when T-lymphoblastic leukemia cells are toggled between the Notch-on and -off states with gamma-secretase inhibiters. Dynamic NOTCH1 sites are functional, being highly associated with Notch target genes, are located mainly in distal enhancers, and frequently overlap with RUNX1 binding. In line with the latter association, we show that expression of IL7R, a gene with key roles in normal T-cell development and in T-lymphoblastic leukemia, is coordinately regulated by Runx factors and dynamic NOTCH1 binding to distal enhancers. Like IL7R, most Notch target genes and associated dynamic NOTCH1-binding sites cooccupy chromatin domains defined by constitutive binding of CCCTC binding factor, which appears to restrict the regulatory potential of dynamic NOTCH1 sites. More remarkably, the majority of dynamic NOTCH1 sites lie in superenhancers, distal elements with exceptionally broad and high levels of H3K27ac. Changes in Notch occupancy produces dynamic alterations in H3K27ac levels across the entire breadth of superenhancers and in the promoters of Notch target genes. These findings link regulation of superenhancer function to NOTCH1, a master regulatory factor and potent oncoprotein in the context of immature T cells, and delineate a generally applicable roadmap for identifying functional Notch sites in cellular genomes.


Enhancer Elements, Genetic/genetics , Gene Expression Regulation/physiology , Immunoglobulin J Recombination Signal Sequence-Binding Protein/metabolism , Multiprotein Complexes/metabolism , Receptor, Notch1/metabolism , Signal Transduction/physiology , Binding Sites/genetics , Cell Line, Tumor , Chromatin Immunoprecipitation , DNA Primers/genetics , Gene Expression Regulation/genetics , Humans , Luciferases , Plasmids/genetics , Promoter Regions, Genetic/genetics , Real-Time Polymerase Chain Reaction , Receptors, Interleukin-7/metabolism , Signal Transduction/genetics
19.
Bioinformatics ; 29(10): 1352-4, 2013 May 15.
Article En | MEDLINE | ID: mdl-23508969

SUMMARY: Chromatin immunoprecipitation and DNase I hypersensitivity assays with high-throughput sequencing have greatly accelerated the understanding of transcriptional and epigenetic regulation, although data reuse for the community of experimental biologists has been challenging. We created a data portal CistromeFinder that can help query, evaluate and visualize publicly available Chromatin immunoprecipitation and DNase I hypersensitivity assays with high-throughput sequencing data in human and mouse. The database currently contains 6378 samples over 4391 datasets, 313 factors and 102 cell lines or cell populations. Each dataset has gone through a consistent analysis and quality control pipeline; therefore, users could evaluate the overall quality of each dataset before examining binding sites near their genes of interest. CistromeFinder is integrated with UCSC genome browser for visualization, Primer3Plus for ChIP-qPCR primer design and CistromeMap for submitting newly available datasets. It also allows users to leave comments to facilitate data evaluation and update. AVAILABILITY: http://cistrome.org/finder. CONTACT: xsliu@jimmy.harvard.edu or henry_long@dfci.harvard.edu.


Chromatin Immunoprecipitation , Databases, Genetic , High-Throughput Nucleotide Sequencing , Information Storage and Retrieval , Animals , Cell Line , Deoxyribonuclease I/metabolism , Epigenesis, Genetic , Humans , Mice , Oligonucleotide Array Sequence Analysis , Software
20.
Bioinformatics ; 28(10): 1411-2, 2012 May 15.
Article En | MEDLINE | ID: mdl-22495751

SUMMARY: Transcription and chromatin regulators, and histone modifications play essential roles in gene expression regulation. We have created CistromeMap as a web server to provide a comprehensive knowledgebase of all of the publicly available ChIP-Seq and DNase-Seq data in mouse and human. We have also manually curated metadata to ensure annotation consistency, and developed a user-friendly display matrix for quick navigation and retrieval of data for specific factors, cells and papers. Finally, we provide users with summary statistics of ChIP-Seq and DNase-Seq studies.


Databases, Genetic , Knowledge Bases , Animals , Chromatin Immunoprecipitation , Gene Expression Regulation , Humans , Mice , Oligonucleotide Array Sequence Analysis , Software
...