RESUMEN
Inflammation can trigger lasting phenotypes in immune and non-immune cells. Whether and how human infections and associated inflammation can form innate immune memory in hematopoietic stem and progenitor cells (HSPC) has remained unclear. We found that circulating HSPC, enriched from peripheral blood, captured the diversity of bone marrow HSPC, enabling investigation of their epigenomic reprogramming following coronavirus disease 2019 (COVID-19). Alterations in innate immune phenotypes and epigenetic programs of HSPC persisted for months to 1 year following severe COVID-19 and were associated with distinct transcription factor (TF) activities, altered regulation of inflammatory programs, and durable increases in myelopoiesis. HSPC epigenomic alterations were conveyed, through differentiation, to progeny innate immune cells. Early activity of IL-6 contributed to these persistent phenotypes in human COVID-19 and a mouse coronavirus infection model. Epigenetic reprogramming of HSPC may underlie altered immune function following infection and be broadly relevant, especially for millions of COVID-19 survivors.
Asunto(s)
COVID-19 , Memoria Epigenética , Síndrome Post Agudo de COVID-19 , Animales , Humanos , Ratones , Diferenciación Celular , COVID-19/inmunología , Modelos Animales de Enfermedad , Células Madre Hematopoyéticas , Inflamación/genética , Inmunidad Entrenada , Monocitos/inmunología , Síndrome Post Agudo de COVID-19/genética , Síndrome Post Agudo de COVID-19/inmunología , Síndrome Post Agudo de COVID-19/patologíaRESUMEN
Cis-Regulatory elements (cis-REs) include promoters, enhancers, and insulators that regulate gene expression programs via binding of transcription factors. ATAC-seq technology effectively identifies active cis-REs in a given cell type (including from single cells) by mapping accessible chromatin at base-pair resolution. However, these maps are not immediately useful for inferring specific functions of cis-REs. For this purpose, we developed a deep learning framework (CoRE-ATAC) with novel data encoders that integrate DNA sequence (reference or personal genotypes) with ATAC-seq cut sites and read pileups. CoRE-ATAC was trained on 4 cell types (n = 6 samples/replicates) and accurately predicted known cis-RE functions from 7 cell types (n = 40 samples) that were not used in model training (mean average precision = 0.80, mean F1 score = 0.70). CoRE-ATAC enhancer predictions from 19 human islet samples coincided with genetically modulated gain/loss of enhancer activity, which was confirmed by massively parallel reporter assays (MPRAs). Finally, CoRE-ATAC effectively inferred cis-RE function from aggregate single nucleus ATAC-seq (snATAC) data from human blood-derived immune cells that overlapped with known functional annotations in sorted immune cells, which established the efficacy of these models to study cis-RE functions of rare cells without the need for cell sorting. ATAC-seq maps from primary human cells reveal individual- and cell-specific variation in cis-RE activity. CoRE-ATAC increases the functional resolution of these maps, a critical step for studying regulatory disruptions behind diseases.
Asunto(s)
Secuenciación de Inmunoprecipitación de Cromatina/métodos , Aprendizaje Profundo , Secuencias Reguladoras de Ácidos Nucleicos/genética , Análisis de la Célula Individual/métodos , Células Cultivadas , Biología Computacional , ADN/análisis , ADN/genética , Humanos , Islotes Pancreáticos/citología , Monocitos/citologíaRESUMEN
SUMMARY: Current approaches for pathway analyses focus on representing gene expression levels on graph representations of pathways and conducting pathway enrichment among differentially expressed genes. However, gene expression levels by themselves do not reflect the overall picture as non-coding factors play an important role to regulate gene expression. To incorporate these non-coding factors into pathway analyses and to systematically prioritize genes in a pathway we introduce a new software: Triangulation of Perturbation Origins and Identification of Non-Coding Targets. Triangulation of Perturbation Origins and Identification of Non-Coding Targets is a pathway analysis tool, implemented in Java that identifies the significance of a gene under a condition (e.g. a disease phenotype) by studying graph representations of pathways, analyzing upstream and downstream gene interactions and integrating non-coding regions that may be regulating gene expression levels. AVAILABILITY AND IMPLEMENTATION: The TriPOINT open source software is freely available at https://github.uconn.edu/ajt06004/TriPOINT under the GPL v3.0 license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Programas Informáticos , Expresión GénicaRESUMEN
UNLABELLED: Recent studies of the human genome have indicated that regulatory elements (e.g. promoters and enhancers) at distal genomic locations can interact with each other via chromatin folding and affect gene expression levels. Genomic technologies for mapping interactions between DNA regions, e.g., ChIA-PET and HiC, can generate genome-wide maps of interactions between regulatory elements. These interaction datasets are important resources to infer distal gene targets of non-coding regulatory elements and to facilitate prioritization of critical loci for important cellular functions. With the increasing diversity and complexity of genomic information and public ontologies, making sense of these datasets demands integrative and easy-to-use software tools. Moreover, network representation of chromatin interaction maps enables effective data visualization, integration, and mining. Currently, there is no software that can take full advantage of network theory approaches for the analysis of chromatin interaction datasets. To fill this gap, we developed a web-based application, QuIN, which enables: 1) building and visualizing chromatin interaction networks, 2) annotating networks with user-provided private and publicly available functional genomics and interaction datasets, 3) querying network components based on gene name or chromosome location, and 4) utilizing network based measures to identify and prioritize critical regulatory targets and their direct and indirect interactions. AVAILABILITY: QuIN's web server is available at http://quin.jax.org QuIN is developed in Java and JavaScript, utilizing an Apache Tomcat web server and MySQL database and the source code is available under the GPLV3 license available on GitHub: https://github.com/UcarLab/QuIN/.
Asunto(s)
Cromatina/genética , Minería de Datos/métodos , Mapeo de Interacción de Proteínas/métodos , Transducción de Señal/genética , Programas Informáticos , Interfaz Usuario-Computador , Sitios de Unión , Epigénesis Genética/genética , Internet , Polimorfismo de Nucleótido Simple/genética , Unión Proteica , Elementos Reguladores de la TranscripciónRESUMEN
Endoplasmic reticulum (ER) and inflammatory stress responses contribute to islet dysfunction in type 2 diabetes (T2D). Comprehensive genomic understanding of these human islet stress responses and whether T2D-associated genetic variants modulate them is lacking. Here, comparative transcriptome and epigenome analyses of human islets exposed ex vivo to these stressors revealed 30% of expressed genes and 14% of islet cis-regulatory elements (CREs) as stress responsive, modulated largely in an ER- or cytokine-specific fashion. T2D variants overlapped 86 stress-responsive CREs, including 21 induced by ER stress. We linked the rs6917676-T T2D risk allele to increased islet ER-stress-responsive CRE accessibility and allele-specific ß cell nuclear factor binding. MAP3K5, the ER-stress-responsive putative rs6917676 T2D effector gene, promoted stress-induced ß cell apoptosis. Supporting its pro-diabetogenic role, MAP3K5 expression correlated inversely with human islet ß cell abundance and was elevated in T2D ß cells. This study provides genome-wide insights into human islet stress responses and context-specific T2D variant effects.
RESUMEN
Monocytes can differentiate into macrophages (Mo-Macs) or dendritic cells (Mo-DCs). The cytokine granulocyte-macrophage colony-stimulating factor (GM-CSF) induces the differentiation of monocytes into Mo-Macs, while the combination of GM-CSF/interleukin (IL)-4 is widely used to generate Mo-DCs for clinical applications and to study human DC biology. Here, we report that pharmacological inhibition of the nuclear receptor peroxisome proliferator-activated receptor gamma (PPARγ) in the presence of GM-CSF and the absence of IL-4 induces monocyte differentiation into Mo-DCs. Remarkably, we find that simultaneous inhibition of PPARγ and the nutrient sensor mammalian target of rapamycin complex 1 (mTORC1) induces the differentiation of Mo-DCs with stronger phenotypic stability, superior immunogenicity, and a transcriptional profile characterized by a strong type I interferon (IFN) signature, a lower expression of a large set of tolerogenic genes, and the differential expression of several transcription factors compared with GM-CSF/IL-4 Mo-DCs. Our findings uncover a pathway that tailors Mo-DC differentiation with potential implications in the fields of DC vaccination and cancer immunotherapy.
Asunto(s)
Factor Estimulante de Colonias de Granulocitos y Macrófagos , Monocitos , Humanos , Monocitos/metabolismo , Factor Estimulante de Colonias de Granulocitos y Macrófagos/farmacología , Factor Estimulante de Colonias de Granulocitos y Macrófagos/metabolismo , PPAR gamma/metabolismo , Interleucina-4/farmacología , Interleucina-4/metabolismo , Células Dendríticas/metabolismo , Diferenciación Celular/fisiología , Células CultivadasRESUMEN
Detecting multiplets in single nucleus (sn)ATAC-seq data is challenging due to data sparsity and limited dynamic range. AMULET (ATAC-seq MULtiplet Estimation Tool) enumerates regions with greater than two uniquely aligned reads across the genome to effectively detect multiplets. We evaluate the method by generating snATAC-seq data in the human blood and pancreatic islet samples. AMULET has high precision, estimated via donor-based multiplexing, and high recall, estimated via simulated multiplets, compared to alternatives and identifies multiplets most effectively when a certain read depth of 25K median valid reads per nucleus is achieved.
Asunto(s)
Secuenciación de Inmunoprecipitación de Cromatina , Programas Informáticos , Anciano , ADN/genética , Humanos , Leucocitos Mononucleares/metabolismo , Funciones de Verosimilitud , Transposasas/metabolismoRESUMEN
EndoC-ßH1 is emerging as a critical human ß cell model to study the genetic and environmental etiologies of ß cell (dys)function and diabetes. Comprehensive knowledge of its molecular landscape is lacking, yet required, for effective use of this model. Here, we report chromosomal (spectral karyotyping), genetic (genotyping), epigenomic (ChIP-seq and ATAC-seq), chromatin interaction (Hi-C and Pol2 ChIA-PET), and transcriptomic (RNA-seq and miRNA-seq) maps of EndoC-ßH1. Analyses of these maps define known (e.g., PDX1 and ISL1) and putative (e.g., PCSK1 and mir-375) ß cell-specific transcriptional cis-regulatory networks and identify allelic effects on cis-regulatory element use. Importantly, comparison with maps generated in primary human islets and/or ß cells indicates preservation of chromatin looping but also highlights chromosomal aberrations and fetal genomic signatures in EndoC-ßH1. Together, these maps, and a web application we created for their exploration, provide important tools for the design of experiments to probe and manipulate the genetic programs governing ß cell identity and (dys)function in diabetes.
Asunto(s)
Redes Reguladoras de Genes/genética , Células Secretoras de Insulina/metabolismo , Línea Celular , HumanosRESUMEN
Enhancers are cis-acting sequences that regulate transcription rates of their target genes in a cell-specific manner and harbor disease-associated sequence variants in cognate cell types. Many complex diseases are associated with enhancer malfunction, necessitating the discovery and study of enhancers from clinical samples. Assay for Transposase Accessible Chromatin (ATAC-seq) technology can interrogate chromatin accessibility from small cell numbers and facilitate studying enhancers in pathologies. However, on average, ~35% of open chromatin regions (OCRs) from ATAC-seq samples map to enhancers. We developed a neural network-based model, Predicting Enhancers from ATAC-Seq data (PEAS), to effectively infer enhancers from clinical ATAC-seq samples by extracting ATAC-seq data features and integrating these with sequence-related features (e.g., GC ratio). PEAS recapitulated ChromHMM-defined enhancers in CD14+ monocytes, CD4+ T cells, GM12878, peripheral blood mononuclear cells, and pancreatic islets. PEAS models trained on these 5 cell types effectively predicted enhancers in four cell types that are not used in model training (EndoC-ßH1, naïve CD8+ T, MCF7, and K562 cells). Finally, PEAS inferred individual-specific enhancers from 19 islet ATAC-seq samples and revealed variability in enhancer activity across individuals, including those driven by genetic differences. PEAS is an easy-to-use tool developed to study enhancers in pathologies by taking advantage of the increasing number of clinical epigenomes.
Asunto(s)
Sitios de Unión , Elementos de Facilitación Genéticos , Redes Neurales de la Computación , Transposasas/metabolismo , Línea Celular , Biología Computacional/métodos , Perfilación de la Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Curva ROC , Sensibilidad y Especificidad , Análisis de Secuencia de ADN , Transcriptoma , Transposasas/químicaRESUMEN
Broad domain promoters and super enhancers are regulatory elements that govern cell-specific functions and harbor disease-associated sequence variants. These elements are characterized by distinct epigenomic profiles, such as expanded deposition of histone marks H3K27ac for super enhancers and H3K4me3 for broad domains, however little is known about how they interact with each other and the rest of the genome in three-dimensional chromatin space. Using network theory methods, we studied chromatin interactions between broad domains and super enhancers in three ENCODE cell lines (K562, MCF7, GM12878) obtained via ChIA-PET, Hi-C, and Hi-CHIP assays. In these networks, broad domains and super enhancers interact more frequently with each other compared to their typical counterparts. Network measures and graphlets revealed distinct connectivity patterns associated with these regulatory elements that are robust across cell types and alternative assays. Machine learning models showed that these connectivity patterns could effectively discriminate broad domains from typical promoters and super enhancers from typical enhancers. Finally, targets of broad domains in these networks were enriched in disease-causing SNPs of cognate cell types. Taken together these results suggest a robust and unique organization of the chromatin around broad domains and super enhancers: loci critical for pathologies and cell-specific functions.