Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Res Sq ; 2023 Jul 19.
Artículo en Inglés | MEDLINE | ID: mdl-37503119

RESUMEN

The Encyclopedia of DNA elements (ENCODE) project is a collaborative effort to create a comprehensive catalog of functional elements in the human genome. The current database comprises more than 19000 functional genomics experiments across more than 1000 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All experimental data, metadata, and associated computational analyses created by the ENCODE consortium are submitted to the Data Coordination Center (DCC) for validation, tracking, storage, and distribution to community resources and the scientific community. The ENCODE project has engineered and distributed uniform processing pipelines in order to promote data provenance and reproducibility as well as allow interoperability between genomic resources and other consortia. All data files, reference genome versions, software versions, and parameters used by the pipelines are captured and available via the ENCODE Portal. The pipeline code, developed using Docker and Workflow Description Language (WDL; https://openwdl.org/) is publicly available in GitHub, with images available on Dockerhub (https://hub.docker.com), enabling access to a diverse range of biomedical researchers. ENCODE pipelines maintained and used by the DCC can be installed to run on personal computers, local HPC clusters, or in cloud computing environments via Cromwell. Access to the pipelines and data via the cloud allows small labs the ability to use the data or software without access to institutional compute clusters. Standardization of the computational methodologies for analysis and quality control leads to comparable results from different ENCODE collections - a prerequisite for successful integrative analyses.

2.
bioRxiv ; 2023 Apr 06.
Artículo en Inglés | MEDLINE | ID: mdl-37066421

RESUMEN

The Encyclopedia of DNA elements (ENCODE) project is a collaborative effort to create a comprehensive catalog of functional elements in the human genome. The current database comprises more than 19000 functional genomics experiments across more than 1000 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All experimental data, metadata, and associated computational analyses created by the ENCODE consortium are submitted to the Data Coordination Center (DCC) for validation, tracking, storage, and distribution to community resources and the scientific community. The ENCODE project has engineered and distributed uniform processing pipelines in order to promote data provenance and reproducibility as well as allow interoperability between genomic resources and other consortia. All data files, reference genome versions, software versions, and parameters used by the pipelines are captured and available via the ENCODE Portal. The pipeline code, developed using Docker and Workflow Description Language (WDL; https://openwdl.org/) is publicly available in GitHub, with images available on Dockerhub (https://hub.docker.com), enabling access to a diverse range of biomedical researchers. ENCODE pipelines maintained and used by the DCC can be installed to run on personal computers, local HPC clusters, or in cloud computing environments via Cromwell. Access to the pipelines and data via the cloud allows small labs the ability to use the data or software without access to institutional compute clusters. Standardization of the computational methodologies for analysis and quality control leads to comparable results from different ENCODE collections - a prerequisite for successful integrative analyses.

3.
Blood ; 138(17): 1540-1553, 2021 10 28.
Artículo en Inglés | MEDLINE | ID: mdl-34086867

RESUMEN

Thalassemia or sickle cell patients with hereditary persistence of fetal hemoglobin (HbF) have an ameliorated clinical phenotype and, in some cases, can achieve transfusion independence. Inactivation via genome editing of γ-globin developmental suppressors, such as BCL11A or LRF/ZBTB7A, or of their binding sites, have been shown to significantly increase expression of endogenous HbF. To broaden the therapeutic window beyond a single-editing approach, we have explored combinations of cis- and trans-editing targets to enhance HbF reactivation. Multiplex mutagenesis in adult CD34+ cells was well tolerated and did not lead to any detectable defect in the cells' proliferation and differentiation, either in vitro or in vivo. The combination of 1 trans and 1 cis mutation resulted in high editing retention in vivo, coupled with almost pancellular HbF expression in NBSGW mice. The greater in vivo performance of this combination was also recapitulated using a novel helper-dependent adenoviral-CRISPR vector (HD-Ad-dualCRISPR) in CD34+ cells from ß-thalassemia patients transplanted to NBSGW mice. A pronounced increase in HbF expression was observed in human red blood cells in mice with established predominant ß0/ß0-thalassemic hemopoiesis after in vivo injection of the HD-Ad-dualCRISPR vector. Collectively, our data suggest that the combination of cis and trans fetal globin reactivation mutations has the potential to significantly increase HbF both totally and on a per cell basis over single editing and could thus provide significant clinical benefit to patients with severe ß-globin phenotype.


Asunto(s)
Antígenos CD34/genética , Hemoglobina Fetal/genética , Mutagénesis , Talasemia beta/genética , Adulto , Animales , Sistemas CRISPR-Cas , Células Cultivadas , Edición Génica , Terapia Genética , Humanos , Ratones , Talasemia beta/terapia , gamma-Globinas/genética
4.
Nature ; 584(7820): 244-251, 2020 08.
Artículo en Inglés | MEDLINE | ID: mdl-32728217

RESUMEN

DNase I hypersensitive sites (DHSs) are generic markers of regulatory DNA1-5 and contain genetic variations associated with diseases and phenotypic traits6-8. We created high-resolution maps of DHSs from 733 human biosamples encompassing 438 cell and tissue types and states, and integrated these to delineate and numerically index approximately 3.6 million DHSs within the human genome sequence, providing a common coordinate system for regulatory DNA. Here we show that these maps highly resolve the cis-regulatory compartment of the human genome, which encodes unexpectedly diverse cell- and tissue-selective regulatory programs at very high density. These programs can be captured comprehensively by a simple vocabulary that enables the assignment to each DHS of a regulatory barcode that encapsulates its tissue manifestations, and global annotation of protein-coding and non-coding RNA genes in a manner orthogonal to gene expression. Finally, we show that sharply resolved DHSs markedly enhance the genetic association and heritability signals of diseases and traits. Rather than being confined to a small number of distal elements or promoters, we find that genetic signals converge on congruently regulated sets of DHSs that decorate entire gene bodies. Together, our results create a universal, extensible coordinate system and vocabulary for human regulatory DNA marked by DHSs, and provide a new global perspective on the architecture of human gene regulation.


Asunto(s)
Cromatina/genética , ADN/metabolismo , Desoxirribonucleasa I/metabolismo , Anotación de Secuencia Molecular , Cromatina/química , Cromatina/metabolismo , ADN/química , ADN/genética , Regulación de la Expresión Génica , Genes/genética , Genoma Humano/genética , Humanos , Regiones Promotoras Genéticas/genética , Secuencias Reguladoras de Ácidos Nucleicos/genética
5.
Nature ; 583(7818): 729-736, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-32728250

RESUMEN

Combinatorial binding of transcription factors to regulatory DNA underpins gene regulation in all organisms. Genetic variation in regulatory regions has been connected with diseases and diverse phenotypic traits1, but it remains challenging to distinguish variants that affect regulatory function2. Genomic DNase I footprinting enables the quantitative, nucleotide-resolution delineation of sites of transcription factor occupancy within native chromatin3-6. However, only a small fraction of such sites have been precisely resolved on the human genome sequence6. Here, to enable comprehensive mapping of transcription factor footprints, we produced high-density DNase I cleavage maps from 243 human cell and tissue types and states and integrated these data to delineate about 4.5 million compact genomic elements that encode transcription factor occupancy at nucleotide resolution. We map the fine-scale structure within about 1.6 million DNase I-hypersensitive sites and show that the overwhelming majority are populated by well-spaced sites of single transcription factor-DNA interaction. Cell-context-dependent cis-regulation is chiefly executed by wholesale modulation of accessibility at regulatory DNA rather than by differential transcription factor occupancy within accessible elements. We also show that the enrichment of genetic variants associated with diseases or phenotypic traits in regulatory regions1,7 is almost entirely attributable to variants within footprints, and that functional variants that affect transcription factor occupancy are nearly evenly partitioned between loss- and gain-of-function alleles. Unexpectedly, we find increased density of human genetic variation within transcription factor footprints, revealing an unappreciated driver of cis-regulatory evolution. Our results provide a framework for both global and nucleotide-precision analyses of gene regulatory mechanisms and functional genetic variation.


Asunto(s)
Huella de ADN/normas , Genoma Humano/genética , Factores de Transcripción/metabolismo , Secuencia de Consenso , ADN/genética , ADN/metabolismo , Desoxirribonucleasa I/metabolismo , Genética de Población , Estudio de Asociación del Genoma Completo , Humanos , Modelos Moleculares , Polimorfismo de Nucleótido Simple , Secuencias Reguladoras de Ácidos Nucleicos/genética
7.
Cell Rep ; 31(8): 107676, 2020 05 26.
Artículo en Inglés | MEDLINE | ID: mdl-32460018

RESUMEN

The human genome encodes millions of regulatory elements, of which only a small fraction are active within a given cell type. Little is known about the global impact of chromatin remodelers on regulatory DNA landscapes and how this translates to gene expression. We use precision genome engineering to reawaken homozygously inactivated SMARCA4, a central ATPase of the human SWI/SNF chromatin remodeling complex, in lung adenocarcinoma cells. Here, we combine DNase I hypersensitivity, histone modification, and transcriptional profiling to show that SMARCA4 dramatically increases both the number and magnitude of accessible chromatin sites genome-wide, chiefly by unmasking sites of low regulatory factor occupancy. By contrast, transcriptional changes are concentrated within well-demarcated remodeling domains wherein expression of specific genes is gated by both distal element activation and promoter chromatin configuration. Our results provide a perspective on how global chromatin remodeling activity is translated to gene expression via regulatory DNA.


Asunto(s)
Ensamble y Desensamble de Cromatina/genética , ADN Helicasas/metabolismo , ADN/genética , Expresión Génica/genética , Proteínas Nucleares/metabolismo , Factores de Transcripción/metabolismo , Humanos
8.
EBioMedicine ; 41: 427-442, 2019 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-30827930

RESUMEN

BACKGROUND: Transcriptional dysregulation drives cancer formation but the underlying mechanisms are still poorly understood. Renal cell carcinoma (RCC) is the most common malignant kidney tumor which canonically activates the hypoxia-inducible transcription factor (HIF) pathway. Despite intensive study, novel therapeutic strategies to target RCC have been difficult to develop. Since the RCC epigenome is relatively understudied, we sought to elucidate key mechanisms underpinning the tumor phenotype and its clinical behavior. METHODS: We performed genome-wide chromatin accessibility (DNase-seq) and transcriptome profiling (RNA-seq) on paired tumor/normal samples from 3 patients undergoing nephrectomy for removal of RCC. We incorporated publicly available data on HIF binding (ChIP-seq) in a RCC cell line. We performed integrated analyses of these high-resolution, genome-scale datasets together with larger transcriptomic data available through The Cancer Genome Atlas (TCGA). FINDINGS: Though HIF transcription factors play a cardinal role in RCC oncogenesis, we found that numerous transcription factors with a RCC-selective expression pattern also demonstrated evidence of HIF binding near their gene body. Examination of chromatin accessibility profiles revealed that some of these transcription factors influenced the tumor's regulatory landscape, notably the stem cell transcription factor POU5F1 (OCT4). Elevated POU5F1 transcript levels were correlated with advanced tumor stage and poorer overall survival in RCC patients. Unexpectedly, we discovered a HIF-pathway-responsive promoter embedded within a endogenous retroviral long terminal repeat (LTR) element at the transcriptional start site of the PSOR1C3 long non-coding RNA gene upstream of POU5F1. RNA transcripts are induced from this promoter and read through PSOR1C3 into POU5F1 producing a novel POU5F1 transcript isoform. Rather than being unique to the POU5F1 locus, we found that HIF binds to several other transcriptionally active LTR elements genome-wide correlating with broad gene expression changes in RCC. INTERPRETATION: Integrated transcriptomic and epigenomic analysis of matched tumor and normal tissues from even a small number of primary patient samples revealed remarkably convergent shared regulatory landscapes. Several transcription factors appear to act downstream of HIF including the potent stem cell transcription factor POU5F1. Dysregulated expression of POU5F1 is part of a larger pattern of gene expression changes in RCC that may be induced by HIF-dependent reactivation of dormant promoters embedded within endogenous retroviral LTRs.


Asunto(s)
Retrovirus Endógenos/genética , Epigenómica , Factores de Transcripción con Motivo Hélice-Asa-Hélice Básico/genética , Sitios de Unión , Carcinoma de Células Renales/genética , Carcinoma de Células Renales/mortalidad , Carcinoma de Células Renales/patología , Línea Celular Tumoral , Reductasas del Citocromo/genética , Retrovirus Endógenos/fisiología , Regulación Neoplásica de la Expresión Génica , Humanos , Factor 1 Inducible por Hipoxia/genética , Neoplasias Renales/genética , Neoplasias Renales/mortalidad , Neoplasias Renales/patología , Factor 3 de Transcripción de Unión a Octámeros/genética , Factor 3 de Transcripción de Unión a Octámeros/metabolismo , Oxidorreductasas actuantes sobre Donantes de Grupos Sulfuro , Hidrolasas Diéster Fosfóricas/genética , Regiones Promotoras Genéticas , Proteínas/genética , Pirofosfatasas/genética , ARN Largo no Codificante , Tasa de Supervivencia , Secuencias Repetidas Terminales/genética , Enzimas Ubiquitina-Conjugadoras/genética
9.
J Am Soc Nephrol ; 30(3): 421-441, 2019 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-30760496

RESUMEN

BACKGROUND: Linking genetic risk loci identified by genome-wide association studies (GWAS) to their causal genes remains a major challenge. Disease-associated genetic variants are concentrated in regions containing regulatory DNA elements, such as promoters and enhancers. Although researchers have previously published DNA maps of these regulatory regions for kidney tubule cells and glomerular endothelial cells, maps for podocytes and mesangial cells have not been available. METHODS: We generated regulatory DNA maps (DNase-seq) and paired gene expression profiles (RNA-seq) from primary outgrowth cultures of human glomeruli that were composed mainly of podocytes and mesangial cells. We generated similar datasets from renal cortex cultures, to compare with those of the glomerular cultures. Because regulatory DNA elements can act on target genes across large genomic distances, we also generated a chromatin conformation map from freshly isolated human glomeruli. RESULTS: We identified thousands of unique regulatory DNA elements, many located close to transcription factor genes, which the glomerular and cortex samples expressed at different levels. We found that genetic variants associated with kidney diseases (GWAS) and kidney expression quantitative trait loci were enriched in regulatory DNA regions. By combining GWAS, epigenomic, and chromatin conformation data, we functionally annotated 46 kidney disease genes. CONCLUSIONS: We demonstrate a powerful approach to functionally connect kidney disease-/trait-associated loci to their target genes by leveraging unique regulatory DNA maps and integrated epigenomic and genetic analysis. This process can be applied to other kidney cell types and will enhance our understanding of genome regulation and its effects on gene expression in kidney disease.

10.
PLoS One ; 10(12): e0144310, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26637170

RESUMEN

Increased incidence of infections due to Klebsiella pneumoniae carbapenemase (KPC)-producing Klebsiella pneumoniae (KPC-Kp) was noted among patients undergoing endoscopic retrograde cholangiopancreatography (ERCP) at a single hospital. An epidemiologic investigation identified KPC-Kp and non-KPC-producing, extended-spectrum ß-lactamase (ESBL)-producing Kp in cultures from 2 endoscopes. Genotyping was performed on patient and endoscope isolates to characterize the microbial genomics of the outbreak. Genetic similarity of 51 Kp isolates from 37 patients and 3 endoscopes was assessed by pulsed-field gel electrophoresis (PFGE) and multi-locus sequence typing (MLST). Five patient and 2 endoscope isolates underwent whole genome sequencing (WGS). Two KPC-encoding plasmids were characterized by single molecule, real-time sequencing. Plasmid diversity was assessed by endonuclease digestion. Genomic and epidemiologic data were used in conjunction to investigate the outbreak source. Two clusters of Kp patient isolates were genetically related to endoscope isolates by PFGE. A subset of patient isolates were collected post-ERCP, suggesting ERCP endoscopes as a possible source. A phylogeny of 7 Kp genomes from patient and endoscope isolates supported ERCP as a potential source of transmission. Differences in gene content defined 5 ST258 subclades and identified 2 of the subclades as outbreak-associated. A novel KPC-encoding plasmid, pKp28 helped define and track one endoscope-associated ST258 subclade. WGS demonstrated high genetic relatedness of patient and ERCP endoscope isolates suggesting ERCP-associated transmission of ST258 KPC-Kp. Gene and plasmid content discriminated the outbreak from endemic ST258 populations and assisted with the molecular epidemiologic investigation of an extended KPC-Kp outbreak.


Asunto(s)
Proteínas Bacterianas , Colangiopancreatografia Retrógrada Endoscópica/efectos adversos , Brotes de Enfermedades , Genoma Bacteriano , Infecciones por Klebsiella , Klebsiella pneumoniae , Filogenia , beta-Lactamasas , Proteínas Bacterianas/biosíntesis , Proteínas Bacterianas/genética , Femenino , Humanos , Infecciones por Klebsiella/enzimología , Infecciones por Klebsiella/epidemiología , Infecciones por Klebsiella/etiología , Infecciones por Klebsiella/genética , Klebsiella pneumoniae/genética , Klebsiella pneumoniae/aislamiento & purificación , Masculino , Plásmidos/genética , beta-Lactamasas/biosíntesis , beta-Lactamasas/genética
11.
J Clin Microbiol ; 52(7): 2406-9, 2014 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-24759727

RESUMEN

Previous studies suggested that 7 to 15% of healthy adults are colonized with toxigenic Clostridium difficile. To investigate the epidemiology, genetic diversity, and duration of C. difficile colonization in asymptomatic persons, we recruited healthy adults from the general population in Allegheny County, Pennsylvania. Participants provided epidemiological and dietary intake data and submitted stool specimens. The presence of C. difficile in stool specimens was determined by anaerobic culture. Stool specimens yielding C. difficile underwent nucleic acid testing of the tcdA gene segment with a commercial assay; tcdC genotyping was performed on C. difficile isolates. Subjects positive for C. difficile by toxigenic anaerobic culture were asked to submit additional specimens. One hundred six (81%) of 130 subjects submitted specimens, and 7 (6.6%) of those subjects were colonized with C. difficile. Seven distinct tcdC genotypes were observed among the 7 C. difficile-colonized individuals, including tcdC genotype 20, which has been found in uncooked ground pork in this region. Two (33%) out of 6 C. difficile-colonized subjects who submitted additional specimens tested positive for identical C. difficile strains on successive occasions, 1 month apart. The prevalence of C. difficile carriage in this healthy cohort is concordant with prior estimates. C. difficile-colonized individuals may be important reservoirs for C. difficile and may falsely test positive for infections due to C. difficile when evaluated for community-acquired diarrhea caused by other enteric pathogens.


Asunto(s)
Portador Sano/epidemiología , Clostridioides difficile/aislamiento & purificación , Infecciones por Clostridium/epidemiología , Voluntarios Sanos , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Proteínas Bacterianas/genética , Toxinas Bacterianas/genética , Portador Sano/microbiología , Infecciones por Clostridium/microbiología , Enterotoxinas/genética , Heces/microbiología , Conducta Alimentaria , Femenino , Variación Genética , Genotipo , Técnicas de Genotipaje , Humanos , Estudios Longitudinales , Masculino , Persona de Mediana Edad , Pennsylvania/epidemiología , Prevalencia , Proteínas Represoras/genética , Encuestas y Cuestionarios , Factores de Tiempo , Adulto Joven
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...