Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 152
Filtrar
1.
Genome Biol ; 25(1): 3, 2024 Jan 02.
Artículo en Inglés | MEDLINE | ID: mdl-38167104

RESUMEN

The majority of disease-associated variants identified through genome-wide association studies are located outside of protein-coding regions. Prioritizing candidate regulatory variants and gene targets to identify potential biological mechanisms for further functional experiments can be challenging. To address this challenge, we developed FORGEdb ( https://forgedb.cancer.gov/ ; https://forge2.altiusinstitute.org/files/forgedb.html ; and https://doi.org/10.5281/zenodo.10067458 ), a standalone and web-based tool that integrates multiple datasets, delivering information on associated regulatory elements, transcription factor binding sites, and target genes for over 37 million variants. FORGEdb scores provide researchers with a quantitative assessment of the relative importance of each variant for targeted functional experiments.


Asunto(s)
Estudio de Asociación del Genoma Completo , Secuencias Reguladoras de Ácidos Nucleicos , Unión Proteica , Polimorfismo de Nucleótido Simple
2.
bioRxiv ; 2023 Nov 13.
Artículo en Inglés | MEDLINE | ID: mdl-38014075

RESUMEN

Identifying transcriptional enhancers and their target genes is essential for understanding gene regulation and the impact of human genetic variation on disease1-6. Here we create and evaluate a resource of >13 million enhancer-gene regulatory interactions across 352 cell types and tissues, by integrating predictive models, measurements of chromatin state and 3D contacts, and largescale genetic perturbations generated by the ENCODE Consortium7. We first create a systematic benchmarking pipeline to compare predictive models, assembling a dataset of 10,411 elementgene pairs measured in CRISPR perturbation experiments, >30,000 fine-mapped eQTLs, and 569 fine-mapped GWAS variants linked to a likely causal gene. Using this framework, we develop a new predictive model, ENCODE-rE2G, that achieves state-of-the-art performance across multiple prediction tasks, demonstrating a strategy involving iterative perturbations and supervised machine learning to build increasingly accurate predictive models of enhancer regulation. Using the ENCODE-rE2G model, we build an encyclopedia of enhancer-gene regulatory interactions in the human genome, which reveals global properties of enhancer networks, identifies differences in the functions of genes that have more or less complex regulatory landscapes, and improves analyses to link noncoding variants to target genes and cell types for common, complex diseases. By interpreting the model, we find evidence that, beyond enhancer activity and 3D enhancer-promoter contacts, additional features guide enhancerpromoter communication including promoter class and enhancer-enhancer synergy. Altogether, these genome-wide maps of enhancer-gene regulatory interactions, benchmarking software, predictive models, and insights about enhancer function provide a valuable resource for future studies of gene regulation and human genetics.

3.
BMC Genom Data ; 24(1): 70, 2023 11 20.
Artículo en Inglés | MEDLINE | ID: mdl-37986041

RESUMEN

Complex disorders are caused by a combination of genetic, environmental and lifestyle factors, and their prevalence can vary greatly across different populations. The extent to which genetic risk, as identified by Genome Wide Association Study (GWAS), correlates to disease prevalence in different populations has not been investigated systematically. Here, we studied 14 different complex disorders and explored whether polygenic risk scores (PRS) based on current GWAS correlate to disease prevalence within Europe and around the world. A clear variation in GWAS-based genetic risk was observed based on ancestry and we identified populations that have a higher genetic liability for developing certain disorders. We found that for four out of the 14 studied disorders, PRS significantly correlates to disease prevalence within Europe. We also found significant correlations between worldwide disease prevalence and PRS for eight of the studied disorders with Multiple Sclerosis genetic risk having the highest correlation to disease prevalence. Based on current GWAS results, the across population differences in genetic risk for certain disorders can potentially be used to understand differences in disease prevalence and identify populations with the highest genetic liability. The study highlights both the limitations of PRS based on current GWAS but also the fact that in some cases, PRS may already have high predictive power. This could be due to the genetic architecture of specific disorders or increased GWAS power in some cases.


Asunto(s)
Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo/métodos , Prevalencia , Factores de Riesgo , Herencia Multifactorial/genética
4.
Biol Psychiatry ; 2023 Feb 02.
Artículo en Inglés | MEDLINE | ID: mdl-36738982

RESUMEN

BACKGROUND: Tourette syndrome (TS) is a childhood-onset neurodevelopmental disorder of complex genetic architecture and is characterized by multiple motor tics and at least one vocal tic persisting for more than 1 year. METHODS: We performed a genome-wide meta-analysis integrating a novel TS cohort with previously published data, resulting in a sample size of 6133 individuals with TS and 13,565 ancestry-matched control participants. RESULTS: We identified a genome-wide significant locus on chromosome 5q15. Integration of expression quantitative trait locus, Hi-C (high-throughput chromosome conformation capture), and genome-wide association study data implicated the NR2F1 gene and associated long noncoding RNAs within the 5q15 locus. Heritability partitioning identified statistically significant enrichment in brain tissue histone marks, while polygenic risk scoring of brain volume data identified statistically significant associations with right and left thalamus volumes and right putamen volume. CONCLUSIONS: Our work presents novel insights into the neurobiology of TS, thereby opening up new directions for future studies.

5.
PLoS Genet ; 18(12): e1010557, 2022 12.
Artículo en Inglés | MEDLINE | ID: mdl-36574455

RESUMEN

Genetic association studies of many heritable traits resulting from physiological testing often have modest sample sizes due to the cost and burden of the required phenotyping. This reduces statistical power and limits discovery of multiple genetic associations. We present a strategy to leverage pleiotropy between traits to both discover new loci and to provide mechanistic hypotheses of the underlying pathophysiology. Specifically, we combine a colocalization test with a locus-level test of pleiotropy. In simulations, we show that this approach is highly selective for identifying true pleiotropy driven by the same causative variant, thereby improves the chance to replicate the associations in underpowered validation cohorts and leads to higher interpretability. Here, as an exemplar, we use Obstructive Sleep Apnea (OSA), a common disorder diagnosed using overnight multi-channel physiological testing. We leverage pleiotropy with relevant cellular and cardio-metabolic phenotypes and gene expression traits to map new risk loci in an underpowered OSA GWAS. We identify several pleiotropic loci harboring suggestive associations to OSA and genome-wide significant associations to other traits, and show that their OSA association replicates in independent cohorts of diverse ancestries. By investigating pleiotropic loci, our strategy allows proposing new hypotheses about OSA pathobiology across many physiological layers. For example, we identify and replicate the pleiotropy across the plateletcrit, OSA and an eQTL of DNA primase subunit 1 (PRIM1) in immune cells. We find suggestive links between OSA, a measure of lung function (FEV1/FVC), and an eQTL of matrix metallopeptidase 15 (MMP15) in lung tissue. We also link a previously known genome-wide significant peak for OSA in the hexokinase 1 (HK1) locus to hematocrit and other red blood cell related traits. Thus, the analysis of pleiotropic associations has the potential to assemble diverse phenotypes into a chain of mechanistic hypotheses that provide insight into the pathogenesis of complex human diseases.


Asunto(s)
Estudio de Asociación del Genoma Completo , Apnea Obstructiva del Sueño , Humanos , Estudio de Asociación del Genoma Completo/métodos , Fenotipo , Estudios de Asociación Genética , Sueño , Pleiotropía Genética , Polimorfismo de Nucleótido Simple , ADN Primasa
6.
Elife ; 112022 12 14.
Artículo en Inglés | MEDLINE | ID: mdl-36515579

RESUMEN

The genetic basis of most traits is highly polygenic and dominated by non-coding alleles. It is widely assumed that such alleles exert small regulatory effects on the expression of cis-linked genes. However, despite the availability of gene expression and epigenomic datasets, few variant-to-gene links have emerged. It is unclear whether these sparse results are due to limitations in available data and methods, or to deficiencies in the underlying assumed model. To better distinguish between these possibilities, we identified 220 gene-trait pairs in which protein-coding variants influence a complex trait or its Mendelian cognate. Despite the presence of expression quantitative trait loci near most GWAS associations, by applying a gene-based approach we found limited evidence that the baseline expression of trait-related genes explains GWAS associations, whether using colocalization methods (8% of genes implicated), transcription-wide association (2% of genes implicated), or a combination of regulatory annotations and distance (4% of genes implicated). These results contradict the hypothesis that most complex trait-associated variants coincide with homeostatic expression QTLs, suggesting that better models are needed. The field must confront this deficit and pursue this 'missing regulation.'


Asunto(s)
Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo , Humanos , Estudio de Asociación del Genoma Completo/métodos , Fenotipo , Herencia Multifactorial/genética , Epigenómica , Polimorfismo de Nucleótido Simple , Predisposición Genética a la Enfermedad
7.
Proc Natl Acad Sci U S A ; 119(41): e2205272119, 2022 10 11.
Artículo en Inglés | MEDLINE | ID: mdl-36191217

RESUMEN

Trade and colonization caused an unprecedented increase in Mediterranean human mobility in the first millennium BCE. Often seen as a dividing force, warfare is in fact another catalyst of culture contact. We provide insight into the demographic dynamics of ancient warfare by reporting genome-wide data from fifth-century soldiers who fought for the army of the Greek Sicilian colony of Himera, along with representatives of the civilian population, nearby indigenous settlements, and 96 present-day individuals from Italy and Greece. Unlike the rest of the sample, many soldiers had ancestral origins in northern Europe, the Steppe, and the Caucasus. Integrating genetic, archaeological, isotopic, and historical data, these results illustrate the significant role mercenaries played in ancient Greek armies and highlight how participation in war contributed to continental-scale human mobility in the Classical world.


Asunto(s)
Arqueología , Personal Militar , Arqueología/métodos , Europa (Continente) , Grecia , Historia Antigua , Humanos , Guerra
8.
Clin Infect Dis ; 75(1): e536-e544, 2022 08 24.
Artículo en Inglés | MEDLINE | ID: mdl-35412591

RESUMEN

BACKGROUND: The coronavirus disease 2019 (COVID-19) pandemic is dominated by variant viruses; the resulting impact on disease severity remains unclear. Using a retrospective cohort study, we assessed the hospitalization risk following infection with 7 severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants. METHODS: Our study includes individuals with positive SARS-CoV-2 reverse transcription polymerase chain reaction (RT-PCR) in the Washington Disease Reporting System with available viral genome data, from 1 December 2020 to 14 January 2022. The analysis was restricted to cases with specimens collected through sentinel surveillance. Using a Cox proportional hazards model with mixed effects, we estimated hazard ratios (HR) for hospitalization risk following infection with a variant, adjusting for age, sex, calendar week, and vaccination. RESULTS: In total, 58 848 cases were sequenced through sentinel surveillance, of which 1705 (2.9%) were hospitalized due to COVID-19. Higher hospitalization risk was found for infections with Gamma (HR 3.20, 95% confidence interval [CI] 2.40-4.26), Beta (HR 2.85, 95% CI 1.56-5.23), Delta (HR 2.28 95% CI 1.56-3.34), or Alpha (HR 1.64, 95% CI 1.29-2.07) compared to infections with ancestral lineages; Omicron (HR 0.92, 95% CI .56-1.52) showed no significant difference in risk. Following Alpha, Gamma, or Delta infection, unvaccinated patients show higher hospitalization risk, while vaccinated patients show no significant difference in risk, both compared to unvaccinated, ancestral lineage cases. Hospitalization risk following Omicron infection is lower with vaccination. CONCLUSIONS: Infection with Alpha, Gamma, or Delta results in a higher hospitalization risk, with vaccination attenuating that risk. Our findings support hospital preparedness, vaccination, and genomic surveillance.


Asunto(s)
COVID-19 , SARS-CoV-2 , COVID-19/epidemiología , Hospitalización , Humanos , Estudios Retrospectivos , SARS-CoV-2/genética , Washingtón/epidemiología
10.
Biophys J ; 121(6): 977-990, 2022 03 15.
Artículo en Inglés | MEDLINE | ID: mdl-35150617

RESUMEN

Methodological advances in conformation capture techniques have fundamentally changed our understanding of chromatin architecture. However, the nanoscale organization of chromatin and its cell-to-cell variance are less studied. Analyzing genome-wide data from 733 human cell and tissue samples, we identified 2 prototypical regions that exhibit high or absent hypersensitivity to deoxyribonuclease I, respectively. These regulatory active or inactive regions were examined in the lymphoblast cell line K562 by using high-throughput super-resolution microscopy. In both regions, we systematically measured the physical distance of 2 fluorescence in situ hybridization spots spaced by only 5 kb of DNA. Unexpectedly, the resulting distance distributions range from very compact to almost elongated configurations of more than 200-nm length for both the active and inactive regions. Monte Carlo simulations of a coarse-grained model of these chromatin regions based on published data of nucleosome occupancy in K562 cells were performed to understand the underlying mechanisms. There was no parameter set for the simulation model that can explain the microscopically measured distance distributions. Obviously, the chromatin state given by the strength of internucleosomal interaction, nucleosome occupancy, or amount of histone H1 differs from cell to cell, which results in the observed broad distance distributions. This large variability was not expected, especially in inactive regions. The results for the mechanisms for different distance distributions on this scale are important for understanding the contacts that mediate gene regulation. Microscopic measurements show that the inactive region investigated here is expected to be embedded in a more compact chromatin environment. The simulation results of this region require an increase in the strength of internucleosomal interactions. It may be speculated that the higher density of chromatin is caused by the increased internucleosomal interaction strength.


Asunto(s)
Cromatina , Nucleosomas , ADN/genética , Humanos , Hibridación Fluorescente in Situ/métodos , Conformación Molecular
11.
Genome Biol ; 23(1): 13, 2022 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-34996498

RESUMEN

BACKGROUND: Genome-wide association study (GWAS) single nucleotide polymorphisms (SNPs) are known to preferentially co-locate to active regulatory elements in tissues and cell types relevant to disease aetiology. Further characterisation of associated cell type-specific regulation can broaden our understanding of how GWAS signals may contribute to disease risk. RESULTS: To gain insight into potential functional mechanisms underlying GWAS associations, we developed FORGE2 ( https://forge2.altiusinstitute.org/ ), which is an updated version of the FORGE web tool. FORGE2 uses an expanded atlas of cell type-specific regulatory element annotations, including DNase I hotspots, five histone mark categories and 15 hidden Markov model (HMM) chromatin states, to identify tissue- and cell type-specific signals. An analysis of 3,604 GWAS from the NHGRI-EBI GWAS catalogue yielded at least one significant disease/trait-tissue association for 2,057 GWAS, including > 400 associations specific to epigenomic marks in immune tissues and cell types, > 30 associations specific to heart tissue, and > 60 associations specific to brain tissue, highlighting the key potential of tissue- and cell type-specific regulatory elements. Importantly, we demonstrate that FORGE2 analysis can separate previously observed accessible chromatin enrichments into different chromatin states, such as enhancers or active transcription start sites, providing a greater understanding of underlying regulatory mechanisms. Interestingly, tissue-specific enrichments for repressive chromatin states and histone marks were also detected, suggesting a role for tissue-specific repressed regions in GWAS-mediated disease aetiology. CONCLUSION: In summary, we demonstrate that FORGE2 has the potential to uncover previously unreported disease-tissue associations and identify new candidate mechanisms. FORGE2 is a transparent, user-friendly web tool for the integrative analysis of loci discovered from GWAS.


Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Epigenómica , Predisposición Genética a la Enfermedad , Humanos , Fenotipo , Secuencias Reguladoras de Ácidos Nucleicos
12.
medRxiv ; 2022 Feb 16.
Artículo en Inglés | MEDLINE | ID: mdl-34729567

RESUMEN

BACKGROUND: The COVID-19 pandemic is dominated by variant viruses; the resulting impact on disease severity remains unclear. Using a retrospective cohort study, we assessed the hospitalization risk following infection with seven SARS-CoV-2 variants. METHODS: Our study includes individuals with positive SARS-CoV-2 RT-PCR in the Washington Disease Reporting System with available viral genome data, from December 1, 2020 to January 14, 2022. The analysis was restricted to cases with specimens collected through sentinel surveillance. Using a Cox proportional hazards model with mixed effects, we estimated hazard ratios (HR) for hospitalization risk following infection with a variant, adjusting for age, sex, calendar week, and vaccination. FINDINGS: 58,848 cases were sequenced through sentinel surveillance, of which 1705 (2.9%) were hospitalized due to COVID-19. Higher hospitalization risk was found for infections with Gamma (HR 3.20, 95%CI 2.40-4.26), Beta (HR 2.85, 95%CI 1.56-5.23), Delta (HR 2.28 95%CI 1.56-3.34) or Alpha (HR 1.64, 95%CI 1.29-2.07) compared to infections with ancestral lineages; Omicron (HR 0.92, 95%CI 0.56-1.52) showed no significant difference in risk. Following Alpha, Gamma, or Delta infection, unvaccinated patients show higher hospitalization risk, while vaccinated patients show no significant difference in risk, both compared to unvaccinated, ancestral lineage cases. Hospitalization risk following Omicron infection is lower with vaccination. CONCLUSION: Infection with Alpha, Gamma, or Delta results in a higher hospitalization risk, with vaccination attenuating that risk. Our findings support hospital preparedness, vaccination, and genomic surveillance. SUMMARY: Hospitalization risk following infection with SARS-CoV-2 variant remains unclear. We find a higher hospitalization risk in cases infected with Alpha, Beta, Gamma, and Delta, but not Omicron, with vaccination lowering risk. Our findings support hospital preparedness, vaccination, and genomic surveillance.

13.
J Med Genet ; 59(8): 801-809, 2022 08.
Artículo en Inglés | MEDLINE | ID: mdl-34400559

RESUMEN

BACKGROUND: Myasthenia gravis (MG) is a rare autoimmune disorder affecting the neuromuscular junction (NMJ). Here, we investigate the genetic architecture of MG via a genome-wide association study (GWAS) of the largest MG data set analysed to date. METHODS: We performed GWAS meta-analysis integrating three different data sets (total of 1401 cases and 3508 controls). We carried out human leucocyte antigen (HLA) fine-mapping, gene-based and tissue enrichment analyses and investigated genetic correlation with 13 other autoimmune disorders as well as pleiotropy across MG and correlated disorders. RESULTS: We confirmed the previously reported MG association with TNFRSF11A (rs4369774; p=1.09×10-13, OR=1.4). Furthermore, gene-based analysis revealed AGRN as a novel MG susceptibility gene. HLA fine-mapping pointed to two independent MG loci: HLA-DRB1 and HLA-B. MG onset-specific analysis reveals differences in the genetic architecture of early-onset MG (EOMG) versus late-onset MG (LOMG). Furthermore, we find MG to be genetically correlated with type 1 diabetes (T1D), rheumatoid arthritis (RA), late-onset vitiligo and autoimmune thyroid disease (ATD). Cross-disorder meta-analysis reveals multiple risk loci that appear pleiotropic across MG and correlated disorders. DISCUSSION: Our gene-based analysis identifies AGRN as a novel MG susceptibility gene, implicating for the first time a locus encoding a protein (agrin) that is directly relevant to NMJ activation. Mutations in AGRN have been found to underlie congenital myasthenic syndrome. Our results are also consistent with previous studies highlighting the role of HLA and TNFRSF11A in MG aetiology and the different risk genes in EOMG versus LOMG. Finally, we uncover the genetic correlation of MG with T1D, RA, ATD and late-onset vitiligo, pointing to shared underlying genetic mechanisms.


Asunto(s)
Artritis Reumatoide , Diabetes Mellitus Tipo 1 , Miastenia Gravis , Vitíligo , Edad de Inicio , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Miastenia Gravis/genética
14.
Nat Commun ; 12(1): 6790, 2021 11 23.
Artículo en Inglés | MEDLINE | ID: mdl-34815405

RESUMEN

Lineage commitment and differentiation is driven by the concerted action of master transcriptional regulators at their target chromatin sites. Multiple efforts have characterized the key transcription factors (TFs) that determine the various hematopoietic lineages. However, the temporal interactions between individual TFs and their chromatin targets during differentiation and how these interactions dictate lineage commitment remains poorly understood. Here we perform dense, daily, temporal profiling of chromatin accessibility (DNase I-seq) and gene expression changes (total RNA-seq) along ex vivo human erythropoiesis to comprehensively define developmentally regulated DNase I hypersensitive sites (DHSs) and transcripts. We link both distal DHSs to their target gene promoters and individual TFs to their target DHSs, revealing that the regulatory landscape is organized in distinct sequential regulatory modules that regulate lineage restriction and maturation. Finally, direct comparison of transcriptional dynamics (bulk and single-cell) and lineage potential between erythropoiesis and megakaryopoiesis uncovers differential fate commitment dynamics between the two lineages as they exit the stem and progenitor stage. Collectively, these data provide insights into the temporally regulated synergy of the cis- and the trans-regulatory components underlying hematopoietic lineage commitment and differentiation.


Asunto(s)
Linaje de la Célula/genética , Cromatina/genética , Regulación del Desarrollo de la Expresión Génica , Hematopoyesis/genética , Células Madre Hematopoyéticas/fisiología , Línea Celular , Cromatina/metabolismo , Ensayo de Unidades Formadoras de Colonias , Desoxirribonucleasa I/metabolismo , Humanos , Leucocitos Mononucleares , Cultivo Primario de Células , Regiones Promotoras Genéticas , RNA-Seq , Análisis de la Célula Individual , Factores de Transcripción/metabolismo
15.
Nat Commun ; 12(1): 2850, 2021 05 14.
Artículo en Inglés | MEDLINE | ID: mdl-33990600

RESUMEN

Functional assessment of disease-associated sequence variation at non-coding regulatory elements is complicated by their high degree of context sensitivity to both the local chromatin and nuclear environments. Allelic profiling of DNA accessibility across individuals has shown that only a select minority of sequence variation affects transcription factor (TF) occupancy, yet low sequence diversity in human populations means that no experimental assessment is available for the majority of disease-associated variants. Here we describe high-resolution in vivo maps of allelic DNA accessibility in liver, kidney, lung and B cells from 5 increasingly diverged strains of F1 hybrid mice. The high density of heterozygous sites in these hybrids enables precise quantification of effect size and cell-type specificity for hundreds of thousands of variants throughout the mouse genome. We show that chromatin-altering variants delineate characteristic sensitivity profiles for hundreds of TF motifs. We develop a compendium of TF-specific sensitivity profiles accounting for genomic context effects. Finally, we link maps of allelic accessibility to allelic transcript levels in the same samples. This work provides a foundation for quantitative prediction of cell-type specific effects of non-coding variation on TF activity, which will facilitate both fine-mapping and systems-level analyses of common disease-associated variation in human genomes.


Asunto(s)
ADN/genética , Alelos , Animales , Sitios de Unión/genética , Cromatina/genética , Cromatina/metabolismo , Mapeo Cromosómico , ADN/metabolismo , Femenino , Regulación de la Expresión Génica , Variación Genética , Genoma Humano , Humanos , Hibridación Genética , Masculino , Ratones , Ratones de la Cepa 129 , Ratones Endogámicos C3H , Ratones Endogámicos C57BL , Especificidad de Órganos/genética , Penetrancia , Secuencias Reguladoras de Ácidos Nucleicos , Factores de Transcripción/metabolismo
16.
Stem Cell Reports ; 16(4): 717-726, 2021 04 13.
Artículo en Inglés | MEDLINE | ID: mdl-33770495

RESUMEN

T cell development is restricted to the thymus and is dependent on high levels of Notch signaling induced within the thymic microenvironment. To understand Notch function in thymic restriction, we investigated the basis for target gene selectivity in response to quantitative differences in Notch signal strength, focusing on the chromatin architecture of genes essential for T cell differentiation. We find that high Notch signal strength is required to activate promoters of known targets essential for T cell commitment, including Il2ra, Cd3ε, and Rag1, which feature low CpG content (LCG) and DNA inaccessibility in hematopoietic stem progenitor cells. Our findings suggest that promoter DNA inaccessibility at LCG T lineage genes provides robust protection against stochastic activation in inappropriate Notch signaling contexts, limiting T cell development to the thymus.


Asunto(s)
Islas de CpG/genética , Regiones Promotoras Genéticas/genética , Receptores Notch/metabolismo , Transducción de Señal , Linfocitos T/metabolismo , Animales , ADN/metabolismo , Desoxirribonucleasa I/metabolismo , Ratones Endogámicos C57BL
17.
Nature ; 584(7820): 244-251, 2020 08.
Artículo en Inglés | MEDLINE | ID: mdl-32728217

RESUMEN

DNase I hypersensitive sites (DHSs) are generic markers of regulatory DNA1-5 and contain genetic variations associated with diseases and phenotypic traits6-8. We created high-resolution maps of DHSs from 733 human biosamples encompassing 438 cell and tissue types and states, and integrated these to delineate and numerically index approximately 3.6 million DHSs within the human genome sequence, providing a common coordinate system for regulatory DNA. Here we show that these maps highly resolve the cis-regulatory compartment of the human genome, which encodes unexpectedly diverse cell- and tissue-selective regulatory programs at very high density. These programs can be captured comprehensively by a simple vocabulary that enables the assignment to each DHS of a regulatory barcode that encapsulates its tissue manifestations, and global annotation of protein-coding and non-coding RNA genes in a manner orthogonal to gene expression. Finally, we show that sharply resolved DHSs markedly enhance the genetic association and heritability signals of diseases and traits. Rather than being confined to a small number of distal elements or promoters, we find that genetic signals converge on congruently regulated sets of DHSs that decorate entire gene bodies. Together, our results create a universal, extensible coordinate system and vocabulary for human regulatory DNA marked by DHSs, and provide a new global perspective on the architecture of human gene regulation.


Asunto(s)
Cromatina/genética , ADN/metabolismo , Desoxirribonucleasa I/metabolismo , Anotación de Secuencia Molecular , Cromatina/química , Cromatina/metabolismo , ADN/química , ADN/genética , Regulación de la Expresión Génica , Genes/genética , Genoma Humano/genética , Humanos , Regiones Promotoras Genéticas/genética , Secuencias Reguladoras de Ácidos Nucleicos/genética
18.
Nature ; 583(7818): 729-736, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-32728250

RESUMEN

Combinatorial binding of transcription factors to regulatory DNA underpins gene regulation in all organisms. Genetic variation in regulatory regions has been connected with diseases and diverse phenotypic traits1, but it remains challenging to distinguish variants that affect regulatory function2. Genomic DNase I footprinting enables the quantitative, nucleotide-resolution delineation of sites of transcription factor occupancy within native chromatin3-6. However, only a small fraction of such sites have been precisely resolved on the human genome sequence6. Here, to enable comprehensive mapping of transcription factor footprints, we produced high-density DNase I cleavage maps from 243 human cell and tissue types and states and integrated these data to delineate about 4.5 million compact genomic elements that encode transcription factor occupancy at nucleotide resolution. We map the fine-scale structure within about 1.6 million DNase I-hypersensitive sites and show that the overwhelming majority are populated by well-spaced sites of single transcription factor-DNA interaction. Cell-context-dependent cis-regulation is chiefly executed by wholesale modulation of accessibility at regulatory DNA rather than by differential transcription factor occupancy within accessible elements. We also show that the enrichment of genetic variants associated with diseases or phenotypic traits in regulatory regions1,7 is almost entirely attributable to variants within footprints, and that functional variants that affect transcription factor occupancy are nearly evenly partitioned between loss- and gain-of-function alleles. Unexpectedly, we find increased density of human genetic variation within transcription factor footprints, revealing an unappreciated driver of cis-regulatory evolution. Our results provide a framework for both global and nucleotide-precision analyses of gene regulatory mechanisms and functional genetic variation.


Asunto(s)
Huella de ADN/normas , Genoma Humano/genética , Factores de Transcripción/metabolismo , Secuencia de Consenso , ADN/genética , ADN/metabolismo , Desoxirribonucleasa I/metabolismo , Genética de Población , Estudio de Asociación del Genoma Completo , Humanos , Modelos Moleculares , Polimorfismo de Nucleótido Simple , Secuencias Reguladoras de Ácidos Nucleicos/genética
19.
Nature ; 583(7818): 693-698, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-32728248

RESUMEN

The Encylopedia of DNA Elements (ENCODE) Project launched in 2003 with the long-term goal of developing a comprehensive map of functional elements in the human genome. These included genes, biochemical regions associated with gene regulation (for example, transcription factor binding sites, open chromatin, and histone marks) and transcript isoforms. The marks serve as sites for candidate cis-regulatory elements (cCREs) that may serve functional roles in regulating gene expression1. The project has been extended to model organisms, particularly the mouse. In the third phase of ENCODE, nearly a million and more than 300,000 cCRE annotations have been generated for human and mouse, respectively, and these have provided a valuable resource for the scientific community.


Asunto(s)
Bases de Datos Genéticas , Genoma/genética , Genómica , Anotación de Secuencia Molecular , Animales , Sitios de Unión , Cromatina/genética , Cromatina/metabolismo , Metilación de ADN , Bases de Datos Genéticas/normas , Bases de Datos Genéticas/tendencias , Regulación de la Expresión Génica/genética , Genoma Humano/genética , Genómica/normas , Genómica/tendencias , Histonas/metabolismo , Humanos , Ratones , Anotación de Secuencia Molecular/normas , Control de Calidad , Secuencias Reguladoras de Ácidos Nucleicos/genética , Factores de Transcripción/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...