Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 63
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Nature ; 624(7992): 621-629, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38049589

RESUMO

Type 2 diabetes mellitus (T2D), a major cause of worldwide morbidity and mortality, is characterized by dysfunction of insulin-producing pancreatic islet ß cells1,2. T2D genome-wide association studies (GWAS) have identified hundreds of signals in non-coding and ß cell regulatory genomic regions, but deciphering their biological mechanisms remains challenging3-5. Here, to identify early disease-driving events, we performed traditional and multiplexed pancreatic tissue imaging, sorted-islet cell transcriptomics and islet functional analysis of early-stage T2D and control donors. By integrating diverse modalities, we show that early-stage T2D is characterized by ß cell-intrinsic defects that can be proportioned into gene regulatory modules with enrichment in signals of genetic risk. After identifying the ß cell hub gene and transcription factor RFX6 within one such module, we demonstrated multiple layers of genetic risk that converge on an RFX6-mediated network to reduce insulin secretion by ß cells. RFX6 perturbation in primary human islet cells alters ß cell chromatin architecture at regions enriched for T2D GWAS signals, and population-scale genetic analyses causally link genetically predicted reduced RFX6 expression with increased T2D risk. Understanding the molecular mechanisms of complex, systemic diseases necessitates integration of signals from multiple molecules, cells, organs and individuals, and thus we anticipate that this approach will be a useful template to identify and validate key regulatory networks and master hub genes for other diseases or traits using GWAS data.


Assuntos
Diabetes Mellitus Tipo 2 , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Predisposição Genética para Doença , Ilhotas Pancreáticas , Humanos , Estudos de Casos e Controles , Separação Celular , Cromatina/metabolismo , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/metabolismo , Diabetes Mellitus Tipo 2/patologia , Diabetes Mellitus Tipo 2/fisiopatologia , Redes Reguladoras de Genes/genética , Estudo de Associação Genômica Ampla , Secreção de Insulina , Ilhotas Pancreáticas/metabolismo , Ilhotas Pancreáticas/patologia , Reprodutibilidade dos Testes
2.
Am J Hum Genet ; 111(10): 2107-2116, 2024 Oct 03.
Artigo em Inglês | MEDLINE | ID: mdl-39255798

RESUMO

Same-sex sexual behavior has long interested genetics researchers in part because, while there is evidence of heritability, the trait as typically defined is associated with fewer offspring. Investigations of this phenomenon began in the 1990s with linkage studies and continue today with the advent of genome-wide association studies. As this body of research grows, so does critical scientific and ethical review of it. Here, we provide a targeted overview of existing genetics studies on same-sex sexual behavior, highlight the ethical and scientific considerations of this nascent field, and provide recommendations developed by the authors to enhance social and ethical responsibility.


Assuntos
Estudo de Associação Genômica Ampla , Comportamento Sexual , Humanos , Masculino , Feminino , Homossexualidade
3.
PLoS Comput Biol ; 20(8): e1012344, 2024 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-39196899

RESUMO

Recent studies show that cellular neighborhoods play an important role in evolving biological events such as cancer and diabetes. Therefore, it is critical to accurately and efficiently identify cellular neighborhoods from spatially-resolved single-cell transcriptomic data or single-cell resolution tissue imaging data. In this work, we develop CNTools, a computational toolbox for end-to-end cellular neighborhood analysis on annotated cell images, comprising both the identification and analysis steps. It includes state-of-the-art cellular neighborhood identification methods and post-identification smoothing techniques, with our newly proposed Cellular Neighbor Embedding (CNE) method and Naive Smoothing technique, as well as several established downstream analysis approaches. We applied CNTools on three real-world CODEX datasets and evaluated identification methods with smoothing techniques quantitatively and qualitatively. It shows that CNE with Naive Smoothing overall outperformed other methods and revealed more convincing biological insights. We also provided suggestions on how to choose proper identification methods and smoothing techniques according to input data.


Assuntos
Biologia Computacional , Processamento de Imagem Assistida por Computador , Biologia Computacional/métodos , Humanos , Processamento de Imagem Assistida por Computador/métodos , Análise de Célula Única/métodos , Algoritmos , Software
4.
Am J Hum Genet ; 108(7): 1169-1189, 2021 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-34038741

RESUMO

Identifying the molecular mechanisms by which genome-wide association study (GWAS) loci influence traits remains challenging. Chromatin accessibility quantitative trait loci (caQTLs) help identify GWAS loci that may alter GWAS traits by modulating chromatin structure, but caQTLs have been identified in a limited set of human tissues. Here we mapped caQTLs in human liver tissue in 20 liver samples and identified 3,123 caQTLs. The caQTL variants are enriched in liver tissue promoter and enhancer states and frequently disrupt binding motifs of transcription factors expressed in liver. We predicted target genes for 861 caQTL peaks using proximity, chromatin interactions, correlation with promoter accessibility or gene expression, and colocalization with expression QTLs. Using GWAS signals for 19 liver function and/or cardiometabolic traits, we identified 110 colocalized caQTLs and GWAS signals, 56 of which contained a predicted caPeak target gene. At the LITAF LDL-cholesterol GWAS locus, we validated that a caQTL variant showed allelic differences in protein binding and transcriptional activity. These caQTLs contribute to the epigenomic characterization of human liver and help identify molecular mechanisms and genes at GWAS loci.


Assuntos
Cromatina/metabolismo , Fígado/metabolismo , Locos de Características Quantitativas , Motivos de Aminoácidos , Sítios de Ligação , Montagem e Desmontagem da Cromatina , Elementos Facilitadores Genéticos , Variação Genética , Estudo de Associação Genômica Ampla , Humanos , Regiões Promotoras Genéticas , Ligação Proteica , Fatores de Transcrição/química , Fatores de Transcrição/metabolismo , Transcriptoma
5.
Genome Res ; 31(12): 2258-2275, 2021 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-34815310

RESUMO

Skeletal muscle accounts for the largest proportion of human body mass, on average, and is a key tissue in complex diseases and mobility. It is composed of several different cell and muscle fiber types. Here, we optimize single-nucleus ATAC-seq (snATAC-seq) to map skeletal muscle cell-specific chromatin accessibility landscapes in frozen human and rat samples, and single-nucleus RNA-seq (snRNA-seq) to map cell-specific transcriptomes in human. We additionally perform multi-omics profiling (gene expression and chromatin accessibility) on human and rat muscle samples. We capture type I and type II muscle fiber signatures, which are generally missed by existing single-cell RNA-seq methods. We perform cross-modality and cross-species integrative analyses on 33,862 nuclei and identify seven cell types ranging in abundance from 59.6% to 1.0% of all nuclei. We introduce a regression-based approach to infer cell types by comparing transcription start site-distal ATAC-seq peaks to reference enhancer maps and show consistency with RNA-based marker gene cell type assignments. We find heterogeneity in enrichment of genetic variants linked to complex phenotypes from the UK Biobank and diabetes genome-wide association studies in cell-specific ATAC-seq peaks, with the most striking enrichment patterns in muscle mesenchymal stem cells (∼3.5% of nuclei). Finally, we overlay these chromatin accessibility maps on GWAS data to nominate causal cell types, SNPs, transcription factor motifs, and target genes for type 2 diabetes signals. These chromatin accessibility profiles for human and rat skeletal muscle cell types are a useful resource for nominating causal GWAS SNPs and cell types.

6.
Nature ; 536(7614): 41-47, 2016 08 04.
Artigo em Inglês | MEDLINE | ID: mdl-27398621

RESUMO

The genetic architecture of common traits, including the number, frequency, and effect sizes of inherited variants that contribute to individual risk, has been long debated. Genome-wide association studies have identified scores of common variants associated with type 2 diabetes, but in aggregate, these explain only a fraction of the heritability of this disease. Here, to test the hypothesis that lower-frequency variants explain much of the remainder, the GoT2D and T2D-GENES consortia performed whole-genome sequencing in 2,657 European individuals with and without diabetes, and exome sequencing in 12,940 individuals from five ancestry groups. To increase statistical power, we expanded the sample size via genotyping and imputation in a further 111,548 subjects. Variants associated with type 2 diabetes after sequencing were overwhelmingly common and most fell within regions previously identified by genome-wide association studies. Comprehensive enumeration of sequence variation is necessary to identify functional alleles that provide important clues to disease pathophysiology, but large-scale sequencing does not support the idea that lower-frequency variants have a major role in predisposition to type 2 diabetes.


Assuntos
Diabetes Mellitus Tipo 2/genética , Predisposição Genética para Doença/genética , Variação Genética/genética , Alelos , Análise Mutacional de DNA , Europa (Continente)/etnologia , Exoma , Estudo de Associação Genômica Ampla , Técnicas de Genotipagem , Humanos , Tamanho da Amostra
7.
Proc Natl Acad Sci U S A ; 116(22): 10883-10888, 2019 05 28.
Artigo em Inglês | MEDLINE | ID: mdl-31076557

RESUMO

We integrate comeasured gene expression and DNA methylation (DNAme) in 265 human skeletal muscle biopsies from the FUSION study with >7 million genetic variants and eight physiological traits: height, waist, weight, waist-hip ratio, body mass index, fasting serum insulin, fasting plasma glucose, and type 2 diabetes. We find hundreds of genes and DNAme sites associated with fasting insulin, waist, and body mass index, as well as thousands of DNAme sites associated with gene expression (eQTM). We find that controlling for heterogeneity in tissue/muscle fiber type reduces the number of physiological trait associations, and that long-range eQTMs (>1 Mb) are reduced when controlling for tissue/muscle fiber type or latent factors. We map genetic regulators (quantitative trait loci; QTLs) of expression (eQTLs) and DNAme (mQTLs). Using Mendelian randomization (MR) and mediation techniques, we leverage these genetic maps to predict 213 causal relationships between expression and DNAme, approximately two-thirds of which predict methylation to causally influence expression. We use MR to integrate FUSION mQTLs, FUSION eQTLs, and GTEx eQTLs for 48 tissues with genetic associations for 534 diseases and quantitative traits. We identify hundreds of genes and thousands of DNAme sites that may drive the reported disease/quantitative trait genetic associations. We identify 300 gene expression MR associations that are present in both FUSION and GTEx skeletal muscle and that show stronger evidence of MR association in skeletal muscle than other tissues, which may partially reflect differences in power across tissues. As one example, we find that increased RXRA muscle expression may decrease lean tissue mass.


Assuntos
Metilação de DNA/genética , Expressão Gênica/genética , Músculo Esquelético , Glicemia/análise , Pesos e Medidas Corporais , Diabetes Mellitus Tipo 2 , Estudo de Associação Genômica Ampla/métodos , Genômica/métodos , Humanos , Insulina/análise , Músculo Esquelético/química , Músculo Esquelético/fisiologia , Locos de Características Quantitativas/genética
8.
BMC Biol ; 19(1): 76, 2021 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-33858413

RESUMO

BACKGROUND: The pituitary gland is a neuroendocrine organ containing diverse cell types specialized in secreting hormones that regulate physiology. Pituitary thyrotropes produce thyroid-stimulating hormone (TSH), a critical factor for growth and maintenance of metabolism. The transcription factors POU1F1 and GATA2 have been implicated in thyrotrope fate, but the transcriptomic and epigenomic landscapes of these neuroendocrine cells have not been characterized. The goal of this work was to discover transcriptional regulatory elements that drive thyrotrope fate. RESULTS: We identified the transcription factors and epigenomic changes in chromatin that are associated with differentiation of POU1F1-expressing progenitors into thyrotropes using cell lines that represent an undifferentiated Pou1f1 lineage progenitor (GHF-T1) and a committed thyrotrope line that produces TSH (TαT1). We compared RNA-seq, ATAC-seq, histone modification (H3K27Ac, H3K4Me1, and H3K27Me3), and POU1F1 binding in these cell lines. POU1F1 binding sites are commonly associated with bZIP transcription factor consensus binding sites in GHF-T1 cells and Helix-Turn-Helix (HTH) or basic Helix-Loop-Helix (bHLH) factors in TαT1 cells, suggesting that these classes of transcription factors may recruit or cooperate with POU1F1 binding at unique sites. We validated enhancer function of novel elements we mapped near Cga, Pitx1, Gata2, and Tshb by transfection in TαT1 cells. Finally, we confirmed that an enhancer element near Tshb can drive expression in thyrotropes of transgenic mice, and we demonstrate that GATA2 enhances Tshb expression through this element. CONCLUSION: These results extend the ENCODE multi-omic profiling approach to the pituitary gland, which should be valuable for understanding pituitary development and disease pathogenesis.


Assuntos
Hipófise , Animais , Camundongos , Hipófise/metabolismo , Sequências Reguladoras de Ácido Nucleico , Tireotropina/genética , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Transfecção
9.
Hum Mol Genet ; 28(5): 736-750, 2019 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-30380057

RESUMO

Danforth's short tail (Sd) mice provide an excellent model for investigating the underlying etiology of human caudal birth defects, which affect 1 in 10 000 live births. Sd animals exhibit aberrant axial skeleton, urogenital and gastrointestinal development similar to human caudal malformation syndromes including urorectal septum malformation, caudal regression, vertebral-anal-cardiac-tracheo-esophageal fistula-renal-limb (VACTERL) association and persistent cloaca. Previous studies have shown that the Sd mutation results from an endogenous retroviral (ERV) insertion upstream of the Ptf1a gene resulting in its ectopic expression at E9.5. Though the genetic lesion has been determined, the resulting epigenomic and transcriptomic changes driving the phenotype have not been investigated. Here, we performed ATAC-seq experiments on isolated E9.5 tailbud tissue, which revealed minimal changes in chromatin accessibility in Sd/Sd mutant embryos. Interestingly, chromatin changes were localized to a small interval adjacent to the Sd ERV insertion overlapping a known Ptf1a enhancer region, which is conserved in mice and humans. Furthermore, mRNA-seq experiments revealed increased transcription of Ptf1a target genes and, importantly, downregulation of hedgehog pathway genes. Reduced sonic hedgehog (SHH) signaling was confirmed by in situ hybridization and immunofluorescence suggesting that the Sd phenotype results, in part, from downregulated SHH signaling. Taken together, these data demonstrate substantial transcriptome changes in the Sd mouse, and indicate that the effect of the ERV insertion on Ptf1a expression may be mediated by increased chromatin accessibility at a conserved Ptf1a enhancer. We propose that human caudal dysgenesis disorders may result from dysregulation of hedgehog signaling pathways.


Assuntos
Montagem e Desmontagem da Cromatina , Cromatina/genética , Cromatina/metabolismo , Epigenoma , Proteínas Hedgehog/metabolismo , Transdução de Sinais , Transcriptoma , Animais , Biomarcadores , Biologia Computacional/métodos , Elementos Facilitadores Genéticos , Imunofluorescência , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Ontologia Genética , Camundongos , Mutação , Organogênese/genética , Fenótipo , Regiões Promotoras Genéticas
10.
Am J Hum Genet ; 102(4): 620-635, 2018 04 05.
Artigo em Inglês | MEDLINE | ID: mdl-29625024

RESUMO

Genome-wide association studies (GWASs) and functional genomics approaches implicate enhancer disruption in islet dysfunction and type 2 diabetes (T2D) risk. We applied genetic fine-mapping and functional (epi)genomic approaches to a T2D- and proinsulin-associated 15q22.2 locus to identify a most likely causal variant, determine its direction of effect, and elucidate plausible target genes. Fine-mapping and conditional analyses of proinsulin levels of 8,635 non-diabetic individuals from the METSIM study support a single association signal represented by a cluster of 16 strongly associated (p < 10-17) variants in high linkage disequilibrium (r2 > 0.8) with the GWAS index SNP rs7172432. These variants reside in an evolutionarily and functionally conserved islet and ß cell stretch or super enhancer; the most strongly associated variant (rs7163757, p = 3 × 10-19) overlaps a conserved islet open chromatin site. DNA sequence containing the rs7163757 risk allele displayed 2-fold higher enhancer activity than the non-risk allele in reporter assays (p < 0.01) and was differentially bound by ß cell nuclear extract proteins. Transcription factor NFAT specifically potentiated risk-allele enhancer activity and altered patterns of nuclear protein binding to the risk allele in vitro, suggesting that it could be a factor mediating risk-allele effects. Finally, the rs7163757 proinsulin-raising and T2D risk allele (C) was associated with increased expression of C2CD4B, and possibly C2CD4A, both of which were induced by inflammatory cytokines, in human islets. Together, these data suggest that rs7163757 contributes to genetic risk of islet dysfunction and T2D by increasing NFAT-mediated islet enhancer activity and modulating C2CD4B, and possibly C2CD4A, expression in (patho)physiologic states.


Assuntos
Proteínas de Ligação ao Cálcio/genética , Sequência Conservada , Elementos Facilitadores Genéticos/genética , Evolução Molecular , Ilhotas Pancreáticas/patologia , Mutação/genética , Proteínas Nucleares/genética , Fatores de Transcrição/genética , Idoso , Alelos , Animais , Sequência de Bases , Proteínas de Ligação ao Cálcio/metabolismo , Linhagem Celular , Cromatina/metabolismo , Cromossomos Humanos Par 15/genética , Citocinas/metabolismo , DNA Intergênico/genética , Humanos , Mediadores da Inflamação/metabolismo , Camundongos , Pessoa de Meia-Idade , Fatores de Transcrição NFATC/metabolismo , Mapeamento Físico do Cromossomo , Polimorfismo de Nucleotídeo Único/genética , Proinsulina/metabolismo , Ratos , Fatores de Risco
11.
Nature ; 520(7548): 558-62, 2015 Apr 23.
Artigo em Inglês | MEDLINE | ID: mdl-25686607

RESUMO

Enhancers regulate spatiotemporal gene expression and impart cell-specific transcriptional outputs that drive cell identity. Super-enhancers (SEs), also known as stretch-enhancers, are a subset of enhancers especially important for genes associated with cell identity and genetic risk of disease. CD4(+) T cells are critical for host defence and autoimmunity. Here we analysed maps of mouse T-cell SEs as a non-biased means of identifying key regulatory nodes involved in cell specification. We found that cytokines and cytokine receptors were the dominant class of genes exhibiting SE architecture in T cells. Nonetheless, the locus encoding Bach2, a key negative regulator of effector differentiation, emerged as the most prominent T-cell SE, revealing a network in which SE-associated genes critical for T-cell biology are repressed by BACH2. Disease-associated single-nucleotide polymorphisms for immune-mediated disorders, including rheumatoid arthritis, were highly enriched for T-cell SEs versus typical enhancers or SEs in other cell lineages. Intriguingly, treatment of T cells with the Janus kinase (JAK) inhibitor tofacitinib disproportionately altered the expression of rheumatoid arthritis risk genes with SE structures. Together, these results indicate that genes with SE architecture in T cells encompass a variety of cytokines and cytokine receptors but are controlled by a 'guardian' transcription factor, itself endowed with an SE. Thus, enumeration of SEs allows the unbiased determination of key regulatory nodes in T cells, which are preferentially modulated by pharmacological intervention.


Assuntos
Artrite Reumatoide/genética , Elementos Facilitadores Genéticos/genética , Linfócitos T Auxiliares-Indutores/metabolismo , Linfócitos T Auxiliares-Indutores/patologia , Animais , Artrite Reumatoide/imunologia , Artrite Reumatoide/patologia , Fatores de Transcrição de Zíper de Leucina Básica/metabolismo , Diferenciação Celular/genética , Linhagem da Célula/genética , Regulação da Expressão Gênica/genética , Predisposição Genética para Doença/genética , Janus Quinase 3/antagonistas & inibidores , Camundongos , Camundongos Endogâmicos C57BL , Piperidinas/farmacologia , Pirimidinas/farmacologia , Pirróis/farmacologia , RNA não Traduzido/genética , Linfócitos T Auxiliares-Indutores/imunologia , Transcrição Gênica/genética , Fatores de Transcrição de p300-CBP/metabolismo
12.
Nature ; 512(7515): 449-52, 2014 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-25164756

RESUMO

Genome function is dynamically regulated in part by chromatin, which consists of the histones, non-histone proteins and RNA molecules that package DNA. Studies in Caenorhabditis elegans and Drosophila melanogaster have contributed substantially to our understanding of molecular mechanisms of genome function in humans, and have revealed conservation of chromatin components and mechanisms. Nevertheless, the three organisms have markedly different genome sizes, chromosome architecture and gene organization. On human and fly chromosomes, for example, pericentric heterochromatin flanks single centromeres, whereas worm chromosomes have dispersed heterochromatin-like regions enriched in the distal chromosomal 'arms', and centromeres distributed along their lengths. To systematically investigate chromatin organization and associated gene regulation across species, we generated and analysed a large collection of genome-wide chromatin data sets from cell lines and developmental stages in worm, fly and human. Here we present over 800 new data sets from our ENCODE and modENCODE consortia, bringing the total to over 1,400. Comparison of combinatorial patterns of histone modifications, nuclear lamina-associated domains, organization of large-scale topological domains, chromatin environment at promoters and enhancers, nucleosome positioning, and DNA replication patterns reveals many conserved features of chromatin organization among the three organisms. We also find notable differences in the composition and locations of repressive chromatin. These data sets and analyses provide a rich resource for comparative and species-specific investigations of chromatin composition, organization and function.


Assuntos
Caenorhabditis elegans/citologia , Caenorhabditis elegans/genética , Cromatina/genética , Cromatina/metabolismo , Drosophila melanogaster/citologia , Drosophila melanogaster/genética , Animais , Linhagem Celular , Centrômero/genética , Centrômero/metabolismo , Cromatina/química , Montagem e Desmontagem da Cromatina/genética , Replicação do DNA/genética , Elementos Facilitadores Genéticos/genética , Epigênese Genética , Heterocromatina/química , Heterocromatina/genética , Heterocromatina/metabolismo , Histonas/química , Histonas/metabolismo , Humanos , Anotação de Sequência Molecular , Lâmina Nuclear/metabolismo , Nucleossomos/química , Nucleossomos/genética , Nucleossomos/metabolismo , Regiões Promotoras Genéticas/genética , Especificidade da Espécie
13.
Proc Natl Acad Sci U S A ; 114(9): 2301-2306, 2017 02 28.
Artigo em Inglês | MEDLINE | ID: mdl-28193859

RESUMO

Genome-wide association studies (GWAS) have identified >100 independent SNPs that modulate the risk of type 2 diabetes (T2D) and related traits. However, the pathogenic mechanisms of most of these SNPs remain elusive. Here, we examined genomic, epigenomic, and transcriptomic profiles in human pancreatic islets to understand the links between genetic variation, chromatin landscape, and gene expression in the context of T2D. We first integrated genome and transcriptome variation across 112 islet samples to produce dense cis-expression quantitative trait loci (cis-eQTL) maps. Additional integration with chromatin-state maps for islets and other diverse tissue types revealed that cis-eQTLs for islet-specific genes are specifically and significantly enriched in islet stretch enhancers. High-resolution chromatin accessibility profiling using assay for transposase-accessible chromatin sequencing (ATAC-seq) in two islet samples enabled us to identify specific transcription factor (TF) footprints embedded in active regulatory elements, which are highly enriched for islet cis-eQTL. Aggregate allelic bias signatures in TF footprints enabled us de novo to reconstruct TF binding affinities genetically, which support the high-quality nature of the TF footprint predictions. Interestingly, we found that T2D GWAS loci were strikingly and specifically enriched in islet Regulatory Factor X (RFX) footprints. Remarkably, within and across independent loci, T2D risk alleles that overlap with RFX footprints uniformly disrupt the RFX motifs at high-information content positions. Together, these results suggest that common regulatory variations have shaped islet TF footprints and the transcriptome and that a confluent RFX regulatory grammar plays a significant role in the genetic component of T2D predisposition.


Assuntos
Diabetes Mellitus Tipo 2/genética , Predisposição Genética para Doença , Genoma Humano , Ilhotas Pancreáticas/metabolismo , Locos de Características Quantitativas , Transcriptoma , Alelos , Sequência de Bases , Sítios de Ligação , Cromatina/química , Cromatina/metabolismo , Diabetes Mellitus Tipo 2/metabolismo , Diabetes Mellitus Tipo 2/patologia , Epigênese Genética , Perfilação da Expressão Gênica , Variação Genética , Estudo de Associação Genômica Ampla , Impressão Genômica , Humanos , Ilhotas Pancreáticas/patologia , Polimorfismo de Nucleotídeo Único , Ligação Proteica , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Fatores de Transcrição de Fator Regulador X/genética , Fatores de Transcrição de Fator Regulador X/metabolismo
14.
Diabetologia ; 62(5): 735-743, 2019 05.
Artigo em Inglês | MEDLINE | ID: mdl-30756131

RESUMO

Variation in non-coding DNA, encompassing gene regulatory regions such as enhancers and promoters, contributes to risk for complex disorders, including type 2 diabetes. While genome-wide association studies have successfully identified hundreds of type 2 diabetes loci throughout the genome, the vast majority of these reside in non-coding DNA, which complicates the process of determining their functional significance and level of priority for further study. Here we review the methods used to experimentally annotate these non-coding variants, to nominate causal variants and to link them to diabetes pathophysiology. In recent years, chromatin profiling, massively parallel sequencing, high-throughput reporter assays and CRISPR gene editing technologies have rapidly become indispensable tools. Rather than treating individual variants in isolation, we discuss the importance of accounting for context, both genetic (such as flanking DNA sequence) and environmental (such as cellular state or environmental exposure). Incorporating these features shows promise in terms of revealing biologically convergent molecular signatures across distant and seemingly unrelated loci. Studying regulatory elements in the proper context will be crucial for interpreting the functional significance of disease-associated variants and applying the resulting knowledge to improve patient care.


Assuntos
Diabetes Mellitus Tipo 2/diagnóstico , Diabetes Mellitus Tipo 2/genética , Estudo de Associação Genômica Ampla , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Cromatina/química , Predisposição Genética para Doença , Variação Genética , Genoma Humano , Genômica , Histonas/química , Humanos , Regiões Promotoras Genéticas , Sequências Reguladoras de Ácido Nucleico
15.
Bioinformatics ; 34(20): 3578-3580, 2018 10 15.
Artigo em Inglês | MEDLINE | ID: mdl-29790915

RESUMO

Motivation: Motif discovery in large biopolymer sequence datasets can be computationally demanding, presenting significant challenges for discovery in omics research. MEME, arguably one of the most popular motif discovery software, takes quadratic time with respect to dataset size, leading to excessively long runtimes for large datasets. Therefore, there is a demand for fast programs that can generate results of the same quality as MEME. Results: Here we describe YAMDA, a highly scalable motif discovery software package. It is built on Pytorch, a tensor computation deep learning library with strong GPU acceleration that is highly optimized for tensor operations that are also useful for motifs. YAMDA takes linear time to find motifs as accurately as MEME, completing in seconds or minutes, which translates to speedups over a thousandfold. Availability and implementation: YAMDA is freely available on Github (https://github.com/daquang/YAMDA). Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Aprendizado Profundo , Software , Algoritmos , Gráficos por Computador , Fatores de Tempo
16.
BMC Genomics ; 19(1): 390, 2018 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-29792182

RESUMO

BACKGROUND: Bisulfite sequencing is widely employed to study the role of DNA methylation in disease; however, the data suffer from biases due to coverage depth variability. Imputation of methylation values at low-coverage sites may mitigate these biases while also identifying important genomic features associated with predictive power. RESULTS: Here we describe BoostMe, a method for imputing low-quality DNA methylation estimates within whole-genome bisulfite sequencing (WGBS) data. BoostMe uses a gradient boosting algorithm, XGBoost, and leverages information from multiple samples for prediction. We find that BoostMe outperforms existing algorithms in speed and accuracy when applied to WGBS of human tissues. Furthermore, we show that imputation improves concordance between WGBS and the MethylationEPIC array at low WGBS depth, suggesting improved WGBS accuracy after imputation. CONCLUSIONS: Our findings support the use of BoostMe as a preprocessing step for WGBS analysis.


Assuntos
Biologia Computacional/métodos , Metilação de DNA/efeitos dos fármacos , Sulfitos/farmacologia , Sequenciamento Completo do Genoma , Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos
17.
Nucleic Acids Res ; 43(Database issue): D103-9, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25326329

RESUMO

Many regulatory mechanisms require a high degree of specificity in protein-DNA binding. Nucleotide sequence does not provide an answer to the question of why a protein binds only to a small subset of the many putative binding sites in the genome that share the same core motif. Whereas higher-order effects, such as chromatin accessibility, cooperativity and cofactors, have been described, DNA shape recently gained attention as another feature that fine-tunes the DNA binding specificities of some transcription factor families. Our Genome Browser for DNA shape annotations (GBshape; freely available at http://rohslab.cmb.usc.edu/GBshape/) provides minor groove width, propeller twist, roll, helix twist and hydroxyl radical cleavage predictions for the entire genomes of 94 organisms. Additional genomes can easily be added using the GBshape framework. GBshape can be used to visualize DNA shape annotations qualitatively in a genome browser track format, and to download quantitative values of DNA shape features as a function of genomic position at nucleotide resolution. As biological applications, we illustrate the periodicity of DNA shape features that are present in nucleosome-occupied sequences from human, fly and worm, and we demonstrate structural similarities between transcription start sites in the genomes of four Drosophila species.


Assuntos
DNA/química , Bases de Dados de Ácidos Nucleicos , Genoma , Anotação de Sequência Molecular , Navegador , Animais , Sítios de Ligação , Humanos , Conformação de Ácido Nucleico , Nucleossomos/metabolismo , Sítio de Iniciação de Transcrição
18.
Proc Natl Acad Sci U S A ; 110(44): 17921-6, 2013 Oct 29.
Artigo em Inglês | MEDLINE | ID: mdl-24127591

RESUMO

Chromatin-based functional genomic analyses and genomewide association studies (GWASs) together implicate enhancers as critical elements influencing gene expression and risk for common diseases. Here, we performed systematic chromatin and transcriptome profiling in human pancreatic islets. Integrated analysis of islet data with those from nine cell types identified specific and significant enrichment of type 2 diabetes and related quantitative trait GWAS variants in islet enhancers. Our integrated chromatin maps reveal that most enhancers are short (median = 0.8 kb). Each cell type also contains a substantial number of more extended (≥ 3 kb) enhancers. Interestingly, these stretch enhancers are often tissue-specific and overlap locus control regions, suggesting that they are important chromatin regulatory beacons. Indeed, we show that (i) tissue specificity of enhancers and nearby gene expression increase with enhancer length; (ii) neighborhoods containing stretch enhancers are enriched for important cell type-specific genes; and (iii) GWAS variants associated with traits relevant to a particular cell type are more enriched in stretch enhancers compared with short enhancers. Reporter constructs containing stretch enhancer sequences exhibited tissue-specific activity in cell culture experiments and in transgenic mice. These results suggest that stretch enhancers are critical chromatin elements for coordinating cell type-specific regulatory programs and that sequence variation in stretch enhancers affects risk of major common human diseases.


Assuntos
Diferenciação Celular/fisiologia , Cromatina/fisiologia , Diabetes Mellitus Tipo 2/fisiopatologia , Elementos Facilitadores Genéticos/genética , Epigenômica/métodos , Regulação da Expressão Gênica/fisiologia , Células Secretoras de Insulina/metabolismo , Animais , Imunoprecipitação da Cromatina , Diabetes Mellitus Tipo 2/genética , Elementos Facilitadores Genéticos/fisiologia , Perfilação da Expressão Gênica , Regulação da Expressão Gênica/genética , Estudo de Associação Genômica Ampla , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Células Secretoras de Insulina/fisiologia , Luciferases , Camundongos , Camundongos Transgênicos
19.
Proc Natl Acad Sci U S A ; 110(33): 13481-6, 2013 Aug 13.
Artigo em Inglês | MEDLINE | ID: mdl-23901115

RESUMO

Synonymous mutations, which do not alter the protein sequence, have been shown to affect protein function [Sauna ZE, Kimchi-Sarfaty C (2011) Nat Rev Genet 12(10):683-691]. However, synonymous mutations are rarely investigated in the cancer genomics field. We used whole-genome and -exome sequencing to identify somatic mutations in 29 melanoma samples. Validation of one synonymous somatic mutation in BCL2L12 in 285 samples identified 12 cases that harbored the recurrent F17F mutation. This mutation led to increased BCL2L12 mRNA and protein levels because of differential targeting of WT and mutant BCL2L12 by hsa-miR-671-5p. Protein made from mutant BCL2L12 transcript bound p53, inhibited UV-induced apoptosis more efficiently than WT BCL2L12, and reduced endogenous p53 target gene transcription. This report shows selection of a recurrent somatic synonymous mutation in cancer. Our data indicate that silent alterations have a role to play in human cancer, emphasizing the importance of their investigation in future cancer genome studies.


Assuntos
Apoptose/genética , Regulação da Expressão Gênica/genética , Genoma Humano/genética , Melanoma/genética , Proteínas Musculares/genética , Proteínas Proto-Oncogênicas c-bcl-2/genética , Sequência de Bases , Western Blotting , Primers do DNA/genética , Exoma/genética , Vetores Genéticos/genética , Células HEK293 , Humanos , Imunoprecipitação , Lentivirus , MicroRNAs/genética , Dados de Sequência Molecular , Proteínas Musculares/metabolismo , Mutação/genética , Polimorfismo de Nucleotídeo Único/genética , Proteínas Proto-Oncogênicas c-bcl-2/metabolismo , RNA Interferente Pequeno/genética , Reação em Cadeia da Polimerase em Tempo Real , Análise de Sequência de DNA , Proteína Supressora de Tumor p53/metabolismo
20.
PLoS Genet ; 8(6): e1002789, 2012 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-22761590

RESUMO

Understanding the molecular basis for phenotypic differences between humans and other primates remains an outstanding challenge. Mutations in non-coding regulatory DNA that alter gene expression have been hypothesized as a key driver of these phenotypic differences. This has been supported by differential gene expression analyses in general, but not by the identification of specific regulatory elements responsible for changes in transcription and phenotype. To identify the genetic source of regulatory differences, we mapped DNaseI hypersensitive (DHS) sites, which mark all types of active gene regulatory elements, genome-wide in the same cell type isolated from human, chimpanzee, and macaque. Most DHS sites were conserved among all three species, as expected based on their central role in regulating transcription. However, we found evidence that several hundred DHS sites were gained or lost on the lineages leading to modern human and chimpanzee. Species-specific DHS site gains are enriched near differentially expressed genes, are positively correlated with increased transcription, show evidence of branch-specific positive selection, and overlap with active chromatin marks. Species-specific sequence differences in transcription factor motifs found within these DHS sites are linked with species-specific changes in chromatin accessibility. Together, these indicate that the regulatory elements identified here are genetic contributors to transcriptional and phenotypic differences among primate species.


Assuntos
Desoxirribonuclease I/genética , Evolução Molecular , Primatas/genética , Sequências Reguladoras de Ácido Nucleico/genética , Transcrição Gênica , Animais , Sítios de Ligação/genética , Linhagem Celular , Cromatina/genética , Regulação da Expressão Gênica , Genoma Humano , Humanos , Mutação , Motivos de Nucleotídeos , Fenótipo , Seleção Genética , Especificidade da Espécie , Fatores de Transcrição/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA