Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 31
Filter
1.
Cell ; 184(24): 5985-6001.e19, 2021 11 24.
Article in English | MEDLINE | ID: mdl-34774128

ABSTRACT

Current catalogs of regulatory sequences in the human genome are still incomplete and lack cell type resolution. To profile the activity of gene regulatory elements in diverse cell types and tissues in the human body, we applied single-cell chromatin accessibility assays to 30 adult human tissue types from multiple donors. We integrated these datasets with previous single-cell chromatin accessibility data from 15 fetal tissue types to reveal the status of open chromatin for ∼1.2 million candidate cis-regulatory elements (cCREs) in 222 distinct cell types comprised of >1.3 million nuclei. We used these chromatin accessibility maps to delineate cell-type-specificity of fetal and adult human cCREs and to systematically interpret the noncoding variants associated with complex human traits and diseases. This rich resource provides a foundation for the analysis of gene regulatory programs in human cell types across tissues, life stages, and organ systems.


Subject(s)
Chromatin/metabolism , Genome, Human , Single-Cell Analysis , Adult , Cluster Analysis , Fetus/metabolism , Genetic Variation , Genome-Wide Association Study , Humans , Organ Specificity , Phylogeny , Regulatory Sequences, Nucleic Acid/genetics , Risk Factors
2.
Nature ; 591(7848): 147-151, 2021 03.
Article in English | MEDLINE | ID: mdl-33505025

ABSTRACT

Many sequence variants have been linked to complex human traits and diseases1, but deciphering their biological functions remains challenging, as most of them reside in noncoding DNA. Here we have systematically assessed the binding of 270 human transcription factors to 95,886 noncoding variants in the human genome using an ultra-high-throughput multiplex protein-DNA binding assay, termed single-nucleotide polymorphism evaluation by systematic evolution of ligands by exponential enrichment (SNP-SELEX). The resulting 828 million measurements of transcription factor-DNA interactions enable estimation of the relative affinity of these transcription factors to each variant in vitro and evaluation of the current methods to predict the effects of noncoding variants on transcription factor binding. We show that the position weight matrices of most transcription factors lack sufficient predictive power, whereas the support vector machine combined with the gapped k-mer representation show much improved performance, when assessed on results from independent SNP-SELEX experiments involving a new set of 61,020 sequence variants. We report highly predictive models for 94 human transcription factors and demonstrate their utility in genome-wide association studies and understanding of the molecular pathways involved in diverse human traits and diseases.


Subject(s)
Polymorphism, Single Nucleotide/genetics , SELEX Aptamer Technique , Support Vector Machine , Transcription Factors/metabolism , Binding Sites/genetics , Disease/genetics , Genome, Human/genetics , Humans , Ligands , Protein Binding
3.
Nature ; 598(7879): 129-136, 2021 10.
Article in English | MEDLINE | ID: mdl-34616068

ABSTRACT

The mammalian cerebrum performs high-level sensory perception, motor control and cognitive functions through highly specialized cortical and subcortical structures1. Recent surveys of mouse and human brains with single-cell transcriptomics2-6 and high-throughput imaging technologies7,8 have uncovered hundreds of neural cell types distributed in different brain regions, but the transcriptional regulatory programs that are responsible for the unique identity and function of each cell type remain unknown. Here we probe the accessible chromatin in more than 800,000 individual nuclei from 45 regions that span the adult mouse isocortex, olfactory bulb, hippocampus and cerebral nuclei, and use the resulting data to map the state of 491,818 candidate cis-regulatory DNA elements in 160 distinct cell types. We find high specificity of spatial distribution for not only excitatory neurons, but also most classes of inhibitory neurons and a subset of glial cell types. We characterize the gene regulatory sequences associated with the regional specificity within these cell types. We further link a considerable fraction of the cis-regulatory elements to putative target genes expressed in diverse cerebral cell types and predict transcriptional regulators that are involved in a broad spectrum of molecular and cellular pathways in different neuronal and glial cell populations. Our results provide a foundation for comprehensive analysis of gene regulatory programs of the mammalian brain and assist in the interpretation of noncoding risk variants associated with various neurological diseases and traits in humans.


Subject(s)
Cerebrum/cytology , Cerebrum/metabolism , Regulatory Sequences, Nucleic Acid/genetics , Animals , Atlases as Topic , Chromatin/chemistry , Chromatin/genetics , Chromatin/metabolism , Chromatin Assembly and Disassembly , Gene Expression Regulation , Genetic Predisposition to Disease/genetics , Humans , Male , Mice , Mice, Inbred C57BL , Nervous System Diseases/genetics , Neuroglia/classification , Neuroglia/metabolism , Neurons/classification , Neurons/metabolism , Sequence Analysis, DNA , Single-Cell Analysis
4.
Brief Bioinform ; 25(3)2024 Mar 27.
Article in English | MEDLINE | ID: mdl-38711367

ABSTRACT

Hi-C data are commonly normalized using single sample processing methods, with focus on comparisons between regions within a given contact map. Here, we aim to compare contact maps across different samples. We demonstrate that unwanted variation, of likely technical origin, is present in Hi-C data with replicates from different individuals, and that properties of this unwanted variation change across the contact map. We present band-wise normalization and batch correction, a method for normalization and batch correction of Hi-C data and show that it substantially improves comparisons across samples, including in a quantitative trait loci analysis as well as differential enrichment across cell types.


Subject(s)
Quantitative Trait Loci , Humans , Computational Biology
5.
Nature ; 583(7818): 744-751, 2020 07.
Article in English | MEDLINE | ID: mdl-32728240

ABSTRACT

The Encyclopedia of DNA Elements (ENCODE) project has established a genomic resource for mammalian development, profiling a diverse panel of mouse tissues at 8 developmental stages from 10.5 days after conception until birth, including transcriptomes, methylomes and chromatin states. Here we systematically examined the state and accessibility of chromatin in the developing mouse fetus. In total we performed 1,128 chromatin immunoprecipitation with sequencing (ChIP-seq) assays for histone modifications and 132 assay for transposase-accessible chromatin using sequencing (ATAC-seq) assays for chromatin accessibility across 72 distinct tissue-stages. We used integrative analysis to develop a unified set of chromatin state annotations, infer the identities of dynamic enhancers and key transcriptional regulators, and characterize the relationship between chromatin state and accessibility during developmental gene regulation. We also leveraged these data to link enhancers to putative target genes and demonstrate tissue-specific enrichments of sequence variants associated with disease in humans. The mouse ENCODE data sets provide a compendium of resources for biomedical researchers and achieve, to our knowledge, the most comprehensive view of chromatin dynamics during mammalian fetal development to date.


Subject(s)
Chromatin/genetics , Chromatin/metabolism , Datasets as Topic , Fetal Development/genetics , Histones/metabolism , Molecular Sequence Annotation , Regulatory Sequences, Nucleic Acid/genetics , Animals , Chromatin/chemistry , Chromatin Immunoprecipitation Sequencing , Disease/genetics , Enhancer Elements, Genetic/genetics , Female , Gene Expression Regulation, Developmental/genetics , Genetic Variation , Histones/chemistry , Humans , Male , Mice , Mice, Inbred C57BL , Organ Specificity/genetics , Reproducibility of Results , Transposases/metabolism
7.
Nature ; 560(7720): 655-660, 2018 08.
Article in English | MEDLINE | ID: mdl-30135582

ABSTRACT

Mammalian cells are surrounded by neighbouring cells and extracellular matrix (ECM), which provide cells with structural support and mechanical cues that influence diverse biological processes1. The Hippo pathway effectors YAP (also known as YAP1) and TAZ (also known as WWTR1) are regulated by mechanical cues and mediate cellular responses to ECM stiffness2,3. Here we identified the Ras-related GTPase RAP2 as a key intracellular signal transducer that relays ECM rigidity signals to control mechanosensitive cellular activities through YAP and TAZ. RAP2 is activated by low ECM stiffness, and deletion of RAP2 blocks the regulation of YAP and TAZ by stiffness signals and promotes aberrant cell growth. Mechanistically, matrix stiffness acts through phospholipase Cγ1 (PLCγ1) to influence levels of phosphatidylinositol 4,5-bisphosphate and phosphatidic acid, which activates RAP2 through PDZGEF1 and PDZGEF2 (also known as RAPGEF2 and RAPGEF6). At low stiffness, active RAP2 binds to and stimulates MAP4K4, MAP4K6, MAP4K7 and ARHGAP29, resulting in activation of LATS1 and LATS2 and inhibition of YAP and TAZ. RAP2, YAP and TAZ have pivotal roles in mechanoregulated transcription, as deletion of YAP and TAZ abolishes the ECM stiffness-responsive transcriptome. Our findings show that RAP2 is a molecular switch in mechanotransduction, thereby defining a mechanosignalling pathway from ECM stiffness to the nucleus.


Subject(s)
Protein Serine-Threonine Kinases/metabolism , Signal Transduction , rap GTP-Binding Proteins/metabolism , Adaptor Proteins, Signal Transducing/metabolism , Animals , Cell Transformation, Neoplastic , Extracellular Matrix/chemistry , Extracellular Matrix/genetics , Extracellular Matrix/metabolism , Female , GTPase-Activating Proteins/metabolism , Germinal Center Kinases , Guanine Nucleotide Exchange Factors/metabolism , HEK293 Cells , Hippo Signaling Pathway , Humans , Intracellular Signaling Peptides and Proteins/metabolism , Mice , Mice, Inbred NOD , Mice, Nude , Mice, SCID , Nerve Tissue Proteins/metabolism , Phospholipase C gamma/metabolism , Phosphoproteins/metabolism , Trans-Activators , Transcription Factors , Transcriptional Coactivator with PDZ-Binding Motif Proteins , Transcriptome , YAP-Signaling Proteins , rap GTP-Binding Proteins/genetics
9.
BMC Genomics ; 22(1): 84, 2021 Jan 28.
Article in English | MEDLINE | ID: mdl-33509077

ABSTRACT

BACKGROUND: Co-localized combinations of histone modifications ("chromatin states") have been shown to correlate with promoter and enhancer activity. Changes in chromatin states over multiple time points ("chromatin state trajectories") have previously been analyzed at promoter and enhancers separately. With the advent of time series Hi-C data it is now possible to connect promoters and enhancers and to analyze chromatin state trajectories at promoter-enhancer pairs. RESULTS: We present TimelessFlex, a framework for investigating chromatin state trajectories at promoters and enhancers and at promoter-enhancer pairs based on Hi-C information. TimelessFlex extends our previous approach Timeless, a Bayesian network for clustering multiple histone modification data sets at promoter and enhancer feature regions. We utilize time series ATAC-seq data measuring open chromatin to define promoters and enhancer candidates. We developed an expectation-maximization algorithm to assign promoters and enhancers to each other based on Hi-C interactions and jointly cluster their feature regions into paired chromatin state trajectories. We find jointly clustered promoter-enhancer pairs showing the same activation patterns on both sides but with a stronger trend at the enhancer side. While the promoter side remains accessible across the time series, the enhancer side becomes dynamically more open towards the gene activation time point. Promoter cluster patterns show strong correlations with gene expression signals, whereas Hi-C signals get only slightly stronger towards activation. The code of the framework is available at https://github.com/henriettemiko/TimelessFlex . CONCLUSIONS: TimelessFlex clusters time series histone modifications at promoter-enhancer pairs based on Hi-C and it can identify distinct chromatin states at promoter and enhancer feature regions and their changes over time.


Subject(s)
Chromatin , Enhancer Elements, Genetic , Bayes Theorem , Chromatin/genetics , Chromosomes , Promoter Regions, Genetic
10.
Nature ; 518(7539): 350-354, 2015 Feb 19.
Article in English | MEDLINE | ID: mdl-25693566

ABSTRACT

Allelic differences between the two homologous chromosomes can affect the propensity of inheritance in humans; however, the extent of such differences in the human genome has yet to be fully explored. Here we delineate allelic chromatin modifications and transcriptomes among a broad set of human tissues, enabled by a chromosome-spanning haplotype reconstruction strategy. The resulting large collection of haplotype-resolved epigenomic maps reveals extensive allelic biases in both chromatin state and transcription, which show considerable variation across tissues and between individuals, and allow us to investigate cis-regulatory relationships between genes and their control sequences. Analyses of histone modification maps also uncover intriguing characteristics of cis-regulatory elements and tissue-restricted activities of repetitive elements. The rich data sets described here will enhance our understanding of the mechanisms by which cis-regulatory elements control gene expression programs.


Subject(s)
Alleles , Epigenesis, Genetic/genetics , Epigenomics , Haplotypes/genetics , Acetylation , Chromatin/genetics , Chromatin/metabolism , Chromosomes, Human/genetics , Datasets as Topic , Enhancer Elements, Genetic/genetics , Genetic Variation/genetics , Histones/metabolism , Humans , Nucleotide Motifs , Organ Specificity/genetics , Transcription, Genetic/genetics
11.
Nat Methods ; 14(6): 629-635, 2017 Jun.
Article in English | MEDLINE | ID: mdl-28417999

ABSTRACT

Millions of cis-regulatory elements are predicted to be present in the human genome, but direct evidence for their biological function is scarce. Here we report a high-throughput method, cis-regulatory element scan by tiling-deletion and sequencing (CREST-seq), for the unbiased discovery and functional assessment of cis-regulatory sequences in the genome. We used it to interrogate the 2-Mb POU5F1 locus in human embryonic stem cells, and identified 45 cis-regulatory elements. A majority of these elements have active chromatin marks, DNase hypersensitivity, and occupancy by multiple transcription factors, which confirms the utility of chromatin signatures in cis-element mapping. Notably, 17 of them are previously annotated promoters of functionally unrelated genes, and like typical enhancers, they form extensive spatial contacts with the POU5F1 promoter. These results point to the commonality of enhancer-like promoters in the human genome.


Subject(s)
Chromosome Mapping/methods , Genetic Testing/methods , Regulatory Sequences, Nucleic Acid/genetics , Algorithms , Cells, Cultured , Embryonic Stem Cells/physiology , Gene Expression Regulation/genetics , High-Throughput Nucleotide Sequencing , Humans , Sequence Analysis, DNA , Single-Cell Analysis
12.
PLoS Comput Biol ; 15(4): e1006982, 2019 04.
Article in English | MEDLINE | ID: mdl-30986246

ABSTRACT

Hi-C and chromatin immunoprecipitation (ChIP) have been combined to identify long-range chromatin interactions genome-wide at reduced cost and enhanced resolution, but extracting information from the resulting datasets has been challenging. Here we describe a computational method, MAPS, Model-based Analysis of PLAC-seq and HiChIP, to process the data from such experiments and identify long-range chromatin interactions. MAPS adopts a zero-truncated Poisson regression framework to explicitly remove systematic biases in the PLAC-seq and HiChIP datasets, and then uses the normalized chromatin contact frequencies to identify significant chromatin interactions anchored at genomic regions bound by the protein of interest. MAPS shows superior performance over existing software tools in the analysis of chromatin interactions from multiple PLAC-seq and HiChIP datasets centered on different transcriptional factors and histone marks. MAPS is freely available at https://github.com/ijuric/MAPS.


Subject(s)
Chromatin Assembly and Disassembly/physiology , Chromosome Mapping/methods , Computational Biology/methods , Chromatin/metabolism , Chromatin/physiology , Chromatin Immunoprecipitation/methods , Computer Simulation , Genome , Genomics/methods , High-Throughput Nucleotide Sequencing/methods , Histone Code , Humans , Sequence Analysis, DNA/methods , Software
13.
J Biol Chem ; 293(28): 11230-11240, 2018 07 13.
Article in English | MEDLINE | ID: mdl-29802201

ABSTRACT

The Hippo pathway plays an important role in regulating tissue homeostasis, and its effectors, the transcriptional co-activators Yes-associated protein (YAP) and WW domain-containing transcription regulator 1 (WWTR1 or TAZ), are responsible for mediating the vast majority of its physiological functions. Although YAP and TAZ are thought to be largely redundant and similarly regulated by Hippo signaling, they have developmental, structural, and physiological differences that suggest they may differ in their regulation and downstream functions. To better understand the functions of YAP and TAZ in the Hippo pathway, using CRISPR/Cas9, we generated YAP KO, TAZ KO, and YAP/TAZ KO cell lines in HEK293A cells. We evaluated them in response to many environmental conditions and stimuli and used RNA-Seq to compare their transcriptional profiles. We found that YAP inactivation has a greater effect on cellular physiology (namely, cell spreading, volume, granularity, glucose uptake, proliferation, and migration) than TAZ inactivation. However, functional redundancy between YAP and TAZ was also observed. In summary, our findings confirm that the Hippo pathway effectors YAP and TAZ are master regulators for multiple cellular processes but also reveal that YAP has a stronger influence than TAZ.


Subject(s)
Adaptor Proteins, Signal Transducing/metabolism , Cell Physiological Phenomena , Phosphoproteins/metabolism , Protein Serine-Threonine Kinases/metabolism , Signal Transduction , Transcription Factors/metabolism , Tumor Suppressor Proteins/metabolism , Acyltransferases , Adaptor Proteins, Signal Transducing/antagonists & inhibitors , Adaptor Proteins, Signal Transducing/genetics , CRISPR-Cas Systems , Gene Expression Profiling , HEK293 Cells , Hippo Signaling Pathway , Homeostasis , Humans , Phosphoproteins/antagonists & inhibitors , Phosphoproteins/genetics , Protein Serine-Threonine Kinases/antagonists & inhibitors , Protein Serine-Threonine Kinases/genetics , Transcription Factors/antagonists & inhibitors , Transcription Factors/genetics , Tumor Suppressor Proteins/antagonists & inhibitors , Tumor Suppressor Proteins/genetics , YAP-Signaling Proteins
14.
Nucleic Acids Res ; 43(1): 104-14, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25505163

ABSTRACT

To find signature features shared by various ncRNA sub-types and characterize novel ncRNAs, we have developed a method, RNAfeature, to investigate >600 sets of genomic and epigenomic data with various evolutionary and biophysical scores. RNAfeature utilizes a fine-tuned intra-species wrapper algorithm that is followed by a novel feature selection strategy across species. It considers long distance effect of certain features (e.g. histone modification at the promoter region). We finally narrow down on 10 informative features (including sequences, structures, expression profiles and epigenetic signals). These features are complementary to each other and as a whole can accurately distinguish canonical ncRNAs from CDSs and UTRs (accuracies: >92% in human, mouse, worm and fly). Moreover, the feature pattern is conserved across multiple species. For instance, the supervised 10-feature model derived from animal species can predict ncRNAs in Arabidopsis (accuracy: 82%). Subsequently, we integrate the 10 features to define a set of noncoding potential scores, which can identify, evaluate and characterize novel noncoding RNAs. The score covers all transcribed regions (including unconserved ncRNAs), without requiring assembly of the full-length transcripts. Importantly, the noncoding potential allows us to identify and characterize potential functional domains with feature patterns similar to canonical ncRNAs (e.g. tRNA, snRNA, miRNA, etc) on ∼70% of human long ncRNAs (lncRNAs).


Subject(s)
Genomics/methods , RNA, Untranslated/chemistry , RNA, Untranslated/genetics , Algorithms , Animals , Humans , Mice , Nucleic Acid Conformation , RNA, Long Noncoding/chemistry , RNA, Untranslated/metabolism
15.
bioRxiv ; 2023 Mar 12.
Article in English | MEDLINE | ID: mdl-36945429

ABSTRACT

Tandem repeats (TRs) represent one of the largest sources of genetic variation in humans and are implicated in a range of phenotypes. Here we present a deep characterization of TR variation based on high coverage whole genome sequencing from 3,550 diverse individuals from the 1000 Genomes Project and H3Africa cohorts. We develop a method, EnsembleTR, to integrate genotypes from four separate methods resulting in high-quality genotypes at more than 1.7 million TR loci. Our catalog reveals novel sequence features influencing TR heterozygosity, identifies population-specific trinucleotide expansions, and finds hundreds of novel eQTL signals. Finally, we generate a phased haplotype panel which can be used to impute most TRs from nearby single nucleotide polymorphisms (SNPs) with high accuracy. Overall, the TR genotypes and reference haplotype panel generated here will serve as valuable resources for future genome-wide and population-wide studies of TRs and their role in human phenotypes.

16.
Nat Commun ; 14(1): 6711, 2023 10 23.
Article in English | MEDLINE | ID: mdl-37872149

ABSTRACT

Tandem repeats (TRs) represent one of the largest sources of genetic variation in humans and are implicated in a range of phenotypes. Here we present a deep characterization of TR variation based on high coverage whole genome sequencing from 3550 diverse individuals from the 1000 Genomes Project and H3Africa cohorts. We develop a method, EnsembleTR, to integrate genotypes from four separate methods resulting in high-quality genotypes at more than 1.7 million TR loci. Our catalog reveals novel sequence features influencing TR heterozygosity, identifies population-specific trinucleotide expansions, and finds hundreds of novel eQTL signals. Finally, we generate a phased haplotype panel which can be used to impute most TRs from nearby single nucleotide polymorphisms (SNPs) with high accuracy. Overall, the TR genotypes and reference haplotype panel generated here will serve as valuable resources for future genome-wide and population-wide studies of TRs and their role in human phenotypes.


Subject(s)
Polymorphism, Single Nucleotide , Tandem Repeat Sequences , Humans , Genotype , Whole Genome Sequencing
17.
Sci Adv ; 8(21): eabl9806, 2022 May 27.
Article in English | MEDLINE | ID: mdl-35613278

ABSTRACT

Semaphorins were originally identified as axonal guidance molecules, but they also control processes such as vascular development and tumorigenesis. The downstream signaling cascades of Semaphorins in these biological processes remain unclear. Here, we show that the class 3 Semaphorins (SEMA3s) activate the Hippo pathway to attenuate tissue growth, angiogenesis, and tumorigenesis. SEMA3B restoration in lung cancer cells with SEMA3B loss of heterozygosity suppresses cancer cell growth via activating the core Hippo kinases LATS1/2 (large tumor suppressor kinase 1/2). Furthermore, SEMA3 also acts through LATS1/2 to inhibit angiogenesis. We identified p190RhoGAPs as essential partners of the SEMA3A receptor PlexinA in Hippo regulation. Upon SEMA3 treatment, PlexinA interacts with the pseudo-guanosine triphosphatase (GTPase) domain of p190RhoGAP and simultaneously recruits RND GTPases to activate p190RhoGAP, which then stimulates LATS1/2. Disease-associated etiological factors, such as genetic lesions and oscillatory shear, diminish Hippo pathway regulation by SEMA3. Our study thus discovers a critical role of Hippo signaling in mediating SEMA3 physiological function.

18.
Cell Genom ; 2(12): 100214, 2022 Dec 14.
Article in English | MEDLINE | ID: mdl-36778047

ABSTRACT

We combined functional genomics and human genetics to investigate processes that affect type 1 diabetes (T1D) risk by mediating beta cell survival in response to proinflammatory cytokines. We mapped 38,931 cytokine-responsive candidate cis-regulatory elements (cCREs) in beta cells using ATAC-seq and snATAC-seq and linked them to target genes using co-accessibility and HiChIP. Using a genome-wide CRISPR screen in EndoC-ßH1 cells, we identified 867 genes affecting cytokine-induced survival, and genes promoting survival and up-regulated in cytokines were enriched at T1D risk loci. Using SNP-SELEX, we identified 2,229 variants in cytokine-responsive cCREs altering transcription factor (TF) binding, and variants altering binding of TFs regulating stress, inflammation, and apoptosis were enriched for T1D risk. At the 16p13 locus, a fine-mapped T1D variant altering TF binding in a cytokine-induced cCRE interacted with SOCS1, which promoted survival in cytokine exposure. Our findings reveal processes and genes acting in beta cells during inflammation that modulate T1D risk.

19.
Genome Med ; 14(1): 84, 2022 08 11.
Article in English | MEDLINE | ID: mdl-35948990

ABSTRACT

BACKGROUND: Expansions of short tandem repeats are the cause of many neurogenetic disorders including familial amyotrophic lateral sclerosis, Huntington disease, and many others. Multiple methods have been recently developed that can identify repeat expansions in whole genome or exome sequencing data. Despite the widely recognized need for visual assessment of variant calls in clinical settings, current computational tools lack the ability to produce such visualizations for repeat expansions. Expanded repeats are difficult to visualize because they correspond to large insertions relative to the reference genome and involve many misaligning and ambiguously aligning reads. RESULTS: We implemented REViewer, a computational method for visualization of sequencing data in genomic regions containing long repeat expansions and FlipBook, a companion image viewer designed for manual curation of large collections of REViewer images. To generate a read pileup, REViewer reconstructs local haplotype sequences and distributes reads to these haplotypes in a way that is most consistent with the fragment lengths and evenness of read coverage. To create appropriate training materials for onboarding new users, we performed a concordance study involving 12 scientists involved in short tandem repeat research. We used the results of this study to create a user guide that describes the basic principles of using REViewer as well as a guide to the typical features of read pileups that correspond to low confidence repeat genotype calls. Additionally, we demonstrated that REViewer can be used to annotate clinically relevant repeat interruptions by comparing visual assessment results of 44 FMR1 repeat alleles with the results of triplet repeat primed PCR. For 38 of these alleles, the results of visual assessment were consistent with triplet repeat primed PCR. CONCLUSIONS: Read pileup plots generated by REViewer offer an intuitive way to visualize sequencing data in regions containing long repeat expansions. Laboratories can use REViewer and FlipBook to assess the quality of repeat genotype calls as well as to visually detect interruptions or other imperfections in the repeat sequence and the surrounding flanking regions. REViewer and FlipBook are available under open-source licenses at https://github.com/illumina/REViewer and https://github.com/broadinstitute/flipbook respectively.


Subject(s)
Amyotrophic Lateral Sclerosis , Tandem Repeat Sequences , Alleles , Amyotrophic Lateral Sclerosis/genetics , Exome , Fragile X Mental Retardation Protein/genetics , Haplotypes , High-Throughput Nucleotide Sequencing/methods , Humans
20.
Elife ; 102021 02 05.
Article in English | MEDLINE | ID: mdl-33544077

ABSTRACT

Genetic variants associated with type 2 diabetes (T2D) risk affect gene regulation in metabolically relevant tissues, such as pancreatic islets. Here, we investigated contributions of regulatory programs active during pancreatic development to T2D risk. Generation of chromatin maps from developmental precursors throughout pancreatic differentiation of human embryonic stem cells (hESCs) identifies enrichment of T2D variants in pancreatic progenitor-specific stretch enhancers that are not active in islets. Genes associated with progenitor-specific stretch enhancers are predicted to regulate developmental processes, most notably tissue morphogenesis. Through gene editing in hESCs, we demonstrate that progenitor-specific enhancers harboring T2D-associated variants regulate cell polarity genes LAMA1 and CRB2. Knockdown of lama1 or crb2 in zebrafish embryos causes a defect in pancreas morphogenesis and impairs islet cell development. Together, our findings reveal that a subset of T2D risk variants specifically affects pancreatic developmental programs, suggesting that dysregulation of developmental processes can predispose to T2D.


Subject(s)
Diabetes Mellitus, Type 2/genetics , Epigenome , Intracellular Signaling Peptides and Proteins/genetics , Transcription Factors/genetics , Zebrafish Proteins/genetics , Zebrafish/genetics , Animals , Humans , Intracellular Signaling Peptides and Proteins/metabolism , Transcription Factors/metabolism , Zebrafish/metabolism , Zebrafish Proteins/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL