Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 18 de 18
Filter
1.
Genes Dev ; 32(1): 26-41, 2018 01 01.
Article in English | MEDLINE | ID: mdl-29378787

ABSTRACT

Regulation by gene-distal enhancers is critical for cell type-specific and condition-specific patterns of gene expression. Thus, to understand the basis of gene activity in a given cell type or tissue, we must identify the precise locations of enhancers and functionally characterize their behaviors. Here, we demonstrate that transcription is a nearly universal feature of enhancers in Drosophila and mammalian cells and that nascent RNA sequencing strategies are optimal for identification of both enhancers and superenhancers. We dissect the mechanisms governing enhancer transcription and discover remarkable similarities to transcription at protein-coding genes. We show that RNA polymerase II (RNAPII) undergoes regulated pausing and release at enhancers. However, as compared with mRNA genes, RNAPII at enhancers is less stable and more prone to early termination. Furthermore, we found that the level of histone H3 Lys4 (H3K4) methylation at enhancers corresponds to transcriptional activity such that highly active enhancers display H3K4 trimethylation rather than the H3K4 monomethylation considered a hallmark of enhancers. Finally, our work provides insights into the unique characteristics of superenhancers, which stimulate high-level gene expression through rapid pause release; interestingly, this property renders associated genes resistant to the loss of factors that stabilize paused RNAPII.


Subject(s)
Enhancer Elements, Genetic , Gene Expression Regulation , Transcription Elongation, Genetic , Animals , Chromatin/chemistry , Chromosomal Proteins, Non-Histone/physiology , Drosophila/genetics , Drosophila/metabolism , Drosophila Proteins/biosynthesis , Drosophila Proteins/physiology , Embryonic Stem Cells/metabolism , Histones/metabolism , Mice , Promoter Regions, Genetic , RNA Polymerase II/metabolism , RNA, Untranslated/biosynthesis , Transcription Initiation Site , Transcription, Genetic , Transcriptional Elongation Factors/physiology
2.
PLoS Comput Biol ; 16(1): e1007571, 2020 01.
Article in English | MEDLINE | ID: mdl-31978042

ABSTRACT

High-throughput sequencing has become ubiquitous in biomedical sciences. As new technologies emerge and sequencing costs decline, the diversity and volume of available data increases exponentially, and successfully navigating the data becomes more challenging. Though datasets are often hosted by public repositories, scientists must rely on inconsistent annotation to identify and interpret meaningful data. Moreover, the experimental heterogeneity and wide-ranging quality of high-throughput biological data means that even data with desired cell lines, tissue types, or molecular targets may not be readily interpretable or integrated. We have developed ORSO (Online Resource for Social Omics) as an easy-to-use web application to connect life scientists with genomics data. In ORSO, users interact within a data-driven social network, where they can favorite datasets and follow other users. In addition to more than 30,000 datasets hosted from major biomedical consortia, users may contribute their own data to ORSO, facilitating its discovery by other users. Leveraging user interactions, ORSO provides a novel recommendation system to automatically connect users with hosted data. In addition to social interactions, the recommendation system considers primary read coverage information and annotated metadata. Similarities used by the recommendation system are presented by ORSO in a graph display, allowing exploration of dataset associations. The topology of the network graph reflects established biology, with samples from related systems grouped together. We tested the recommendation system using an RNA-seq time course dataset from differentiation of embryonic stem cells to cardiomyocytes. The ORSO recommendation system correctly predicted early data point sources as embryonic stem cells and late data point sources as heart and muscle samples, resulting in recommendation of related datasets. By connecting scientists with relevant data, ORSO provides a critical new service that facilitates wide-ranging research interests.


Subject(s)
Database Management Systems , Databases, Genetic , Genomics , Online Social Networking , Research Personnel/organization & administration , Genomics/methods , Genomics/organization & administration , High-Throughput Nucleotide Sequencing , Humans , Social Media
3.
Nucleic Acids Res ; 46(16): 8153-8167, 2018 09 19.
Article in English | MEDLINE | ID: mdl-30107566

ABSTRACT

p53 transcriptional networks are well-characterized in many organisms. However, a global understanding of requirements for in vivo p53 interactions with DNA and relationships with transcription across human biological systems in response to various p53 activating situations remains limited. Using a common analysis pipeline, we analyzed 41 data sets from genome-wide ChIP-seq studies of which 16 have associated gene expression data, including our recent primary data with normal human lymphocytes. The resulting extensive analysis, accessible at p53 BAER hub via the UCSC browser, provides a robust platform to characterize p53 binding throughout the human genome including direct influence on gene expression and underlying mechanisms. We establish the impact of spacers and mismatches from consensus on p53 binding in vivo and propose that once bound, neither significantly influences the likelihood of expression. Our rigorous approach revealed a large p53 genome-wide cistrome composed of >900 genes directly targeted by p53. Importantly, we identify a core cistrome signature composed of genes appearing in over half the data sets, and we identify signatures that are treatment- or cell-specific, demonstrating new functions for p53 in cell biology. Our analysis reveals a broad homeostatic role for human p53 that is relevant to both basic and translational studies.


Subject(s)
DNA-Binding Proteins/genetics , Genome, Human/genetics , Transcription, Genetic , Tumor Suppressor Protein p53/genetics , DNA, Intergenic/genetics , Databases, Genetic , Gene Expression Regulation/genetics , Genes/genetics , Humans , Lymphocytes , Protein Biosynthesis
4.
Nucleic Acids Res ; 45(10): 5678-5690, 2017 Jun 02.
Article in English | MEDLINE | ID: mdl-28402545

ABSTRACT

Established and emerging next generation sequencing (NGS)-based technologies allow for genome-wide interrogation of diverse biological processes. However, accessibility of NGS data remains a problem, and few user-friendly resources exist for integrative analysis of NGS data from different sources and experimental techniques. Here, we present Online Resource for Integrative Omics (ORIO; https://orio.niehs.nih.gov/), a web-based resource with an intuitive user interface for rapid analysis and integration of NGS data. To use ORIO, the user specifies NGS data of interest along with a list of genomic coordinates. Genomic coordinates may be biologically relevant features from a variety of sources, such as ChIP-seq peaks for a given protein or transcription start sites from known gene models. ORIO first iteratively finds read coverage values at each genomic feature for each NGS dataset. Data are then integrated using clustering-based approaches, giving hierarchical relationships across NGS datasets and separating individual genomic features into groups. In focusing its analysis on read coverage, ORIO makes limited assumptions about the analyzed data; this allows the tool to be applied across data from a variety of experiments and techniques. Results from analysis are presented in dynamic displays alongside user-controlled statistical tests, supporting rapid statistical validation of observed results. We emphasize the versatility of ORIO through diverse examples, ranging from NGS data quality control to characterization of enhancer regions and integration of gene expression information. Easily accessible on a public web server, we anticipate wide use of ORIO in genome-wide investigations by life scientists.


Subject(s)
Genomics/statistics & numerical data , High-Throughput Nucleotide Sequencing/statistics & numerical data , Histones/genetics , Transcription Initiation Site , User-Computer Interface , Animals , Chromatin Immunoprecipitation , Data Interpretation, Statistical , Enhancer Elements, Genetic , Genomics/methods , Histones/metabolism , Humans , Internet , Mice , Sequence Analysis, DNA
5.
PLoS Genet ; 12(8): e1006224, 2016 08.
Article in English | MEDLINE | ID: mdl-27487356

ABSTRACT

Antisense transcription is a prevalent feature at mammalian promoters. Previous studies have primarily focused on antisense transcription initiating upstream of genes. Here, we characterize promoter-proximal antisense transcription downstream of gene transcription starts sites in human breast cancer cells, investigating the genomic context of downstream antisense transcription. We find extensive correlations between antisense transcription and features associated with the chromatin environment at gene promoters. Antisense transcription downstream of promoters is widespread, with antisense transcription initiation observed within 2 kb of 28% of gene transcription start sites. Antisense transcription initiates between nucleosomes regularly positioned downstream of these promoters. The nucleosomes between gene and downstream antisense transcription start sites carry histone modifications associated with active promoters, such as H3K4me3 and H3K27ac. This region is bound by chromatin remodeling and histone modifying complexes including SWI/SNF subunits and HDACs, suggesting that antisense transcription or resulting RNA transcripts contribute to the creation and maintenance of a promoter-associated chromatin environment. Downstream antisense transcription overlays additional regulatory features, such as transcription factor binding, DNA accessibility, and the downstream edge of promoter-associated CpG islands. These features suggest an important role for antisense transcription in the regulation of gene expression and the maintenance of a promoter-associated chromatin environment.


Subject(s)
Antisense Elements (Genetics)/genetics , Nuclear Proteins/genetics , Promoter Regions, Genetic , Transcription, Genetic , Antisense Elements (Genetics)/biosynthesis , Chromatin/genetics , CpG Islands/genetics , Gene Expression Regulation, Fungal , Genomics , Histone Code/genetics , Histones/genetics , Humans , Nuclear Proteins/biosynthesis , Nucleosomes/genetics , Protein Binding/genetics , Sequence Alignment
6.
BMC Genomics ; 19(1): 345, 2018 May 09.
Article in English | MEDLINE | ID: mdl-29743009

ABSTRACT

BACKGROUND: Identification of mutations from next-generation sequencing data typically requires a balance between sensitivity and accuracy. This is particularly true of DNA insertions and deletions (indels), that can impart significant phenotypic consequences on cells but are harder to call than substitution mutations from whole genome mutation accumulation experiments. To overcome these difficulties, we present muver, a computational framework that integrates established bioinformatics tools with novel analytical methods to generate mutation calls with the extremely low false positive rates and high sensitivity required for accurate mutation rate determination and comparison. RESULTS: Muver uses statistical comparison of ancestral and descendant allelic frequencies to identify variant loci and assigns genotypes with models that include per-sample assessments of sequencing errors by mutation type and repeat context. Muver identifies maximally parsimonious mutation pathways that connect these genotypes, differentiating potential allelic conversion events and delineating ambiguities in mutation location, type, and size. Benchmarking with a human gold standard father-son pair demonstrates muver's sensitivity and low false positive rates. In DNA mismatch repair (MMR) deficient Saccharomyces cerevisiae, muver detects multi-base deletions in homopolymers longer than the replicative polymerase footprint at rates greater than predicted for sequential single-base deletions, implying a novel multi-repeat-unit slippage mechanism. CONCLUSIONS: Benchmarking results demonstrate the high accuracy and sensitivity achieved with muver, particularly for indels, relative to available tools. Applied to an MMR-deficient Saccharomyces cerevisiae system, muver mutation calls facilitate mechanistic insights into DNA replication fidelity.


Subject(s)
Genome, Fungal , Mutation Accumulation , Saccharomyces cerevisiae Proteins/genetics , Saccharomyces cerevisiae/genetics , Sequence Analysis, DNA/methods , Software , Computational Biology , Fathers , High-Throughput Nucleotide Sequencing , Humans , Male , Mutation Rate , Reference Standards
7.
Proc Natl Acad Sci U S A ; 112(8): 2425-30, 2015 Feb 24.
Article in English | MEDLINE | ID: mdl-25675474

ABSTRACT

It was shown decades ago that purified 30S ribosome subunits readily interconvert between "active" and "inactive" conformations in a switch that involves changes in the functionally important neck and decoding regions. However, the physiological significance of this conformational change had remained unknown. In exponentially growing Escherichia coli cells, RNA SHAPE probing revealed that 16S rRNA largely adopts the inactive conformation in stably assembled, mature 30S subunits and the active conformation in translating (70S) ribosomes. Inactive 30S subunits bind mRNA as efficiently as active subunits but initiate translation more slowly. Mutations that inhibited interconversion between states compromised translation in vivo. Binding by the small antibiotic paromomycin induced the inactive-to-active conversion, consistent with a low-energy barrier between the two states. Despite the small energetic barrier between states, but consistent with slow translation initiation and a functional role in vivo, interconversion involved large-scale changes in structure in the neck region that likely propagate across the 30S body via helix 44. These findings suggest the inactive state is a biologically relevant alternate conformation that regulates ribosome function as a conformational switch.


Subject(s)
Biochemistry/methods , Escherichia coli/cytology , RNA/metabolism , Ribosome Subunits, Small, Bacterial/metabolism , Acylation , Escherichia coli/metabolism , Models, Molecular , Nucleic Acid Conformation , Protein Biosynthesis , RNA, Messenger/metabolism , RNA, Ribosomal, 16S/chemistry , RNA, Ribosomal, 16S/metabolism , Ribosome Subunits, Small, Bacterial/chemistry
8.
Proc Natl Acad Sci U S A ; 111(38): 13858-63, 2014 Sep 23.
Article in English | MEDLINE | ID: mdl-25205807

ABSTRACT

Complex higher-order RNA structures play critical roles in all facets of gene expression; however, the through-space interaction networks that define tertiary structures and govern sampling of multiple conformations are poorly understood. Here we describe single-molecule RNA structure analysis in which multiple sites of chemical modification are identified in single RNA strands by massively parallel sequencing and then analyzed for correlated and clustered interactions. The strategy thus identifies RNA interaction groups by mutational profiling (RING-MaP) and makes possible two expansive applications. First, we identify through-space interactions, create 3D models for RNAs spanning 80-265 nucleotides, and characterize broad classes of intramolecular interactions that stabilize RNA. Second, we distinguish distinct conformations in solution ensembles and reveal previously undetected hidden states and large-scale structural reconfigurations that occur in unfolded RNAs relative to native states. RING-MaP single-molecule nucleic acid structure interrogation enables concise and facile analysis of the global architectures and multiple conformations that govern function in RNA.


Subject(s)
Escherichia coli/chemistry , Geobacillus stearothermophilus/chemistry , Models, Molecular , RNA, Bacterial/chemistry , RNA, Protozoan/chemistry , Tetrahymena/chemistry , Nucleic Acid Conformation
9.
PLoS Comput Biol ; 11(5): e1004230, 2015 May.
Article in English | MEDLINE | ID: mdl-25992893

ABSTRACT

HIV and related primate lentiviruses possess single-stranded RNA genomes. Multiple regions of these genomes participate in critical steps in the viral replication cycle, and the functions of many RNA elements are dependent on the formation of defined structures. The structures of these elements are still not fully understood, and additional functional elements likely exist that have not been identified. In this work, we compared three full-length HIV-related viral genomes: HIV-1NL4-3, SIVcpz, and SIVmac (the latter two strains are progenitors for all HIV-1 and HIV-2 strains, respectively). Model-free RNA structure comparisons were performed using whole-genome structure information experimentally derived from nucleotide-resolution SHAPE reactivities. Consensus secondary structures were constructed for strongly correlated regions by taking into account both SHAPE probing structural data and nucleotide covariation information from structure-based alignments. In these consensus models, all known functional RNA elements were recapitulated with high accuracy. In addition, we identified multiple previously unannotated structural elements in the HIV-1 genome likely to function in translation, splicing and other replication cycle processes; these are compelling targets for future functional analyses. The structure-informed alignment strategy developed here will be broadly useful for efficient RNA motif discovery.


Subject(s)
HIV-1/genetics , RNA, Viral/chemistry , RNA, Viral/genetics , Animals , Base Sequence , Conserved Sequence , Genetic Structures , Genome, Viral , Molecular Sequence Data , Nucleic Acid Conformation , Nucleotide Motifs , Sequence Alignment/methods , Structure-Activity Relationship
10.
PLoS Comput Biol ; 11(5): e1004126, 2015 May.
Article in English | MEDLINE | ID: mdl-25992778

ABSTRACT

Discovery and characterization of functional RNA structures remains challenging due to deficiencies in de novo secondary structure modeling. Here we describe a dynamic programming approach for model-free sequence comparison that incorporates high-throughput chemical probing data. Based on SHAPE probing data alone, ribosomal RNAs (rRNAs) from three diverse organisms--the eubacteria E. coli and C. difficile and the archeon H. volcanii--could be aligned with accuracies comparable to alignments based on actual sequence identity. When both base sequence identity and chemical probing reactivities were considered together, accuracies improved further. Derived sequence alignments and chemical probing data from protein-free RNAs were then used as pseudo-free energy constraints to model consensus secondary structures for the 16S and 23S rRNAs. There are critical differences between these experimentally-informed models and currently accepted models, including in the functionally important neck and decoding regions of the 16S rRNA. We infer that the 16S rRNA has evolved to undergo large-scale changes in base pairing as part of ribosome function. As high-quality RNA probing data become widely available, structurally-informed sequence alignment will become broadly useful for de novo motif and function discovery.


Subject(s)
Nucleic Acid Conformation , RNA, Ribosomal, 16S/chemistry , RNA, Ribosomal, 16S/genetics , Sequence Alignment/statistics & numerical data , Base Sequence , Clostridioides difficile/genetics , Computational Biology , Escherichia coli/genetics , Haloferax volcanii/genetics , High-Throughput Nucleotide Sequencing/statistics & numerical data , Models, Molecular , Molecular Sequence Data , RNA, Archaeal/chemistry , RNA, Archaeal/genetics , RNA, Bacterial/chemistry , RNA, Bacterial/genetics , Sequence Analysis, RNA/statistics & numerical data
11.
Nat Methods ; 9(6): 603-8, 2012 Apr 15.
Article in English | MEDLINE | ID: mdl-22504587

ABSTRACT

Molecular modeling guided by experimentally derived structural information is an attractive approach for three-dimensional structure determination of complex RNAs that are not amenable to study by high-resolution methods. Hydroxyl radical probing (HRP), which is performed routinely in many laboratories, provides a measure of solvent accessibility at individual nucleotides. HRP measurements have, to date, only been used to evaluate RNA models qualitatively. Here we report the development of a quantitative structure refinement approach using HRP measurements to drive discrete molecular dynamics simulations for RNAs ranging in size from 80 to 230 nucleotides. We first used HRP reactivities to identify RNAs that form extensive helical packing interactions. For these RNAs, we achieved highly significant structure predictions given the inputs of RNA sequence and base pairing. This HRP-directed tertiary structure refinement approach generates robust structural hypotheses that are useful for guiding explorations of structure-function inter-relationships in RNA.


Subject(s)
Hydroxyl Radical/chemistry , RNA/ultrastructure , Base Pairing , Base Sequence , Models, Molecular , Molecular Dynamics Simulation , Nucleic Acid Conformation , RNA/chemistry , RNA, Transfer/chemistry , Software
12.
RNA ; 18(4): 610-25, 2012 Apr.
Article in English | MEDLINE | ID: mdl-22361291

ABSTRACT

We report the results of a first, collective, blind experiment in RNA three-dimensional (3D) structure prediction, encompassing three prediction puzzles. The goals are to assess the leading edge of RNA structure prediction techniques; compare existing methods and tools; and evaluate their relative strengths, weaknesses, and limitations in terms of sequence length and structural complexity. The results should give potential users insight into the suitability of available methods for different applications and facilitate efforts in the RNA structure prediction community in ongoing efforts to improve prediction tools. We also report the creation of an automated evaluation pipeline to facilitate the analysis of future RNA structure prediction exercises.


Subject(s)
Nucleic Acid Conformation , RNA/chemistry , Base Sequence , Dimerization , Models, Molecular , Molecular Sequence Data
13.
J Clin Endocrinol Metab ; 106(3): e1441-e1452, 2021 03 08.
Article in English | MEDLINE | ID: mdl-32870266

ABSTRACT

CONTEXT: Functional hypothalamic amenorrhea (HA) is a common, acquired form of hypogonadotropic hypogonadism that occurs in the setting of energy deficits and/or stress. Variability in individual susceptibility to these stressors, HA heritability, and previous identification of several rare sequence variants (RSVs) in genes associated with the rare disorder, isolated hypogonadotropic hypogonadism (IHH), in individuals with HA suggest a possible genetic contribution to HA susceptibility. OBJECTIVE: We sought to determine whether the burden of RSVs in IHH-related genes is greater in women with HA than controls. DESIGN: We compared patients with HA to control women. SETTING: The study was conducted at secondary referral centers. PATIENTS AND OTHER PARTICIPANTS: Women with HA (n = 106) and control women (ClinSeq study; n = 468). INTERVENTIONS: We performed exome sequencing in all patients and controls. MAIN OUTCOME MEASURE(S): The frequency of RSVs in 53 IHH-associated genes was determined using rare variant burden and association tests. RESULTS: RSVs were overrepresented in women with HA compared with controls (P = .007). Seventy-eight heterozygous RSVs in 33 genes were identified in 58 women with HA (36.8% of alleles) compared to 255 RSVs in 41 genes among 200 control women (27.2%). CONCLUSIONS: Women with HA are enriched for RSVs in genes that cause IHH, suggesting that variation in genes associated with gonadotropin-releasing hormone neuronal ontogeny and function may be a major determinant of individual susceptibility to developing HA in the face of diet, exercise, and/or stress.


Subject(s)
Amenorrhea/genetics , Gonadotropin-Releasing Hormone/metabolism , Hypothalamic Diseases/genetics , Adolescent , Adult , Aged , Amenorrhea/epidemiology , Amenorrhea/etiology , Case-Control Studies , Child , DNA Mutational Analysis , Female , Gene Frequency , Genetic Association Studies , Genetic Predisposition to Disease , Humans , Hypogonadism/epidemiology , Hypogonadism/etiology , Hypogonadism/genetics , Hypothalamic Diseases/complications , Hypothalamic Diseases/epidemiology , Metabolic Networks and Pathways/genetics , Middle Aged , Mutation, Missense , Exome Sequencing , Young Adult
14.
Biochemistry ; 49(24): 4931-3, 2010 Jun 22.
Article in English | MEDLINE | ID: mdl-20545364

ABSTRACT

RNA function is dependent on its structure, yet three-dimensional folds for most biologically important RNAs are unknown. We develop a generic discrete molecular dynamics-based modeling system that uses long-range constraints inferred from diverse biochemical or bioinformatic analyses to create statistically significant (p < 0.01) nativelike folds for RNAs of known structure ranging from 45 to 158 nucleotides. We then predict the unknown structure of the hepatitis C virus internal ribosome entry site (IRES) pseudoknot domain. The resulting RNA model rationalizes independent solvent accessibility and cryo-electron microscopy structure information. The pseudoknot domain positions the AUG start codon near the mRNA channel and is tRNA-like, suggesting the IRES employs molecular mimicry as a functional strategy.


Subject(s)
Hepacivirus/genetics , Models, Molecular , RNA, Messenger/chemistry , RNA, Viral/chemistry , Codon, Initiator , Cryoelectron Microscopy , Molecular Dynamics Simulation , Nucleic Acid Conformation , Ribosomes/chemistry , Solvents/chemistry
15.
Genome Biol ; 21(1): 248, 2020 09 17.
Article in English | MEDLINE | ID: mdl-32943091

ABSTRACT

BACKGROUND: Acquired human mitochondrial genome (mtDNA) deletions are symptoms and drivers of focal mitochondrial respiratory deficiency, a pathological hallmark of aging and late-onset mitochondrial disease. RESULTS: To decipher connections between these processes, we create LostArc, an ultrasensitive method for quantifying deletions in circular mtDNA molecules. LostArc reveals 35 million deletions (~ 470,000 unique spans) in skeletal muscle from 22 individuals with and 19 individuals without pathogenic variants in POLG. This nuclear gene encodes the catalytic subunit of replicative mitochondrial DNA polymerase γ. Ablation, the deleted mtDNA fraction, suffices to explain skeletal muscle phenotypes of aging and POLG-derived disease. Unsupervised bioinformatic analyses reveal distinct age- and disease-correlated deletion patterns. CONCLUSIONS: These patterns implicate replication by DNA polymerase γ as the deletion driver and suggest little purifying selection against mtDNA deletions by mitophagy in postmitotic muscle fibers. Observed deletion patterns are best modeled as mtDNA deletions initiated by replication fork stalling during strand displacement mtDNA synthesis.


Subject(s)
DNA Polymerase gamma/genetics , DNA, Mitochondrial/analysis , Genetic Techniques , Mitochondrial Diseases/genetics , Sequence Deletion , Software , Adolescent , Adult , Aged , Aged, 80 and over , Aging/genetics , Aging/pathology , DNA Replication , DNA, Mitochondrial/metabolism , HEK293 Cells , Humans , Middle Aged , Quadriceps Muscle/chemistry , Quadriceps Muscle/pathology , Young Adult
16.
Nat Commun ; 10(1): 3072, 2019 07 11.
Article in English | MEDLINE | ID: mdl-31296853

ABSTRACT

Faithful transcription initiation is critical for accurate gene expression, yet the mechanisms underlying specific transcription start site (TSS) selection in mammals remain unclear. Here, we show that the histone-fold domain protein NF-Y, a ubiquitously expressed transcription factor, controls the fidelity of transcription initiation at gene promoters in mouse embryonic stem cells. We report that NF-Y maintains the region upstream of TSSs in a nucleosome-depleted state while simultaneously protecting this accessible region against aberrant and/or ectopic transcription initiation. We find that loss of NF-Y binding in mammalian cells disrupts the promoter chromatin landscape, leading to nucleosomal encroachment over the canonical TSS. Importantly, this chromatin rearrangement is accompanied by upstream relocation of the transcription pre-initiation complex and ectopic transcription initiation. Further, this phenomenon generates aberrant extended transcripts that undergo translation, disrupting gene expression profiles. These results suggest NF-Y is a central player in TSS selection in metazoans and highlight the deleterious consequences of inaccurate transcription initiation.


Subject(s)
CCAAT-Binding Factor/metabolism , Nucleosomes/metabolism , Transcription Initiation Site , Transcription Initiation, Genetic , Animals , CCAAT-Binding Factor/genetics , Cell Line , Chromatin/genetics , Chromatin/metabolism , Embryonic Stem Cells , Gene Knockdown Techniques , Mice , Nucleosomes/genetics , Promoter Regions, Genetic/genetics , RNA, Small Interfering/metabolism
17.
J R Soc Interface ; 15(141)2018 04.
Article in English | MEDLINE | ID: mdl-29618526

ABSTRACT

Deep learning describes a class of machine learning algorithms that are capable of combining raw inputs into layers of intermediate features. These algorithms have recently shown impressive results across a variety of domains. Biology and medicine are data-rich disciplines, but the data are complex and often ill-understood. Hence, deep learning techniques may be particularly well suited to solve problems of these fields. We examine applications of deep learning to a variety of biomedical problems-patient classification, fundamental biological processes and treatment of patients-and discuss whether deep learning will be able to transform these tasks or if the biomedical sphere poses unique challenges. Following from an extensive literature review, we find that deep learning has yet to revolutionize biomedicine or definitively resolve any of the most pressing challenges in the field, but promising advances have been made on the prior state of the art. Even though improvements over previous baselines have been modest in general, the recent progress indicates that deep learning methods will provide valuable means for speeding up or aiding human investigation. Though progress has been made linking a specific neural network's prediction to input features, understanding how users should interpret these models to make testable hypotheses about the system under study remains an open challenge. Furthermore, the limited amount of labelled data for training presents problems in some domains, as do legal and privacy constraints on work with sensitive health records. Nonetheless, we foresee deep learning enabling changes at both bench and bedside with the potential to transform several areas of biology and medicine.


Subject(s)
Biomedical Research/trends , Biomedical Technology/trends , Deep Learning/trends , Algorithms , Biomedical Research/methods , Decision Making , Delivery of Health Care/methods , Delivery of Health Care/trends , Disease/genetics , Drug Design , Electronic Health Records/trends , Humans , Terminology as Topic
18.
Elife ; 62017 03 27.
Article in English | MEDLINE | ID: mdl-28346137

ABSTRACT

Histone H3 lysine 36 methylation (H3K36me) is thought to participate in a host of co-transcriptional regulatory events. To study the function of this residue independent from the enzymes that modify it, we used a 'histone replacement' system in Drosophila to generate a non-modifiable H3K36 lysine-to-arginine (H3K36R) mutant. We observed global dysregulation of mRNA levels in H3K36R animals that correlates with the incidence of H3K36me3. Similar to previous studies, we found that mutation of H3K36 also resulted in H4 hyperacetylation. However, neither cryptic transcription initiation, nor alternative pre-mRNA splicing, contributed to the observed changes in expression, in contrast with previously reported roles for H3K36me. Interestingly, knockdown of the RNA surveillance nuclease, Xrn1, and members of the CCR4-Not deadenylase complex, restored mRNA levels for a class of downregulated, H3K36me3-rich genes. We propose a post-transcriptional role for modification of replication-dependent H3K36 in the control of metazoan gene expression.


Subject(s)
Gene Expression Regulation , Histones/metabolism , Protein Processing, Post-Translational , Transcription, Genetic , Amino Acid Substitution , Animals , Drosophila , Gene Expression Profiling , Histones/genetics , Methylation , Mutation, Missense
SELECTION OF CITATIONS
SEARCH DETAIL