Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
1.
Nature ; 583(7818): 711-719, 2020 07.
Article in English | MEDLINE | ID: mdl-32728246

ABSTRACT

Many proteins regulate the expression of genes by binding to specific regions encoded in the genome1. Here we introduce a new data set of RNA elements in the human genome that are recognized by RNA-binding proteins (RBPs), generated as part of the Encyclopedia of DNA Elements (ENCODE) project phase III. This class of regulatory elements functions only when transcribed into RNA, as they serve as the binding sites for RBPs that control post-transcriptional processes such as splicing, cleavage and polyadenylation, and the editing, localization, stability and translation of mRNAs. We describe the mapping and characterization of RNA elements recognized by a large collection of human RBPs in K562 and HepG2 cells. Integrative analyses using five assays identify RBP binding sites on RNA and chromatin in vivo, the in vitro binding preferences of RBPs, the function of RBP binding sites and the subcellular localization of RBPs, producing 1,223 replicated data sets for 356 RBPs. We describe the spectrum of RBP binding throughout the transcriptome and the connections between these interactions and various aspects of RNA biology, including RNA stability, splicing regulation and RNA localization. These data expand the catalogue of functional elements encoded in the human genome by the addition of a large set of elements that function at the RNA level by interacting with RBPs.


Subject(s)
RNA-Binding Proteins/chemistry , RNA-Binding Proteins/metabolism , Transcriptome/genetics , Alternative Splicing/genetics , Base Sequence , Binding Sites , Cell Line , Chromatin/genetics , Chromatin/metabolism , Databases, Genetic , Female , Gene Knockdown Techniques , Humans , Intracellular Space/genetics , Male , Protein Binding , RNA, Messenger/chemistry , RNA, Messenger/genetics , RNA, Messenger/metabolism , RNA-Binding Proteins/genetics , Substrate Specificity
2.
Nature ; 583(7818): 699-710, 2020 07.
Article in English | MEDLINE | ID: mdl-32728249

ABSTRACT

The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development. All data are available through the ENCODE data portal (https://www.encodeproject.org), including phase II ENCODE1 and Roadmap Epigenomics2 data. We have developed a registry of 926,535 human and 339,815 mouse candidate cis-regulatory elements, covering 7.9 and 3.4% of their respective genomes, by integrating selected datatypes associated with gene regulation, and constructed a web-based server (SCREEN; http://screen.encodeproject.org) to provide flexible, user-defined access to this resource. Collectively, the ENCODE data and registry provide an expansive resource for the scientific community to build a better understanding of the organization and function of the human and mouse genomes.


Subject(s)
DNA/genetics , Databases, Genetic , Genome/genetics , Genomics , Molecular Sequence Annotation , Registries , Regulatory Sequences, Nucleic Acid/genetics , Animals , Chromatin/genetics , Chromatin/metabolism , DNA/chemistry , DNA Footprinting , DNA Methylation/genetics , DNA Replication Timing , Deoxyribonuclease I/metabolism , Genome, Human , Histones/metabolism , Humans , Mice , Mice, Transgenic , RNA-Binding Proteins/genetics , Transcription, Genetic/genetics , Transposases/metabolism
3.
Mol Cell ; 70(5): 854-867.e9, 2018 06 07.
Article in English | MEDLINE | ID: mdl-29883606

ABSTRACT

RNA binding proteins (RBPs) orchestrate the production, processing, and function of mRNAs. Here, we present the affinity landscapes of 78 human RBPs using an unbiased assay that determines the sequence, structure, and context preferences of these proteins in vitro by deep sequencing of bound RNAs. These data enable construction of "RNA maps" of RBP activity without requiring crosslinking-based assays. We found an unexpectedly low diversity of RNA motifs, implying frequent convergence of binding specificity toward a relatively small set of RNA motifs, many with low compositional complexity. Offsetting this trend, however, we observed extensive preferences for contextual features distinct from short linear RNA motifs, including spaced "bipartite" motifs, biased flanking nucleotide composition, and bias away from or toward RNA structure. Our results emphasize the importance of contextual features in RNA recognition, which likely enable targeting of distinct subsets of transcripts by different RBPs that recognize the same linear motif.


Subject(s)
RNA Recognition Motif Proteins/metabolism , RNA/metabolism , Base Sequence , Binding Sites , High-Throughput Nucleotide Sequencing , Humans , Nucleic Acid Conformation , Nucleotide Motifs , Protein Binding , RNA/chemistry , RNA/genetics , RNA Recognition Motif Proteins/chemistry , RNA Recognition Motif Proteins/genetics , Structure-Activity Relationship
4.
Am J Pathol ; 192(10): 1368-1378, 2022 10.
Article in English | MEDLINE | ID: mdl-35948080

ABSTRACT

Current imaging-based cancer screening approaches provide useful but limited prognostic information. Complementary to existing screening tests, cell-free DNA-based multicancer early detection (MCED) tests account for cancer biology [manifested through circulating tumor allele fraction (cTAF)], which could inform prognosis and help assess the cancer's clinical significance. This review discusses the factors affecting circulating tumor DNA (ctDNA) levels and cTAF, and their correlation with the cancer's clinical significance. Furthermore, it discusses the influence of cTAF on MCED test performance, which could help inform prognosis. Clinically significant cancers show higher ctDNA levels quantified by cTAF than indolent phenotype cancers within each stage. This is because more frequent mitosis and cell death combined with increased trafficking of cell-free DNA into circulation leads to greater vascularization and depth of tumor invasion. cTAF has been correlated with biomarkers for cancer aggressiveness and overall survival; cancers with lower cTAF had better survival when compared with cancers as determined by the higher cTAF and Surveillance, Epidemiology, and End Results-based survival for that cancer type at each stage. MCED-detected cancers in case-control studies had comparable survival to Surveillance, Epidemiology, and End Results-based survival at each stage. Because many MCED tests use ctDNA as an analyte, cTAF could provide a common metric to compare performance. The prognostic value of cTAF may allow MCED tests to preferentially detect clinically significant cancers at early stages when outcomes are favorable and this may avoid overdiagnosis.


Subject(s)
Circulating Tumor DNA , Neoplasms , Alleles , Biomarkers, Tumor/genetics , Circulating Tumor DNA/genetics , Early Detection of Cancer , Humans , Neoplasms/diagnosis , Neoplasms/genetics , Neoplasms/pathology , Prognosis
7.
Genome Res ; 25(6): 858-71, 2015 Jun.
Article in English | MEDLINE | ID: mdl-25883322

ABSTRACT

RNA binding proteins of the conserved CUGBP1, Elav-like factor (CELF) family contribute to heart and skeletal muscle development and are implicated in myotonic dystrophy (DM). To understand their genome-wide functions, we analyzed the transcriptome dynamics following induction of CELF1 or CELF2 in adult mouse heart and of CELF1 in muscle by RNA-seq, complemented by crosslinking/immunoprecipitation-sequencing (CLIP-seq) analysis of mouse cells and tissues to distinguish direct from indirect regulatory targets. We identified hundreds of mRNAs bound in their 3' UTRs by both CELF1 and the developmentally induced MBNL1 protein, a threefold greater overlap in target messages than expected, including messages involved in development and cell differentiation. The extent of 3' UTR binding by CELF1 and MBNL1 predicted the degree of mRNA repression or stabilization, respectively, following CELF1 induction. However, CELF1's RNA binding specificity in vitro was not detectably altered by coincubation with recombinant MBNL1. These findings support a model in which CELF and MBNL proteins bind independently to mRNAs but functionally compete to specify down-regulation or localization/stabilization, respectively, of hundreds of mRNA targets. Expression of many alternative 3' UTR isoforms was altered following CELF1 induction, with 3' UTR binding associated with down-regulation of isoforms and genes. The splicing of hundreds of alternative exons was oppositely regulated by these proteins, confirming an additional layer of regulatory antagonism previously observed in a handful of cases. The regulatory relationships between CELFs and MBNLs in control of both mRNA abundance and splicing appear to have evolved to enhance developmental transitions in major classes of heart and muscle genes.


Subject(s)
CELF Proteins/genetics , Gene Expression Regulation , Nerve Tissue Proteins/genetics , RNA Splicing , RNA, Messenger/genetics , RNA-Binding Proteins/genetics , Animals , CELF Proteins/metabolism , CELF1 Protein/genetics , CELF1 Protein/metabolism , Down-Regulation , Exons , Gene Expression Regulation, Developmental , Heart/physiology , Humans , Mice , Mice, Transgenic , Muscle, Skeletal/metabolism , Myotonic Dystrophy/genetics , Myotonic Dystrophy/therapy , Nerve Tissue Proteins/metabolism , Protein Isoforms/metabolism , RNA-Binding Proteins/metabolism , Sequence Analysis, RNA , Transcriptome
8.
Biophys J ; 106(4): 944-54, 2014 Feb 18.
Article in English | MEDLINE | ID: mdl-24559997

ABSTRACT

Conjugation is the primary mechanism of horizontal gene transfer that spreads antibiotic resistance among bacteria. Although conjugation normally occurs in surface-associated growth (e.g., biofilms), it has been traditionally studied in well-mixed liquid cultures lacking spatial structure, which is known to affect many evolutionary and ecological processes. Here we visualize spatial patterns of gene transfer mediated by F plasmid conjugation in a colony of Escherichia coli growing on solid agar, and we develop a quantitative understanding by spatial extension of traditional mass-action models. We found that spatial structure suppresses conjugation in surface-associated growth because strong genetic drift leads to spatial isolation of donor and recipient cells, restricting conjugation to rare boundaries between donor and recipient strains. These results suggest that ecological strategies, such as enforcement of spatial structure and enhancement of genetic drift, could complement molecular strategies in slowing the spread of antibiotic resistance genes.


Subject(s)
Conjugation, Genetic , Escherichia coli/genetics , Genetic Drift , Models, Genetic , Escherichia coli/physiology , F Factor/genetics
9.
Cancers (Basel) ; 16(1)2023 Dec 23.
Article in English | MEDLINE | ID: mdl-38201510

ABSTRACT

Estimating the abundance of cell-free DNA (cfDNA) fragments shed from a tumor (i.e., circulating tumor DNA (ctDNA)) can approximate tumor burden, which has numerous clinical applications. We derived a novel, broadly applicable statistical method to quantify cancer-indicative methylation patterns within cfDNA to estimate ctDNA abundance, even at low levels. Our algorithm identified differentially methylated regions (DMRs) between a reference database of cancer tissue biopsy samples and cfDNA from individuals without cancer. Then, without utilizing matched tissue biopsy, counts of fragments matching the cancer-indicative hyper/hypo-methylated patterns within DMRs were used to determine a tumor methylated fraction (TMeF; a methylation-based quantification of the circulating tumor allele fraction and estimate of ctDNA abundance) for plasma samples. TMeF and small variant allele fraction (SVAF) estimates of the same cancer plasma samples were correlated (Spearman's correlation coefficient: 0.73), and synthetic dilutions to expected TMeF of 10-3 and 10-4 had estimated TMeF within two-fold for 95% and 77% of samples, respectively. TMeF increased with cancer stage and tumor size and inversely correlated with survival probability. Therefore, tumor-derived fragments in the cfDNA of patients with cancer can be leveraged to estimate ctDNA abundance without the need for a tumor biopsy, which may provide non-invasive clinical approximations of tumor burden.

10.
Cancer Cell ; 40(12): 1537-1549.e12, 2022 12 12.
Article in English | MEDLINE | ID: mdl-36400018

ABSTRACT

In the Circulating Cell-free Genome Atlas (NCT02889978) substudy 1, we evaluate several approaches for a circulating cell-free DNA (cfDNA)-based multi-cancer early detection (MCED) test by defining clinical limit of detection (LOD) based on circulating tumor allele fraction (cTAF), enabling performance comparisons. Among 10 machine-learning classifiers trained on the same samples and independently validated, when evaluated at 98% specificity, those using whole-genome (WG) methylation, single nucleotide variants with paired white blood cell background removal, and combined scores from classifiers evaluated in this study show the highest cancer signal detection sensitivities. Compared with clinical stage and tumor type, cTAF is a more significant predictor of classifier performance and may more closely reflect tumor biology. Clinical LODs mirror relative sensitivities for all approaches. The WG methylation feature best predicts cancer signal origin. WG methylation is the most promising technology for MCED and informs development of a targeted methylation MCED test.


Subject(s)
Cell-Free Nucleic Acids , Neoplasms , Humans , Cell-Free Nucleic Acids/genetics , Early Detection of Cancer , Neoplasms/diagnosis , Neoplasms/genetics , Biomarkers, Tumor/genetics , DNA Methylation
11.
Nat Commun ; 10(1): 1338, 2019 03 22.
Article in English | MEDLINE | ID: mdl-30902979

ABSTRACT

Allele-specific protein-RNA binding is an essential aspect that may reveal functional genetic variants (GVs) mediating post-transcriptional regulation. Recently, genome-wide detection of in vivo binding of RNA-binding proteins is greatly facilitated by the enhanced crosslinking and immunoprecipitation (eCLIP) method. We developed a new computational approach, called BEAPR, to identify allele-specific binding (ASB) events in eCLIP-Seq data. BEAPR takes into account crosslinking-induced sequence propensity and variations between replicated experiments. Using simulated and actual data, we show that BEAPR largely outperforms often-used count analysis methods. Importantly, BEAPR overcomes the inherent overdispersion problem of these methods. Complemented by experimental validations, we demonstrate that the application of BEAPR to ENCODE eCLIP-Seq data of 154 proteins helps to predict functional GVs that alter splicing or mRNA abundance. Moreover, many GVs with ASB patterns have known disease relevance. Overall, BEAPR is an effective method that helps to address the outstanding challenge of functional interpretation of GVs.


Subject(s)
Alleles , Genetic Variation , RNA-Binding Proteins/metabolism , RNA/genetics , 3' Untranslated Regions/genetics , Amino Acid Motifs , Base Sequence , Computational Biology , Computer Simulation , Disease/genetics , Genetic Predisposition to Disease , Hep G2 Cells , Humans , K562 Cells , Polymorphism, Single Nucleotide/genetics , Protein Binding , Quantitative Trait Loci/genetics , RNA Helicases/metabolism , RNA Splicing/genetics , RNA, Messenger/genetics , RNA, Messenger/metabolism , Reproducibility of Results , Trans-Activators/metabolism
12.
Cell Rep ; 15(3): 666-679, 2016 Apr 19.
Article in English | MEDLINE | ID: mdl-27068461

ABSTRACT

Human pluripotent stem cells (hPSCs) require precise control of post-transcriptional RNA networks to maintain proliferation and survival. Using enhanced UV crosslinking and immunoprecipitation (eCLIP), we identify RNA targets of the IMP/IGF2BP family of RNA-binding proteins in hPSCs. At the broad region and binding site levels, IMP1 and IMP2 show reproducible binding to a large and overlapping set of 3' UTR-enriched targets. RNA Bind-N-seq applied to recombinant full-length IMP1 and IMP2 reveals CA-rich motifs that are enriched in eCLIP-defined binding sites. We observe that IMP1 loss in hPSCs recapitulates IMP1 phenotypes, including a reduction in cell adhesion and increase in cell death. For cell adhesion, we find IMP1 maintains levels of integrin mRNA specifically regulating RNA stability of ITGB5 in hPSCs. Additionally, we show that IMP1 can be linked to hPSC survival via direct target BCL2. Thus, transcriptome-wide binding profiles identify hPSC targets modulating well-characterized IMP1 roles.


Subject(s)
Cross-Linking Reagents/metabolism , Immunoprecipitation/methods , Pluripotent Stem Cells/cytology , Pluripotent Stem Cells/metabolism , RNA/metabolism , 3' Untranslated Regions/genetics , Base Sequence , Cell Adhesion , Cell Survival , Gene Expression Regulation , Human Embryonic Stem Cells/metabolism , Humans , Integrins/metabolism , Nucleotide Motifs/genetics , Protein Binding , Proto-Oncogene Proteins c-bcl-2/metabolism , RNA Stability , RNA, Messenger/genetics , RNA, Messenger/metabolism , RNA-Binding Proteins/metabolism
13.
Nat Commun ; 7: 12143, 2016 07 05.
Article in English | MEDLINE | ID: mdl-27378374

ABSTRACT

The RNA-binding protein (RBP) TAF15 is implicated in amyotrophic lateral sclerosis (ALS). To compare TAF15 function to that of two ALS-associated RBPs, FUS and TDP-43, we integrate CLIP-seq and RNA Bind-N-Seq technologies, and show that TAF15 binds to ∼4,900 RNAs enriched for GGUA motifs in adult mouse brains. TAF15 and FUS exhibit similar binding patterns in introns, are enriched in 3' untranslated regions and alter genes distinct from TDP-43. However, unlike FUS and TDP-43, TAF15 has a minimal role in alternative splicing. In human neural progenitors, TAF15 and FUS affect turnover of their RNA targets. In human stem cell-derived motor neurons, the RNA profile associated with concomitant loss of both TAF15 and FUS resembles that observed in the presence of the ALS-associated mutation FUS R521G, but contrasts with late-stage sporadic ALS patients. Taken together, our findings reveal convergent and divergent roles for FUS, TAF15 and TDP-43 in RNA metabolism.


Subject(s)
Alternative Splicing/genetics , Amyotrophic Lateral Sclerosis/genetics , DNA-Binding Proteins/genetics , RNA-Binding Protein FUS/genetics , TATA-Binding Protein Associated Factors/genetics , 3' Untranslated Regions/genetics , Animals , Computational Biology/methods , DNA-Binding Proteins/metabolism , Disease Models, Animal , Female , Fibroblasts , Gene Knockdown Techniques , High-Throughput Nucleotide Sequencing/methods , Humans , Induced Pluripotent Stem Cells , Introns/genetics , Mice , Mice, Inbred C57BL , Motor Neurons/metabolism , Mutation , Oligonucleotides, Antisense/administration & dosage , Oligonucleotides, Antisense/genetics , Primary Cell Culture , RNA, Messenger/genetics , RNA, Messenger/metabolism , RNA, Small Interfering/metabolism , RNA-Binding Protein FUS/metabolism , Sequence Analysis, RNA/methods , TATA-Binding Protein Associated Factors/metabolism
14.
Neuron ; 92(4): 780-795, 2016 Nov 23.
Article in English | MEDLINE | ID: mdl-27773581

ABSTRACT

HnRNPA2B1 encodes an RNA binding protein associated with neurodegeneration. However, its function in the nervous system is unclear. Transcriptome-wide crosslinking and immunoprecipitation in mouse spinal cord discover UAGG motifs enriched within ∼2,500 hnRNP A2/B1 binding sites and an unexpected role for hnRNP A2/B1 in alternative polyadenylation. HnRNP A2/B1 loss results in alternative splicing (AS), including skipping of an exon in amyotrophic lateral sclerosis (ALS)-associated D-amino acid oxidase (DAO) that reduces D-serine metabolism. ALS-associated hnRNP A2/B1 D290V mutant patient fibroblasts and motor neurons differentiated from induced pluripotent stem cells (iPSC-MNs) demonstrate abnormal splicing changes, likely due to increased nuclear-insoluble hnRNP A2/B1. Mutant iPSC-MNs display decreased survival in long-term culture and exhibit hnRNP A2/B1 localization to cytoplasmic granules as well as exacerbated changes in gene expression and splicing upon cellular stress. Our findings provide a cellular resource and reveal RNA networks relevant to neurodegeneration, regulated by normal and mutant hnRNP A2/B1. VIDEO ABSTRACT.


Subject(s)
Alternative Splicing/genetics , Amyotrophic Lateral Sclerosis/genetics , Cell Survival/genetics , Fibroblasts/metabolism , Heterogeneous-Nuclear Ribonucleoprotein Group A-B/genetics , Motor Neurons/metabolism , Protein Transport/genetics , Amyotrophic Lateral Sclerosis/metabolism , Animals , Case-Control Studies , D-Amino-Acid Oxidase/genetics , D-Amino-Acid Oxidase/metabolism , Fluorescent Antibody Technique , Gene Expression , Gene Expression Profiling , Heterogeneous-Nuclear Ribonucleoprotein Group A-B/metabolism , Humans , Induced Pluripotent Stem Cells , Mice , Mutation , Polyadenylation
SELECTION OF CITATIONS
SEARCH DETAIL