Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 70
Filter
1.
bioRxiv ; 2024 Apr 03.
Article in English | MEDLINE | ID: mdl-38559255

ABSTRACT

Our understanding of how the DNA sequences of cis-regulatory elements encode transcription initiation patterns remains limited. Here we introduce CLIPNET, a deep learning model trained on population-scale PRO-cap data that accurately predicts the position and quantity of transcription initiation with single nucleotide resolution from DNA sequence. Interpretation of CLIPNET revealed a complex regulatory syntax consisting of DNA-protein interactions in five major positions between -200 and +50 bp relative to the transcription start site, as well as more subtle positional preferences among different transcriptional activators. Transcriptional activator and core promoter motifs occupy different positions and play distinct roles in regulating initiation, with the former driving initiation quantity and the latter initiation position. We identified core promoter motifs that explain initiation patterns in the majority of promoters and enhancers, including DPR motifs and AT-rich TBP binding sequences in TATA-less promoters. Our results provide insights into the sequence architecture governing transcription initiation.

2.
Nat Microbiol ; 9(1): 241-250, 2024 Jan.
Article in English | MEDLINE | ID: mdl-38172625

ABSTRACT

Bacteria respond to environmental stimuli through precise regulation of transcription initiation and elongation. Bulk RNA sequencing primarily characterizes mature transcripts, so to identify actively transcribed loci we need to capture RNA polymerase (RNAP) complexed with nascent RNA. However, such capture methods have only previously been applied to culturable, genetically tractable organisms such as E. coli and B. subtilis. Here we apply precision run-on sequencing (PRO-seq) to profile nascent transcription in cultured E. coli and diverse uncultured bacteria. We demonstrate that PRO-seq can characterize the transcription of small, structured, or post-transcriptionally modified RNAs, which are often absent from bulk RNA-seq libraries. Applying PRO-seq to the human microbiome highlights taxon-specific RNAP pause motifs and pause-site distributions across non-coding RNA loci that reflect structure-coincident pausing. We also uncover concurrent transcription and cleavage of CRISPR guide RNAs and transfer RNAs. We demonstrate the utility of PRO-seq for exploring transcriptional dynamics in diverse microbial communities.


Subject(s)
Escherichia coli , RNA, Guide, CRISPR-Cas Systems , Humans , Escherichia coli/genetics , DNA-Directed RNA Polymerases/genetics , RNA/genetics , Gene Expression Profiling
3.
Nat Genet ; 55(8): 1370-1380, 2023 08.
Article in English | MEDLINE | ID: mdl-37430091

ABSTRACT

How enhancers control target gene expression over long genomic distances remains an important unsolved problem. Here we investigated enhancer-promoter communication by integrating data from nucleosome-resolution genomic contact maps, nascent transcription and perturbations affecting either RNA polymerase II (Pol II) dynamics or the activity of thousands of candidate enhancers. Integration of new Micro-C experiments with published CRISPRi data demonstrated that enhancers spend more time in close proximity to their target promoters in functional enhancer-promoter pairs compared to nonfunctional pairs, which can be attributed in part to factors unrelated to genomic position. Manipulation of the transcription cycle demonstrated a key role for Pol II in enhancer-promoter interactions. Notably, promoter-proximal paused Pol II itself partially stabilized interactions. We propose an updated model in which elements of transcriptional dynamics shape the duration or frequency of interactions to facilitate enhancer-promoter communication.


Subject(s)
Enhancer Elements, Genetic , RNA Polymerase II , RNA Polymerase II/genetics , RNA Polymerase II/metabolism , Enhancer Elements, Genetic/genetics , Promoter Regions, Genetic/genetics , Transcription, Genetic
4.
Nat Commun ; 14(1): 1753, 2023 03 29.
Article in English | MEDLINE | ID: mdl-36990976

ABSTRACT

During meiotic prophase I, spermatocytes must balance transcriptional activation with homologous recombination and chromosome synapsis, biological processes requiring extensive changes to chromatin state. We explored the interplay between chromatin accessibility and transcription through prophase I of mammalian meiosis by measuring genome-wide patterns of chromatin accessibility, nascent transcription, and processed mRNA. We find that Pol II is loaded on chromatin and maintained in a paused state early during prophase I. In later stages, paused Pol II is released in a coordinated transcriptional burst mediated by the transcription factors A-MYB and BRDT, resulting in ~3-fold increase in transcription. Transcriptional activity is temporally and spatially segregated from key steps of meiotic recombination: double strand breaks show evidence of chromatin accessibility earlier during prophase I and at distinct loci from those undergoing transcriptional activation, despite shared chromatin marks. Our findings reveal mechanisms underlying chromatin specialization in either transcription or recombination in meiotic cells.


Subject(s)
Meiosis , RNA Polymerase II , Animals , Male , Chromatin/genetics , Chromosomes , Gene Expression Regulation , Mammals/genetics , Meiosis/genetics , RNA Polymerase II/genetics , Spermatocytes , Proto-Oncogene Proteins/metabolism , Trans-Activators/metabolism , Nuclear Proteins/metabolism
5.
Res Sq ; 2023 Mar 24.
Article in English | MEDLINE | ID: mdl-36993251

ABSTRACT

Promoter-proximal pausing of RNA polymerase II (Pol II) is a key regulatory step during transcription. Despite the central role of pausing in gene regulation, we do not understand the evolutionary processes that led to the emergence of Pol II pausing or its transition to a rate-limiting step actively controlled by transcription factors. Here we analyzed transcription in species across the tree of life. We found that unicellular eukaryotes display a slow acceleration of Pol II near transcription start sites. This proto-paused-like state transitioned to a longer, focused pause in derived metazoans which coincided with the evolution of new subunits in the NELF and 7SK complexes. Depletion of NELF reverts the mammalian focal pause to a proto-pause-like state and compromises transcriptional activation for a set of heat shock genes. Collectively, this work details the evolutionary history of Pol II pausing and sheds light on how new transcriptional regulatory mechanisms evolve.

6.
BMC Cancer ; 23(1): 138, 2023 Feb 10.
Article in English | MEDLINE | ID: mdl-36765275

ABSTRACT

BACKGROUND: Rearranged during transfection (RET) tyrosine kinase signaling has been previously implicated in endocrine resistant breast cancer, however the mechanism by which this signaling cascade promotes resistance is currently not well described. We recently reported that glial cell-derived neurotrophic factor (GDNF)-RET signaling appears to promote a positive feedback loop with the transcription factor early growth response 1 (EGR1). Here we investigate the mechanism behind this feedback loop and test the hypothesis that GDNF-RET signaling forms a regulatory loop with EGR1 to upregulate cyclin D1 (CCND1) transcription, leading to cell cycle progression and tamoxifen resistance. METHODS: To gain a better understanding of the GDNF-RET-EGR1 resistance mechanism, we studied the GDNF-EGR1 positive feedback loop and the role of GDNF and EGR1 in endocrine resistance by modulating their transcription levels using CRISPR-dCAS9 in tamoxifen sensitive (TamS) and tamoxifen resistant (TamR) MCF-7 cells. Additionally, we performed kinetic studies using recombinant GDNF (rGDNF) treatment of TamS cells. Finally, we performed cell proliferation assays using rGDNF, tamoxifen (TAM), and Palbociclib treatments in TamS cells. Statistical significance for qPCR and chromatin immunoprecipitation (ChIP)-qPCR experiments were determined using a student's paired t-test and statistical significance for the cell viability assay was a one-way ANOVA. RESULTS: GDNF-RET signaling formed a positive feedback loop with EGR1 and also downregulated estrogen receptor 1 (ESR1) transcription. Upregulation of GDNF and EGR1 promoted tamoxifen resistance in TamS cells and downregulation of GDNF promoted tamoxifen sensitivity in TamR cells. Additionally, we show that rGDNF treatment activated GDNF-RET signaling in TamS cells, leading to recruitment of phospho-ELK-1 to the EGR1 promoter, upregulation of EGR1 mRNA and protein, binding of EGR1 to the GDNF and CCND1 promoters, increased GDNF protein expression, and subsequent upregulation of CCND1 mRNA levels. We also show that inhibition of cyclin D1 with Palbociclib, in the presence of rGDNF, decreases cell proliferation and resensitizes cells to TAM. CONCLUSION: Outcomes from these studies support the hypotheses that GDNF-RET signaling forms a positive feedback loop with the transcription factor EGR1, and that GDNF-RET-EGR1 signaling promotes endocrine resistance via signaling to cyclin D1. Inhibition of components of this signaling pathway could lead to therapeutic insights into the treatment of endocrine resistant breast cancer.


Subject(s)
Glial Cell Line-Derived Neurotrophic Factor , Tamoxifen , Cell Line, Tumor , Cell Proliferation , Cyclin D1/genetics , Cyclin D1/metabolism , Drug Resistance, Neoplasm/genetics , Feedback , Glial Cell Line-Derived Neurotrophic Factor/genetics , Glial Cell Line-Derived Neurotrophic Factor/metabolism , Kinetics , RNA, Messenger , Signal Transduction , Tamoxifen/pharmacology , Tamoxifen/therapeutic use , Transcription Factors , Humans
7.
Methods Mol Biol ; 2599: 215-226, 2023.
Article in English | MEDLINE | ID: mdl-36427152

ABSTRACT

Run-on and sequencing assays like GRO-seq, PRO-seq, and ChRO-seq allow for joint profiling of transcription activity of transcriptional regulatory elements (TREs), i.e., promoters and active enhancers, and target genes. Variation in biological conditions, such as treated vs. control, results in changes in the activity of transcription factors (TFs), which induces concerted changes in TREs and target genes. By modeling the differences between two biological conditions, we developed the computational pipeline known as tfTarget that predicts a set of putative TREs and target genes responding to each TF under the biological condition of interest. In this chapter, we demonstrate the use of the new web-based tfTarget in mapping transcription regulation using run-on sequencing data.


Subject(s)
Gene Expression Regulation , Regulatory Elements, Transcriptional , Transcription Factors/genetics , Promoter Regions, Genetic , Internet
8.
BMC Genomics ; 23(1): 792, 2022 Dec 01.
Article in English | MEDLINE | ID: mdl-36457077

ABSTRACT

Somatic mutations drive colorectal cancer (CRC) by disrupting gene regulatory mechanisms. Distinct combinations of mutations can result in unique changes to regulatory mechanisms leading to variability in the efficacy of therapeutics. MicroRNAs are important regulators of gene expression, and their activity can be altered by oncogenic mutations. However, it is unknown how distinct combinations of CRC-risk mutations differentially affect microRNAs. Here, using genetically-modified mouse intestinal organoid (enteroid) models, we identify 12 different modules of microRNA expression patterns across different combinations of mutations common in CRC. We also show that miR-24-3p is aberrantly upregulated in genetically-modified mouse enteroids irrespective of mutational context. Furthermore, we identify an enrichment of miR-24-3p predicted targets in downregulated gene lists from various mutational contexts compared to WT. In follow-up experiments, we demonstrate that miR-24-3p promotes CRC cell survival in multiple cell contexts. Our novel characterization of genotype-specific patterns of miRNA expression offer insight into the mechanisms that drive inter-tumor heterogeneity and highlight candidate microRNA therapeutic targets for the advancement of precision medicine for CRC.


Subject(s)
Colorectal Neoplasms , MicroRNAs , Animals , Mice , Cell Survival/genetics , Colorectal Neoplasms/genetics , Genotype , MicroRNAs/genetics , Organoids
9.
Genes Dev ; 2022 Aug 18.
Article in English | MEDLINE | ID: mdl-35981753

ABSTRACT

Promoter-proximal RNA Pol II pausing is a critical step in transcriptional control. Pol II pausing has been predominantly studied in tissue culture systems. While Pol II pausing has been shown to be required for mammalian development, the phenotypic and mechanistic details of this requirement are unknown. Here, we found that loss of Pol II pausing stalls pluripotent state transitions within the epiblast of the early mouse embryo. Using Nelfb -/- mice and a NELFB degron mouse pluripotent stem cell model, we show that embryonic stem cells (ESCs) representing the naïve state of pluripotency successfully initiate a transition program but fail to balance levels of induced and repressed genes and enhancers in the absence of NELF. We found an increase in chromatin-associated NELF during transition from the naïve to later pluripotent states. Overall, our work defines the acute and long-term molecular consequences of NELF loss and reveals a role for Pol II pausing in the pluripotency continuum as a modulator of cell state transitions.

10.
Elife ; 112022 07 01.
Article in English | MEDLINE | ID: mdl-35775732

ABSTRACT

How DNA sequence affects the dynamics and position of RNA Polymerase II (Pol II) during transcription remains poorly understood. Here, we used naturally occurring genetic variation in F1 hybrid mice to explore how DNA sequence differences affect the genome-wide distribution of Pol II. We measured the position and orientation of Pol II in eight organs collected from heterozygous F1 hybrid mice using ChRO-seq. Our data revealed a strong genetic basis for the precise coordinates of transcription initiation and promoter proximal pause, allowing us to redefine molecular models of core transcriptional processes. Our results implicate DNA sequence, including both known and novel DNA sequence motifs, as key determinants of the position of Pol II initiation and pause. We report evidence that initiation site selection follows a stochastic process similar to Brownian motion along the DNA template. We found widespread differences in the position of transcription termination, which impact the primary structure and stability of mature mRNA. Finally, we report evidence that allelic changes in transcription often affect mRNA and ncRNA expression across broad genomic domains. Collectively, we reveal how DNA sequences shape core transcriptional processes at single nucleotide resolution in mammals.


Subject(s)
RNA Polymerase II , Transcription, Genetic , Animals , Mammals/genetics , Mice , Promoter Regions, Genetic , RNA Polymerase II/genetics , RNA Polymerase II/metabolism , RNA, Messenger/metabolism , Transcription Initiation Site
11.
Nat Cancer ; 3(4): 505-517, 2022 04.
Article in English | MEDLINE | ID: mdl-35469013

ABSTRACT

Inferring single-cell compositions and their contributions to global gene expression changes from bulk RNA sequencing (RNA-seq) datasets is a major challenge in oncology. Here we develop Bayesian cell proportion reconstruction inferred using statistical marginalization (BayesPrism), a Bayesian method to predict cellular composition and gene expression in individual cell types from bulk RNA-seq, using patient-derived, scRNA-seq as prior information. We conduct integrative analyses in primary glioblastoma, head and neck squamous cell carcinoma and skin cutaneous melanoma to correlate cell type composition with clinical outcomes across tumor types, and explore spatial heterogeneity in malignant and nonmalignant cell states. We refine current cancer subtypes using gene expression annotation after exclusion of confounding nonmalignant cells. Finally, we identify genes whose expression in malignant cells correlates with macrophage infiltration, T cells, fibroblasts and endothelial cells across multiple tumor types. Our work introduces a new lens to accurately infer cellular composition and expression in large cohorts of bulk RNA-seq data.


Subject(s)
Head and Neck Neoplasms , Melanoma , Skin Neoplasms , Bayes Theorem , Endothelial Cells , Gene Expression , Humans , Melanoma/genetics , Sequence Analysis, RNA/methods , Single-Cell Analysis/methods , Melanoma, Cutaneous Malignant
12.
Cancer Discov ; 12(7): 1782-1803, 2022 07 06.
Article in English | MEDLINE | ID: mdl-35443279

ABSTRACT

SETD2 is the sole histone methyltransferase responsible for H3K36me3, with roles in splicing, transcription initiation, and DNA damage response. Homozygous disruption of SETD2 yields a tumor suppressor effect in various cancers. However, SETD2 mutation is typically heterozygous in diffuse large B-cell lymphomas. Here we show that heterozygous Setd2 deficiency results in germinal center (GC) hyperplasia and increased competitive fitness, with reduced DNA damage checkpoint activity and apoptosis, resulting in accelerated lymphomagenesis. Impaired DNA damage sensing in Setd2-haploinsufficient germinal center B (GCB) and lymphoma cells associated with increased AICDA-induced somatic hypermutation, complex structural variants, and increased translocations including those activating MYC. DNA damage was selectively increased on the nontemplate strand, and H3K36me3 loss was associated with greater RNAPII processivity and mutational burden, suggesting that SETD2-mediated H3K36me3 is required for proper sensing of cytosine deamination. Hence, Setd2 haploinsufficiency delineates a novel GCB context-specific oncogenic pathway involving defective epigenetic surveillance of AICDA-mediated effects on transcribed genes. SIGNIFICANCE: Our findings define a B cell-specific oncogenic effect of SETD2 heterozygous mutation, which unleashes AICDA mutagenesis of nontemplate strand DNA in the GC reaction, resulting in lymphomas with heavy mutational burden. GC-derived lymphomas did not tolerate SETD2 homozygous deletion, pointing to a novel context-specific therapeutic vulnerability. This article is highlighted in the In This Issue feature, p. 1599.


Subject(s)
B-Lymphocytes , Cytidine Deaminase , Germinal Center , Haploinsufficiency , Histone-Lysine N-Methyltransferase , Somatic Hypermutation, Immunoglobulin , Cytidine Deaminase/metabolism , Histone-Lysine N-Methyltransferase/genetics , Homozygote , Humans , Sequence Deletion
13.
Nat Genet ; 54(3): 295-305, 2022 03.
Article in English | MEDLINE | ID: mdl-35273399

ABSTRACT

The role of histone modifications in transcription remains incompletely understood. Here, we examine the relationship between histone modifications and transcription using experimental perturbations combined with sensitive machine-learning tools. Transcription predicted the variation in active histone marks and complex chromatin states, like bivalent promoters, down to single-nucleosome resolution and at an accuracy that rivaled the correspondence between independent ChIP-seq experiments. Blocking transcription rapidly removed two punctate marks, H3K4me3 and H3K27ac, from chromatin indicating that transcription is required for active histone modifications. Transcription was also required for maintenance of H3K27me3, consistent with a role for RNA in recruiting PRC2. A subset of DNase-I-hypersensitive sites were refractory to prediction, precluding models where transcription initiates pervasively at any open chromatin. Our results, in combination with past literature, support a model in which active histone modifications serve a supportive, rather than an essential regulatory, role in transcription.


Subject(s)
Histones , Protein Processing, Post-Translational , Chromatin/genetics , Histone Code/genetics , Histones/genetics , Histones/metabolism , Nucleosomes/genetics , Protein Processing, Post-Translational/genetics
14.
J Immunol ; 208(5): 1007-1020, 2022 03 01.
Article in English | MEDLINE | ID: mdl-35181641

ABSTRACT

E-protein transcription factors limit group 2 innate lymphoid cell (ILC2) development while promoting T cell differentiation from common lymphoid progenitors. Inhibitors of DNA binding (ID) proteins block E-protein DNA binding in common lymphoid progenitors to allow ILC2 development. However, whether E-proteins influence ILC2 function upon maturity and activation remains unclear. Mice that overexpress ID1 under control of the thymus-restricted proximal Lck promoter (ID1tg/WT) have a large pool of primarily thymus-derived ILC2s in the periphery that develop in the absence of E-protein activity. We used these mice to investigate how the absence of E-protein activity affects ILC2 function and the genomic landscape in response to house dust mite (HDM) allergens. ID1tg/WT mice had increased KLRG1- ILC2s in the lung compared with wild-type (WT; ID1WT/WT) mice in response to HDM, but ID1tg/WT ILC2s had an impaired capacity to produce type 2 cytokines. Analysis of WT ILC2 accessible chromatin suggested that AP-1 and C/EBP transcription factors but not E-proteins were associated with ILC2 inflammatory gene programs. Instead, E-protein binding sites were enriched at functional genes in ILC2s during development that were later dynamically regulated in allergic lung inflammation, including genes that control ILC2 response to cytokines and interactions with T cells. Finally, ILC2s from ID1tg/WT compared with WT mice had fewer regions of open chromatin near functional genes that were enriched for AP-1 factor binding sites following HDM treatment. These data show that E-proteins shape the chromatin landscape during ILC2 development to dictate the functional capacity of mature ILC2s during allergic inflammation in the lung.


Subject(s)
Antigens, Dermatophagoides/immunology , Asthma/immunology , Inhibitor of Differentiation Protein 1/metabolism , T-Lymphocytes/immunology , Transcription Factors/metabolism , Adaptor Proteins, Signal Transducing/genetics , Allergens/immunology , Animals , Asthma/pathology , Cell Differentiation/immunology , Chromatin/metabolism , Cytokines/immunology , DNA-Binding Proteins/antagonists & inhibitors , Female , Lectins, C-Type/genetics , Lung/immunology , Lung/pathology , Male , Mice , Mice, Inbred C57BL , Pyroglyphidae/immunology , Receptors, Immunologic/genetics , Stem Cells/cytology , T-Lymphocytes/cytology , Transcription Factor AP-1/metabolism
15.
Proc Natl Acad Sci U S A ; 119(4)2022 01 25.
Article in English | MEDLINE | ID: mdl-35042807

ABSTRACT

Genomics encompasses the entire tree of life, both extinct and extant, and the evolutionary processes that shape this diversity. To date, genomic research has focused on humans, a small number of agricultural species, and established laboratory models. Fewer than 18,000 of ∼2,000,000 eukaryotic species (<1%) have a representative genome sequence in GenBank, and only a fraction of these have ancillary information on genome structure, genetic variation, gene expression, epigenetic modifications, and population diversity. This imbalance reflects a perception that human studies are paramount in disease research. Yet understanding how genomes work, and how genetic variation shapes phenotypes, requires a broad view that embraces the vast diversity of life. We have the technology to collect massive and exquisitely detailed datasets about the world, but expertise is siloed into distinct fields. A new approach, integrating comparative genomics with cell and evolutionary biology, ecology, archaeology, anthropology, and conservation biology, is essential for understanding and protecting ourselves and our world. Here, we describe potential for scientific discovery when comparative genomics works in close collaboration with a broad range of fields as well as the technical, scientific, and social constraints that must be addressed.


Subject(s)
Biodiversity , Biological Evolution , Genomics/methods , Animals , Evolution, Molecular , Genetic Variation/genetics , Genome/genetics , Genomics/trends , Humans , Phylogeny
16.
Nat Commun ; 12(1): 6590, 2021 11 15.
Article in English | MEDLINE | ID: mdl-34782625

ABSTRACT

The advent of animal husbandry and hunting increased human exposure to zoonotic pathogens. To understand how a zoonotic disease may have influenced human evolution, we study changes in human expression of anthrax toxin receptor 2 (ANTXR2), which encodes a cell surface protein necessary for Bacillus anthracis virulence toxins to cause anthrax disease. In immune cells, ANTXR2 is 8-fold down-regulated in all available human samples compared to non-human primates, indicating regulatory changes early in the evolution of modern humans. We also observe multiple genetic signatures consistent with recent positive selection driving a European-specific decrease in ANTXR2 expression in multiple tissues affected by anthrax toxins. Our observations fit a model in which humans adapted to anthrax disease following early ecological changes associated with hunting and scavenging, as well as a second period of adaptation after the rise of modern agriculture.


Subject(s)
Evolution, Molecular , Gene Expression Regulation , Receptors, Peptide/genetics , Receptors, Peptide/metabolism , Animals , Antigens, Bacterial , Bacillus anthracis/genetics , Bacterial Toxins , Cell Line , Down-Regulation , Humans , K562 Cells , Membrane Proteins/metabolism , Virulence , Zoonoses
17.
BMC Genomics ; 22(1): 751, 2021 Oct 19.
Article in English | MEDLINE | ID: mdl-34666684

ABSTRACT

BACKGROUND: The transcription of developmental regulatory genes is often controlled by multiple cis-regulatory elements. The identification and functional characterization of distal regulatory elements remains challenging, even in tractable model organisms like sea urchins. RESULTS: We evaluate the use of chromatin accessibility, transcription and RNA Polymerase II for their ability to predict enhancer activity of genomic regions in sea urchin embryos. ATAC-seq, PRO-seq, and Pol II ChIP-seq from early and late blastula embryos are manually contrasted with experimental cis-regulatory analyses available in sea urchin embryos, with particular attention to common developmental regulatory elements known to have enhancer and silencer functions differentially deployed among embryonic territories. Using the three functional genomic data types, machine learning models are trained and tested to classify and quantitatively predict the enhancer activity of several hundred genomic regions previously validated with reporter constructs in vivo. CONCLUSIONS: Overall, chromatin accessibility and transcription have substantial power for predicting enhancer activity. For promoter-overlapping cis-regulatory elements in particular, the distribution of Pol II is the best predictor of enhancer activity in blastula embryos. Furthermore, ATAC- and PRO-seq predictive value is stage dependent for the promoter-overlapping subset. This suggests that the sequence of regulatory mechanisms leading to transcriptional activation have distinct relevance at different levels of the developmental gene regulatory hierarchy deployed during embryogenesis.


Subject(s)
Gene Expression Regulation, Developmental , Regulatory Sequences, Nucleic Acid , Animals , Chromatin/genetics , Promoter Regions, Genetic , Sea Urchins/genetics
18.
Bioinformatics ; 37(24): 4727-4736, 2021 12 11.
Article in English | MEDLINE | ID: mdl-34382072

ABSTRACT

MOTIVATION: Quantification of isoform abundance has been extensively studied at the mature RNA level using RNA-seq but not at the level of precursor RNAs using nascent RNA sequencing. RESULTS: We address this problem with a new computational method called Deconvolution of Expression for Nascent RNA-sequencing data (DENR), which models nascent RNA-sequencing read-counts as a mixture of user-provided isoforms. The baseline algorithm is enhanced by machine-learning predictions of active transcription start sites and an adjustment for the typical 'shape profile' of read-counts along a transcription unit. We show that DENR outperforms simple read-count-based methods for estimating gene and isoform abundances, and that transcription of multiple pre-RNA isoforms per gene is widespread, with frequent differences between cell types. In addition, we provide evidence that a majority of human isoform diversity derives from primary transcription rather than from post-transcriptional processes. AVAILABILITY AND IMPLEMENTATION: DENR and nascentRNASim are freely available at https://github.com/CshlSiepelLab/DENR (version v1.0.0) and https://github.com/CshlSiepelLab/nascentRNASim (version v0.3.0). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
RNA Isoforms , RNA , Humans , RNA Isoforms/genetics , Software , Protein Isoforms/genetics , Sequence Analysis, RNA/methods , Eukaryotic Initiation Factors/genetics
19.
Genome Biol Evol ; 13(7)2021 07 06.
Article in English | MEDLINE | ID: mdl-34117762

ABSTRACT

In butterflies and moths, which exhibit highly variable sex determination mechanisms, the homogametic Z chromosome is deeply conserved and is featured in many genome assemblies. The evolution and origin of the female W sex chromosome, however, remains mostly unknown. Previous studies have proposed that a ZZ/Z0 sex determination system is ancestral to Lepidoptera, and that W chromosomes may originate from sex-linked B chromosomes. Here, we sequence and assemble the female Dryas iulia genome into 32 highly contiguous ordered and oriented chromosomes, including the Z and W sex chromosomes. We then use sex-specific Hi-C, ATAC-seq, PRO-seq, and whole-genome DNA sequence data sets to test if features of the D. iulia W chromosome are consistent with a hypothesized B chromosome origin. We show that the putative W chromosome displays female-associated DNA sequence, gene expression, and chromatin accessibility to confirm the sex-linked function of the W sequence. In contrast with expectations from studies of homologous sex chromosomes, highly repetitive DNA content on the W chromosome, the sole presence of domesticated repetitive elements in functional DNA, and lack of sequence homology with the Z chromosome or autosomes is most consistent with a B chromosome origin for the W, although it remains challenging to rule out extensive sequence divergence. Synteny analysis of the D. iulia W chromosome with other female lepidopteran genome assemblies shows no homology between W chromosomes and suggests multiple, independent origins of the W chromosome from a B chromosome likely occurred in butterflies.


Subject(s)
Butterflies , Moths , Animals , Butterflies/genetics , Female , Genome , Male , Moths/genetics , Sex Chromosomes/genetics , Synteny
20.
Mol Biol Evol ; 38(10): 4449-4462, 2021 09 27.
Article in English | MEDLINE | ID: mdl-34146107

ABSTRACT

Major changes in chromosome number and structure are linked to a series of evolutionary phenomena, including intrinsic barriers to gene flow or suppression of recombination due to chromosomal rearrangements. However, chromosome rearrangements can also affect the fundamental dynamics of molecular evolution within populations by changing relationships between linked loci and altering rates of recombination. Here, we build chromosome-level assembly Eueides isabella and, together with a recent chromosome-level assembly of Dryas iulia, examine the evolutionary consequences of multiple chromosome fusions in Heliconius butterflies. These assemblies pinpoint fusion points on 10 of the 20 autosomal chromosomes and reveal striking differences in the characteristics of fused and unfused chromosomes. The ten smallest autosomes in D. iulia and E. isabella, which have each fused to a longer chromosome in Heliconius, have higher repeat and GC content, and longer introns than predicted by their chromosome length. When fused, these characteristics change to become more in line with chromosome length. The fusions also led to reduced diversity, which likely reflects increased background selection and selection against introgression between diverging populations, following a reduction in per-base recombination rate. We further show that chromosome size and fusion impact turnover rates of functional loci at a macroevolutionary scale. Together these results provide further evidence that chromosome fusion in Heliconius likely had dramatic effects on population level processes shaping rates of neutral and adaptive divergence. These effects may have impacted patterns of diversification in Heliconius, a classic example of an adaptive radiation.


Subject(s)
Butterflies , Animals , Biological Evolution , Butterflies/genetics , Chromosomes/genetics , Evolution, Molecular , Genetic Variation
SELECTION OF CITATIONS
SEARCH DETAIL
...