RESUMO
Promoter-proximal RNA Pol II pausing is a critical step in transcriptional control. Pol II pausing has been predominantly studied in tissue culture systems. While Pol II pausing has been shown to be required for mammalian development, the phenotypic and mechanistic details of this requirement are unknown. Here, we found that loss of Pol II pausing stalls pluripotent state transitions within the epiblast of the early mouse embryo. Using Nelfb -/- mice and a NELFB degron mouse pluripotent stem cell model, we show that embryonic stem cells (ESCs) representing the naïve state of pluripotency successfully initiate a transition program but fail to balance levels of induced and repressed genes and enhancers in the absence of NELF. We found an increase in chromatin-associated NELF during transition from the naïve to later pluripotent states. Overall, our work defines the acute and long-term molecular consequences of NELF loss and reveals a role for Pol II pausing in the pluripotency continuum as a modulator of cell state transitions.
RESUMO
Because of diverged adaptative phenotypes, fish species of the genus Xiphophorus have contributed to a wide range of research for a century. Existing Xiphophorus genome assemblies are not at the chromosomal level and are prone to sequence gaps, thus hindering advancement of the intra- and inter-species differences for evolutionary, comparative, and translational biomedical studies. Herein, we assembled high-quality chromosome-level genome assemblies for three distantly related Xiphophorus species, namely, X. maculatus, X. couchianus, and X. hellerii Our overall goal is to precisely assess microevolutionary processes in the clade to ascertain molecular events that led to the divergence of the Xiphophorus species and to progress understanding of genetic incompatibility to disease. In particular, we measured intra- and inter-species divergence and assessed gene expression dysregulation in reciprocal interspecies hybrids among the three species. We found expanded gene families and positively selected genes associated with live bearing, a special mode of reproduction. We also found positively selected gene families are significantly enriched in nonpolymorphic transposable elements, suggesting the dispersal of these nonpolymorphic transposable elements has accompanied the evolution of the genes, possibly by incorporating new regulatory elements in support of the Britten-Davidson hypothesis. We characterized inter-specific polymorphisms, structural variants, and polymorphic transposable element insertions and assessed their association to interspecies hybridization-induced gene expression dysregulation related to specific disease states in humans.
Assuntos
Ciprinodontiformes , Elementos de DNA Transponíveis , Animais , Humanos , Elementos de DNA Transponíveis/genética , Epistasia Genética , Hibridização Genética , Ciprinodontiformes/genética , Ciprinodontiformes/metabolismoRESUMO
E-protein transcription factors limit group 2 innate lymphoid cell (ILC2) development while promoting T cell differentiation from common lymphoid progenitors. Inhibitors of DNA binding (ID) proteins block E-protein DNA binding in common lymphoid progenitors to allow ILC2 development. However, whether E-proteins influence ILC2 function upon maturity and activation remains unclear. Mice that overexpress ID1 under control of the thymus-restricted proximal Lck promoter (ID1tg/WT) have a large pool of primarily thymus-derived ILC2s in the periphery that develop in the absence of E-protein activity. We used these mice to investigate how the absence of E-protein activity affects ILC2 function and the genomic landscape in response to house dust mite (HDM) allergens. ID1tg/WT mice had increased KLRG1- ILC2s in the lung compared with wild-type (WT; ID1WT/WT) mice in response to HDM, but ID1tg/WT ILC2s had an impaired capacity to produce type 2 cytokines. Analysis of WT ILC2 accessible chromatin suggested that AP-1 and C/EBP transcription factors but not E-proteins were associated with ILC2 inflammatory gene programs. Instead, E-protein binding sites were enriched at functional genes in ILC2s during development that were later dynamically regulated in allergic lung inflammation, including genes that control ILC2 response to cytokines and interactions with T cells. Finally, ILC2s from ID1tg/WT compared with WT mice had fewer regions of open chromatin near functional genes that were enriched for AP-1 factor binding sites following HDM treatment. These data show that E-proteins shape the chromatin landscape during ILC2 development to dictate the functional capacity of mature ILC2s during allergic inflammation in the lung.
Assuntos
Antígenos de Dermatophagoides/imunologia , Asma/imunologia , Proteína 1 Inibidora de Diferenciação/metabolismo , Linfócitos T/imunologia , Fatores de Transcrição/metabolismo , Proteínas Adaptadoras de Transdução de Sinal/genética , Alérgenos/imunologia , Animais , Asma/patologia , Diferenciação Celular/imunologia , Cromatina/metabolismo , Citocinas/imunologia , Proteínas de Ligação a DNA/antagonistas & inibidores , Feminino , Lectinas Tipo C/genética , Pulmão/imunologia , Pulmão/patologia , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Pyroglyphidae/imunologia , Receptores Imunológicos/genética , Células-Tronco/citologia , Linfócitos T/citologia , Fator de Transcrição AP-1/metabolismoRESUMO
BACKGROUND: The red junglefowl, the wild outgroup of domestic chickens, has historically served as a reference for genomic studies of domestic chickens. These studies have provided insight into the etiology of traits of commercial importance. However, the use of a single reference genome does not capture diversity present among modern breeds, many of which have accumulated molecular changes due to drift and selection. While reference-based resequencing is well-suited to cataloging simple variants such as single-nucleotide changes and short insertions and deletions, it is mostly inadequate to discover more complex structural variation in the genome. METHODS: We present a pangenome for the domestic chicken consisting of thirty assemblies of chickens from different breeds and research lines. RESULTS: We demonstrate how this pangenome can be used to catalog structural variants present in modern breeds and untangle complex nested variation. We show that alignment of short reads from 100 diverse wild and domestic chickens to this pangenome reduces reference bias by 38%, which affects downstream genotyping results. This approach also allows for the accurate genotyping of a large and complex pair of structural variants at the K feathering locus using short reads, which would not be possible using a linear reference. CONCLUSIONS: We expect that this new paradigm of genomic reference will allow better pinpointing of exact mutations responsible for specific phenotypes, which will in turn be necessary for breeding chickens that meet new sustainability criteria and are resilient to quickly evolving pathogen threats.
Assuntos
Galinhas , Genoma , Animais , Galinhas/genética , Genótipo , Análise de Sequência de DNA , GenômicaRESUMO
BACKGROUND: Rearranged during transfection (RET) tyrosine kinase signaling has been previously implicated in endocrine resistant breast cancer, however the mechanism by which this signaling cascade promotes resistance is currently not well described. We recently reported that glial cell-derived neurotrophic factor (GDNF)-RET signaling appears to promote a positive feedback loop with the transcription factor early growth response 1 (EGR1). Here we investigate the mechanism behind this feedback loop and test the hypothesis that GDNF-RET signaling forms a regulatory loop with EGR1 to upregulate cyclin D1 (CCND1) transcription, leading to cell cycle progression and tamoxifen resistance. METHODS: To gain a better understanding of the GDNF-RET-EGR1 resistance mechanism, we studied the GDNF-EGR1 positive feedback loop and the role of GDNF and EGR1 in endocrine resistance by modulating their transcription levels using CRISPR-dCAS9 in tamoxifen sensitive (TamS) and tamoxifen resistant (TamR) MCF-7 cells. Additionally, we performed kinetic studies using recombinant GDNF (rGDNF) treatment of TamS cells. Finally, we performed cell proliferation assays using rGDNF, tamoxifen (TAM), and Palbociclib treatments in TamS cells. Statistical significance for qPCR and chromatin immunoprecipitation (ChIP)-qPCR experiments were determined using a student's paired t-test and statistical significance for the cell viability assay was a one-way ANOVA. RESULTS: GDNF-RET signaling formed a positive feedback loop with EGR1 and also downregulated estrogen receptor 1 (ESR1) transcription. Upregulation of GDNF and EGR1 promoted tamoxifen resistance in TamS cells and downregulation of GDNF promoted tamoxifen sensitivity in TamR cells. Additionally, we show that rGDNF treatment activated GDNF-RET signaling in TamS cells, leading to recruitment of phospho-ELK-1 to the EGR1 promoter, upregulation of EGR1 mRNA and protein, binding of EGR1 to the GDNF and CCND1 promoters, increased GDNF protein expression, and subsequent upregulation of CCND1 mRNA levels. We also show that inhibition of cyclin D1 with Palbociclib, in the presence of rGDNF, decreases cell proliferation and resensitizes cells to TAM. CONCLUSION: Outcomes from these studies support the hypotheses that GDNF-RET signaling forms a positive feedback loop with the transcription factor EGR1, and that GDNF-RET-EGR1 signaling promotes endocrine resistance via signaling to cyclin D1. Inhibition of components of this signaling pathway could lead to therapeutic insights into the treatment of endocrine resistant breast cancer.
Assuntos
Fator Neurotrófico Derivado de Linhagem de Célula Glial , Tamoxifeno , Linhagem Celular Tumoral , Proliferação de Células , Ciclina D1/genética , Ciclina D1/metabolismo , Resistencia a Medicamentos Antineoplásicos/genética , Retroalimentação , Fator Neurotrófico Derivado de Linhagem de Célula Glial/genética , Fator Neurotrófico Derivado de Linhagem de Célula Glial/metabolismo , Cinética , RNA Mensageiro , Transdução de Sinais , Tamoxifeno/farmacologia , Tamoxifeno/uso terapêutico , Fatores de Transcrição , HumanosRESUMO
BACKGROUND: The concentrations of distinct types of RNA in cells result from a dynamic equilibrium between RNA synthesis and decay. Despite the critical importance of RNA decay rates, current approaches for measuring them are generally labor-intensive, limited in sensitivity, and/or disruptive to normal cellular processes. Here, we introduce a simple method for estimating relative RNA half-lives that is based on two standard and widely available high-throughput assays: Precision Run-On sequencing (PRO-seq) and RNA sequencing (RNA-seq). RESULTS: Our method treats PRO-seq as a measure of transcription rate and RNA-seq as a measure of RNA concentration, and estimates the rate of RNA decay required for a steady-state equilibrium. We show that this approach can be used to assay relative RNA half-lives genome-wide, with good accuracy and sensitivity for both coding and noncoding transcription units. Using a structural equation model (SEM), we test several features of transcription units, nearby DNA sequences, and nearby epigenomic marks for associations with RNA stability after controlling for their effects on transcription. We find that RNA splicing-related features are positively correlated with RNA stability, whereas features related to miRNA binding and DNA methylation are negatively correlated with RNA stability. Furthermore, we find that a measure based on U1 binding and polyadenylation sites distinguishes between unstable noncoding and stable coding transcripts but is not predictive of relative stability within the mRNA or lincRNA classes. We also identify several histone modifications that are associated with RNA stability. CONCLUSION: We introduce an approach for estimating the relative half-lives of individual RNAs. Together, our estimation method and systematic analysis shed light on the pervasive impacts of RNA stability on cellular RNA concentrations.
Assuntos
Instabilidade Genômica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Estabilidade de RNA , Sequenciamento de Nucleotídeos em Larga Escala/instrumentação , Humanos , RNA-Seq/métodosRESUMO
BACKGROUND: The transcription of developmental regulatory genes is often controlled by multiple cis-regulatory elements. The identification and functional characterization of distal regulatory elements remains challenging, even in tractable model organisms like sea urchins. RESULTS: We evaluate the use of chromatin accessibility, transcription and RNA Polymerase II for their ability to predict enhancer activity of genomic regions in sea urchin embryos. ATAC-seq, PRO-seq, and Pol II ChIP-seq from early and late blastula embryos are manually contrasted with experimental cis-regulatory analyses available in sea urchin embryos, with particular attention to common developmental regulatory elements known to have enhancer and silencer functions differentially deployed among embryonic territories. Using the three functional genomic data types, machine learning models are trained and tested to classify and quantitatively predict the enhancer activity of several hundred genomic regions previously validated with reporter constructs in vivo. CONCLUSIONS: Overall, chromatin accessibility and transcription have substantial power for predicting enhancer activity. For promoter-overlapping cis-regulatory elements in particular, the distribution of Pol II is the best predictor of enhancer activity in blastula embryos. Furthermore, ATAC- and PRO-seq predictive value is stage dependent for the promoter-overlapping subset. This suggests that the sequence of regulatory mechanisms leading to transcriptional activation have distinct relevance at different levels of the developmental gene regulatory hierarchy deployed during embryogenesis.
Assuntos
Regulação da Expressão Gênica no Desenvolvimento , Sequências Reguladoras de Ácido Nucleico , Animais , Cromatina/genética , Regiões Promotoras Genéticas , Ouriços-do-Mar/genéticaRESUMO
BACKGROUND: Basenjis are considered an ancient dog breed of central African origins that still live and hunt with tribesmen in the African Congo. Nicknamed the barkless dog, Basenjis possess unique phylogeny, geographical origins and traits, making their genome structure of great interest. The increasing number of available canid reference genomes allows us to examine the impact the choice of reference genome makes with regard to reference genome quality and breed relatedness. RESULTS: Here, we report two high quality de novo Basenji genome assemblies: a female, China (CanFam_Bas), and a male, Wags. We conduct pairwise comparisons and report structural variations between assembled genomes of three dog breeds: Basenji (CanFam_Bas), Boxer (CanFam3.1) and German Shepherd Dog (GSD) (CanFam_GSD). CanFam_Bas is superior to CanFam3.1 in terms of genome contiguity and comparable overall to the high quality CanFam_GSD assembly. By aligning short read data from 58 representative dog breeds to three reference genomes, we demonstrate how the choice of reference genome significantly impacts both read mapping and variant detection. CONCLUSIONS: The growing number of high-quality canid reference genomes means the choice of reference genome is an increasingly critical decision in subsequent canid variant analyses. The basal position of the Basenji makes it suitable for variant analysis for targeted applications of specific dog breeds. However, we believe more comprehensive analyses across the entire family of canids is more suited to a pangenome approach. Collectively this work highlights the importance the choice of reference genome makes in all variation studies.
Assuntos
Lobos , Animais , China , Cromossomos , Cães , Feminino , Genoma , Genômica , Masculino , Lobos/genéticaRESUMO
Mitochondrial DNA (mtDNA) genes are long known to be cotranscribed in polycistrones, yet it remains impossible to study nascent mtDNA transcripts quantitatively in vivo using existing tools. To this end, we used deep sequencing (GRO-seq and PRO-seq) and analyzed nascent mtDNA-encoded RNA transcripts in diverse human cell lines and metazoan organisms. Surprisingly, accurate detection of human mtDNA transcription initiation sites (TISs) in the heavy and light strands revealed a novel conserved transcription pausing site near the light-strand TIS. This pausing site correlated with the presence of a bacterial pausing sequence motif, with reduced SNP density, and with a DNase footprinting signal in all tested cells. Its location within conserved sequence block 3 (CSBIII), just upstream of the known transcription-replication transition point, suggests involvement in such transition. Analysis of nonhuman organisms enabled de novo mtDNA sequence assembly, as well as detection of previously unknown mtDNA TIS, pausing, and transcription termination sites with unprecedented accuracy. Whereas mammals (Pan troglodytes, Macaca mulatta, Rattus norvegicus, and Mus musculus) showed a human-like mtDNA transcription pattern, the invertebrate pattern (Drosophila melanogaster and Caenorhabditis elegans) profoundly diverged. Our approach paves the path toward in vivo, quantitative, reference sequence-free analysis of mtDNA transcription in all eukaryotes.
Assuntos
DNA Mitocondrial/genética , Evolução Molecular , Sítio de Iniciação de Transcrição , Animais , DNA Mitocondrial/química , Humanos , Invertebrados/genética , Especificidade de Órgãos , Polimorfismo de Nucleotídeo Único , Primatas/genética , Roedores/genética , Iniciação da Transcrição Genética , TranscriptomaRESUMO
The American alligator, Alligator mississippiensis, like all crocodilians, has temperature-dependent sex determination, in which the sex of an embryo is determined by the incubation temperature of the egg during a critical period of development. The lack of genetic differences between male and female alligators leaves open the question of how the genes responsible for sex determination and differentiation are regulated. Insight into this question comes from the fact that exposing an embryo incubated at male-producing temperature to estrogen causes it to develop ovaries. Because estrogen response elements are known to regulate genes over long distances, a contiguous genome assembly is crucial for predicting and understanding their impact. We present an improved assembly of the American alligator genome, scaffolded with in vitro proximity ligation (Chicago) data. We use this assembly to scaffold two other crocodilian genomes based on synteny. We perform RNA sequencing of tissues from American alligator embryos to find genes that are differentially expressed between embryos incubated at male- versus female-producing temperature. Finally, we use the improved contiguity of our assembly along with the current model of CTCF-mediated chromatin looping to predict regions of the genome likely to contain estrogen-responsive genes. We find that these regions are significantly enriched for genes with female-biased expression in developing gonads after the critical period during which sex is determined by incubation temperature. We thus conclude that estrogen signaling is a major driver of female-biased gene expression in the post-temperature sensitive period gonads.
Assuntos
Jacarés e Crocodilos/genética , Sequência Conservada , Estrogênios/genética , Genoma , Transdução de Sinais , Jacarés e Crocodilos/embriologia , Animais , Fator de Ligação a CCCTC/metabolismo , Cromatina/metabolismo , Mapeamento de Sequências Contíguas , Estrogênios/metabolismo , Feminino , Masculino , Análise de Sequência de DNA , Processos de Determinação Sexual/genética , SinteniaRESUMO
BACKGROUND: Canine visceral hemangiosarcoma (HSA) is a highly aggressive cancer of endothelial origin that closely resembles visceral angiosarcoma in humans, both clinically and histopathologically. Currently there is an unmet need for new diagnostics and therapies for both forms of this disease. The goal of this study was to utilize Chromatin run-on sequencing (ChRO-seq) and immunohistochemistry (IHC) to identify gene and protein expression signatures that may be important drivers of HSA progression. RESULTS: ChRO-seq was performed on tissue isolated from 17 HSA samples and 4 normal splenic samples. Computational analysis was then used to identify differentially expressed genes and these factors were subjected to gene ontology analysis. ChRO-seq analysis revealed over a thousand differentially expressed genes in HSA tissue compared with normal splenic tissue (FDR < 0.005). Interestingly, the majority of genes overexpressed in HSA tumor tissue were associated with extracellular matrix (ECM) remodeling. This observation correlated well with our histological analysis, which found that HSA tumors contain a rich and complex collagen network. Additionally, we characterized the protein expression patterns of two highly overexpressed molecules identified in ChRO-seq analysis, podoplanin (PDPN) and laminin alpha 4 (LAMA4). We found that the expression of these two ECM-associated factors appeared to be largely limited to transformed endothelial cells within the HSA lesions. CONCLUSION: Outcomes from this study suggest that ECM remodeling plays an important role in HSA progression. Additionally, our study identified two potential novel biomarkers of HSA, PDPN and LAMA4. Interestingly, given that function-blocking anti-PDPN antibodies have shown anti-tumor effects in mouse models of canine melanoma, our studies raise the possibility that these types of therapeutic strategies could potentially be developed for treating canine HSA.
Assuntos
Doenças do Cão/patologia , Matriz Extracelular/patologia , Hemangiossarcoma/veterinária , Neoplasias Esplênicas/veterinária , Animais , Biomarcadores Tumorais , Cromatina/genética , Cromatina/metabolismo , Mapeamento Cromossômico , Cães , Regulação Neoplásica da Expressão Gênica , Ontologia Genética , Hemangiossarcoma/genética , Hemangiossarcoma/metabolismo , Glicoproteínas de Membrana/metabolismo , Baço/metabolismo , Neoplasias Esplênicas/genética , Neoplasias Esplênicas/metabolismoRESUMO
Intermittent hypoxia (IH) is a hallmark of obstructive sleep apnea (OSA) and induces metabolic dysfunction manifesting as inflammation, increased lipolysis and insulin resistance in visceral white adipose tissues (vWAT). However, the cell types and their corresponding transcriptional pathways underlying these functional perturbations are unknown. Here, we applied single nucleus RNA sequencing (snRNA-seq) coupled with aggregate RNA-seq methods to evaluate the cellular heterogeneity in vWAT following IH exposures mimicking OSA. C57BL/6 male mice were exposed to IH and room air (RA) for 6 weeks, and nuclei from vWAT were isolated and processed for snRNA-seq followed by differential expressed gene (DEGs) analyses by cell type, along with gene ontology and canonical pathways enrichment tests of significance. IH induced significant transcriptional changes compared to RA across 14 different cell types identified in vWAT. We identified cell-specific signature markers, transcriptional networks, metabolic signaling pathways, and cellular subpopulation enrichment in vWAT. Globally, we also identify 298 common regulated genes across multiple cellular types that are associated with metabolic pathways. Deconvolution of cell types in vWAT using global RNA-seq revealed that distinct adipocytes appear to be differentially implicated in key aspects of metabolic dysfunction. Thus, the heterogeneity of vWAT and its response to IH at the cellular level provides important insights into the metabolic morbidity of OSA and may possibly translate into therapeutic targets.
Assuntos
Adipócitos/metabolismo , Perfilação da Expressão Gênica , Hipóxia/metabolismo , Gordura Intra-Abdominal/metabolismo , Transcriptoma , Animais , Biologia Computacional/métodos , Regulação da Expressão Gênica , Ontologia Genética , Sequenciamento de Nucleotídeos em Larga Escala , Camundongos , Anotação de Sequência Molecular , Pequeno RNA não Traduzido , Análise de Célula ÚnicaRESUMO
PURPOSE: Nanocrystalline hydroxyapatite (nHA) cages have emerged as a new alternative to carbon fiber or polyether ether ketone (PEEK) devices to promote intervertebral fusion. No evidence has been published to date regarding rates of fusion for these devices after anterior cervical discectomy and fusion (ACDF). METHODS: Eight patients underwent one- or two-level ACDF with nHA intervertebral cages (Nanoss®-Cervical, Pioneer® Surgical Technology, Inc., Marquette, MI). Radiographs, neck disability index (NDI) and visual analog scores (VAS) for pain were taken preoperatively and at a minimum of 19 months postoperatively. RESULTS: At an average follow-up of 21 months, all eight patients (100%) achieved fusion as assessed by plain radiographs. Reduction in preoperative symptomology was comparable to previously published data with a mean reduction of neck VAS of 3, arm VAS of 6 and NDI reduced by 27%. Radiographs showed clear evidence of bridging bone. CONCLUSIONS: This series provides evidence that nHA intervertebral cages can successfully promote fusion after ACDF and may provide an alternative to carbon fiber and PEEK cages.
Assuntos
Materiais Biocompatíveis , Discotomia/métodos , Durapatita , Fixadores Internos , Fusão Vertebral/métodos , Adulto , Benzofenonas , Avaliação da Deficiência , Feminino , Seguimentos , Humanos , Cetonas , Masculino , Pessoa de Meia-Idade , Cervicalgia/diagnóstico por imagem , Cervicalgia/cirurgia , Medição da Dor , Polietilenoglicóis , Polímeros , Resultado do TratamentoRESUMO
Bacteria respond to environmental stimuli through precise regulation of transcription initiation and elongation. Bulk RNA sequencing primarily characterizes mature transcripts, so to identify actively transcribed loci we need to capture RNA polymerase (RNAP) complexed with nascent RNA. However, such capture methods have only previously been applied to culturable, genetically tractable organisms such as E. coli and B. subtilis. Here we apply precision run-on sequencing (PRO-seq) to profile nascent transcription in cultured E. coli and diverse uncultured bacteria. We demonstrate that PRO-seq can characterize the transcription of small, structured, or post-transcriptionally modified RNAs, which are often absent from bulk RNA-seq libraries. Applying PRO-seq to the human microbiome highlights taxon-specific RNAP pause motifs and pause-site distributions across non-coding RNA loci that reflect structure-coincident pausing. We also uncover concurrent transcription and cleavage of CRISPR guide RNAs and transfer RNAs. We demonstrate the utility of PRO-seq for exploring transcriptional dynamics in diverse microbial communities.
Assuntos
Escherichia coli , RNA Guia de Sistemas CRISPR-Cas , Humanos , Escherichia coli/genética , RNA Polimerases Dirigidas por DNA/genética , RNA/genética , Perfilação da Expressão GênicaRESUMO
Introduction: GWAS have identified multiple regions that confer risk for juvenile idiopathic arthritis (JIA). However, identifying the single nucleotide polymorphisms (SNPs) that drive disease risk is impeded by the SNPs' that identify risk loci being in linkage disequilibrium (LD) with hundreds of other SNPs. Since the causal SNPs remain unknown, it is difficult to identify target genes and use genetic information to inform patient care. We used genotyping and functional data in primary human monocytes/macrophages to nominate disease-driving SNPs on JIA risk haplotypes and identify their likely target genes. Methods: We identified JIA risk haplotypes using Immunochip data from Hinks et al (Nature Gen 2013) and the meta-analysis from McIntosh et al (Arthritis Rheum 2017). We used genotyping data from 3,939 children with JIA and 14,412 healthy controls to identify SNPs that: (1) were situated within open chromatin in multiple immune cell types and (2) were more common in children with JIA than the controls (p< 0.05). We intersected the chosen SNPs (n=846) with regions of bi-directional transcription initiation characteristic of non-coding regulatory regions detected using dREG to analyze GRO-seq data. Finally, we used MicroC data to identify gene promoters interacting with the regulatory regions harboring the candidate causal SNPs. Results: We identified 190 SNPs that overlap with dREG peaks in monocytes and126 SNPs that overlap with dREG peaks in macrophages. Of these SNPs, 101 were situated within dREG peaks in both monocytes and macrophages, suggesting that these SNPs exert their effects independent of the cellular activation state. MicroC data in monocytes identified 20 genes/transcripts whose promoters interact with the enhancers harboring the SNPs of interest. Conclusion: SNPs in JIA risk regions that are candidate causal variants can be further screened using functional data such as GRO-seq. This process identifies a finite number of candidate causal SNPs, the majority of which are likely to exert their biological effects independent of cellular activation state in monocytes. Three-dimensional chromatin data generated with MicroC identifies genes likely to be influenced by these SNPs. These studies demonstrate the importance of investigations into the role of innate immunity in JIA.
RESUMO
The ability of organisms to adapt to sudden extreme environmental changes produces some of the most drastic examples of rapid phenotypic evolution. The Mexican Tetra, Astyanax mexicanus, is abundant in the surface waters of northeastern Mexico, but repeated colonizations of cave environments have resulted in the independent evolution of troglomorphic phenotypes in several populations. Here, we present three chromosome-scale assemblies of this species, for one surface and two cave populations, enabling the first whole-genome comparisons between independently evolved cave populations to evaluate the genetic basis for the evolution of adaptation to the cave environment. Our assemblies represent the highest quality of sequence completeness with predicted protein-coding and noncoding gene metrics far surpassing prior resources and, to our knowledge, all long-read assembled teleost genomes, including zebrafish. Whole-genome synteny alignments show highly conserved gene order among cave forms in contrast to a higher number of chromosomal rearrangements when compared with other phylogenetically close or distant teleost species. By phylogenetically assessing gene orthology across distant branches of amniotes, we discover gene orthogroups unique to A. mexicanus. When compared with a representative surface fish genome, we find a rich amount of structural sequence diversity, defined here as the number and size of insertions and deletions as well as expanding and contracting repeats across cave forms. These new more complete genomic resources ensure higher trait resolution for comparative, functional, developmental, and genetic studies of drastic trait differences within a species.
Assuntos
Cavernas , Characidae , Cromossomos , Animais , Characidae/genética , Cromossomos/genética , Filogenia , Genoma , Genômica/métodos , Variação Genética , Locos de Características Quantitativas , Sintenia , FenótipoRESUMO
The fishing cat, Prionailurus viverrinus, faces a population decline, increasing the importance of maintaining healthy zoo populations. Unfortunately, zoo-managed individuals currently face a high prevalence of transitional cell carcinoma (TCC), a form of bladder cancer. To investigate the genetics of inherited diseases among captive fishing cats, we present a chromosome-scale assembly, generate the pedigree of the zoo-managed population, reaffirm the close genetic relationship with the Asian leopard cat (Prionailurus bengalensis), and identify 7.4 million single nucleotide variants (SNVs) and 23,432 structural variants (SVs) from whole genome sequencing (WGS) data of healthy and TCC cats. Only BRCA2 was found to have a high recurrent number of missense mutations in fishing cats diagnosed with TCC when compared to inherited human cancer risk variants. These new fishing cat genomic resources will aid conservation efforts to improve their genetic fitness and enhance the comparative study of feline genomes.
Assuntos
Carcinoma de Células de Transição , Neoplasias da Bexiga Urinária , Gatos , Animais , Humanos , Genoma/genética , Neoplasias da Bexiga Urinária/patologia , Carcinoma de Células de Transição/patologia , Genômica , Células Germinativas/patologiaRESUMO
Genome assembly can be challenging for species that are characterized by high amounts of polymorphism, heterozygosity, and large effective population sizes. High levels of heterozygosity can result in genome mis-assemblies and a larger than expected genome size due to the haplotig versions of a single locus being assembled as separate loci. Here, we describe the first chromosome-level genome for the eastern oyster, Crassostrea virginica. Publicly released and annotated in 2017, the assembly has a scaffold N50 of 54 mb and is over 97.3% complete based on BUSCO analysis. The genome assembly for the eastern oyster is a critical resource for foundational research into molluscan adaptation to a changing environment and for selective breeding for the aquaculture industry. Subsequent resequencing data suggested the presence of haplotigs in the original assembly, and we developed a post hoc method to break up chimeric contigs and mask haplotigs in published heterozygous genomes and evaluated improvements to the accuracy of downstream analysis. Masking haplotigs had a large impact on SNP discovery and estimates of nucleotide diversity and had more subtle and nuanced effects on estimates of heterozygosity, population structure analysis, and outlier detection. We show that haplotig masking can be a powerful tool for improving genomic inference, and we present an open, reproducible resource for the masking of haplotigs in any published genome.
Assuntos
Crassostrea , Animais , Crassostrea/genética , Genômica/métodos , Análise de Sequência de DNA , Polimorfismo Genético , Tamanho do GenomaRESUMO
Promoter-proximal pausing of RNA polymerase II (Pol II) is a key regulatory step during transcription. Despite the central role of pausing in gene regulation, we do not understand the evolutionary processes that led to the emergence of Pol II pausing or its transition to a rate-limiting step actively controlled by transcription factors. Here we analyzed transcription in species across the tree of life. Unicellular eukaryotes display a slow acceleration of Pol II near transcription start sites that transitioned to a longer-lived, focused pause in metazoans. This event coincided with the evolution of new subunits in the NELF and 7SK complexes. Depletion of NELF in mammals shifted the promoter-proximal buildup of Pol II from the pause site into the early gene body and compromised transcriptional activation for a set of heat shock genes. Our work details the evolutionary history of Pol II pausing and sheds light on how new transcriptional regulatory mechanisms evolve.