Pesquisa | Secretaria de Estado da Saúde

1.

Translational control of furina by an RNA regulon is important for left-right patterning, heart morphogenesis and cardiac valve function.

Nagorska, Agnieszka; Zaucker, Andreas; Lambert, Finnlay; Inman, Angus; Toral-Perez, Sara; Gorodkin, Jan; Wan, Yue; Smutny, Michael; Sampath, Karuna.

Development ; 150(23)2023 Dec 01.

Artigo em Inglês | MEDLINE | ID: mdl-38032088

RESUMO

Heart development is a complex process that requires asymmetric positioning of the heart, cardiac growth and valve morphogenesis. The mechanisms controlling heart morphogenesis and valve formation are not fully understood. The pro-convertase FurinA functions in heart development across vertebrates. How FurinA activity is regulated during heart development is unknown. Through computational analysis of the zebrafish transcriptome, we identified an RNA motif in a variant FurinA transcript harbouring a long 3' untranslated region (3'UTR). The alternative 3'UTR furina isoform is expressed prior to organ positioning. Somatic deletions in the furina 3'UTR lead to embryonic left-right patterning defects. Reporter localisation and RNA-binding assays show that the furina 3'UTR forms complexes with the conserved RNA-binding translational repressor, Ybx1. Conditional ybx1 mutant embryos show premature and increased Furin reporter expression, abnormal cardiac morphogenesis and looping defects. Mutant ybx1 hearts have an expanded atrioventricular canal, abnormal sino-atrial valves and retrograde blood flow from the ventricle to the atrium. This is similar to observations in humans with heart valve regurgitation. Thus, the furina 3'UTR element/Ybx1 regulon is important for translational repression of FurinA and regulation of heart development.

Assuntos

Regulon , Peixe-Zebra , Animais , Humanos , Regiões 3' não Traduzidas , Regulon/genética , Morfogênese/genética , Valvas Cardíacas , Proteínas de Peixe-Zebra/genética , Proteínas de Peixe-Zebra/metabolismo , Pró-Proteína Convertases/genética , Pró-Proteína Convertases/metabolismo

2.

CRISPRroots: on- and off-target assessment of RNA-seq data in CRISPR-Cas9 edited cells.

Corsi, Giulia I; Gadekar, Veerendra P; Gorodkin, Jan; Seemann, Stefan E.

Nucleic Acids Res ; 50(4): e20, 2022 02 28.

Artigo em Inglês | MEDLINE | ID: mdl-34850137

RESUMO

The CRISPR-Cas9 genome editing tool is used to study genomic variants and gene knockouts, and can be combined with transcriptomic analyses to measure the effects of such alterations on gene expression. But how can one be sure that differential gene expression is due to a successful intended edit and not to an off-target event, without performing an often resource-demanding genome-wide sequencing of the edited cell or strain? To address this question we developed CRISPRroots: CRISPR-Cas9-mediated edits with accompanying RNA-seq data assessed for on-target and off-target sites. Our method combines Cas9 and guide RNA binding properties, gene expression changes, and sequence variants between edited and non-edited cells to discover potential off-targets. Applied on seven public datasets, CRISPRroots identified critical off-target candidates that were overlooked in all of the corresponding previous studies. CRISPRroots is available via https://rth.dk/resources/crispr.

Assuntos

Sistemas CRISPR-Cas , Edição de Genes , Sistemas CRISPR-Cas/genética , Edição de Genes/métodos , Técnicas de Inativação de Genes , RNA Guia de Cinetoplastídeos/genética , RNA-Seq

3.

Does rapid sequence divergence preclude RNA structure conservation in vertebrates?

Seemann, Stefan E; Mirza, Aashiq H; Bang-Berthelsen, Claus H; Garde, Christian; Christensen-Dalsgaard, Mikkel; Workman, Christopher T; Pociot, Flemming; Tommerup, Niels; Gorodkin, Jan; Ruzzo, Walter L.

Nucleic Acids Res ; 50(5): 2452-2463, 2022 03 21.

Artigo em Inglês | MEDLINE | ID: mdl-35188540

RESUMO

Accelerated evolution of any portion of the genome is of significant interest, potentially signaling positive selection of phenotypic traits and adaptation. Accelerated evolution remains understudied for structured RNAs, despite the fact that an RNA's structure is often key to its function. RNA structures are typically characterized by compensatory (structure-preserving) basepair changes that are unexpected given the underlying sequence variation, i.e., they have evolved through negative selection on structure. We address the question of how fast the primary sequence of an RNA can change through evolution while conserving its structure. Specifically, we consider predicted and known structures in vertebrate genomes. After careful control of false discovery rates, we obtain 13 de novo structures (and three known Rfam structures) that we predict to have rapidly evolving sequences-defined as structures where the primary sequences of human and mouse have diverged at least twice as fast (1.5 times for Rfam) as nearby neutrally evolving sequences. Two of the three known structures function in translation inhibition related to infection and immune response. We conclude that rapid sequence divergence does not preclude RNA structure conservation in vertebrates, although these events are relatively rare.

Assuntos

Genoma , RNA , Animais , Evolução Molecular , Camundongos , Filogenia , RNA/química , RNA/genética , Vertebrados/genética

4.

The transcriptomic landscape of neurons carrying PSEN1 mutations reveals changes in extracellular matrix components and non-coding gene expression.

Corsi, Giulia I; Gadekar, Veerendra P; Haukedal, Henriette; Doncheva, Nadezhda T; Anthon, Christian; Ambardar, Sheetal; Palakodeti, Dasaradhi; Hyttel, Poul; Freude, Kristine; Seemann, Stefan E; Gorodkin, Jan.

Neurobiol Dis ; 178: 105980, 2023 03.

Artigo em Inglês | MEDLINE | ID: mdl-36572121

RESUMO

Alzheimer's disease (AD) is a progressive and irreversible brain disorder, which can occur either sporadically, due to a complex combination of environmental, genetic, and epigenetic factors, or because of rare genetic variants in specific genes (familial AD, or fAD). A key hallmark of AD is the accumulation of amyloid beta (Aß) and Tau hyperphosphorylated tangles in the brain, but the underlying pathomechanisms and interdependencies remain poorly understood. Here, we identify and characterise gene expression changes related to two fAD mutations (A79V and L150P) in the Presenilin-1 (PSEN1) gene. We do this by comparing the transcriptomes of glutamatergic forebrain neurons derived from fAD-mutant human induced pluripotent stem cells (hiPSCs) and their individual isogenic controls generated via precision CRISPR/Cas9 genome editing. Our analysis of Poly(A) RNA-seq data detects 1111 differentially expressed coding and non-coding genes significantly altered in fAD. Functional characterisation and pathway analysis of these genes reveal profound expression changes in constituents of the extracellular matrix, important to maintain the morphology, structural integrity, and plasticity of neurons, and in genes involved in calcium homeostasis and mitochondrial oxidative stress. Furthermore, by analysing total RNA-seq data we reveal that 30 out of 31 differentially expressed circular RNA genes are significantly upregulated in the fAD lines, and that these may contribute to the observed protein-coding gene expression changes. The results presented in this study contribute to a better understanding of the cellular mechanisms impacted in AD neurons, ultimately leading to neuronal damage and death.

Assuntos

Doença de Alzheimer , Células-Tronco Pluripotentes Induzidas , Humanos , Peptídeos beta-Amiloides/metabolismo , Transcriptoma , Presenilina-1/genética , Presenilina-1/metabolismo , Células-Tronco Pluripotentes Induzidas/metabolismo , Doença de Alzheimer/genética , Doença de Alzheimer/metabolismo , Mutação/genética , Neurônios/metabolismo , Precursor de Proteína beta-Amiloide/genética

5.

CRISPRon/off: CRISPR/Cas9 on- and off-target gRNA design.

Anthon, Christian; Corsi, Giulia Ilaria; Gorodkin, Jan.

Bioinformatics ; 38(24): 5437-5439, 2022 12 13.

Artigo em Inglês | MEDLINE | ID: mdl-36271848

RESUMO

SUMMARY: The effectiveness of CRISPR/Cas9-mediated genome editing experiments largely depends on the guide RNA (gRNA) used by the CRISPR/Cas9 system for target recognition and cleavage activation. Careful design is necessary to select a gRNA with high editing efficiency at the on-target site and with minimum off-target potential. Here, we present our webserver for gRNA design with a user-friendly graphical interface, which provides interoperability between our on- and off-target prediction tools, CRISPRon and CRISPRoff, for a complete and streamlined gRNA selection. AVAILABILITY AND IMPLEMENTATION: The graphical interface uses the Integrative Genomic Viewer (IGV) JavaScript plugin. The backend tools are implemented in Python and C. The CRISPRon and CRISPRoff webservers and command-line tools are freely available at https://rth.dk/resources/crispr.

Assuntos

Sistemas CRISPR-Cas , RNA Guia de Sistemas CRISPR-Cas , Sistemas CRISPR-Cas/genética , Edição de Genes

6.

Alteration of microglial metabolism and inflammatory profile contributes to neurotoxicity in a hiPSC-derived microglia model of frontotemporal dementia 3.

Haukedal, Henriette; Syshøj Lorenzen, Signe; Winther Westi, Emil; Corsi, Giulia I; Gadekar, Veerendra P; McQuade, Amanda; Davtyan, Hayk; Doncheva, Nadezhda T; Schmid, Benjamin; Chandrasekaran, Abinaya; Seemann, Stefan E; Cirera, Susanna; Blurton-Jones, Mathew; Meyer, Morten; Gorodkin, Jan; Aldana, Blanca I; Freude, Kristine.

Brain Behav Immun ; 113: 353-373, 2023 10.

Artigo em Inglês | MEDLINE | ID: mdl-37543250

RESUMO

Frontotemporal dementia (FTD) is a common cause of early-onset dementia, with no current treatment options. FTD linked to chromosome 3 (FTD3) is a rare sub-form of the disease, caused by a point mutation in the Charged Multivesicular Body Protein 2B (CHMP2B). This mutation causes neuronal phenotypes, such as mitochondrial deficiencies, accompanied by metabolic changes and interrupted endosomal-lysosomal fusion. However, the contribution of glial cells to FTD3 pathogenesis has, until recently, been largely unexplored. Glial cells play an important role in most neurodegenerative disorders as drivers and facilitators of neuroinflammation. Microglia are at the center of current investigations as potential pro-inflammatory drivers. While gliosis has been observed in FTD3 patient brains, it has not yet been systematically analyzed. In the light of this, we investigated the role of microglia in FTD3 by implementing human induced pluripotent stem cells (hiPSC) with either a heterozygous or homozygous CHMP2B mutation, introduced into a healthy control hiPSC line via CRISPR-Cas9 precision gene editing. These hiPSC were differentiated into microglia to evaluate the pro-inflammatory profile and metabolic state. Moreover, hiPSC-derived neurons were cultured with conditioned microglia media to investigate disease specific interactions between the two cell populations. Interestingly, we identified two divergent inflammatory microglial phenotypes resulting from the underlying mutations: a severe pro-inflammatory profile in CHMP2B homozygous FTD3 microglia, and an "unresponsive" CHMP2B heterozygous FTD3 microglial state. These findings correlate with our observations of increased phagocytic activity in CHMP2B homozygous, and impaired protein degradation in CHMP2B heterozygous FTD3 microglia. Metabolic mapping confirmed these differences, revealing a metabolic reprogramming of the CHMP2B FTD3 microglia, displayed as a compensatory up-regulation of glutamine metabolism in the CHMP2B homozygous FTD3 microglia. Intriguingly, conditioned CHMP2B homozygous FTD3 microglia media caused neurotoxic effects, which was not evident for the heterozygous microglia. Strikingly, IFN-Î³ treatment initiated an immune boost of the CHMP2B heterozygous FTD3 microglia, and conditioned microglia media exposure promoted neural outgrowth. Our findings indicate that the microglial profile, activity, and behavior is highly dependent on the status of the CHMP2B mutation. Our results suggest that the heterozygous state of the mutation in FTD3 patients could potentially be exploited in form of immune-boosting intervention strategies to counteract neurodegeneration.

Assuntos

Demência Frontotemporal , Células-Tronco Pluripotentes Induzidas , Humanos , Demência Frontotemporal/genética , Demência Frontotemporal/metabolismo , Demência Frontotemporal/patologia , Células-Tronco Pluripotentes Induzidas/metabolismo , Microglia/metabolismo , Complexos Endossomais de Distribuição Requeridos para Transporte/genética , Complexos Endossomais de Distribuição Requeridos para Transporte/metabolismo

7.

Human pathways in animal models: possibilities and limitations.

Doncheva, Nadezhda T; Palasca, Oana; Yarani, Reza; Litman, Thomas; Anthon, Christian; Groenen, Martien A M; Stadler, Peter F; Pociot, Flemming; Jensen, Lars J; Gorodkin, Jan.

Nucleic Acids Res ; 49(4): 1859-1871, 2021 02 26.

Artigo em Inglês | MEDLINE | ID: mdl-33524155

RESUMO

Animal models are crucial for advancing our knowledge about the molecular pathways involved in human diseases. However, it remains unclear to what extent tissue expression of pathways in healthy individuals is conserved between species. In addition, organism-specific information on pathways in animal models is often lacking. Within these limitations, we explore the possibilities that arise from publicly available data for the animal models mouse, rat, and pig. We approximate the animal pathways activity by integrating the human counterparts of curated pathways with tissue expression data from the models. Specifically, we compare whether the animal orthologs of the human genes are expressed in the same tissue. This is complicated by the lower coverage and worse quality of data in rat and pig as compared to mouse. Despite that, from 203 human KEGG pathways and the seven tissues with best experimental coverage, we identify 95 distinct pathways, for which the tissue expression in one animal model agrees better with human than the others. Our systematic pathway-tissue comparison between human and three animal modes points to specific similarities with human and to distinct differences among the animal models, thereby suggesting the most suitable organism for modeling a human pathway or tissue.

Assuntos

Modelos Animais , Animais , Expressão Gênica , Genoma , Humanos , Camundongos , Especificidade de Órgãos , Mapeamento de Interação de Proteínas , Ratos , Suínos

8.

CRISPRi screen for enhancing heterologous α-amylase yield in Bacillus subtilis.

Geissler, Adrian Sven; Fehler, Annaleigh Ohrt; Poulsen, Line Dahl; González-Tortuero, Enrique; Kallehauge, Thomas Beuchert; Alkan, Ferhat; Anthon, Christian; Seemann, Stefan Ernst; Rasmussen, Michael Dolberg; Breüner, Anne; Hjort, Carsten; Vinther, Jeppe; Gorodkin, Jan.

J Ind Microbiol Biotechnol ; 50(1)2023 Feb 17.

Artigo em Inglês | MEDLINE | ID: mdl-36564025

RESUMO

Yield improvements in cell factories can potentially be obtained by fine-tuning the regulatory mechanisms for gene candidates. In pursuit of such candidates, we performed RNA-sequencing of two α-amylase producing Bacillus strains and predict hundreds of putative novel non-coding transcribed regions. Surprisingly, we found among hundreds of non-coding and structured RNA candidates that non-coding genomic regions are proportionally undergoing the highest changes in expression during fermentation. Since these classes of RNA are also understudied, we targeted the corresponding genomic regions with CRIPSRi knockdown to test for any potential impact on the yield. From differentially expression analysis, we selected 53 non-coding candidates. Although CRISPRi knockdowns target both the sense and the antisense strand, the CRISPRi experiment cannot link causes for yield changes to the sense or antisense disruption. Nevertheless, we observed on several instances with strong changes in enzyme yield. The knockdown targeting the genomic region for a putative antisense RNA of the 3' UTR of the skfA-skfH operon led to a 21% increase in yield. In contrast, the knockdown targeting the genomic regions of putative antisense RNAs of the cytochrome c oxidase subunit 1 (ctaD), the sigma factor sigH, and the uncharacterized gene yhfT decreased yields by 31 to 43%.

Assuntos

Bacillus subtilis , alfa-Amilases , alfa-Amilases/biossíntese , alfa-Amilases/genética , Bacillus subtilis/genética , Bacillus subtilis/metabolismo , RNA/genética , Análise de Sequência de RNA

9.

Flagella disruption in Bacillus subtilis increases amylase production yield.

Fehler, Annaleigh Ohrt; Kallehauge, Thomas Beuchert; Geissler, Adrian Sven; González-Tortuero, Enrique; Seemann, Stefan Ernst; Gorodkin, Jan; Vinther, Jeppe.

Microb Cell Fact ; 21(1): 131, 2022 Jul 02.

Artigo em Inglês | MEDLINE | ID: mdl-35780132

RESUMO

BACKGROUND: Bacillus subtilis is a Gram-positive bacterium used as a cell factory for protein production. Over the last decades, the continued optimization of production strains has increased yields of enzymes, such as amylases, and made commercial applications feasible. However, current yields are still significantly lower than the theoretically possible yield based on the available carbon sources. In its natural environment, B. subtilis can respond to unfavorable growth conditions by differentiating into motile cells that use flagella to swim towards available nutrients. RESULTS: In this study, we analyze existing transcriptome data from a B. subtilis α-amylase production strain at different time points during a 5-day fermentation. We observe that genes of the fla/che operon, essential for flagella assembly and motility, are differentially expressed over time. To investigate whether expression of the flagella operon affects yield, we performed CRISPR-dCas9 based knockdown of the fla/che operon with sgRNA target against the genes flgE, fliR, and flhG, respectively. The knockdown resulted in inhibition of mobility and a striking 2-threefold increase in α-amylase production yield. Moreover, replacing flgE (required for flagella hook assembly) with an erythromycin resistance gene followed by a transcription terminator increased α-amylase yield by about 30%. Transcript levels of the α-amylase were unaltered in the CRISPR-dCas9 knockdowns as well as the flgE deletion strain, but all manipulations disrupted the ability of cells to swim on agar. CONCLUSIONS: We demonstrate that the disruption of flagella in a B. subtilis α-amylase production strain, either by CRISPR-dCas9-based knockdown of the operon or by replacing flgE with an erythromycin resistance gene followed by a transcription terminator, increases the production of α-amylase in small-scale fermentation.

Assuntos

Amilases , Bacillus subtilis , Flagelos , alfa-Amilases , Amilases/genética , Bacillus subtilis/genética , Eritromicina , Flagelos/genética , alfa-Amilases/genética , alfa-Amilases/metabolismo

10.

Good guide, bad guide: spacer sequence-dependent cleavage efficiency of Cas12a.

Creutzburg, Sjoerd C A; Wu, Wen Y; Mohanraju, Prarthana; Swartjes, Thomas; Alkan, Ferhat; Gorodkin, Jan; Staals, Raymond H J; van der Oost, John.

Nucleic Acids Res ; 48(6): 3228-3243, 2020 04 06.

Artigo em Inglês | MEDLINE | ID: mdl-31989168

RESUMO

Genome editing has recently made a revolutionary development with the introduction of the CRISPR-Cas technology. The programmable CRISPR-associated Cas9 and Cas12a nucleases generate specific dsDNA breaks in the genome, after which host DNA-repair mechanisms can be manipulated to implement the desired editing. Despite this spectacular progress, the efficiency of Cas9/Cas12a-based engineering can still be improved. Here, we address the variation in guide-dependent efficiency of Cas12a, and set out to reveal the molecular basis of this phenomenon. We established a sensitive and robust in vivo targeting assay based on loss of a target plasmid encoding the red fluorescent protein (mRFP). Our results suggest that folding of both the precursor guide (pre-crRNA) and the mature guide (crRNA) have a major influence on Cas12a activity. Especially, base pairing of the direct repeat, other than with itself, was found to be detrimental to the activity of Cas12a. Furthermore, we describe different approaches to minimize base-pairing interactions between the direct repeat and the variable part of the guide. We show that design of the 3' end of the guide, which is not involved in target strand base pairing, may result in substantial improvement of the guide's targeting potential and hence of its genome editing efficiency.

Assuntos

Proteínas de Bactérias/genética , Proteínas Associadas a CRISPR/genética , Sistemas CRISPR-Cas/genética , Reparo do DNA/genética , Endodesoxirribonucleases/genética , Edição de Genes , Proteína 9 Associada à CRISPR/genética , Escherichia coli/genética , Proteínas Luminescentes/genética , Plasmídeos/genética , RNA Guia de Cinetoplastídeos/genética

11.

The identification and functional annotation of RNA structures conserved in vertebrates.

Seemann, Stefan E; Mirza, Aashiq H; Hansen, Claus; Bang-Berthelsen, Claus H; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L; Gorodkin, Jan.

Genome Res ; 27(8): 1371-1383, 2017 08.

Artigo em Inglês | MEDLINE | ID: mdl-28487280

RESUMO

Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict â¼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3' ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality.

Assuntos

Regulação da Expressão Gênica , Conformação de Ácido Nucleico , RNA/química , RNA/genética , Elementos Reguladores de Transcrição , Vertebrados/genética , Animais , Sequência de Bases , Sequência Conservada , Genoma Humano , Humanos , Camundongos , RNA/metabolismo , Proteínas de Ligação a RNA/metabolismo , Homologia de Sequência , Transcrição Gênica

12.

Inferring disease-associated long non-coding RNAs using genome-wide tissue expression profiles.

Pan, Xiaoyong; Jensen, Lars Juhl; Gorodkin, Jan.

Bioinformatics ; 35(9): 1494-1502, 2019 05 01.

Artigo em Inglês | MEDLINE | ID: mdl-30295698

RESUMO

MOTIVATION: Long non-coding RNAs (lncRNAs) are important regulators in wide variety of biological processes, which are linked to many diseases. Compared to protein-coding genes (PCGs), the association between diseases and lncRNAs is still not well studied. Thus, inferring disease-associated lncRNAs on a genome-wide scale has become imperative. RESULTS: In this study, we propose a machine learning-based method, DislncRF, which infers disease-associated lncRNAs on a genome-wide scale based on tissue expression profiles. DislncRF uses random forest models trained on expression profiles of known disease-associated PCGs across human tissues to extract general patterns between expression profiles and diseases. These models are then applied to score associations between lncRNAs and diseases. DislncRF was benchmarked against a gold standard dataset and compared to other methods. The results show that DislncRF yields promising performance and outperforms the existing methods. The utility of DislncRF is further substantiated on two diseases in which we find that top scoring candidates are supported by literature or independent datasets. AVAILABILITY AND IMPLEMENTATION: https://github.com/xypan1232/DislncRF. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

RNA Longo não Codificante/genética , Genoma , Humanos , Aprendizado de Máquina

13.

Translational co-regulation of a ligand and inhibitor by a conserved RNA element.

Zaucker, Andreas; Nagorska, Agnieszka; Kumari, Pooja; Hecker, Nikolai; Wang, Yin; Huang, Sizhou; Cooper, Ledean; Sivashanmugam, Lavanya; VijayKumar, Shruthi; Brosens, Jan; Gorodkin, Jan; Sampath, Karuna.

Nucleic Acids Res ; 46(1): 104-119, 2018 01 09.

Artigo em Inglês | MEDLINE | ID: mdl-29059375

RESUMO

In many organisms, transcriptional and post-transcriptional regulation of components of pathways or processes has been reported. However, to date, there are few reports of translational co-regulation of multiple components of a developmental signaling pathway. Here, we show that an RNA element which we previously identified as a dorsal localization element (DLE) in the 3'UTR of zebrafish nodal-related1/squint (ndr1/sqt) ligand mRNA, is shared by the related ligand nodal-related2/cyclops (ndr2/cyc) and the nodal inhibitors, lefty1 (lft1) and lefty2 mRNAs. We investigated the activity of the DLEs through functional assays in live zebrafish embryos. The lft1 DLE localizes fluorescently labeled RNA similarly to the ndr1/sqt DLE. Similar to the ndr1/sqt 3'UTR, the lft1 and lft2 3'UTRs are bound by the RNA-binding protein (RBP) and translational repressor, Y-box binding protein 1 (Ybx1), whereas deletions in the DLE abolish binding to Ybx1. Analysis of zebrafish ybx1 mutants shows that Ybx1 represses lefty1 translation in embryos. CRISPR/Cas9-mediated inactivation of human YBX1 also results in human NODAL translational de-repression, suggesting broader conservation of the DLE RNA element/Ybx1 RBP module in regulation of Nodal signaling. Our findings demonstrate translational co-regulation of components of a signaling pathway by an RNA element conserved in both sequence and structure and an RBP, revealing a 'translational regulon'.

Assuntos

Embrião não Mamífero/metabolismo , Regulação da Expressão Gênica no Desenvolvimento , Proteínas de Peixe-Zebra/genética , Peixe-Zebra/genética , Regiões 3' não Traduzidas/genética , Animais , Embrião não Mamífero/embriologia , Células HEK293 , Humanos , Peptídeos e Proteínas de Sinalização Intracelular/genética , Peptídeos e Proteínas de Sinalização Intracelular/metabolismo , Fatores de Determinação Direita-Esquerda/genética , Fatores de Determinação Direita-Esquerda/metabolismo , Ligantes , Ligantes da Sinalização Nodal/genética , Ligantes da Sinalização Nodal/metabolismo , RNA/genética , RNA/metabolismo , Peixe-Zebra/embriologia , Peixe-Zebra/metabolismo , Proteínas de Peixe-Zebra/metabolismo

14.

Cytoscape StringApp: Network Analysis and Visualization of Proteomics Data.

Doncheva, Nadezhda T; Morris, John H; Gorodkin, Jan; Jensen, Lars J.

J Proteome Res ; 18(2): 623-632, 2019 02 01.

Artigo em Inglês | MEDLINE | ID: mdl-30450911

RESUMO

Protein networks have become a popular tool for analyzing and visualizing the often long lists of proteins or genes obtained from proteomics and other high-throughput technologies. One of the most popular sources of such networks is the STRING database, which provides protein networks for more than 2000 organisms, including both physical interactions from experimental data and functional associations from curated pathways, automatic text mining, and prediction methods. However, its web interface is mainly intended for inspection of small networks and their underlying evidence. The Cytoscape software, on the other hand, is much better suited for working with large networks and offers greater flexibility in terms of network analysis, import, and visualization of additional data. To include both resources in the same workflow, we created stringApp, a Cytoscape app that makes it easy to import STRING networks into Cytoscape, retains the appearance and many of the features of STRING, and integrates data from associated databases. Here, we introduce many of the stringApp features and show how they can be used to carry out complex network analysis and visualization tasks on a typical proteomics data set, all through the Cytoscape user interface. stringApp is freely available from the Cytoscape app store: http://apps.cytoscape.org/apps/stringapp .

Assuntos

Análise de Dados , Proteômica/métodos , Software , Biologia Computacional/métodos , Internet , Mapas de Interação de Proteínas , Interface Usuário-Computador

15.

Letter to the editor: Testing on external independent datasets is necessary to corroborate machine learning model improvement.

Corsi, Giulia Ilaria; Anthon, Christian; Gorodkin, Jan.

Bioinformatics ; 39(6)2023 06 01.

Artigo em Inglês | MEDLINE | ID: mdl-37202356

Assuntos

Aprendizado de Máquina

16.

A splice-site variant in the lncRNA gene RP1-140A9.1 cosegregates in the large Volkmann cataract family.

Eiberg, Hans; Mikkelsen, Annemette F; Bak, Mads; Tommerup, Niels; Lund, Allan M; Wenzel, Anne; Sabarinathan, Radhakrishnan; Gorodkin, Jan; Bang-Berthelsen, Claus H; Hansen, Lars.

Mol Vis ; 25: 1-11, 2019.

Artigo em Inglês | MEDLINE | ID: mdl-30820140

RESUMO

Purpose: To identify the mutation for Volkmann cataract (CTRCT8) at 1p36.33. Methods: The genes in the candidate region 1p36.33 were Sanger and parallel deep sequenced, and informative single nucleotide polymorphisms (SNPs) were identified for linkage analysis. Expression analysis with reverse transcription polymerase chain reaction (RT-PCR) of the candidate gene was performed using RNA from different human tissues. Quantitative transcription polymerase chain reaction (qRT-PCR) analysis of the GNB1 gene was performed in affected and healthy individuals. Bioinformatic analysis of the linkage regions including the candidate gene was performed. Results: Linkage analysis of the 1p36.33 CCV locus applying new marker systems obtained with Sanger and deep sequencing reduced the candidate locus from 2.1 Mb to 0.389 Mb flanked by the markers STS-22AC and rs549772338 and resulted in an logarithm of the odds (LOD) score of Z = 21.67. The identified mutation, rs763295804, affects the donor splice site in the long non-coding RNA gene RP1-140A9.1 (ENSG00000231050). The gene including splice-site junctions is conserved in primates but not in other mammalian genomes, and two alternative transcripts were shown with RT-PCR. One of these transcripts represented a lens cell-specific transcript. Meta-analysis of the Cross-Linking-Immuno-Precipitation sequencing (CLIP-Seq) data suggested the RNA binding protein (RBP) eIF4AIII is an active counterpart for RP1-140A9.1, and several miRNA and transcription factors binding sites were predicted in the proximity of the mutation. ENCODE DNase I hypersensitivity and histone methylation and acetylation data suggest the genomic region may have regulatory functions. Conclusions: The mutation in RP1-140A9.1 suggests the long non-coding RNA as the candidate cataract gene associated with the autosomal dominant inherited congenital cataract from CCV. The mutation has the potential to destroy exon/intron splicing of both transcripts of RP1-140A9.1. Sanger and massive deep resequencing of the linkage region failed to identify alternative candidates suggesting the mutation in RP1-140A9.1 is causative for the CCV phenotype.

Assuntos

Catarata/congênito , Cromossomos Humanos Par 1/química , Mutação , RNA Longo não Codificante/genética , RNA Mensageiro/genética , Acetilação , Adulto , Sequência de Bases , Sítios de Ligação , Catarata/diagnóstico , Catarata/genética , Catarata/patologia , Fator de Iniciação 4A em Eucariotos/genética , Fator de Iniciação 4A em Eucariotos/metabolismo , Éxons , Família , Feminino , Genes Dominantes , Loci Gênicos , Marcadores Genéticos , Sequenciamento de Nucleotídeos em Larga Escala , Histonas/genética , Histonas/metabolismo , Humanos , Íntrons , Masculino , Metilação , Pessoa de Meia-Idade , Linhagem , Sítios de Splice de RNA , Splicing de RNA , RNA Longo não Codificante/metabolismo , RNA Mensageiro/metabolismo

17.

RIsearch2: suffix array-based large-scale prediction of RNA-RNA interactions and siRNA off-targets.

Alkan, Ferhat; Wenzel, Anne; Palasca, Oana; Kerpedjiev, Peter; Rudebeck, Anders Frost; Stadler, Peter F; Hofacker, Ivo L; Gorodkin, Jan.

Nucleic Acids Res ; 45(8): e60, 2017 05 05.

Artigo em Inglês | MEDLINE | ID: mdl-28108657

RESUMO

Intermolecular interactions of ncRNAs are at the core of gene regulation events, and identifying the full map of these interactions bears crucial importance for ncRNA functional studies. It is known that RNA-RNA interactions are built up by complementary base pairings between interacting RNAs and high level of complementarity between two RNA sequences is a powerful predictor of such interactions. Here, we present RIsearch2, a large-scale RNA-RNA interaction prediction tool that enables quick localization of potential near-complementary RNA-RNA interactions between given query and target sequences. In contrast to previous heuristics which either search for exact matches while including G-U wobble pairs or employ simplified energy models, we present a novel approach using a single integrated seed-and-extend framework based on suffix arrays. RIsearch2 enables fast discovery of candidate RNA-RNA interactions on genome/transcriptome-wide scale. We furthermore present an siRNA off-target discovery pipeline that not only predicts the off-target transcripts but also computes the off-targeting potential of a given siRNA. This is achieved by combining genome-wide RIsearch2 predictions with target site accessibilities and transcript abundance estimates. We show that this pipeline accurately predicts siRNA off-target interactions and enables off-targeting potential comparisons between different siRNA designs. RIsearch2 and the siRNA off-target discovery pipeline are available as stand-alone software packages from http://rth.dk/resources/risearch.

Assuntos

Modelos Estatísticos , RNA Interferente Pequeno/genética , RNA não Traduzido/genética , Software , Transcriptoma , Algoritmos , Pareamento de Bases , Sequência de Bases , Linhagem Celular Tumoral , Humanos , Modelos Genéticos , RNA Interferente Pequeno/metabolismo , RNA não Traduzido/metabolismo

18.

Identification and characterization of novel conserved RNA structures in Drosophila.

Kirsch, Rebecca; Seemann, Stefan E; Ruzzo, Walter L; Cohen, Stephen M; Stadler, Peter F; Gorodkin, Jan.

BMC Genomics ; 19(1): 899, 2018 Dec 11.

Artigo em Inglês | MEDLINE | ID: mdl-30537930

RESUMO

BACKGROUND: Comparative genomics approaches have facilitated the discovery of many novel non-coding and structured RNAs (ncRNAs). The increasing availability of related genomes now makes it possible to systematically search for compensatory base changes - and thus for conserved secondary structures - even in genomic regions that are poorly alignable in the primary sequence. The wealth of available transcriptome data can add valuable insight into expression and possible function for new ncRNA candidates. Earlier work identifying ncRNAs in Drosophila melanogaster made use of sequence-based alignments and employed a sliding window approach, inevitably biasing identification toward RNAs encoded in the more conserved parts of the genome. RESULTS: To search for conserved RNA structures (CRSs) that may not be highly conserved in sequence and to assess the expression of CRSs, we conducted a genome-wide structural alignment screen of 27 insect genomes including D. melanogaster and integrated this with an extensive set of tiling array data. The structural alignment screen revealed â¼30,000 novel candidate CRSs at an estimated false discovery rate of less than 10%. With more than one quarter of all individual CRS motifs showing sequence identities below 60%, the predicted CRSs largely complement the findings of sliding window approaches applied previously. While a sixth of the CRSs were ubiquitously expressed, we found that most were expressed in specific developmental stages or cell lines. Notably, most statistically significant enrichment of CRSs were observed in pupae, mainly in exons of untranslated regions, promotors, enhancers, and long ncRNAs. Interestingly, cell lines were found to express a different set of CRSs than were found in vivo. Only a small fraction of intergenic CRSs were co-expressed with the adjacent protein coding genes, which suggests that most intergenic CRSs are independent genetic units. CONCLUSIONS: This study provides a more comprehensive view of the ncRNA transcriptome in fly as well as evidence for differential expression of CRSs during development and in cell lines.

Assuntos

Sequência Conservada , Drosophila melanogaster/genética , RNA/química , Animais , Composição de Bases/genética , Sequência de Bases , Drosophila melanogaster/crescimento & desenvolvimento , Regulação da Expressão Gênica , Anotação de Sequência Molecular , RNA não Traduzido/genética , Software

19.

RNAscClust: clustering RNA sequences using structure conservation and graph based motifs.

Miladi, Milad; Junge, Alexander; Costa, Fabrizio; Seemann, Stefan E; Havgaard, Jakob Hull; Gorodkin, Jan; Backofen, Rolf.

Bioinformatics ; 33(14): 2089-2096, 2017 Jul 15.

Artigo em Inglês | MEDLINE | ID: mdl-28334186

RESUMO

MOTIVATION: Clustering RNA sequences with common secondary structure is an essential step towards studying RNA function. Whereas structural RNA alignment strategies typically identify common structure for orthologous structured RNAs, clustering seeks to group paralogous RNAs based on structural similarities. However, existing approaches for clustering paralogous RNAs, do not take the compensatory base pair changes obtained from structure conservation in orthologous sequences into account. RESULTS: Here, we present RNAscClust , the implementation of a new algorithm to cluster a set of structured RNAs taking their respective structural conservation into account. For a set of multiple structural alignments of RNA sequences, each containing a paralog sequence included in a structural alignment of its orthologs, RNAscClust computes minimum free-energy structures for each sequence using conserved base pairs as prior information for the folding. The paralogs are then clustered using a graph kernel-based strategy, which identifies common structural features. We show that the clustering accuracy clearly benefits from an increasing degree of compensatory base pair changes in the alignments. AVAILABILITY AND IMPLEMENTATION: RNAscClust is available at http://www.bioinf.uni-freiburg.de/Software/RNAscClust . CONTACT: gorodkin@rth.dk or backofen@informatik.uni-freiburg.de. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

RNA/química , Análise de Sequência de RNA/métodos , Software , Algoritmos , Análise por Conglomerados , Humanos , Conformação de Ácido Nucleico

20.

Simultaneous DNA and RNA Mapping of Somatic Mitochondrial Mutations across Diverse Human Cancers.

Stewart, James B; Alaei-Mahabadi, Babak; Sabarinathan, Radhakrishnan; Samuelsson, Tore; Gorodkin, Jan; Gustafsson, Claes M; Larsson, Erik.

PLoS Genet ; 11(6): e1005333, 2015 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-26125550

RESUMO

Somatic mutations in the nuclear genome are required for tumor formation, but the functional consequences of somatic mitochondrial DNA (mtDNA) mutations are less understood. Here we identify somatic mtDNA mutations across 527 tumors and 14 cancer types, using an approach that takes advantage of evidence from both genomic and transcriptomic sequencing. We find that there is selective pressure against deleterious coding mutations, supporting that functional mitochondria are required in tumor cells, and also observe a strong mutational strand bias, compatible with endogenous replication-coupled errors as the major source of mutations. Interestingly, while allelic ratios in general were consistent in RNA compared to DNA, some mutations in tRNAs displayed strong allelic imbalances caused by accumulation of unprocessed tRNA precursors. The effect was explained by altered secondary structure, demonstrating that correct tRNA folding is a major determinant for processing of polycistronic mitochondrial transcripts. Additionally, the data suggest that tRNA clusters are preferably processed in the 3' to 5' direction. Our study gives insights into mtDNA function in cancer and answers questions regarding mitochondrial tRNA biogenesis that are difficult to address in controlled experimental systems.

Assuntos

Mitocôndrias/genética , Mutação , Neoplasias/genética , Alelos , DNA Mitocondrial , DNA de Neoplasias/genética , Genoma Mitocondrial , Humanos , RNA Neoplásico , RNA de Transferência/genética , Análise de Sequência de RNA

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

Detalhe da pesquisa