Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 154
Filtrar
Más filtros

Bases de datos
Tipo del documento
Intervalo de año de publicación
1.
EMBO J ; 42(12): e112362, 2023 06 15.
Artículo en Inglés | MEDLINE | ID: mdl-37155573

RESUMEN

eIF3, whose subunits are frequently overexpressed in cancer, regulates mRNA translation from initiation to termination, but mRNA-selective functions of individual subunits remain poorly defined. Using multiomic profiling upon acute depletion of eIF3 subunits, we observed that while eIF3a, b, e, and f markedly differed in their impact on eIF3 holo-complex formation and translation, they were each required for cancer cell proliferation and tumor growth. Remarkably, eIF3k showed the opposite pattern with depletion promoting global translation, cell proliferation, tumor growth, and stress resistance through repressing the synthesis of ribosomal proteins, especially RPS15A. Whereas ectopic expression of RPS15A mimicked the anabolic effects of eIF3k depletion, disruption of eIF3 binding to the 5'-UTR of RSP15A mRNA negated them. eIF3k and eIF3l are selectively downregulated in response to endoplasmic reticulum and oxidative stress. Supported by mathematical modeling, our data uncover eIF3k-l as a mRNA-specific module which, through controlling RPS15A translation, serves as a rheostat of ribosome content, possibly to secure spare translational capacity that can be mobilized during stress.


Asunto(s)
Factor 3 de Iniciación Eucariótica , Neoplasias , Humanos , Factor 3 de Iniciación Eucariótica/genética , Factor 3 de Iniciación Eucariótica/metabolismo , ARN Mensajero/genética , ARN Mensajero/metabolismo , Ribosomas/genética , Ribosomas/metabolismo , Proteínas Ribosómicas/genética , Proteínas Ribosómicas/metabolismo , Neoplasias/genética , Neoplasias/metabolismo , Biosíntesis de Proteínas
2.
PLoS Comput Biol ; 20(6): e1012214, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38848440

RESUMEN

CRISPR is a gene editing technology which enables precise in-vivo genome editing; but its potential is hampered by its relatively low specificity and sensitivity. Improving CRISPR's on-target and off-target effects requires a better understanding of its mechanism and determinants. Here we demonstrate, for the first time, the chromosomal 3D spatial structure's association with CRISPR's cleavage efficiency, and its predictive capabilities. We used high-resolution Hi-C data to estimate the 3D distance between different regions in the human genome and utilized these spatial properties to generate 3D-based features, characterizing each region's density. We evaluated these features based on empirical, in-vivo CRISPR efficiency data and compared them to 425 features used in state-of-the-art models. The 3D features ranked in the top 13% of the features, and significantly improved the predictive power of LASSO and xgboost models trained with these features. The features indicated that sites with lower spatial density demonstrated higher efficiency. Understanding how CRISPR is affected by the 3D DNA structure provides insight into CRISPR's mechanism in general and improves our ability to correctly predict CRISPR's cleavage as well as design sgRNAs for therapeutic and scientific use.


Asunto(s)
Sistemas CRISPR-Cas , Edición Génica , Genoma Humano , Humanos , Sistemas CRISPR-Cas/genética , Edición Génica/métodos , Genoma Humano/genética , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas/genética , Biología Computacional/métodos , Genómica/métodos , Conformación de Ácido Nucleico , ADN/genética , ADN/química , ADN/metabolismo
3.
Cell ; 141(2): 344-54, 2010 Apr 16.
Artículo en Inglés | MEDLINE | ID: mdl-20403328

RESUMEN

Recent years have seen intensive progress in measuring protein translation. However, the contributions of coding sequences to the efficiency of the process remain unclear. Here, we identify a universally conserved profile of translation efficiency along mRNAs computed based on adaptation between coding sequences and the tRNA pool. In this profile, the first approximately 30-50 codons are, on average, translated with a low efficiency. Additionally, in eukaryotes, the last approximately 50 codons show the highest efficiency over the full coding sequence. The profile accurately predicts position-dependent ribosomal density along yeast genes. These data suggest that translation speed and, as a consequence, ribosomal density are encoded by coding sequences and the tRNA pool. We suggest that the slow "ramp" at the beginning of mRNAs serves as a late stage of translation initiation, forming an optimal and robust means to reduce ribosomal traffic jams, thus minimizing the cost of protein expression.


Asunto(s)
Evolución Biológica , Codón/metabolismo , Biosíntesis de Proteínas , ARN de Transferencia/metabolismo , Saccharomyces cerevisiae/genética , Selección Genética , ARN de Transferencia/genética , Ribosomas/metabolismo , Saccharomyces cerevisiae/citología , Saccharomyces cerevisiae/metabolismo
4.
Nucleic Acids Res ; 50(3): 1297-1316, 2022 02 22.
Artículo en Inglés | MEDLINE | ID: mdl-35100399

RESUMEN

Translation Complex Profile Sequencing (TCP-seq), a protocol that was developed and implemented on Saccharomyces cerevisiae, provides the footprints of the small subunit (SSU) of the ribosome (with additional factors) across the entire transcriptome of the analyzed organism. In this study, based on the TCP-seq data, we developed for the first-time a predictive model of the SSU density and analyzed the effect of transcript features on the dynamics of the SSU scan in the 5'UTR. Among others, our model is based on novel tools for detecting complex statistical relations tailored to TCP-seq. We quantitatively estimated the effect of several important features, including the context of the upstream AUG, the upstream ORF length and the mRNA folding strength. Specifically, we suggest that around 50% of the variance related to the read counts (RC) distribution near a start codon can be attributed to the AUG context score. We provide the first large scale direct quantitative evidence that shows that indeed AUG context affects the small sub-unit movement. In addition, we suggest that strong folding may cause the detachment of the SSU from the mRNA. We also identified a number of novel sequence motifs that can affect the SSU scan; some of these motifs affect transcription factors and RNA binding proteins. The results presented in this study provide a better understanding of the biophysical aspects related to the SSU scan along the 5'UTR and of translation initiation in S. cerevisiae, a fundamental step toward a comprehensive modeling of initiation.


Asunto(s)
Proteínas de Saccharomyces cerevisiae , Saccharomyces cerevisiae , Regiones no Traducidas 5' , Codón Iniciador/metabolismo , Biosíntesis de Proteínas , ARN Mensajero/metabolismo , Ribosomas/genética , Ribosomas/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo
5.
Bioinformatics ; 36(22-23): 5398-5404, 2021 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-33320173

RESUMEN

MOTIVATION: MicroRNAs (miRNAs) are short (∼24nt), non-coding RNAs, which downregulate gene expression in many species and physiological processes. Many details regarding the mechanism which governs miRNA-mediated repression continue to elude researchers. RESULTS: We elucidate the interplay between the coding sequence and the 3'UTR, by using elastic net regularization and incorporating translation-related features to predict miRNA-mediated repression. We find that miRNA binding sites at the end of the coding sequence contribute to repression, and that weak binding sites are linked to effective de-repression, possibly as a result of competing with stronger binding sites. Furthermore, we propose a recycling model for miRNAs dissociated from the open reading frame (ORF) by traversing ribosomes, explaining the observed link between increased ribosome density/traversal speed and increased repression. We uncover a novel layer of interaction between the coding sequence and the 3'UTR (untranslated region) and suggest the ORF has a larger role than previously thought in the mechanism of miRNA-mediated repression. AVAILABILITY AND IMPLEMENTATION: The code is freely available at https://github.com/aescrdni/miRNA_model. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

6.
Semin Cell Dev Biol ; 90: 78-93, 2019 06.
Artículo en Inglés | MEDLINE | ID: mdl-30030143

RESUMEN

The regulation of gene expression is mediated via the complex three-dimensional (3D) conformation of the genetic material and its interactions with various intracellular factors. Various experimental and computational approaches have been developed in recent years for understating the relation between the 3D conformation of the genome and the phenotypes of cells in normal condition and diseases. In this review, we will discuss novel approaches for analyzing and modeling the 3D genomic conformation, focusing on deciphering disease-causing mutations that affect gene expression. We conclude that as this is a very challenging mission, an important direction should involve the comparative analysis of various 3D models from various organisms or cells.


Asunto(s)
Biología Computacional , Modelos Genéticos , Algoritmos , Expresión Génica , Humanos , Fenotipo
7.
PLoS Comput Biol ; 16(7): e1008038, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-32649657

RESUMEN

The importance of mRNA translation models has been demonstrated across many fields of science and biotechnology. However, a whole cell model with codon resolution and biophysical dynamics is still lacking. We describe a whole cell model of translation for E. coli. The model simulates all major translation components in the cell: ribosomes, mRNAs and tRNAs. It also includes, for the first time, fundamental aspects of translation, such as competition for ribosomes and tRNAs at a codon resolution while considering tRNAs wobble interactions and tRNA recycling. The model uses parameters that are tightly inferred from large scale measurements of translation. Furthermore, we demonstrate a robust modelling approach which relies on state-of-the-art practices of translation modelling and also provides a framework for easy generalizations. This novel approach allows simulation of thousands of mRNAs that undergo translation in the same cell with common resources such as ribosomes and tRNAs in feasible time. Based on this model, we demonstrate, for the first time, the direct importance of competition for resources on translation and its accurate modelling. An effective supply-demand ratio (ESDR) measure, which is related to translation factors such as tRNAs, has been devised and utilized to show superior predictive power in complex scenarios of heterologous gene expression. The devised model is not only more accurate than the existing models, but, more importantly, provides a framework for analyzing complex whole cell translation problems and variables that haven't been explored before, making it important in various biomedical fields.


Asunto(s)
Codón/metabolismo , Biosíntesis de Proteínas , ARN de Transferencia/metabolismo , Ribosomas/metabolismo , Escherichia coli/genética , Escherichia coli/metabolismo , Cinética , Modelos Genéticos , Modelos Estadísticos , ARN Mensajero/metabolismo , Análisis de Regresión , Biología de Sistemas , Transcriptoma
8.
RNA Biol ; 18(12): 2296-2307, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-33691590

RESUMEN

ABSTRASTDue to the redundancy of the genetic code most amino acids are encoded by several 'synonymous' codons. These codons are used unevenly, and each organism demonstrates its own unique codon usage bias, where the 'preferred' codons are associated with tRNAs that are found in high concentrations. Therefore, for decades, the prevailing view had been that preferred and non-preferred codons are linked to high or slow translation rates, respectively.However, this simplified view is contrasted by the frequent failures of codon-optimization efforts and by evidence of non-preferred (i.e. 'slow') codons having specific roles important for efficient production of functional proteins. One such specific role of slower codons is the regulation of co-translational protein folding, a complex biophysical process that is very challenging to model or to measure.Here, we combined a genome-wide approach with experiments to investigate the role of slow codons in protein production and co-translational folding. We analysed homologous gene groups from divergent bacteria and identified positions of inter-species conservation of bias towards slow codons. We then generated mutants where the conserved slow codons are substituted with 'fast' ones, and experimentally studied the effects of these codon substitutions. Using cellular and biochemical approaches we find that at certain locations, slow-to-fast codon substitutions reduce protein expression, increase protein aggregation, and impair protein function.This report provides an approach for identifying functionally relevant regions with slower codons and demonstrates that such codons are important for protein expression and function.


Asunto(s)
Bacillus subtilis/genética , Proteínas Bacterianas/química , Proteínas Bacterianas/genética , Escherichia coli/genética , Secuencia de Aminoácidos , Bacillus subtilis/metabolismo , Proteínas Bacterianas/metabolismo , Uso de Codones , Secuencia Conservada , Escherichia coli/metabolismo , Evolución Molecular , Código Genético , Biosíntesis de Proteínas , Pliegue de Proteína , ARN de Transferencia/genética , Mutación Silenciosa
9.
RNA Biol ; 18(sup2): 684-698, 2021 11 12.
Artículo en Inglés | MEDLINE | ID: mdl-34586043

RESUMEN

The well-established Shine-Dalgarno model suggests that translation initiation in bacteria is regulated via base-pairing between ribosomal RNA (rRNA) and mRNA. We used novel computational analyses and modelling of 823 bacterial genomes coupled with experiments to demonstrate that rRNA-mRNA interactions are diverse and regulate all translation steps from pre-initiation to termination. Previous research has reported the significant influence of rRNA-mRNA interactions, mainly in the initiation phase of translation. The results reported in this paper suggest that, in addition to the rRNA-mRNA interactions near the start codon that trigger initiation in bacteria, rRNA-mRNA interactions affect all sub-stages of the translation process (pre-initiation, initiation, elongation, termination). As these interactions dictate translation efficiency, they serve as an evolutionary driving force for shaping transcripts in bacteria while considering trade-offs between the effects of different interactions across different transcript regions on translation efficacy and efficiency. We observed selection for strong interactions in regions where such interactions are likely to enhance initiation, regulate early elongation, and ensure translation termination fidelity. We discovered selection against strong interactions and for intermediate interactions in coding regions and presented evidence that these patterns maximize elongation efficiency while also enhancing initiation. These finding are relevant to all biomedical disciplines due to the centrality of the translation process and the effect of rRNA-mRNA interactions on transcript evolution.


Asunto(s)
Fenómenos Fisiológicos Bacterianos , Epistasis Genética , Células Procariotas/fisiología , Biosíntesis de Proteínas/genética , ARN Mensajero/genética , ARN Ribosómico/genética , Regiones no Traducidas 3' , Regiones no Traducidas 5' , Bacterias/genética , Sistemas de Lectura Abierta , ARN Ribosómico 16S/genética
10.
Nucleic Acids Res ; 47(7): 3344-3352, 2019 04 23.
Artículo en Inglés | MEDLINE | ID: mdl-30828719

RESUMEN

While bacterial operons have been thoroughly studied, few analyses of chloroplast operons exist, limiting the ability to study fundamental elements of these structures and utilize them for synthetic biology. Here, we describe the creation of a plastome-specific operon database (link provided below) achieved by combining experimental tools and predictive modeling. Using a Reverse-Transcription-PCR based method and published data, we determined the transcription-state of 213 gene pairs from four plastomes of evolutionary distinct organisms. By analyzing sequence-based features computed for our dataset, we were able to highlight fundamental characteristics differentiating between operon pairs and non-operon pairs. These include an interesting tendency toward maintaining similar messenger RNA-folding profiles in operon gene pairs, a feature that failed to yield any informative separation in cyanobacteria, suggesting that it catches unique traits of operon gene expression, which have evolved post-endosymbiosis. Subsequently, we used this feature set to train a random-forest classifier for operon prediction. As our results demonstrate the ability of our predictor to obtain accurate (84%) and robust predictions on unlabeled datasets, we proceeded to building operon maps for 2018 sequenced plastids. Our database may now present new opportunities for promoting metabolic engineering and synthetic biology in chloroplasts.


Asunto(s)
Algoritmos , Evolución Molecular , Operón/genética , Plantas/genética , Plastidios/genética , ARN de Planta/genética , Cloroplastos/genética , Bases de Datos Genéticas , Conjuntos de Datos como Asunto , Árboles de Decisión , Ingeniería Metabólica , Plantas/clasificación , Pliegue del ARN , ARN Mensajero/química , ARN Mensajero/genética , ARN Mensajero/metabolismo , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Transcripción Genética
11.
BMC Bioinformatics ; 21(1): 190, 2020 May 15.
Artículo en Inglés | MEDLINE | ID: mdl-32414319

RESUMEN

BACKGROUND: Most organisms cannot be cultivated, as they live in unique ecological conditions that cannot be mimicked in the lab. Understanding the functionality of those organisms' genes and their interactions by performing large-scale measurements of transcription levels, protein-protein interactions or metabolism, is extremely difficult and, in some cases, impossible. Thus, efficient algorithms for deciphering genome functionality based only on the genomic sequences with no other experimental measurements are needed. RESULTS: In this study, we describe a novel algorithm that infers gene networks that we name Common Substring Network (CSN). The algorithm enables inferring novel regulatory relations among genes based only on the genomic sequence of a given organism and partial homolog/ortholog-based functional annotation. It can specifically infer the functional annotation of genes with unknown homology. This approach is based on the assumption that related genes, not necessarily homologs, tend to share sub-sequences, which may be related to common regulatory mechanisms, similar functionality of encoded proteins, common evolutionary history, and more. We demonstrate that CSNs, which are based on S. cerevisiae and E. coli genomes, have properties similar to 'traditional' biological networks inferred from experiments. Highly expressed genes tend to have higher degree nodes in the CSN, genes with similar protein functionality tend to be closer, and the CSN graph exhibits a power-law degree distribution. Also, we show how the CSN can be used for predicting gene interactions and functions. CONCLUSIONS: The reported results suggest that 'silent' code inside the transcript can help to predict central features of biological networks and gene function. This approach can help researchers to understand the genome of novel microorganisms, analyze metagenomic data, and can help to decipher new gene functions. AVAILABILITY: Our MATLAB implementation of CSN is available at https://www.cs.tau.ac.il/~tamirtul/CSN-Autogen.


Asunto(s)
Redes Reguladoras de Genes , Genoma , Algoritmos , Epistasis Genética , Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo
12.
Mol Biol Evol ; 36(12): 2854-2860, 2019 12 01.
Artículo en Inglés | MEDLINE | ID: mdl-31503284

RESUMEN

Chloroplasts originated from an ancient cyanobacterium and still harbor a bacterial-like genome. However, the centrality of Shine-Dalgarno ribosome binding, which predominantly regulates proteobacterial translation initiation, is significantly decreased in chloroplasts. As plastid ribosomal RNA anti-Shine-Dalgarno elements are similar to their bacterial counterparts, these sites alone cannot explain this decline. By computational simulation we show that upstream point mutations modulate the local structure of ribosomal RNA in chloroplasts, creating significantly tighter structures around the anti-Shine-Dalgarno locus, which in-turn reduce the probability of ribosome binding. To validate our model, we expressed two reporter genes (mCherry, hydrogenase) harboring a Shine-Dalgarno motif in the Chlamydomonas reinhardtii chloroplast. Coexpressing them with a 16S ribosomal RNA, modified according to our model, significantly enhances mCherry and hydrogenase expression compared with coexpression with an endogenous 16S gene.


Asunto(s)
Cloroplastos/metabolismo , Evolución Molecular , Modelos Genéticos , Biosíntesis de Proteínas , ARN Ribosómico 16S/genética , Chlamydomonas reinhardtii , Simulación por Computador , Mutación Puntual , Pliegue del ARN , ARN Ribosómico 16S/metabolismo
13.
Bioinformatics ; 35(16): 2718-2723, 2019 08 15.
Artículo en Inglés | MEDLINE | ID: mdl-30596896

RESUMEN

MOTIVATION: The COP9 signalosome is a highly conserved multi-protein complex consisting of eight subunits, which influences key developmental pathways through its regulation of protein stability and transcription. In Arabidopsis thaliana, mutations in the COP9 signalosome exhibit a number of diverse pleiotropic phenotypes. Total or partial loss of COP9 signalosome function in Arabidopsis leads to misregulation of a number of genes involved in DNA methylation, suggesting that part of the pleiotropic phenotype is due to global effects on DNA methylation. RESULTS: We determined and analyzed the methylomes and transcriptomes of both partial- and total-loss-of-function Arabidopsis mutants of the COP9 signalosome. Our results support the hypothesis that the COP9 signalosome has a global genome-wide effect on methylation and that this effect is at least partially encoded in the DNA. Our analyses suggest that COP9 signalosome-dependent methylation is related to gene expression regulation in various ways. Differentially methylated regions tend to be closer in the 3D conformation of the genome to differentially expressed genes. These results suggest that the COP9 signalosome has a more comprehensive effect on gene expression than thought before, and this is partially related to regulation of methylation. The high level of COP9 signalosome conservation among eukaryotes may also suggest that COP9 signalosome regulates methylation not only in plants but also in other eukaryotes, including humans. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Arabidopsis , Arabidopsis/genética , Proteínas de Arabidopsis , Complejo del Señalosoma COP9 , Epigénesis Genética
14.
Bioinformatics ; 35(18): 3365-3371, 2019 09 15.
Artículo en Inglés | MEDLINE | ID: mdl-30715207

RESUMEN

MOTIVATION: Regulation of the amount of protein that is synthesized from genes has proved to be a serious challenge in terms of analysis and prediction, and in terms of engineering and optimization, due to the large diversity in expression machinery across species. RESULTS: To address this challenge, we developed a methodology and a software tool (ChimeraUGEM) for predicting gene expression as well as adapting the coding sequence of a target gene to any host organism. We demonstrate these methods by predicting protein levels in seven organisms, in seven human tissues, and by increasing in vivo the expression of a synthetic gene up to 26-fold in the single-cell green alga Chlamydomonas reinhardtii. The underlying model is designed to capture sequence patterns and regulatory signals with minimal prior knowledge on the host organism and can be applied to a multitude of species and applications. AVAILABILITY AND IMPLEMENTATION: Source code (MATLAB, C) and binaries are freely available for download for non-commercial use at http://www.cs.tau.ac.il/~tamirtul/ChimeraUGEM/, and supported on macOS, Linux and Windows. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Expresión Génica , Programas Informáticos , Humanos , Sistemas de Lectura Abierta , Proteínas
15.
Biochem Soc Trans ; 48(4): 1519-1528, 2020 08 28.
Artículo en Inglés | MEDLINE | ID: mdl-32662820

RESUMEN

The transcript is populated with numerous overlapping codes that regulate all steps of gene expression. Deciphering these codes is very challenging due to the large number of variables involved, the non-modular nature of the codes, biases and limitations in current experimental approaches, our limited knowledge in gene expression regulation across the tree of life, and other factors. In recent years, it has been shown that computational modeling and algorithms can significantly accelerate the discovery of novel gene expression codes. Here, we briefly summarize the latest developments and different approaches in the field.


Asunto(s)
Biología Computacional , Regulación de la Expresión Génica , Modelos Genéticos , ARN Mensajero/genética , Algoritmos
16.
Phys Biol ; 17(3): 031002, 2020 04 08.
Artículo en Inglés | MEDLINE | ID: mdl-31986496

RESUMEN

Messenger RNAs (mRNAs) consist of a coding region (open reading frame (ORF)) and two untranslated regions (UTRs), 5'UTR and 3'UTR. Ribosomes travel along the coding region, translating nucleotide triplets (called codons) to a chain of amino acids. The coding region was long believed to mainly encode the amino acid content of proteins, whereas regulatory signals reside in the UTRs and in other genomic regions. However, in recent years we have learned that the ORF is expansively populated with various regulatory signals, or codes, which are related to all gene expression steps and additional intracellular aspects. In this paper, we review the current knowledge related to overlapping codes inside the coding regions, such as the influence of synonymous codon usage on translation speed (and, in turn, the effect of translation speed on protein folding), ribosomal frameshifting, mRNA stability, methylation, splicing, transcription and more. All these codes come together and overlap in the ORF sequence, ensuring production of the right protein at the right time.


Asunto(s)
Sistemas de Lectura Abierta/genética , ARN/genética , Secuencia de Aminoácidos
17.
PLoS Comput Biol ; 15(7): e1007192, 2019 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-31265462

RESUMEN

[This corrects the article DOI: 10.1371/journal.pcbi.1006055.].

18.
Plant J ; 94(1): 22-31, 2018 04.
Artículo en Inglés | MEDLINE | ID: mdl-29383789

RESUMEN

Various species of microalgae have recently emerged as promising host-organisms for use in biotechnology industries due to their unique properties. These include efficient conversion of sunlight into organic compounds, the ability to grow in extreme conditions and the occurrence of numerous post-translational modification pathways. However, the inability to obtain high levels of nuclear heterologous gene expression in microalgae hinders the development of the entire field. To overcome this limitation, we analyzed different sequence optimization algorithms while studying the effect of transcript sequence features on heterologous expression in the model microalga Chlamydomonas reinhardtii, whose genome consists of rare features such as a high GC content. Based on the analysis of genomic data, we created eight unique sequences coding for a synthetic ferredoxin-hydrogenase enzyme, used here as a reporter gene. Following in silico design, these synthetic genes were transformed into the C. reinhardtii nucleus, after which gene expression levels were measured. The empirical data, measured in vivo show a discrepancy of up to 65-fold between the different constructs. In this work we demonstrate how the combination of computational methods and our empirical results enable us to learn about the way gene expression is encoded in the C. reinhardtii transcripts. We describe the deleterious effect on overall expression of codons encoding for splicing signals. Subsequently, our analysis shows that utilization of a frequent subset of preferred codons results in elevated transcript levels, and that mRNA folding energy in the vicinity of translation initiation significantly affects gene expression.


Asunto(s)
Chlamydomonas reinhardtii/genética , Regulación de la Expresión Génica de las Plantas/genética , Transgenes/genética , Chlamydomonas reinhardtii/metabolismo , Codón/genética , Secuencia Conservada/genética , Iniciación de la Cadena Peptídica Traduccional/genética , Plantas Modificadas Genéticamente/genética , Plantas Modificadas Genéticamente/metabolismo , Pliegue del ARN/genética , ARN Mensajero/genética
19.
RNA ; 23(7): 983-994, 2017 07.
Artículo en Inglés | MEDLINE | ID: mdl-28363900

RESUMEN

Interactions between the ribosomal exit tunnel and the nascent peptide can affect translation elongation rates. While previous studies have already demonstrated the feasibility of such interactions, little is known about the nature of the stalling peptide sequences and their distribution in the proteome. Here we ask which peptide sequences tend to occupy the tunnel of stalled ribosomes and how they are distributed in the proteome. Using computational analysis of ribosome profiling data from S. cerevisiae, we identified for the first time dozens of short stalling peptide sequences and studied their statistical properties. We found that short peptide sequences associated with ribosome stalling tend significantly to be either over- or underrepresented in the proteome. We then showed that the stalling interactions may occur at different positions along the length of the tunnel, prominently close to the P-site. Our findings throw light on the determinants of nascent peptide-mediated ribosome stalling during translation elongation and support the novel conjecture that mRNA translation affects the proteomic distribution of short peptide sequences.


Asunto(s)
Péptidos/genética , Proteómica/métodos , Ribosomas/metabolismo , Saccharomyces cerevisiae/metabolismo , Secuencia de Aminoácidos , Biología Computacional/métodos , Péptidos/química , Péptidos/metabolismo , Biosíntesis de Proteínas , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/química , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo
20.
Bioinformatics ; 34(19): 3241-3248, 2018 10 01.
Artículo en Inglés | MEDLINE | ID: mdl-29718236

RESUMEN

Motivation: Understanding how viruses co-evolve with their hosts and adapt various genomic level strategies in order to ensure their fitness may have essential implications in unveiling the secrets of viral evolution, and in developing new vaccines and therapeutic approaches. Here, based on a novel genomic analysis of 2625 different viruses and 439 corresponding host organisms, we provide evidence of universal evolutionary selection for high dimensional 'silent' patterns of information hidden in the redundancy of viral genetic code. Results: Our model suggests that long substrings of nucleotides in the coding regions of viruses from all classes, often also repeat in the corresponding viral hosts from all domains of life. Selection for these substrings cannot be explained only by such phenomena as codon usage bias, horizontal gene transfer and the encoded proteins. Genes encoding structural proteins responsible for building the core of the viral particles were found to include more host-repeating substrings, and these substrings tend to appear in the middle parts of the viral coding regions. In addition, in human viruses these substrings tend to be enriched with motives related to transcription factors and RNA binding proteins. The host-repeating substrings are possibly related to the evolutionary pressure on the viruses to effectively interact with host's intracellular factors and to efficiently escape from the host's immune system. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Evolución Molecular , Código Genético , Genoma Viral , Genómica , Virus/genética , Codón , Transferencia de Gen Horizontal , Sistemas de Lectura Abierta , Selección Genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA