Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
1.
Mol Cell ; 83(11): 1839-1855.e13, 2023 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-37267905

RESUMO

Localized translation is vital to polarized cells and requires precise and robust distribution of different mRNAs and ribosomes across the cell. However, the underlying molecular mechanisms are poorly understood and important players are lacking. Here, we discovered a Rab5 effector, the five-subunit endosomal Rab5 and RNA/ribosome intermediary (FERRY) complex, that recruits mRNAs and ribosomes to early endosomes through direct mRNA-interaction. FERRY displays preferential binding to certain groups of transcripts, including mRNAs encoding mitochondrial proteins. Deletion of FERRY subunits reduces the endosomal localization of transcripts in cells and has a significant impact on mRNA levels. Clinical studies show that genetic disruption of FERRY causes severe brain damage. We found that, in neurons, FERRY co-localizes with mRNA on early endosomes, and mRNA loaded FERRY-positive endosomes are in close proximity of mitochondria. FERRY thus transforms endosomes into mRNA carriers and plays a key role in regulating mRNA distribution and transport.


Assuntos
Endossomos , Proteínas rab5 de Ligação ao GTP , Proteínas rab5 de Ligação ao GTP/genética , Proteínas rab5 de Ligação ao GTP/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Endossomos/metabolismo , Transporte Biológico , Endocitose/fisiologia
2.
Nat Methods ; 20(5): 673-676, 2023 05.
Artigo em Inglês | MEDLINE | ID: mdl-37024650

RESUMO

The discovery of biomolecular condensates transformed our understanding of intracellular compartmentalization of molecules. To integrate interdisciplinary scientific knowledge about the function and composition of biomolecular condensates, we developed the crowdsourcing condensate database and encyclopedia ( cd-code.org ). CD-CODE is a community-editable platform, which includes a database of biomolecular condensates based on the literature, an encyclopedia of relevant scientific terms and a crowdsourcing web application. Our platform will accelerate the discovery and validation of biomolecular condensates, and facilitate efforts to understand their role in disease and as therapeutic targets.


Assuntos
Crowdsourcing , Bases de Dados Factuais , Software
3.
Mol Biol Evol ; 41(3)2024 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-38421032

RESUMO

Errors in protein translation can lead to non-genetic, phenotypic mutations, including amino acid misincorporations. While phenotypic mutations can increase protein diversity, the systematic characterization of their proteome-wide frequencies and their evolutionary impact has been lacking. Here, we developed a mechanistic model of translation errors to investigate how selection acts on protein populations produced by amino acid misincorporations. We fitted the model to empirical observations of misincorporations obtained from over a hundred mass spectrometry datasets of E. coli and S. cerevisiae. We found that on average 20% to 23% of proteins synthesized in the cell are expected to harbor at least one amino acid misincorporation, and that deleterious misincorporations are less likely to occur. Combining misincorporation probabilities and the estimated fitness effects of amino acid substitutions in a population genetics framework, we found 74% of mistranslation events in E. coli and 94% in S. cerevisiae to be neutral. We further show that the set of available synonymous tRNAs is subject to evolutionary pressure, as the presence of missing tRNAs would increase codon-anticodon cross-reactivity and misincorporation error rates. Overall, we find that the translation machinery is likely optimal in E. coli and S. cerevisiae and that both local solutions at the level of codons and a global solution such as the tRNA pool can mitigate the impact of translation errors. We provide a framework to study the evolutionary impact of codon-specific translation errors and a method for their proteome-wide detection across organisms and conditions.


Assuntos
Proteoma , Saccharomyces cerevisiae , Proteoma/genética , Saccharomyces cerevisiae/genética , Biossíntese de Proteínas , Escherichia coli/genética , Aminoácidos/genética , RNA de Transferência/metabolismo , Códon/metabolismo , Mutação
4.
Bioinformatics ; 40(7)2024 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-38941503

RESUMO

MOTIVATION: Errors in the processing of genetic information during protein synthesis can lead to phenotypic mutations, such as amino acid substitutions, e.g. by transcription or translation errors. While genetic mutations can be readily identified using DNA sequencing, and mutations due to transcription errors by RNA sequencing, translation errors can only be identified proteome-wide using mass spectrometry. RESULTS: Here, we provide a Python package implementation of a high-throughput pipeline to detect amino acid substitutions in mass spectrometry datasets. Our tools enable users to process hundreds of mass spectrometry datasets in batch mode to detect amino acid substitutions and calculate codon-specific and site-specific translation error rates. deTELpy will facilitate the systematic understanding of amino acid misincorporation rates (translation error rates), and the inference of error models across organisms and under stress conditions, such as drug treatment or disease conditions. AVAILABILITY AND IMPLEMENTATION: deTELpy is implemented in Python 3 and is freely available with detailed documentation and practical examples at https://git.mpi-cbg.de/tothpetroczylab/detelpy and https://pypi.org/project/deTELpy/ and can be easily installed via pip install deTELpy.


Assuntos
Substituição de Aminoácidos , Espectrometria de Massas , Software , Espectrometria de Massas/métodos
5.
BMC Evol Biol ; 20(1): 109, 2020 08 26.
Artigo em Inglês | MEDLINE | ID: mdl-32842959

RESUMO

BACKGROUND: For decades, codon usage has been used as a measure of adaptation for translational efficiency and translation accuracy of a gene's coding sequence. These patterns of codon usage reflect both the selective and mutational environment in which the coding sequences evolved. Over this same period, gene transfer between lineages has become widely recognized as an important biological phenomenon. Nevertheless, most studies of codon usage implicitly assume that all genes within a genome evolved under the same selective and mutational environment, an assumption violated when introgression occurs. In order to better understand the effects of introgression on codon usage patterns and vice versa, we examine the patterns of codon usage in Lachancea kluyveri, a yeast which has experienced a large introgression. We quantify the effects of mutation bias and selection for translation efficiency on the codon usage pattern of the endogenous and introgressed exogenous genes using a Bayesian mixture model, ROC SEMPPR, which is built on mechanistic assumptions about protein synthesis and grounded in population genetics. RESULTS: We find substantial differences in codon usage between the endogenous and exogenous genes, and show that these differences can be largely attributed to differences in mutation bias favoring A/T ending codons in the endogenous genes while favoring C/G ending codons in the exogenous genes. Recognizing the two different signatures of mutation bias and selection improves our ability to predict protein synthesis rate by 42% and allowed us to accurately assess the decaying signal of endogenous codon mutation and preferences. In addition, using our estimates of mutation bias and selection, we identify Eremothecium gossypii as the closest relative to the exogenous genes, providing an alternative hypothesis about the origin of the exogenous genes, estimate that the introgression occurred ∼6×108 generation ago, and estimate its historic and current selection against mismatched codon usage. CONCLUSIONS: Our work illustrates how mechanistic, population genetic models like ROC SEMPPR can separate the effects of mutation and selection on codon usage and provide quantitative estimates from sequence data.


Assuntos
Uso do Códon , Genética Populacional , Modelos Genéticos , Saccharomycetales/genética , Seleção Genética , Teorema de Bayes , Mutação
6.
Mol Biol Evol ; 36(4): 834-851, 2019 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-30521036

RESUMO

We present a new phylogenetic approach, selection on amino acids and codons (SelAC), whose substitution rates are based on a nested model linking protein expression to population genetics. Unlike simpler codon models that assume a single substitution matrix for all sites, our model more realistically represents the evolution of protein-coding DNA under the assumption of consistent, stabilizing selection using a cost-benefit approach. This cost-benefit approach allows us to generate a set of 20 optimal amino acid-specific matrix families using just a handful of parameters and naturally links the strength of stabilizing selection to protein synthesis levels, which we can estimate. Using a yeast data set of 100 orthologs for 6 taxa, we find SelAC fits the data much better than popular models by 104-105 Akike information criterion units adjusted for small sample bias. Our results also indicated that nested, mechanistic models better predict observed data patterns highlighting the improvement in biological realism in amino acid sequence evolution that our model provides. Additional parameters estimated by SelAC indicate that a large amount of nonphylogenetic, but biologically meaningful, information can be inferred from existing data. For example, SelAC prediction of gene-specific protein synthesis rates correlates well with both empirical (r=0.33-0.48) and other theoretical predictions (r=0.45-0.64) for multiple yeast species. SelAC also provides estimates of the optimal amino acid at each site. Finally, because SelAC is a nested approach based on clearly stated biological assumptions, future modifications, such as including shifts in the optimal amino acid sequence within or across lineages, are possible.


Assuntos
Substituição de Aminoácidos , Técnicas Genéticas , Modelos Genéticos , Filogenia , Seleção Genética , Genética Populacional/métodos
7.
Bioinformatics ; 34(14): 2496-2498, 2018 07 15.
Artigo em Inglês | MEDLINE | ID: mdl-29522124

RESUMO

Summary: AnaCoDa is an R package for estimating biologically relevant parameters of mixture models, such as selection against translation inefficiency, non-sense errors and ribosome pausing time, from genomic and high throughput datasets. AnaCoDa provides an adaptive Bayesian MCMC algorithm, fully implemented in C++ for high performance with an ergonomic R interface to improve usability. AnaCoDa employs a generic object-oriented design to allow users to extend the framework and implement their own models. Current models implemented in AnaCoDa can accurately estimate biologically relevant parameters given either protein coding sequences or ribosome foot-printing data. Optionally, AnaCoDa can utilize additional data sources, such as gene expression measurements, to aid model fitting and parameter estimation. By utilizing a hierarchical object structure, some parameters can vary between sets of genes while others can be shared. Genes may be assigned to clusters or membership may be estimated by AnaCoDa. This flexibility allows users to estimate the same model parameter under different biological conditions and categorize genes into different sets based on shared model properties embedded within the data. AnaCoDa also allows users to generate simulated data which can be used to aid model development and model analysis as well as evaluate model adequacy. Finally, AnaCoDa contains a set of visualization routines and the ability to revisit or re-initiate previous model fitting, providing researchers with a well rounded easy to use framework to analyze genome scale data. Availability and implementation: AnaCoDa is freely available under the Mozilla Public License 2.0 on CRAN (https://cran.r-project.org/web/packages/AnaCoDa/).


Assuntos
Códon , Genômica/métodos , Modelos Genéticos , Análise de Sequência de DNA/métodos , Software , Algoritmos , Teorema de Bayes
8.
BMC Bioinformatics ; 14 Suppl 3: S7, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23514582

RESUMO

BACKGROUND: Any method that de novo predicts protein function should do better than random. More challenging, it also ought to outperform simple homology-based inference. METHODS: Here, we describe a few methods that predict protein function exclusively through homology. Together, they set the bar or lower limit for future improvements. RESULTS AND CONCLUSIONS: During the development of these methods, we faced two surprises. Firstly, our most successful implementation for the baseline ranked very high at CAFA1. In fact, our best combination of homology-based methods fared only slightly worse than the top-of-the-line prediction method from the Jones group. Secondly, although the concept of homology-based inference is simple, this work revealed that the precise details of the implementation are crucial: not only did the methods span from top to bottom performers at CAFA, but also the reasons for these differences were unexpected. In this work, we also propose a new rigorous measure to compare predicted and experimental annotations. It puts more emphasis on the details of protein function than the other measures employed by CAFA and may best reflect the expectations of users. Clearly, the definition of proper goals remains one major objective for CAFA.


Assuntos
Proteínas/fisiologia , Homologia de Sequência de Aminoácidos , Algoritmos , Proteínas/genética
9.
Protein Sci ; 31(9): e4397, 2022 09.
Artigo em Inglês | MEDLINE | ID: mdl-36040266

RESUMO

Errors in DNA replication generate genetic mutations, while errors in transcription and translation lead to phenotypic mutations. Phenotypic mutations are orders of magnitude more frequent than genetic ones, yet they are less understood. Here, we review the types of phenotypic mutations, their quantifications, and their role in protein evolution and disease. The diversity generated by phenotypic mutation can facilitate adaptive evolution. Indeed, phenotypic mutations, such as ribosomal frameshift and stop codon readthrough, sometimes serve to regulate protein expression and function. Phenotypic mutations have often been linked to fitness decrease and diseases. Thus, understanding the protein heterogeneity and phenotypic diversity caused by phenotypic mutations will advance our understanding of protein evolution and have implications on human health and diseases.


Assuntos
Replicação do DNA , Biossíntese de Proteínas , Códon de Terminação , Evolução Molecular , Humanos , Mutação
10.
PLoS One ; 11(2): e0149477, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-26901434

RESUMO

The seven-transmembrane receptor Smoothened (Smo) activates all Hedgehog (Hh) signaling by translocation into the primary cilia (PC), but how this is regulated is not well understood. Here we show that Pitchfork (Pifo) and the G protein-coupled receptor associated sorting protein 2 (Gprasp2) are essential components of an Hh induced ciliary targeting complex able to regulate Smo translocation to the PC. Depletion of Pifo or Gprasp2 leads to failure of Smo translocation to the PC and lack of Hh target gene activation. Together, our results identify a novel protein complex that is regulated by Hh signaling and required for Smo ciliary trafficking and Hh pathway activation.


Assuntos
Proteínas Hedgehog/metabolismo , Proteínas de Homeodomínio/metabolismo , Receptores Acoplados a Proteínas G/metabolismo , Transdução de Sinais/fisiologia , Animais , Linhagem Celular , Cílios/genética , Cílios/metabolismo , Proteínas Hedgehog/genética , Proteínas de Homeodomínio/genética , Peptídeos e Proteínas de Sinalização Intracelular , Camundongos , Transporte Proteico/fisiologia , Receptores Acoplados a Proteínas G/genética , Receptor Smoothened
11.
Genome Biol Evol ; 7(6): 1559-79, 2015 May 14.
Artigo em Inglês | MEDLINE | ID: mdl-25977456

RESUMO

Extracting biologically meaningful information from the continuing flood of genomic data is a major challenge in the life sciences. Codon usage bias (CUB) is a general feature of most genomes and is thought to reflect the effects of both natural selection for efficient translation and mutation bias. Here we present a mechanistically interpretable, Bayesian model (ribosome overhead costs Stochastic Evolutionary Model of Protein Production Rate [ROC SEMPPR]) to extract meaningful information from patterns of CUB within a genome. ROC SEMPPR is grounded in population genetics and allows us to separate the contributions of mutational biases and natural selection against translational inefficiency on a gene-by-gene and codon-by-codon basis. Until now, the primary disadvantage of similar approaches was the need for genome scale measurements of gene expression. Here, we demonstrate that it is possible to both extract accurate estimates of codon-specific mutation biases and translational efficiencies while simultaneously generating accurate estimates of gene expression, rather than requiring such information. We demonstrate the utility of ROC SEMPPR using the Saccharomyces cerevisiae S288c genome. When we compare our model fits with previous approaches we observe an exceptionally high agreement between estimates of both codon-specific parameters and gene expression levels ([Formula: see text] in all cases). We also observe strong agreement between our parameter estimates and those derived from alternative data sets. For example, our estimates of mutation bias and those from mutational accumulation experiments are highly correlated ([Formula: see text]). Our estimates of codon-specific translational inefficiencies and tRNA copy number-based estimates of ribosome pausing time ([Formula: see text]), and mRNA and ribosome profiling footprint-based estimates of gene expression ([Formula: see text]) are also highly correlated, thus supporting the hypothesis that selection against translational inefficiency is an important force driving the evolution of CUB. Surprisingly, we find that for particular amino acids, codon usage in highly expressed genes can still be largely driven by mutation bias and that failing to take mutation bias into account can lead to the misidentification of an amino acid's "optimal" codon. In conclusion, our method demonstrates that an enormous amount of biologically important information is encoded within genome scale patterns of codon usage, accessing this information does not require gene expression measurements, but instead carefully formulated biologically interpretable models.


Assuntos
Códon , Evolução Molecular , Genômica/métodos , Mutação , Biossíntese de Proteínas , Seleção Genética , Expressão Gênica , Modelos Genéticos , Saccharomyces cerevisiae/genética
12.
Evolution ; 68(1): 284-94, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24102483

RESUMO

Prior specification is an essential component of parameter estimation and model comparison in Approximate Bayesian computation (ABC). Oaks et al. present a simulation-based power analysis of msBayes and conclude that msBayes has low power to detect genuinely random divergence times across taxa, and suggest the cause is Lindley's paradox. Although the predictions are similar, we show that their findings are more fundamentally explained by insufficient prior sampling that arises with poorly chosen wide priors that critically undersample nonsimultaneous divergence histories of high likelihood. In a reanalysis of their data on Philippine Island vertebrates, we show how this problem can be circumvented by expanding upon a previously developed procedure that accommodates uncertainty in prior selection using Bayesian model averaging. When these procedures are used, msBayes supports recent divergences without support for synchronous divergence in the Oaks et al. data and we further present a simulation analysis that demonstrates that msBayes can have high power to detect asynchronous divergence under narrower priors for divergence time. Our findings highlight the need for exploration of plausible parameter space and prior sampling efficiency for ABC samplers in high dimensions. We discus potential improvements to msBayes and conclude that when used appropriately with model averaging, msBayes remains an effective and powerful tool.


Assuntos
Evolução Biológica , Clima , Modelos Biológicos , Animais
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA