Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 49
Filtrar
1.
Nature ; 620(7973): 434-444, 2023 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-37468638

RESUMEN

Advances in DNA sequencing and machine learning are providing insights into protein sequences and structures on an enormous scale1. However, the energetics driving folding are invisible in these structures and remain largely unknown2. The hidden thermodynamics of folding can drive disease3,4, shape protein evolution5-7 and guide protein engineering8-10, and new approaches are needed to reveal these thermodynamics for every sequence and structure. Here we present cDNA display proteolysis, a method for measuring thermodynamic folding stability for up to 900,000 protein domains in a one-week experiment. From 1.8 million measurements in total, we curated a set of around 776,000 high-quality folding stabilities covering all single amino acid variants and selected double mutants of 331 natural and 148 de novo designed protein domains 40-72 amino acids in length. Using this extensive dataset, we quantified (1) environmental factors influencing amino acid fitness, (2) thermodynamic couplings (including unexpected interactions) between protein sites, and (3) the global divergence between evolutionary amino acid usage and protein folding stability. We also examined how our approach could identify stability determinants in designed proteins and evaluate design methods. The cDNA display proteolysis method is fast, accurate and uniquely scalable, and promises to reveal the quantitative rules for how amino acid sequences encode folding stability.


Asunto(s)
Biología , Ingeniería de Proteínas , Pliegue de Proteína , Proteínas , Aminoácidos/genética , Aminoácidos/metabolismo , Biología/métodos , ADN Complementario/genética , Estabilidad Proteica , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Termodinámica , Proteolisis , Ingeniería de Proteínas/métodos , Dominios Proteicos/genética , Mutación
2.
PLoS Genet ; 19(8): e1010848, 2023 08.
Artículo en Inglés | MEDLINE | ID: mdl-37585488

RESUMEN

N-terminal ends of polypeptides are critical for the selective co-translational recruitment of N-terminal modification enzymes. However, it is unknown whether specific N-terminal signatures differentially regulate protein fate according to their cellular functions. In this work, we developed an in-silico approach to detect functional preferences in cellular N-terminomes, and identified in S. cerevisiae more than 200 Gene Ontology terms with specific N-terminal signatures. In particular, we discovered that Mitochondrial Targeting Sequences (MTS) show a strong and specific over-representation at position 2 of hydrophobic residues known to define potential substrates of the N-terminal acetyltransferase NatC. We validated mitochondrial precursors as co-translational targets of NatC by selective purification of translating ribosomes, and found that their N-terminal signature is conserved in Saccharomycotina yeasts. Finally, systematic mutagenesis of the position 2 in a prototypal yeast mitochondrial protein confirmed its critical role in mitochondrial protein import. Our work highlights the hydrophobicity of MTS N-terminal residues and their targeting by NatC as important features for the definition of the mitochondrial proteome, providing a molecular explanation for mitochondrial defects observed in yeast or human NatC-depleted cells. Functional mapping of N-terminal residues thus has the potential to support the discovery of novel mechanisms of protein regulation or targeting.


Asunto(s)
Proteoma , Saccharomyces cerevisiae , Humanos , Saccharomyces cerevisiae/genética , Secuencia de Aminoácidos , Proteoma/metabolismo , Transporte de Proteínas , Proteínas Fúngicas/metabolismo , Proteínas Mitocondriales/metabolismo
3.
Genome Res ; 31(8): 1462-1473, 2021 08.
Artículo en Inglés | MEDLINE | ID: mdl-34266979

RESUMEN

Understanding how protein function has evolved and diversified is of great importance for human genetics and medicine. Here, we tackle the problem of describing the whole transcript variability observed in several species by generalizing the definition of splicing graph. We provide a practical solution to construct parsimonious evolutionary splicing graphs where each node is a minimal transcript building block defined across species. We show a clear link between the functional relevance, tissue regulation, and conservation of alternative transcripts on a set of 50 genes. By scaling up to the whole human protein-coding genome, we identify a few thousand genes where alternative splicing modulates the number and composition of pseudorepeats. We have implemented our approach in ThorAxe, an efficient, versatile, robust, and freely available computational tool.


Asunto(s)
Empalme Alternativo , Empalme del ARN , Genoma Humano , Humanos
4.
Bioinformatics ; 39(39 Suppl 1): i544-i552, 2023 06 30.
Artículo en Inglés | MEDLINE | ID: mdl-37387162

RESUMEN

MOTIVATION: The spectacular recent advances in protein and protein complex structure prediction hold promise for reconstructing interactomes at large-scale and residue resolution. Beyond determining the 3D arrangement of interacting partners, modeling approaches should be able to unravel the impact of sequence variations on the strength of the association. RESULTS: In this work, we report on Deep Local Analysis, a novel and efficient deep learning framework that relies on a strikingly simple deconstruction of protein interfaces into small locally oriented residue-centered cubes and on 3D convolutions recognizing patterns within cubes. Merely based on the two cubes associated with the wild-type and the mutant residues, DLA accurately estimates the binding affinity change for the associated complexes. It achieves a Pearson correlation coefficient of 0.735 on about 400 mutations on unseen complexes. Its generalization capability on blind datasets of complexes is higher than the state-of-the-art methods. We show that taking into account the evolutionary constraints on residues contributes to predictions. We also discuss the influence of conformational variability on performance. Beyond the predictive power on the effects of mutations, DLA is a general framework for transferring the knowledge gained from the available non-redundant set of complex protein structures to various tasks. For instance, given a single partially masked cube, it recovers the identity and physicochemical class of the central residue. Given an ensemble of cubes representing an interface, it predicts the function of the complex. AVAILABILITY AND IMPLEMENTATION: Source code and models are available at http://gitlab.lcqb.upmc.fr/DLA/DLA.git.


Asunto(s)
Evolución Biológica , Programas Informáticos , Mutación
5.
Proteomics ; 23(17): e2200159, 2023 09.
Artículo en Inglés | MEDLINE | ID: mdl-37403279

RESUMEN

Physical interactions between proteins are central to all biological processes. Yet, the current knowledge of who interacts with whom in the cell and in what manner relies on partial, noisy, and highly heterogeneous data. Thus, there is a need for methods comprehensively describing and organizing such data. LEVELNET is a versatile and interactive tool for visualizing, exploring, and comparing protein-protein interaction (PPI) networks inferred from different types of evidence. LEVELNET helps to break down the complexity of PPI networks by representing them as multi-layered graphs and by facilitating the direct comparison of their subnetworks toward biological interpretation. It focuses primarily on the protein chains whose 3D structures are available in the Protein Data Bank. We showcase some potential applications, such as investigating the structural evidence supporting PPIs associated to specific biological processes, assessing the co-localization of interaction partners, comparing the PPI networks obtained through computational experiments versus homology transfer, and creating PPI benchmarks with desired properties.


Asunto(s)
Mapeo de Interacción de Proteínas , Mapas de Interacción de Proteínas , Mapeo de Interacción de Proteínas/métodos , Proteínas/metabolismo , Bases de Datos de Proteínas , Biología Computacional
6.
J Struct Biol ; 215(3): 107997, 2023 09.
Artículo en Inglés | MEDLINE | ID: mdl-37453591

RESUMEN

Alternative splicing of repeats in proteins provides a mechanism for rewiring and fine-tuning protein interaction networks. In this work, we developed a robust and versatile method, ASPRING, to identify alternatively spliced protein repeats from gene annotations. ASPRING leverages evolutionary meaningful alternative splicing-aware hierarchical graphs to provide maps between protein repeats sequences and 3D structures. We re-think the definition of repeats by explicitly accounting for transcript diversity across several genes/species. Using a stringent sequence-based similarity criterion, we detected over 5,000 evolutionary conserved repeats by screening virtually all human protein-coding genes and their orthologs across a dozen species. Through a joint analysis of their sequences and structures, we extracted specificity-determining sequence signatures and assessed their implication in experimentally resolved and modelled protein interactions. Our findings demonstrate the widespread alternative usage of protein repeats in modulating protein interactions and open avenues for targeting repeat-mediated interactions.


Asunto(s)
Empalme Alternativo , Proteínas , Humanos , Empalme Alternativo/genética , Proteínas/genética
7.
Bioinformatics ; 38(9): 2615-2616, 2022 04 28.
Artículo en Inglés | MEDLINE | ID: mdl-35188186

RESUMEN

SUMMARY: ASES is a versatile tool for assessing the impact of alternative splicing (AS), initiation and termination of transcription on protein diversity in evolution. It identifies exon and transcript orthogroups from a set of input genes/species for comparative transcriptomics analyses. It computes an evolutionary splicing graph, where the nodes are exon orthogroups, allowing for a direct evaluation of AS conservation. It also reconstructs a transcripts' phylogenetic forest to date the appearance of specific transcripts and explore the events that have shaped them. ASES web server features a highly interactive interface enabling the synchronous selection of events, exons or transcripts in the different outputs, and the visualization and retrieval of the corresponding amino acid sequences, for subsequent 3D structure prediction. AVAILABILITY AND IMPLEMENTATION: http://www.lcqb.upmc.fr/Ases. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Empalme Alternativo , Proteínas , Filogenia , Exones , Proteínas/química , Empalme del ARN
8.
Bioinformatics ; 38(19): 4505-4512, 2022 09 30.
Artículo en Inglés | MEDLINE | ID: mdl-35962985

RESUMEN

MOTIVATION: With the recent advances in protein 3D structure prediction, protein interactions are becoming more central than ever before. Here, we address the problem of determining how proteins interact with one another. More specifically, we investigate the possibility of discriminating near-native protein complex conformations from incorrect ones by exploiting local environments around interfacial residues. RESULTS: Deep Local Analysis (DLA)-Ranker is a deep learning framework applying 3D convolutions to a set of locally oriented cubes representing the protein interface. It explicitly considers the local geometry of the interfacial residues along with their neighboring atoms and the regions of the interface with different solvent accessibility. We assessed its performance on three docking benchmarks made of half a million acceptable and incorrect conformations. We show that DLA-Ranker successfully identifies near-native conformations from ensembles generated by molecular docking. It surpasses or competes with other deep learning-based scoring functions. We also showcase its usefulness to discover alternative interfaces. AVAILABILITY AND IMPLEMENTATION: http://gitlab.lcqb.upmc.fr/dla-ranker/DLA-Ranker.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Proteínas , Simulación del Acoplamiento Molecular , Conformación Proteica , Proteínas/química , Unión Proteica
9.
PLoS Comput Biol ; 18(1): e1009825, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-35089918

RESUMEN

Proteins ensure their biological functions by interacting with each other. Hence, characterising protein interactions is fundamental for our understanding of the cellular machinery, and for improving medicine and bioengineering. Over the past years, a large body of experimental data has been accumulated on who interacts with whom and in what manner. However, these data are highly heterogeneous and sometimes contradictory, noisy, and biased. Ab initio methods provide a means to a "blind" protein-protein interaction network reconstruction. Here, we report on a molecular cross-docking-based approach for the identification of protein partners. The docking algorithm uses a coarse-grained representation of the protein structures and treats them as rigid bodies. We applied the approach to a few hundred of proteins, in the unbound conformations, and we systematically investigated the influence of several key ingredients, such as the size and quality of the interfaces, and the scoring function. We achieved some significant improvement compared to previous works, and a very high discriminative power on some specific functional classes. We provide a readout of the contributions of shape and physico-chemical complementarity, interface matching, and specificity, in the predictions. In addition, we assessed the ability of the approach to account for protein surface multiple usages, and we compared it with a sequence-based deep learning method. This work may contribute to guiding the exploitation of the large amounts of protein structural models now available toward the discovery of unexpected partners and their complex structure characterisation.


Asunto(s)
Sitios de Unión/fisiología , Simulación del Acoplamiento Molecular , Conformación Proteica , Mapas de Interacción de Proteínas/fisiología , Proteínas , Algoritmos , Biología Computacional , Bases de Datos de Proteínas , Mapeo de Interacción de Proteínas , Proteínas/química , Proteínas/metabolismo
10.
Proteins ; 89(12): 1770-1786, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34519095

RESUMEN

The potential of deep learning has been recognized in the protein structure prediction community for some time, and became indisputable after CASP13. In CASP14, deep learning has boosted the field to unanticipated levels reaching near-experimental accuracy. This success comes from advances transferred from other machine learning areas, as well as methods specifically designed to deal with protein sequences and structures, and their abstractions. Novel emerging approaches include (i) geometric learning, that is, learning on representations such as graphs, three-dimensional (3D) Voronoi tessellations, and point clouds; (ii) pretrained protein language models leveraging attention; (iii) equivariant architectures preserving the symmetry of 3D space; (iv) use of large meta-genome databases; (v) combinations of protein representations; and (vi) finally truly end-to-end architectures, that is, differentiable models starting from a sequence and returning a 3D structure. Here, we provide an overview and our opinion of the novel deep learning approaches developed in the last 2 years and widely used in CASP14.


Asunto(s)
Secuencia de Aminoácidos , Conformación Proteica , Proteínas , Programas Informáticos , Biología Computacional , Bases de Datos de Proteínas , Aprendizaje Profundo , Proteínas/química , Proteínas/metabolismo , Análisis de Secuencia de Proteína
11.
PLoS Comput Biol ; 16(2): e1007624, 2020 02.
Artículo en Inglés | MEDLINE | ID: mdl-32012150

RESUMEN

Interactions between proteins and nucleic acids are at the heart of many essential biological processes. Despite increasing structural information about how these interactions may take place, our understanding of the usage made of protein surfaces by nucleic acids is still very limited. This is in part due to the inherent complexity associated to protein surface deformability and evolution. In this work, we present a method that contributes to decipher such complexity by predicting protein-DNA interfaces and characterizing their properties. It relies on three biologically and physically meaningful descriptors, namely evolutionary conservation, physico-chemical properties and surface geometry. We carefully assessed its performance on several hundreds of protein structures and compared it to several machine-learning state-of-the-art methods. Our approach achieves a higher sensitivity compared to the other methods, with a similar precision. Importantly, we show that it is able to unravel 'hidden' binding sites by applying it to unbound protein structures and to proteins binding to DNA via multiple sites and in different conformations. It is also applicable to the detection of RNA-binding sites, without significant loss of performance. This confirms that DNA and RNA-binding sites share similar properties. Our method is implemented as a fully automated tool, [Formula: see text], freely accessible at: http://www.lcqb.upmc.fr/JET2DNA. We also provide a new dataset of 187 protein-DNA complex structures, along with a subset of 82 associated unbound structures. The set represents the largest body of high-resolution crystallographic structures of protein-DNA complexes, use biological protein assemblies as DNA-binding units, and covers all major types of protein-DNA interactions. It is available at: http://www.lcqb.upmc.fr/PDNAbenchmarks.


Asunto(s)
Evolución Biológica , Proteínas de Unión al ADN/metabolismo , ADN/metabolismo , Proteínas/metabolismo , Algoritmos , Aprendizaje Automático
12.
Biophys J ; 118(10): 2513-2525, 2020 05 19.
Artículo en Inglés | MEDLINE | ID: mdl-32330413

RESUMEN

Large macromolecules, including proteins and their complexes, very often adopt multiple conformations. Some of them can be seen experimentally, for example with x-ray crystallography or cryo-electron microscopy. This structural heterogeneity is not occasional and is frequently linked with specific biological function. Thus, the accurate description of macromolecular conformational transitions is crucial for understanding fundamental mechanisms of life's machinery. We report on a real-time method to predict such transitions by extrapolating from instantaneous eigen motions, computed using the normal mode analysis, to a series of twists. We demonstrate the applicability of our approach to the prediction of a wide range of motions, including large collective opening-closing transitions and conformational changes induced by partner binding. We also highlight particularly difficult cases of very small transitions between crystal and solution structures. Our method guarantees preservation of the protein structure during the transition and allows accessing conformations that are unreachable with classical normal mode analysis. We provide practical solutions to describe localized motions with a few low-frequency modes and to relax some geometrical constraints along the predicted transitions. This work opens the way to the systematic description of protein motions, whatever their degree of collectivity. Our method is freely available as a part of the NOn-Linear rigid Block (NOLB) package.


Asunto(s)
Proteínas , Microscopía por Crioelectrón , Cristalografía por Rayos X , Modelos Moleculares , Conformación Proteica
13.
BMC Bioinformatics ; 21(Suppl 19): 573, 2020 Dec 21.
Artículo en Inglés | MEDLINE | ID: mdl-33349244

RESUMEN

BACKGROUND: Coiled-coils are described as stable structural motifs, where two or more helices wind around each other. However, coiled-coils are associated with local mobility and intrinsic disorder. Intrinsically disordered regions in proteins are characterized by lack of stable secondary and tertiary structure under physiological conditions in vitro. They are increasingly recognized as important for protein function. However, characterizing their behaviour in solution and determining precisely the extent of disorder of a protein region remains challenging, both experimentally and computationally. RESULTS: In this work, we propose a computational framework to quantify the extent of disorder within a coiled-coil in solution and to help design substitutions modulating such disorder. Our method relies on the analysis of conformational ensembles generated by relatively short all-atom Molecular Dynamics (MD) simulations. We apply it to the phosphoprotein multimerisation domains (PMD) of Measles virus (MeV) and Nipah virus (NiV), both forming tetrameric left-handed coiled-coils. We show that our method can help quantify the extent of disorder of the C-terminus region of MeV and NiV PMDs from MD simulations of a few tens of nanoseconds, and without requiring an extensive exploration of the conformational space. Moreover, this study provided a conceptual framework for the rational design of substitutions aimed at modulating the stability of the coiled-coils. By assessing the impact of four substitutions known to destabilize coiled-coils, we derive a set of rules to control MeV PMD structural stability and cohesiveness. We therefore design two contrasting substitutions, one increasing the stability of the tetramer and the other increasing its flexibility. CONCLUSIONS: Our method can be considered as a platform to reason about how to design substitutions aimed at regulating flexibility and stability.


Asunto(s)
Biología Computacional/métodos , Proteínas Virales/química , Secuencia de Aminoácidos , Virus del Sarampión/metabolismo , Simulación de Dinámica Molecular , Virus Nipah/metabolismo , Dominios Proteicos , Estabilidad Proteica , Estructura Secundaria de Proteína , Proteínas Virales/metabolismo
14.
Mol Biol Evol ; 36(11): 2604-2619, 2019 Nov 01.
Artículo en Inglés | MEDLINE | ID: mdl-31406981

RESUMEN

The systematic and accurate description of protein mutational landscapes is a question of utmost importance in biology, bioengineering, and medicine. Recent progress has been achieved by leveraging on the increasing wealth of genomic data and by modeling intersite dependencies within biological sequences. However, state-of-the-art methods remain time consuming. Here, we present Global Epistatic Model for predicting Mutational Effects (GEMME) (www.lcqb.upmc.fr/GEMME), an original and fast method that predicts mutational outcomes by explicitly modeling the evolutionary history of natural sequences. This allows accounting for all positions in a sequence when estimating the effect of a given mutation. GEMME uses only a few biologically meaningful and interpretable parameters. Assessed against 50 high- and low-throughput mutational experiments, it overall performs similarly or better than existing methods. It accurately predicts the mutational landscapes of a wide range of protein families, including viral ones and, more generally, of much conserved families. Given an input alignment, it generates the full mutational landscape of a protein in a matter of minutes. It is freely available as a package and a webserver at www.lcqb.upmc.fr/GEMME/.

15.
Proteins ; 87(11): 952-965, 2019 11.
Artículo en Inglés | MEDLINE | ID: mdl-31199528

RESUMEN

The growing body of experimental and computational data describing how proteins interact with each other has emphasized the multiplicity of protein interactions and the complexity underlying protein surface usage and deformability. In this work, we propose new concepts and methods toward deciphering such complexity. We introduce the notion of interacting region to account for the multiple usage of a protein's surface residues by several partners and for the variability of protein interfaces coming from molecular flexibility. We predict interacting patches by crossing evolutionary, physicochemical and geometrical properties of the protein surface with information coming from complete cross-docking (CC-D) simulations. We show that our predictions match well interacting regions and that the different sources of information are complementary. We further propose an indicator of whether a protein has a few or many partners. Our prediction strategies are implemented in the dynJET2 algorithm and assessed on a new dataset of 262 protein on which we performed CC-D. The code and the data are available at: http://www.lcqb.upmc.fr/dynJET2/.


Asunto(s)
Proteínas/metabolismo , Algoritmos , Animales , Sitios de Unión , Humanos , Simulación del Acoplamiento Molecular , Unión Proteica , Conformación Proteica , Dominios y Motivos de Interacción de Proteínas , Mapeo de Interacción de Proteínas/métodos , Mapas de Interacción de Proteínas , Proteínas/química , Programas Informáticos
16.
Proteins ; 87(12): 1200-1221, 2019 12.
Artículo en Inglés | MEDLINE | ID: mdl-31612567

RESUMEN

We present the results for CAPRI Round 46, the third joint CASP-CAPRI protein assembly prediction challenge. The Round comprised a total of 20 targets including 14 homo-oligomers and 6 heterocomplexes. Eight of the homo-oligomer targets and one heterodimer comprised proteins that could be readily modeled using templates from the Protein Data Bank, often available for the full assembly. The remaining 11 targets comprised 5 homodimers, 3 heterodimers, and two higher-order assemblies. These were more difficult to model, as their prediction mainly involved "ab-initio" docking of subunit models derived from distantly related templates. A total of ~30 CAPRI groups, including 9 automatic servers, submitted on average ~2000 models per target. About 17 groups participated in the CAPRI scoring rounds, offered for most targets, submitting ~170 models per target. The prediction performance, measured by the fraction of models of acceptable quality or higher submitted across all predictors groups, was very good to excellent for the nine easy targets. Poorer performance was achieved by predictors for the 11 difficult targets, with medium and high quality models submitted for only 3 of these targets. A similar performance "gap" was displayed by scorer groups, highlighting yet again the unmet challenge of modeling the conformational changes of the protein components that occur upon binding or that must be accounted for in template-based modeling. Our analysis also indicates that residues in binding interfaces were less well predicted in this set of targets than in previous Rounds, providing useful insights for directions of future improvements.


Asunto(s)
Biología Computacional , Conformación Proteica , Proteínas/ultraestructura , Programas Informáticos , Algoritmos , Sitios de Unión/genética , Bases de Datos de Proteínas , Modelos Moleculares , Unión Proteica/genética , Mapeo de Interacción de Proteínas , Proteínas/química , Proteínas/genética , Homología Estructural de Proteína
17.
PLoS Comput Biol ; 14(3): e1005992, 2018 03.
Artículo en Inglés | MEDLINE | ID: mdl-29543809

RESUMEN

We present a new educational initiative called Meet-U that aims to train students for collaborative work in computational biology and to bridge the gap between education and research. Meet-U mimics the setup of collaborative research projects and takes advantage of the most popular tools for collaborative work and of cloud computing. Students are grouped in teams of 4-5 people and have to realize a project from A to Z that answers a challenging question in biology. Meet-U promotes "coopetition," as the students collaborate within and across the teams and are also in competition with each other to develop the best final product. Meet-U fosters interactions between different actors of education and research through the organization of a meeting day, open to everyone, where the students present their work to a jury of researchers and jury members give research seminars. This very unique combination of education and research is strongly motivating for the students and provides a formidable opportunity for a scientific community to unite and increase its visibility. We report on our experience with Meet-U in two French universities with master's students in bioinformatics and modeling, with protein-protein docking as the subject of the course. Meet-U is easy to implement and can be straightforwardly transferred to other fields and/or universities. All the information and data are available at www.meet-u.org.


Asunto(s)
Biología Computacional/educación , Biología Computacional/métodos , Investigación/educación , Humanos , Proyectos de Investigación , Estudiantes , Universidades
18.
Nucleic Acids Res ; 45(D1): D236-D242, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27899675

RESUMEN

The database JET2 Viewer, openly accessible at http://www.jet2viewer.upmc.fr/, reports putative protein binding sites for all three-dimensional (3D) structures available in the Protein Data Bank (PDB). This knowledge base was generated by applying the computational method JET2 at large-scale on more than 20 000 chains. JET2 strategy yields very precise predictions of interacting surfaces and unravels their evolutionary process and complexity. JET2 Viewer provides an online intelligent display, including interactive 3D visualization of the binding sites mapped onto PDB structures and suitable files recording JET2 analyses. Predictions were evaluated on more than 15 000 experimentally characterized protein interfaces. This is, to our knowledge, the largest evaluation of a protein binding site prediction method. The overall performance of JET2 on all interfaces are: Sen = 52.52, PPV = 51.24, Spe = 80.05, Acc = 75.89. The data can be used to foster new strategies for protein-protein interactions modulation and interaction surface redesign.


Asunto(s)
Bases de Datos de Proteínas , Mapeo de Interacción de Proteínas , Animales , Sitios de Unión , Humanos , Internet , Modelos Moleculares , Unión Proteica , Conformación Proteica , Programas Informáticos
19.
Nucleic Acids Res ; 44(18): 8826-8841, 2016 Oct 14.
Artículo en Inglés | MEDLINE | ID: mdl-27580715

RESUMEN

The discovery of novel specific ribosome-associated factors challenges the assumption that translation relies on standardized molecular machinery. In this work, we demonstrate that Tma108, an uncharacterized translation machinery-associated factor in yeast, defines a subpopulation of cellular ribosomes specifically involved in the translation of less than 200 mRNAs encoding proteins with ATP or Zinc binding domains. Using ribonucleoparticle dissociation experiments we established that Tma108 directly interacts with the nascent protein chain. Additionally, we have shown that translation of the first 35 amino acids of Asn1, one of the Tma108 targets, is necessary and sufficient to recruit Tma108, suggesting that it is loaded early during translation. Comparative genomic analyses, molecular modeling and directed mutagenesis point to Tma108 as an original M1 metallopeptidase, which uses its putative catalytic peptide-binding pocket to bind the N-terminus of its targets. The involvement of Tma108 in co-translational regulation is attested by a drastic change in the subcellular localization of ATP2 mRNA upon Tma108 inactivation. Tma108 is a unique example of a nascent chain-associated factor with high selectivity and its study illustrates the existence of other specific translation-associated factors besides RNA binding proteins.


Asunto(s)
Aminopeptidasas/metabolismo , Biosíntesis de Proteínas , Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Adenosina Trifosfato/metabolismo , Aminopeptidasas/química , Hibridación Fluorescente in Situ , Mitocondrias/genética , Mitocondrias/metabolismo , Extensión de la Cadena Peptídica de Translación , Unión Proteica , ATPasas de Translocación de Protón/genética , Transporte de ARN , ARN Mensajero/genética , ARN Mensajero/metabolismo , Proteínas de Unión al ARN/química , Proteínas de Unión al ARN/metabolismo , Ribosomas/metabolismo , Proteínas de Saccharomyces cerevisiae/química , Proteínas de Saccharomyces cerevisiae/genética , Zinc/metabolismo
20.
Proteins ; 85(1): 137-154, 2017 01.
Artículo en Inglés | MEDLINE | ID: mdl-27802579

RESUMEN

Cells are interactive living systems where proteins movements, interactions and regulation are substantially free from centralized management. How protein physico-chemical and geometrical properties determine who interact with whom remains far from fully understood. We show that characterizing how a protein behaves with many potential interactors in a complete cross-docking study leads to a sharp identification of its cellular/true/native partner(s). We define a sociability index, or S-index, reflecting whether a protein likes or not to pair with other proteins. Formally, we propose a suitable normalization function that accounts for protein sociability and we combine it with a simple interface-based (ranking) score to discriminate partners from non-interactors. We show that sociability is an important factor and that the normalization permits to reach a much higher discriminative power than shape complementarity docking scores. The social effect is also observed with more sophisticated docking algorithms. Docking conformations are evaluated using experimental binding sites. These latter approximate in the best possible way binding sites predictions, which have reached high accuracy in recent years. This makes our analysis helpful for a global understanding of partner identification and for suggesting discriminating strategies. These results contradict previous findings claiming the partner identification problem being solvable solely with geometrical docking. Proteins 2016; 85:137-154. © 2016 Wiley Periodicals, Inc.


Asunto(s)
Modelos Estadísticos , Mapeo de Interacción de Proteínas/estadística & datos numéricos , Proteínas/química , Sitios de Unión , Escherichia coli/química , Humanos , Simulación del Acoplamiento Molecular , Unión Proteica , Dominios y Motivos de Interacción de Proteínas , Mapeo de Interacción de Proteínas/métodos , Estructura Secundaria de Proteína , Programas Informáticos , Virus/química
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA