Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 15 de 15
Filter
1.
J Proteome Res ; 22(7): 2199-2217, 2023 07 07.
Article in English | MEDLINE | ID: mdl-37235544

ABSTRACT

Generating top-down tandem mass spectra (MS/MS) from complex mixtures of proteoforms benefits from improvements in fractionation, separation, fragmentation, and mass analysis. The algorithms to match MS/MS to sequences have undergone a parallel evolution, with both spectral alignment and match-counting approaches producing high-quality proteoform-spectrum matches (PrSMs). This study assesses state-of-the-art algorithms for top-down identification (ProSight PD, TopPIC, MSPathFinderT, and pTop) in their yield of PrSMs while controlling false discovery rate. We evaluated deconvolution engines (ThermoFisher Xtract, Bruker AutoMSn, Matrix Science Mascot Distiller, TopFD, and FLASHDeconv) in both ThermoFisher Orbitrap-class and Bruker maXis Q-TOF data (PXD033208) to produce consistent precursor charges and mass determinations. Finally, we sought post-translational modifications (PTMs) in proteoforms from bovine milk (PXD031744) and human ovarian tissue. Contemporary identification workflows produce excellent PrSM yields, although approximately half of all identified proteoforms from these four pipelines were specific to only one workflow. Deconvolution algorithms disagree on precursor masses and charges, contributing to identification variability. Detection of PTMs is inconsistent among algorithms. In bovine milk, 18% of PrSMs produced by pTop and TopMG were singly phosphorylated, but this percentage fell to 1% for one algorithm. Applying multiple search engines produces more comprehensive assessments of experiments. Top-down algorithms would benefit from greater interoperability.


Subject(s)
Proteome , Tandem Mass Spectrometry , Humans , Proteome/genetics , Proteomics , Software , Protein Processing, Post-Translational
2.
Microbiol Spectr ; : e0382622, 2023 Feb 27.
Article in English | MEDLINE | ID: mdl-36847572

ABSTRACT

The genus Yersinia includes a large variety of nonpathogenic and life-threatening pathogenic bacteria, which cause a broad spectrum of diseases in humans and animals, such as plague, enteritis, Far East scarlet-like fever (FESLF), and enteric redmouth disease. Like most clinically relevant microorganisms, Yersinia spp. are currently subjected to intense multi-omics investigations whose numbers have increased extensively in recent years, generating massive amounts of data useful for diagnostic and therapeutic developments. The lack of a simple and centralized way to exploit these data led us to design Yersiniomics, a web-based platform allowing straightforward analysis of Yersinia omics data. Yersiniomics contains a curated multi-omics database at its core, gathering 200 genomic, 317 transcriptomic, and 62 proteomic data sets for Yersinia species. It integrates genomic, transcriptomic, and proteomic browsers, a genome viewer, and a heatmap viewer to navigate within genomes and experimental conditions. For streamlined access to structural and functional properties, it directly links each gene to GenBank, the Kyoto Encyclopedia of Genes and Genomes (KEGG), UniProt, InterPro, IntAct, and the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and each experiment to Gene Expression Omnibus (GEO), the European Nucleotide Archive (ENA), or the Proteomics Identifications Database (PRIDE). Yersiniomics provides a powerful tool for microbiologists to assist with investigations ranging from specific gene studies to systems biology studies. IMPORTANCE The expanding genus Yersinia is composed of multiple nonpathogenic species and a few pathogenic species, including the deadly etiologic agent of plague, Yersinia pestis. In 2 decades, the number of genomic, transcriptomic, and proteomic studies on Yersinia grew massively, delivering a wealth of data. We developed Yersiniomics, an interactive web-based platform, to centralize and analyze omics data sets on Yersinia species. The platform allows user-friendly navigation between genomic data, expression data, and experimental conditions. Yersiniomics will be a valuable tool to microbiologists.

3.
J Mass Spectrom ; 58(3): e4909, 2023 Mar.
Article in English | MEDLINE | ID: mdl-36822210

ABSTRACT

In antibody-based drug research, a complete characterization of antibody proteoforms covering both the amino acid sequence and all posttranslational modifications remains a major concern. The usual mass spectrometry-based approach to achieve this goal is bottom-up proteomics, which relies on the digestion of antibodies but does not allow the diversity of proteoforms to be assessed. Middle-down and top-down approaches have recently emerged as attractive alternatives but are not yet mastered and thus used in routine by many analytical chemistry laboratories. The work described here aims at providing guidelines to achieve the best sequence coverage for the fragmentation of intact light and heavy chains generated from a simple reduction of intact antibodies using Orbitrap mass spectrometry. Three parameters were found crucial to this aim: the use of an electron-based activation technique, the multiplex selection of precursor ions of different charge states, and the combination of replicates.


Subject(s)
Antibodies, Monoclonal , Tandem Mass Spectrometry , Antibodies, Monoclonal/analysis , Antibodies, Monoclonal/chemistry , Chromatography, Liquid/methods , Tandem Mass Spectrometry/methods , Amino Acid Sequence , Protein Processing, Post-Translational
4.
Bioinformatics ; 38(5): 1261-1268, 2022 02 07.
Article in English | MEDLINE | ID: mdl-34908131

ABSTRACT

MOTIVATION: Protein-protein interactions (PPIs) are key elements in numerous biological pathways and the subject of a growing number of drug discovery projects including against infectious diseases. Designing drugs on PPI targets remains a difficult task and requires extensive efforts to qualify a given interaction as an eligible target. To this end, besides the evident need to determine the role of PPIs in disease-associated pathways and their experimental characterization as therapeutics targets, prediction of their capacity to be bound by other protein partners or modulated by future drugs is of primary importance. RESULTS: We present InDeep, a tool for predicting functional binding sites within proteins that could either host protein epitopes or future drugs. Leveraging deep learning on a curated dataset of PPIs, this tool can proceed to enhanced functional binding site predictions either on experimental structures or along molecular dynamics trajectories. The benchmark of InDeep demonstrates that our tool outperforms state-of-the-art ligandable binding sites predictors when assessing PPI targets but also conventional targets. This offers new opportunities to assist drug design projects on PPIs by identifying pertinent binding pockets at or in the vicinity of PPI interfaces. AVAILABILITY AND IMPLEMENTATION: The tool is available on GitLab at https://gitlab.pasteur.fr/InDeep/InDeep. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Neural Networks, Computer , Proteins , Proteins/chemistry , Binding Sites , Protein Binding , Drug Design
5.
BMC Bioinformatics ; 22(1): 190, 2021 Apr 14.
Article in English | MEDLINE | ID: mdl-33853521

ABSTRACT

BACKGROUND: Harmonin Homogy Domains (HHD) are recently identified orphan domains of about 70 residues folded in a compact five alpha-helix bundle that proved to be versatile in terms of function, allowing for direct binding to a partner as well as regulating the affinity and specificity of adjacent domains for their own targets. Adding their small size and rather simple fold, HHDs appear as convenient modules to regulate protein-protein interactions in various biological contexts. Surprisingly, only nine HHDs have been detected in six proteins, mainly expressed in sensory neurons. RESULTS: Here, we built a profile Hidden Markov Model to screen the entire UniProtKB for new HHD-containing proteins. Every hit was manually annotated, using a clustering approach, confirming that only a few proteins contain HHDs. We report the phylogenetic coverage of each protein and build a phylogenetic tree to trace the evolution of HHDs. We suggest that a HHD ancestor is shared with Paired Amphipathic Helices (PAH) domains, a four-helix bundle partially sharing fold and functional properties. We characterized amino-acid sequences of the various HHDs using pairwise BLASTP scoring coupled with community clustering and manually assessed sequence features among each individual family. These sequence features were analyzed using reported structures as well as homology models to highlight structural motifs underlying HHDs fold. We show that functional divergence is carried out by subtle differences in sequences that automatized approaches failed to detect. CONCLUSIONS: We provide the first HHD databases, including sequences and conservation, phylogenic trees and a list of HHD variants found in the auditory system, which are available for the community. This case study highlights surprising phylogenetic properties found in orphan domains and will assist further studies of HHDs. We unveil the implication of HHDs in their various binding interfaces using conservation across families and a new protein-protein surface predictor. Finally, we discussed the functional consequences of three identified pathogenic HHD variants involved in Hoyeraal-Hreidarsson syndrome and of three newly reported pathogenic variants identified in patients suffering from Usher Syndrome.


Subject(s)
Dyskeratosis Congenita , Membrane Proteins , Amino Acid Sequence , Fetal Growth Retardation , Humans , Membrane Proteins/genetics , Phylogeny
6.
Bioinformatics ; 37(1): 89-96, 2021 Apr 09.
Article in English | MEDLINE | ID: mdl-33416858

ABSTRACT

MOTIVATION: One avenue to address the paucity of clinically testable targets is to reinvestigate the druggable genome by tackling complicated types of targets such as Protein-Protein Interactions (PPIs). Given the challenge to target those interfaces with small chemical compounds, it has become clear that learning from successful examples of PPI modulation is a powerful strategy. Freely accessible databases of PPI modulators that provide the community with tractable chemical and pharmacological data, as well as powerful tools to query them, are therefore essential to stimulate new drug discovery projects on PPI targets. RESULTS: Here, we present the new version iPPI-DB, our manually curated database of PPI modulators. In this completely redesigned version of the database, we introduce a new web interface relying on crowdsourcing for the maintenance of the database. This interface was created to enable community contributions, whereby external experts can suggest new database entries. Moreover, the data model, the graphical interface, and the tools to query the database have been completely modernized and improved. We added new PPI modulators, new PPI targets and extended our focus to stabilizers of PPIs as well. AVAILABILITY AND IMPLEMENTATION: The iPPI-DB server is available at https://ippidb.pasteur.fr The source code for this server is available at https://gitlab.pasteur.fr/ippidb/ippidb-web/ and is distributed under GPL licence (http://www.gnu.org/licences/gpl). Queries can be shared through persistent links according to the FAIR data standards. Data can be downloaded from the website as csv files. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

7.
J Phys Chem A ; 124(51): 10637-10648, 2020 Dec 24.
Article in English | MEDLINE | ID: mdl-33170681

ABSTRACT

We describe methods for physics-based protein design and some recent applications from our work. We present the physical interpretation of a MC simulation in sequence space and show that sequences and conformations form a well-defined statistical ensemble, explored with Monte Carlo and Boltzmann sampling. The folded state energy combines molecular mechanics for solutes with continuum electrostatics for solvent. We usually assume one or a few fixed protein backbone structures and discrete side chain rotamers. Methods based on molecular dynamics, which introduce additional backbone and side chain flexibility, are under development. The redesign of a PDZ domain and an aminoacyl-tRNA synthetase enzyme were successful. We describe a versatile, adaptive, Wang-Landau MC method that can be used to design for substrate affinity, catalytic rate, catalytic efficiency, or the specificity of these properties. The methods are transferable to all biomolecules, can be systematically improved, and give physical insights.


Subject(s)
Proteins/chemistry , Algorithms , Computational Chemistry , Data Interpretation, Statistical , Molecular Dynamics Simulation , Monte Carlo Method , Protein Conformation , Protein Folding , Software , Thermodynamics
8.
J Integr Bioinform ; 15(2)2018 Jun 21.
Article in English | MEDLINE | ID: mdl-29927748

ABSTRACT

Proteomic and transcriptomic technologies resulted in massive biological datasets, their interpretation requiring sophisticated computational strategies. Efficient and intuitive real-time analysis remains challenging. We use proteomic data on 1417 proteins of the green microalga Chlamydomonas reinhardtii to investigate physicochemical parameters governing selectivity of three cysteine-based redox post translational modifications (PTM): glutathionylation (SSG), nitrosylation (SNO) and disulphide bonds (SS) reduced by thioredoxins. We aim to understand underlying molecular mechanisms and structural determinants through integration of redox proteome data from gene- to structural level. Our interactive visual analytics approach on an 8.3 m2 display wall of 25 MPixel resolution features stereoscopic three dimensions (3D) representation performed by UnityMol WebGL. Virtual reality headsets complement the range of usage configurations for fully immersive tasks. Our experiments confirm that fast access to a rich cross-linked database is necessary for immersive analysis of structural data. We emphasize the possibility to display complex data structures and relationships in 3D, intrinsic to molecular structure visualization, but less common for omics-network analysis. Our setup is powered by MinOmics, an integrated analysis pipeline and visualization framework dedicated to multi-omics analysis. MinOmics integrates data from various sources into a materialized physical repository. We evaluate its performance, a design criterion for the framework.


Subject(s)
Algal Proteins/metabolism , Chlamydomonas reinhardtii/metabolism , Computer Graphics , Protein Processing, Post-Translational , Software , Algal Proteins/chemistry , Chlamydomonas reinhardtii/chemistry , Imaging, Three-Dimensional , Models, Structural , Oxidation-Reduction , Protein Conformation , Protein Interaction Maps , Proteome/analysis , Proteomics/methods , Virtual Reality
9.
J Mol Graph Model ; 71: 192-199, 2017 01.
Article in English | MEDLINE | ID: mdl-27939931

ABSTRACT

The stereospecificity of aminoacyl-tRNA synthetases helps exclude d-amino acids from protein synthesis and could perhaps be engineered to allow controlled d-amino acylation of tRNA. We use molecular dynamics simulations to probe the stereospecificity of the class I tyrosyl- and glutaminyl-tRNA synthetases (TyrRS, GlnRS), including wildtype enzymes and three point mutants suggested by three different protein design methods. l/d binding free energy differences are obtained by alchemically and reversibly transforming the ligand from L to D in simulations of the protein-ligand complex. The D81Q mutation in Escherichia coli TyrRS is homologous to the D81R mutant shown earlier to have inverted stereospecificity. D81Q is predicted to lead to a rotated ligand backbone and an increased, not a decreased l-Tyr preference. The E36Q mutation in Methanococcus jannaschii TyrRS has a predicted l/d binding free energy difference ΔΔG of just 0.5±0.9kcal/mol, compared to 3.1±0.8kcal/mol for the wildtype enzyme (favoring l-Tyr). The ligand ammonium position is preserved in the d-Tyr complex, while the carboxylate is shifted. Wildtype GlnRS has a similar preference for l-glutaminyl adenylate; the R260Q mutant has an increased preference, even though Arg260 makes a large contribution to the wildtype ΔΔG value.


Subject(s)
Amino Acyl-tRNA Synthetases/chemistry , Molecular Dynamics Simulation , Protein Biosynthesis/genetics , Tyrosine-tRNA Ligase/chemistry , Adenosine Monophosphate/chemistry , Amino Acids/chemistry , Amino Acids/genetics , Amino Acyl-tRNA Synthetases/genetics , Escherichia coli/enzymology , Methanocaldococcus/enzymology , Point Mutation , Protein Engineering , Tyrosine-tRNA Ligase/genetics
10.
J Chem Theory Comput ; 12(12): 6035-6048, 2016 Dec 13.
Article in English | MEDLINE | ID: mdl-27775883

ABSTRACT

Multistate protein design explores side chain mutations, with the backbone allowed to sample a small, predetermined library of conformations. To achieve Boltzmann sampling of sequences and conformations, we use a hybrid Monte Carlo (MC) scheme: a trial hop between backbone models is followed by a short MC segment where side chain rotamers adjust to the new backbone, before applying a Metropolis-like acceptance test. The theoretical form and a practical approximation for the acceptance test are derived. We then compute backbone conformational free energies for two SH2 and SH3 proteins using different routes and protocols, and verify that for simple test problems, the free energy behaves like a state function, a hallmark of Boltzmann sampling.


Subject(s)
Proteins/chemistry , Catalytic Domain , Enzymes/chemistry , Enzymes/metabolism , Molecular Dynamics Simulation , Monte Carlo Method , Protein Structure, Tertiary , Proteins/metabolism , Thermodynamics , src Homology Domains
11.
Methods Mol Biol ; 1414: 77-97, 2016.
Article in English | MEDLINE | ID: mdl-27094287

ABSTRACT

This chapter describes the organization and use of Proteus, a multitool computational suite for the optimization of protein and ligand conformations and sequences, and the calculation of pK α shifts and relative binding affinities. The software offers the use of several molecular mechanics force fields and solvent models, including two generalized Born variants, and a large range of scoring functions, which can combine protein stability, ligand affinity, and ligand specificity terms, for positive and negative design. We present in detail the steps for structure preparation, system setup, construction of the interaction energy matrix, protein sequence and structure optimizations, pK α calculations, and ligand titration calculations. We discuss illustrative examples, including the chemical/structural optimization of a complex between the MHC class II protein HLA-DQ8 and the vinculin epitope, and the chemical optimization of the compstatin analog Ac-Val4Trp/His9Ala, which regulates the function of protein C3 of the complement system.


Subject(s)
Proteins/chemistry , Binding Sites , Ligands , Thermodynamics
12.
Proteins ; 84(2): 240-53, 2016 Feb.
Article in English | MEDLINE | ID: mdl-26676967

ABSTRACT

D-Amino acids are largely excluded from protein synthesis, yet they are of great interest in biotechnology. Unnatural amino acids have been introduced into proteins using engineered aminoacyl-tRNA synthetases (aaRSs), and this strategy might be applicable to D-amino acids. Several aaRSs can aminoacylate their tRNA with a D-amino acid; of these, tyrosyl-tRNA synthetase (TyrRS) has the weakest stereospecificity. We use computational protein design to suggest active site mutations in Escherichia coli TyrRS that could increase its D-Tyr binding further, relative to L-Tyr. The mutations selected all modify one or more sidechain charges in the Tyr binding pocket. We test their effect by probing the aminoacyl-adenylation reaction through pyrophosphate exchange experiments. We also perform extensive alchemical free energy simulations to obtain L-Tyr/D-Tyr binding free energy differences. Agreement with experiment is good, validating the structural models and detailed thermodynamic predictions the simulations provide. The TyrRS stereospecificity proves hard to engineer through charge-altering mutations in the first and second coordination shells of the Tyr ammonium group. Of six mutants tested, two are active towards D-Tyr; one of these has an inverted stereospecificity, with a large preference for D-Tyr. However, its activity is low. Evidently, the TyrRS stereospecificity is robust towards charge rearrangements near the ligand. Future design may have to consider more distant and/or electrically neutral target mutations, and possibly design for binding of the transition state, whose structure however can only be modeled.


Subject(s)
Escherichia coli Proteins/chemistry , Escherichia coli Proteins/metabolism , Tyrosine-tRNA Ligase/chemistry , Tyrosine-tRNA Ligase/metabolism , Escherichia coli Proteins/genetics , Molecular Dynamics Simulation , Protein Engineering , Stereoisomerism , Thermodynamics , Tyrosine-tRNA Ligase/genetics
13.
J Comput Chem ; 37(4): 404-15, 2016 Feb 05.
Article in English | MEDLINE | ID: mdl-26503829

ABSTRACT

A computational protein design method is extended to allow Monte Carlo simulations where two ligands are titrated into a protein binding pocket, yielding binding free energy differences. These provide a stringent test of the physical model, including the energy surface and sidechain rotamer definition. As a test, we consider tyrosyl-tRNA synthetase (TyrRS), which has been extensively redesigned experimentally. We consider its specificity for its substrate l-tyrosine (l-Tyr), compared to the analogs d-Tyr, p-acetyl-, and p-azido-phenylalanine (ac-Phe, az-Phe). We simulate l- and d-Tyr binding to TyrRS and six mutants, and compare the structures and binding free energies to a more rigorous "MD/GBSA" procedure: molecular dynamics with explicit solvent for structures and a Generalized Born + Surface Area model for binding free energies. Next, we consider l-Tyr, ac- and az-Phe binding to six other TyrRS variants. The titration results are sensitive to the precise rotamer definition, which involves a short energy minimization for each sidechain pair to help relax bad contacts induced by the discrete rotamer set. However, when designed mutant structures are rescored with a standard GBSA energy model, results agree well with the more rigorous MD/GBSA. As a third test, we redesign three amino acid positions in the substrate coordination sphere, with either l-Tyr or d-Tyr as the ligand. For two, we obtain good agreement with experiment, recovering the wildtype residue when l-Tyr is the ligand and a d-Tyr specific mutant when d-Tyr is the ligand. For the third, we recover His with either ligand, instead of wildtype Gln.


Subject(s)
Thermodynamics , Tyrosine-tRNA Ligase/chemistry , Tyrosine-tRNA Ligase/metabolism , Tyrosine/chemistry , Tyrosine/metabolism , Binding Sites/drug effects , Ligands , Molecular Dynamics Simulation , Monte Carlo Method , Mutant Proteins/chemistry , Mutant Proteins/genetics , Mutant Proteins/metabolism , Protein Binding , Tyrosine-tRNA Ligase/genetics
14.
Future Med Chem ; 7(17): 2317-31, 2015.
Article in English | MEDLINE | ID: mdl-26599419

ABSTRACT

AIM: Molecular dynamics simulations and normal mode analysis are well-established approaches to generate receptor conformational ensembles (RCEs) for ligand docking and virtual screening. Here, we report new fast molecular dynamics-based and normal mode analysis-based protocols combined with conformational pocket classifications to efficiently generate RCEs. MATERIALS & METHODS: We assessed our protocols on two well-characterized protein targets showing local active site flexibility, dihydrofolate reductase and large collective movements, CDK2. The performance of the RCEs was validated by distinguishing known ligands of dihydrofolate reductase and CDK2 among a dataset of diverse chemical decoys. RESULTS & DISCUSSION: Our results show that different simulation protocols can be efficient for generation of RCEs depending on different kind of protein flexibility.


Subject(s)
Molecular Dynamics Simulation , Binding Sites , Cyclin-Dependent Kinase 2/chemistry , Cyclin-Dependent Kinase 2/metabolism , High-Throughput Screening Assays , Humans , Ligands , Protein Structure, Tertiary , Tetrahydrofolate Dehydrogenase/chemistry , Tetrahydrofolate Dehydrogenase/metabolism
15.
J Comput Chem ; 34(28): 2472-84, 2013 Oct 30.
Article in English | MEDLINE | ID: mdl-24037756

ABSTRACT

We describe an automated procedure for protein design, implemented in a flexible software package, called Proteus. System setup and calculation of an energy matrix are done with the XPLOR modeling program and its sophisticated command language, supporting several force fields and solvent models. A second program provides algorithms to search sequence space. It allows a decomposition of the system into groups, which can be combined in different ways in the energy function, for both positive and negative design. The whole procedure can be controlled by editing 2-4 scripts. Two applications consider the tyrosyl-tRNA synthetase enzyme and its successful redesign to bind both O-methyl-tyrosine and D-tyrosine. For the latter, we present Monte Carlo simulations where the D-tyrosine concentration is gradually increased, displacing L-tyrosine from the binding pocket and yielding the binding free energy difference, in good agreement with experiment. Complete redesign of the Crk SH3 domain is presented. The top 10000 sequences are all assigned to the correct fold by the SUPERFAMILY library of Hidden Markov Models. Finally, we report the acid/base behavior of the SNase protein. Sidechain protonation is treated as a form of mutation; it is then straightforward to perform constant-pH Monte Carlo simulations, which yield good agreement with experiment. Overall, the software can be used for a wide range of application, producing not only native-like sequences but also thermodynamic properties with errors that appear comparable to other current software packages.


Subject(s)
Computational Biology , Proteins/chemistry , Software , Algorithms , Hydrogen-Ion Concentration , Models, Molecular , Molecular Dynamics Simulation , Monte Carlo Method , Protein Unfolding , Proto-Oncogene Proteins c-crk/chemistry , Tyrosine/analogs & derivatives , Tyrosine/chemistry , Tyrosine/metabolism , Tyrosine-tRNA Ligase/chemistry , Tyrosine-tRNA Ligase/metabolism , src Homology Domains
SELECTION OF CITATIONS
SEARCH DETAIL
...