Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 68
Filter
1.
PLoS Comput Biol ; 18(2): e1009855, 2022 02.
Article in English | MEDLINE | ID: mdl-35143481

ABSTRACT

Antimicrobial resistance presents a significant health care crisis. The mutation F98Y in Staphylococcus aureus dihydrofolate reductase (SaDHFR) confers resistance to the clinically important antifolate trimethoprim (TMP). Propargyl-linked antifolates (PLAs), next generation DHFR inhibitors, are much more resilient than TMP against this F98Y variant, yet this F98Y substitution still reduces efficacy of these agents. Surprisingly, differences in the enantiomeric configuration at the stereogenic center of PLAs influence the isomeric state of the NADPH cofactor. To understand the molecular basis of F98Y-mediated resistance and how PLAs' inhibition drives NADPH isomeric states, we used protein design algorithms in the osprey protein design software suite to analyze a comprehensive suite of structural, biophysical, biochemical, and computational data. Here, we present a model showing how F98Y SaDHFR exploits a different anomeric configuration of NADPH to evade certain PLAs' inhibition, while other PLAs remain unaffected by this resistance mechanism.


Subject(s)
Folic Acid Antagonists , Staphylococcal Infections , Drug Resistance, Bacterial/genetics , Folic Acid Antagonists/chemistry , Folic Acid Antagonists/pharmacology , Humans , NADP/metabolism , Staphylococcus aureus/genetics , Staphylococcus aureus/metabolism , Tetrahydrofolate Dehydrogenase/chemistry , Tetrahydrofolate Dehydrogenase/genetics , Tetrahydrofolate Dehydrogenase/metabolism , Trimethoprim/chemistry , Trimethoprim/metabolism , Trimethoprim/pharmacology
2.
PLoS Comput Biol ; 16(6): e1007447, 2020 06.
Article in English | MEDLINE | ID: mdl-32511232

ABSTRACT

The K* algorithm provably approximates partition functions for a set of states (e.g., protein, ligand, and protein-ligand complex) to a user-specified accuracy ε. Often, reaching an ε-approximation for a particular set of partition functions takes a prohibitive amount of time and space. To alleviate some of this cost, we introduce two new algorithms into the osprey suite for protein design: fries, a Fast Removal of Inadequately Energied Sequences, and EWAK*, an Energy Window Approximation to K*. fries pre-processes the sequence space to limit a design to only the most stable, energetically favorable sequence possibilities. EWAK* then takes this pruned sequence space as input and, using a user-specified energy window, calculates K* scores using the lowest energy conformations. We expect fries/EWAK* to be most useful in cases where there are many unstable sequences in the design sequence space and when users are satisfied with enumerating the low-energy ensemble of conformations. In combination, these algorithms provably retain calculational accuracy while limiting the input sequence space and the conformations included in each partition function calculation to only the most energetically favorable, effectively reducing runtime while still enriching for desirable sequences. This combined approach led to significant speed-ups compared to the previous state-of-the-art multi-sequence algorithm, BBK*, while maintaining its efficiency and accuracy, which we show across 40 different protein systems and a total of 2,826 protein design problems. Additionally, as a proof of concept, we used these new algorithms to redesign the protein-protein interface (PPI) of the c-Raf-RBD:KRas complex. The Ras-binding domain of the protein kinase c-Raf (c-Raf-RBD) is the tightest known binder of KRas, a protein implicated in difficult-to-treat cancers. fries/EWAK* accurately retrospectively predicted the effect of 41 different sets of mutations in the PPI of the c-Raf-RBD:KRas complex. Notably, these mutations include mutations whose effect had previously been incorrectly predicted using other computational methods. Next, we used fries/EWAK* for prospective design and discovered a novel point mutation that improves binding of c-Raf-RBD to KRas in its active, GTP-bound state (KRasGTP). We combined this new mutation with two previously reported mutations (which were highly-ranked by osprey) to create a new variant of c-Raf-RBD, c-Raf-RBD(RKY). fries/EWAK* in osprey computationally predicted that this new variant binds even more tightly than the previous best-binding variant, c-Raf-RBD(RK). We measured the binding affinity of c-Raf-RBD(RKY) using a bio-layer interferometry (BLI) assay, and found that this new variant exhibits single-digit nanomolar affinity for KRasGTP, confirming the computational predictions made with fries/EWAK*. This new variant binds roughly five times more tightly than the previous best known binder and roughly 36 times more tightly than the design starting point (wild-type c-Raf-RBD). This study steps through the advancement and development of computational protein design by presenting theory, new algorithms, accurate retrospective designs, new prospective designs, and biochemical validation.


Subject(s)
Computational Biology , Protein Engineering/methods , Proto-Oncogene Proteins c-raf/chemistry , Proto-Oncogene Proteins p21(ras)/chemistry , Algorithms , Computers , Humans , Interferometry , Lectins/chemistry , Ligands , Models, Statistical , Programming Languages , Protein Binding , Protein Domains , Software
3.
J Comput Chem ; 39(30): 2494-2507, 2018 11 15.
Article in English | MEDLINE | ID: mdl-30368845

ABSTRACT

We present osprey 3.0, a new and greatly improved release of the osprey protein design software. Osprey 3.0 features a convenient new Python interface, which greatly improves its ease of use. It is over two orders of magnitude faster than previous versions of osprey when running the same algorithms on the same hardware. Moreover, osprey 3.0 includes several new algorithms, which introduce substantial speedups as well as improved biophysical modeling. It also includes GPU support, which provides an additional speedup of over an order of magnitude. Like previous versions of osprey, osprey 3.0 offers a unique package of advantages over other design software, including provable design algorithms that account for continuous flexibility during design and model conformational entropy. Finally, we show here empirically that osprey 3.0 accurately predicts the effect of mutations on protein-protein binding. Osprey 3.0 is available at http://www.cs.duke.edu/donaldlab/osprey.php as free and open-source software. © 2018 Wiley Periodicals, Inc.


Subject(s)
Protein Conformation , Proteins/chemistry , Software , Algorithms , Models, Molecular , Protein Binding
4.
Bioinformatics ; 33(14): i5-i12, 2017 Jul 15.
Article in English | MEDLINE | ID: mdl-28882005

ABSTRACT

MOTIVATION: When proteins mutate or bind to ligands, their backbones often move significantly, especially in loop regions. Computational protein design algorithms must model these motions in order to accurately optimize protein stability and binding affinity. However, methods for backbone conformational search in design have been much more limited than for sidechain conformational search. This is especially true for combinatorial protein design algorithms, which aim to search a large sequence space efficiently and thus cannot rely on temporal simulation of each candidate sequence. RESULTS: We alleviate this difficulty with a new parameterization of backbone conformational space, which represents all degrees of freedom of a specified segment of protein chain that maintain valid bonding geometry (by maintaining the original bond lengths and angles and ω dihedrals). In order to search this space, we present an efficient algorithm, CATS, for computing atomic coordinates as a function of our new continuous backbone internal coordinates. CATS generalizes the iMinDEE and EPIC protein design algorithms, which model continuous flexibility in sidechain dihedrals, to model continuous, appropriately localized flexibility in the backbone dihedrals ϕ and ψ as well. We show using 81 test cases based on 29 different protein structures that CATS finds sequences and conformations that are significantly lower in energy than methods with less or no backbone flexibility do. In particular, we show that CATS can model the viability of an antibody mutation known experimentally to increase affinity, but that appears sterically infeasible when modeled with less or no backbone flexibility. AVAILABILITY AND IMPLEMENTATION: Our code is available as free software at https://github.com/donaldlab/OSPREY_refactor . CONTACT: mhallen@ttic.edu or brd+ismb17@cs.duke.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Computational Biology/methods , Proteins/chemistry , Software , Algorithms , Models, Molecular , Mutation , Protein Conformation , Proteins/genetics
5.
PLoS Comput Biol ; 13(3): e1005346, 2017 03.
Article in English | MEDLINE | ID: mdl-28358804

ABSTRACT

Protein design algorithms enumerate a combinatorial number of candidate structures to compute the Global Minimum Energy Conformation (GMEC). To efficiently find the GMEC, protein design algorithms must methodically reduce the conformational search space. By applying distance and energy cutoffs, the protein system to be designed can thus be represented using a sparse residue interaction graph, where the number of interacting residue pairs is less than all pairs of mutable residues, and the corresponding GMEC is called the sparse GMEC. However, ignoring some pairwise residue interactions can lead to a change in the energy, conformation, or sequence of the sparse GMEC vs. the original or the full GMEC. Despite the widespread use of sparse residue interaction graphs in protein design, the above mentioned effects of their use have not been previously analyzed. To analyze the costs and benefits of designing with sparse residue interaction graphs, we computed the GMECs for 136 different protein design problems both with and without distance and energy cutoffs, and compared their energies, conformations, and sequences. Our analysis shows that the differences between the GMECs depend critically on whether or not the design includes core, boundary, or surface residues. Moreover, neglecting long-range interactions can alter local interactions and introduce large sequence differences, both of which can result in significant structural and functional changes. Designs on proteins with experimentally measured thermostability show it is beneficial to compute both the full and the sparse GMEC accurately and efficiently. To this end, we show that a provable, ensemble-based algorithm can efficiently compute both GMECs by enumerating a small number of conformations, usually fewer than 1000. This provides a novel way to combine sparse residue interaction graphs with provable, ensemble-based algorithms to reap the benefits of sparse residue interaction graphs while avoiding their potential inaccuracies.


Subject(s)
Algorithms , Proteins/chemistry , Amino Acid Sequence , Animals , Computational Biology , Computer Graphics , Humans , Models, Molecular , Protein Conformation , Protein Engineering , Proteins/genetics , Software , Thermodynamics
6.
Proc Natl Acad Sci U S A ; 112(3): 749-54, 2015 Jan 20.
Article in English | MEDLINE | ID: mdl-25552560

ABSTRACT

Methods to accurately predict potential drug target mutations in response to early-stage leads could drive the design of more resilient first generation drug candidates. In this study, a structure-based protein design algorithm (K* in the OSPREY suite) was used to prospectively identify single-nucleotide polymorphisms that confer resistance to an experimental inhibitor effective against dihydrofolate reductase (DHFR) from Staphylococcus aureus. Four of the top-ranked mutations in DHFR were found to be catalytically competent and resistant to the inhibitor. Selection of resistant bacteria in vitro reveals that two of the predicted mutations arise in the background of a compensatory mutation. Using enzyme kinetics, microbiology, and crystal structures of the complexes, we determined the fitness of the mutant enzymes and strains, the structural basis of resistance, and the compensatory relationship of the mutations. To our knowledge, this work illustrates the first application of protein design algorithms to prospectively predict viable resistance mutations that arise in bacteria under antibiotic pressure.


Subject(s)
Algorithms , Folic Acid Antagonists/pharmacology , Proteins/chemistry , Drug Resistance/genetics , Polymorphism, Single Nucleotide , Staphylococcus aureus/enzymology , Tetrahydrofolate Dehydrogenase/drug effects
7.
Proc Natl Acad Sci U S A ; 111(4): 1391-6, 2014 Jan 28.
Article in English | MEDLINE | ID: mdl-24474763

ABSTRACT

The membrane proximal external region (MPER) of HIV-1 glycoprotein (gp) 41 is involved in viral-host cell membrane fusion. It contains short amino acid sequences that are binding sites for the HIV-1 broadly neutralizing antibodies 2F5, 4E10, and 10E8, making these binding sites important targets for HIV-1 vaccine development. We report a high-resolution structure of a designed MPER trimer assembled on a detergent micelle. The NMR solution structure of this trimeric domain, designated gp41-M-MAT, shows that the three MPER peptides each adopt symmetric α-helical conformations exposing the amino acid side chains of the antibody binding sites. The helices are closely associated at their N termini, bend between the 2F5 and 4E10 epitopes, and gradually separate toward the C termini, where they associate with the membrane. The mAbs 2F5 and 4E10 bind gp41-M-MAT with nanomolar affinities, consistent with the substantial exposure of their respective epitopes in the trimer structure. The traditional structure determination of gp41-M-MAT using the Xplor-NIH protocol was validated by independently determining the structure using the DISCO sparse-data protocol, which exploits geometric arrangement algorithms that guarantee to compute all structures and assignments that satisfy the data.


Subject(s)
Antibodies, Neutralizing/immunology , Biopolymers/immunology , HIV Envelope Protein gp41/immunology , HIV-1/immunology , Amino Acid Sequence , Antibodies, Neutralizing/chemistry , Biopolymers/chemistry , HIV Envelope Protein gp41/chemistry , Magnetic Resonance Spectroscopy , Models, Molecular , Molecular Sequence Data
8.
J Comput Chem ; 37(12): 1048-58, 2016 May 05.
Article in English | MEDLINE | ID: mdl-26833706

ABSTRACT

One of the main challenges in computational protein design (CPD) is the huge size of the protein sequence and conformational space that has to be computationally explored. Recently, we showed that state-of-the-art combinatorial optimization technologies based on Cost Function Network (CFN) processing allow speeding up provable rigid backbone protein design methods by several orders of magnitudes. Building up on this, we improved and injected CFN technology into the well-established CPD package Osprey to allow all Osprey CPD algorithms to benefit from associated speedups. Because Osprey fundamentally relies on the ability of A* to produce conformations in increasing order of energy, we defined new A* strategies combining CFN lower bounds, with new side-chain positioning-based branching scheme. Beyond the speedups obtained in the new A*-CFN combination, this novel branching scheme enables a much faster enumeration of suboptimal sequences, far beyond what is reachable without it. Together with the immediate and important speedups provided by CFN technology, these developments directly benefit to all the algorithms that previously relied on the DEE/ A* combination inside Osprey* and make it possible to solve larger CPD problems with provable algorithms.


Subject(s)
Algorithms , Computational Biology , Proteins/chemistry , Amino Acid Sequence , Drug Design , Protein Conformation
9.
Proteins ; 83(6): 1151-64, 2015 Jun.
Article in English | MEDLINE | ID: mdl-25846627

ABSTRACT

Flexibility and dynamics are important for protein function and a protein's ability to accommodate amino acid substitutions. However, when computational protein design algorithms search over protein structures, the allowed flexibility is often reduced to a relatively small set of discrete side-chain and backbone conformations. While simplifications in scoring functions and protein flexibility are currently necessary to computationally search the vast protein sequence and conformational space, a rigid representation of a protein causes the search to become brittle and miss low-energy structures. Continuous rotamers more closely represent the allowed movement of a side chain within its torsional well and have been successfully incorporated into the protein design framework to design biomedically relevant protein systems. The use of continuous rotamers in protein design enables algorithms to search a larger conformational space than previously possible, but adds additional complexity to the design search. To design large, complex systems with continuous rotamers, new algorithms are needed to increase the efficiency of the search. We present two methods, PartCR and HOT, that greatly increase the speed and efficiency of protein design with continuous rotamers. These methods specifically target the large errors in energetic terms that are used to bound pairwise energies during the design search. By tightening the energy bounds, additional pruning of the conformation space can be achieved, and the number of conformations that must be enumerated to find the global minimum energy conformation is greatly reduced.


Subject(s)
Computational Biology/methods , Protein Conformation , Proteins/chemistry , Proteins/metabolism , Algorithms , Amino Acid Sequence , Models, Molecular , Protein Engineering
10.
Proteins ; 83(4): 651-61, 2015 Apr.
Article in English | MEDLINE | ID: mdl-25620116

ABSTRACT

Protein structure determination by NMR has predominantly relied on simulated annealing-based conformational search for a converged fold using primarily distance constraints, including constraints derived from nuclear Overhauser effects, paramagnetic relaxation enhancement, and cysteine crosslinkings. Although there is no guarantee that the converged fold represents the global minimum of the conformational space, it is generally accepted that good convergence is synonymous to the global minimum. Here, we show such a criterion breaks down in the presence of large numbers of ambiguous constraints from NMR experiments on homo-oligomeric protein complexes. A systematic evaluation of the conformational solutions that satisfy the NMR constraints of a trimeric membrane protein, DAGK, reveals 9 distinct folds, including the reported NMR and crystal structures. This result highlights the fundamental limitation of global fold determination for homo-oligomeric proteins using ambiguous distance constraints and provides a systematic solution for exhaustive enumeration of all satisfying solutions.


Subject(s)
Magnetic Resonance Spectroscopy , Models, Molecular , Protein Subunits/chemistry , Proteins/chemistry , Protein Conformation , Protein Folding , Protein Multimerization , Protein Subunits/metabolism , Proteins/metabolism
11.
Proteins ; 83(10): 1859-1877, 2015 Oct.
Article in English | MEDLINE | ID: mdl-26235965

ABSTRACT

Despite significant successes in structure-based computational protein design in recent years, protein design algorithms must be improved to increase the biological accuracy of new designs. Protein design algorithms search through an exponential number of protein conformations, protein ensembles, and amino acid sequences in an attempt to find globally optimal structures with a desired biological function. To improve the biological accuracy of protein designs, it is necessary to increase both the amount of protein flexibility allowed during the search and the overall size of the design, while guaranteeing that the lowest-energy structures and sequences are found. DEE/A*-based algorithms are the most prevalent provable algorithms in the field of protein design and can provably enumerate a gap-free list of low-energy protein conformations, which is necessary for ensemble-based algorithms that predict protein binding. We present two classes of algorithmic improvements to the A* algorithm that greatly increase the efficiency of A*. First, we analyze the effect of ordering the expansion of mutable residue positions within the A* tree and present a dynamic residue ordering that reduces the number of A* nodes that must be visited during the search. Second, we propose new methods to improve the conformational bounds used to estimate the energies of partial conformations during the A* search. The residue ordering techniques and improved bounds can be combined for additional increases in A* efficiency. Our enhancements enable all A*-based methods to more fully search protein conformation space, which will ultimately improve the accuracy of complex biomedically relevant designs.


Subject(s)
Computational Biology/methods , Protein Engineering/methods , Sequence Analysis, Protein/methods , Algorithms , Amino Acid Sequence , Protein Conformation , Software
12.
J Virol ; 88(21): 12669-82, 2014 Nov.
Article in English | MEDLINE | ID: mdl-25142607

ABSTRACT

UNLABELLED: Over the past 5 years, a new generation of highly potent and broadly neutralizing HIV-1 antibodies has been identified. These antibodies can protect against lentiviral infection in nonhuman primates (NHPs), suggesting that passive antibody transfer would prevent HIV-1 transmission in humans. To increase the protective efficacy of such monoclonal antibodies, we employed next-generation sequencing, computational bioinformatics, and structure-guided design to enhance the neutralization potency and breadth of VRC01, an antibody that targets the CD4 binding site of the HIV-1 envelope. One variant, VRC07-523, was 5- to 8-fold more potent than VRC01, neutralized 96% of viruses tested, and displayed minimal autoreactivity. To compare its protective efficacy to that of VRC01 in vivo, we performed a series of simian-human immunodeficiency virus (SHIV) challenge experiments in nonhuman primates and calculated the doses of VRC07-523 and VRC01 that provide 50% protection (EC50). VRC07-523 prevented infection in NHPs at a 5-fold lower concentration than VRC01. These results suggest that increased neutralization potency in vitro correlates with improved protection against infection in vivo, documenting the improved functional efficacy of VRC07-523 and its potential clinical relevance for protecting against HIV-1 infection in humans. IMPORTANCE: In the absence of an effective HIV-1 vaccine, alternative strategies are needed to block HIV-1 transmission. Direct administration of HIV-1-neutralizing antibodies may be able to prevent HIV-1 infections in humans. This approach could be especially useful in individuals at high risk for contracting HIV-1 and could be used together with antiretroviral drugs to prevent infection. To optimize the chance of success, such antibodies can be modified to improve their potency, breadth, and in vivo half-life. Here, knowledge of the structure of a potent neutralizing antibody, VRC01, that targets the CD4-binding site of the HIV-1 envelope protein was used to engineer a next-generation antibody with 5- to 8-fold increased potency in vitro. When administered to nonhuman primates, this antibody conferred protection at a 5-fold lower concentration than the original antibody. Our studies demonstrate an important correlation between in vitro assays used to evaluate the therapeutic potential of antibodies and their in vivo effectiveness.


Subject(s)
Antibodies, Neutralizing/immunology , HIV Antibodies/immunology , HIV-1/immunology , Immunization, Passive/methods , Simian Acquired Immunodeficiency Syndrome/prevention & control , Animals , Antibodies, Neutralizing/administration & dosage , Antibodies, Neutralizing/genetics , HIV Antibodies/administration & dosage , HIV Antibodies/genetics , HIV-1/genetics , Macaca mulatta , Male , Molecular Sequence Data , Sequence Analysis, DNA
13.
Bioinformatics ; 30(12): i255-i263, 2014 Jun 15.
Article in English | MEDLINE | ID: mdl-24931991

ABSTRACT

MOTIVATION: Structure-based computational protein design (SCPR) is an important topic in protein engineering. Under the assumption of a rigid backbone and a finite set of discrete conformations of side-chains, various methods have been proposed to address this problem. A popular method is to combine the dead-end elimination (DEE) and A* tree search algorithms, which provably finds the global minimum energy conformation (GMEC) solution. RESULTS: In this article, we improve the efficiency of computing A* heuristic functions for protein design and propose a variant of A* algorithm in which the search process can be performed on a single GPU in a massively parallel fashion. In addition, we make some efforts to address the memory exceeding problem in A* search. As a result, our enhancements can achieve a significant speedup of the A*-based protein design algorithm by four orders of magnitude on large-scale test data through pre-computation and parallelization, while still maintaining an acceptable memory overhead. We also show that our parallel A* search algorithm could be successfully combined with iMinDEE, a state-of-the-art DEE criterion, for rotamer pruning to further improve SCPR with the consideration of continuous side-chain flexibility. AVAILABILITY: Our software is available and distributed open-source under the GNU Lesser General License Version 2.1 (GNU, February 1999). The source code can be downloaded from http://www.cs.duke.edu/donaldlab/osprey.php or http://iiis.tsinghua.edu.cn/∼compbio/software.html.


Subject(s)
Algorithms , Protein Conformation , Protein Engineering/methods , Amino Acid Sequence , Computational Biology/methods
14.
Commun ACM ; 62(10): 76-84, 2019 Oct.
Article in English | MEDLINE | ID: mdl-31607753
15.
Protein Eng Des Sel ; 372024 Jan 29.
Article in English | MEDLINE | ID: mdl-38757573

ABSTRACT

With over 270 unique occurrences in the human genome, peptide-recognizing PDZ domains play a central role in modulating polarization, signaling, and trafficking pathways. Mutations in PDZ domains lead to diseases such as cancer and cystic fibrosis, making PDZ domains attractive targets for therapeutic intervention. D-peptide inhibitors offer unique advantages as therapeutics, including increased metabolic stability and low immunogenicity. Here, we introduce DexDesign, a novel OSPREY-based algorithm for computationally designing de novo D-peptide inhibitors. DexDesign leverages three novel techniques that are broadly applicable to computational protein design: the Minimum Flexible Set, K*-based Mutational Scan, and Inverse Alanine Scan. We apply these techniques and DexDesign to generate novel D-peptide inhibitors of two biomedically important PDZ domain targets: CAL and MAST2. We introduce a framework for analyzing de novo peptides-evaluation along a replication/restitution axis-and apply it to the DexDesign-generated D-peptides. Notably, the peptides we generated are predicted to bind their targets tighter than their targets' endogenous ligands, validating the peptides' potential as lead inhibitors. We also provide an implementation of DexDesign in the free and open source computational protein design software OSPREY.


Subject(s)
Algorithms , Peptides , Peptides/chemistry , Peptides/pharmacology , Humans , Drug Design , PDZ Domains
16.
J Comput Biol ; 2024 Oct 04.
Article in English | MEDLINE | ID: mdl-39364612

ABSTRACT

D-peptides, the mirror image of canonical L-peptides, offer numerous biological advantages that make them effective therapeutics. This article details how to use DexDesign, the newest OSPREY-based algorithm, for designing these D-peptides de novo. OSPREY physics-based models precisely mimic energy-equivariant reflection operations, enabling the generation of D-peptide scaffolds from L-peptide templates. Due to the scarcity of D-peptide:L-protein structural data, DexDesign calls a geometric hashing algorithm, Method of Accelerated Search for Tertiary Ensemble Representatives, as a subroutine to produce a synthetic structural dataset. DexDesign enables mixed-chirality designs with a new user interface and also reduces the conformation and sequence search space using three new design techniques: Minimum Flexible Set, Inverse Alanine Scanning, and K*-based Mutational Scanning.

17.
bioRxiv ; 2024 Feb 14.
Article in English | MEDLINE | ID: mdl-38405797

ABSTRACT

With over 270 unique occurrences in the human genome, peptide-recognizing PDZ domains play a central role in modulating polarization, signaling, and trafficking pathways. Mutations in PDZ domains lead to diseases such as cancer and cystic fibrosis, making PDZ domains attractive targets for therapeutic intervention. D-peptide inhibitors offer unique advantages as therapeutics, including increased metabolic stability and low immunogenicity. Here, we introduce DexDesign, a novel OSPREY-based algorithm for computationally designing de novo D-peptide inhibitors. DexDesign leverages three novel techniques that are broadly applicable to computational protein design: the Minimum Flexible Set, K*-based Mutational Scan, and Inverse Alanine Scan, which enable exponential reductions in the size of the peptide sequence search space. We apply these techniques and DexDesign to generate novel D-peptide inhibitors of two biomedically important PDZ domain targets: CAL and MAST2. We introduce a new framework for analyzing de novo peptides-evaluation along a replication/restitution axis-and apply it to the DexDesign-generated D-peptides. Notably, the peptides we generated are predicted to bind their targets tighter than their targets' endogenous ligands, validating the peptides' potential as lead therapeutic candidates. We provide an implementation of DexDesign in the free and open source computational protein design software OSPREY.

18.
Proteins ; 81(1): 18-39, 2013 Jan.
Article in English | MEDLINE | ID: mdl-22821798

ABSTRACT

Computational protein and drug design generally require accurate modeling of protein conformations. This modeling typically starts with an experimentally determined protein structure and considers possible conformational changes due to mutations or new ligands. The DEE/A* algorithm provably finds the global minimum-energy conformation (GMEC) of a protein assuming that the backbone does not move and the sidechains take on conformations from a set of discrete, experimentally observed conformations called rotamers. DEE/A* can efficiently find the overall GMEC for exponentially many mutant sequences. Previous improvements to DEE/A* include modeling ensembles of sidechain conformations and either continuous sidechain or backbone flexibility. We present a new algorithm, DEEPer (Dead-End Elimination with Perturbations), that combines these advantages and can also handle much more extensive backbone flexibility and backbone ensembles. DEEPer provably finds the GMEC or, if desired by the user, all conformations and sequences within a specified energy window of the GMEC. It includes the new abilities to handle arbitrarily large backbone perturbations and to generate ensembles of backbone conformations. It also incorporates the shear, an experimentally observed local backbone motion never before used in design. Additionally, we derive a new method to accelerate DEE/A*-based calculations, indirect pruning, that is particularly useful for DEEPer. In 67 benchmark tests on 64 proteins, DEEPer consistently identified lower-energy conformations than previous methods did, indicating more accurate modeling. Additional tests demonstrated its ability to incorporate larger, experimentally observed backbone conformational changes and to model realistic conformational ensembles. These capabilities provide significant advantages for modeling protein mutations and protein-ligand interactions.


Subject(s)
Algorithms , Computational Biology/methods , Proteins/chemistry , Databases, Protein , Entropy , Models, Molecular , Protein Conformation , Software
19.
PLoS Comput Biol ; 8(1): e1002335, 2012 Jan.
Article in English | MEDLINE | ID: mdl-22279426

ABSTRACT

UNLABELLED: Optimizing amino acid conformation and identity is a central problem in computational protein design. Protein design algorithms must allow realistic protein flexibility to occur during this optimization, or they may fail to find the best sequence with the lowest energy. Most design algorithms implement side-chain flexibility by allowing the side chains to move between a small set of discrete, low-energy states, which we call rigid rotamers. In this work we show that allowing continuous side-chain flexibility (which we call continuous rotamers) greatly improves protein flexibility modeling. We present a large-scale study that compares the sequences and best energy conformations in 69 protein-core redesigns using a rigid-rotamer model versus a continuous-rotamer model. We show that in nearly all of our redesigns the sequence found by the continuous-rotamer model is different and has a lower energy than the one found by the rigid-rotamer model. Moreover, the sequences found by the continuous-rotamer model are more similar to the native sequences. We then show that the seemingly easy solution of sampling more rigid rotamers within the continuous region is not a practical alternative to a continuous-rotamer model: at computationally feasible resolutions, using more rigid rotamers was never better than a continuous-rotamer model and almost always resulted in higher energies. Finally, we present a new protein design algorithm based on the dead-end elimination (DEE) algorithm, which we call iMinDEE, that makes the use of continuous rotamers feasible in larger systems. iMinDEE guarantees finding the optimal answer while pruning the search space with close to the same efficiency of DEE. AVAILABILITY: Software is available under the Lesser GNU Public License v3. Contact the authors for source code.


Subject(s)
Protein Engineering/methods , Proteins/chemistry , Algorithms , Amino Acids/chemistry , Computational Biology/methods , Computer Simulation , Databases, Protein , Models, Molecular , Protein Conformation , Software , Thermodynamics
20.
PLoS Comput Biol ; 8(4): e1002477, 2012.
Article in English | MEDLINE | ID: mdl-22532795

ABSTRACT

The cystic fibrosis transmembrane conductance regulator (CFTR) is an epithelial chloride channel mutated in patients with cystic fibrosis (CF). The most prevalent CFTR mutation, ΔF508, blocks folding in the endoplasmic reticulum. Recent work has shown that some ΔF508-CFTR channel activity can be recovered by pharmaceutical modulators ("potentiators" and "correctors"), but ΔF508-CFTR can still be rapidly degraded via a lysosomal pathway involving the CFTR-associated ligand (CAL), which binds CFTR via a PDZ interaction domain. We present a study that goes from theory, to new structure-based computational design algorithms, to computational predictions, to biochemical testing and ultimately to epithelial-cell validation of novel, effective CAL PDZ inhibitors (called "stabilizers") that rescue ΔF508-CFTR activity. To design the "stabilizers", we extended our structural ensemble-based computational protein redesign algorithm K* to encompass protein-protein and protein-peptide interactions. The computational predictions achieved high accuracy: all of the top-predicted peptide inhibitors bound well to CAL. Furthermore, when compared to state-of-the-art CAL inhibitors, our design methodology achieved higher affinity and increased binding efficiency. The designed inhibitor with the highest affinity for CAL (kCAL01) binds six-fold more tightly than the previous best hexamer (iCAL35), and 170-fold more tightly than the CFTR C-terminus. We show that kCAL01 has physiological activity and can rescue chloride efflux in CF patient-derived airway epithelial cells. Since stabilizers address a different cellular CF defect from potentiators and correctors, our inhibitors provide an additional therapeutic pathway that can be used in conjunction with current methods.


Subject(s)
Carrier Proteins/antagonists & inhibitors , Carrier Proteins/chemistry , Cystic Fibrosis Transmembrane Conductance Regulator/antagonists & inhibitors , Cystic Fibrosis Transmembrane Conductance Regulator/ultrastructure , Drug Design , Membrane Proteins/antagonists & inhibitors , Membrane Proteins/chemistry , PDZ Domains , Peptides/chemistry , Adaptor Proteins, Signal Transducing , Binding Sites , Computer Simulation , Golgi Matrix Proteins , Membrane Transport Proteins , Models, Chemical , Models, Molecular , Protein Binding
SELECTION OF CITATIONS
SEARCH DETAIL