Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
1.
J Comput Chem ; 37(12): 1048-58, 2016 May 05.
Artigo em Inglês | MEDLINE | ID: mdl-26833706

RESUMO

One of the main challenges in computational protein design (CPD) is the huge size of the protein sequence and conformational space that has to be computationally explored. Recently, we showed that state-of-the-art combinatorial optimization technologies based on Cost Function Network (CFN) processing allow speeding up provable rigid backbone protein design methods by several orders of magnitudes. Building up on this, we improved and injected CFN technology into the well-established CPD package Osprey to allow all Osprey CPD algorithms to benefit from associated speedups. Because Osprey fundamentally relies on the ability of A* to produce conformations in increasing order of energy, we defined new A* strategies combining CFN lower bounds, with new side-chain positioning-based branching scheme. Beyond the speedups obtained in the new A*-CFN combination, this novel branching scheme enables a much faster enumeration of suboptimal sequences, far beyond what is reachable without it. Together with the immediate and important speedups provided by CFN technology, these developments directly benefit to all the algorithms that previously relied on the DEE/ A* combination inside Osprey* and make it possible to solve larger CPD problems with provable algorithms.


Assuntos
Algoritmos , Biologia Computacional , Proteínas/química , Sequência de Aminoácidos , Desenho de Fármacos , Conformação Proteica
2.
Proteins ; 83(6): 1151-64, 2015 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-25846627

RESUMO

Flexibility and dynamics are important for protein function and a protein's ability to accommodate amino acid substitutions. However, when computational protein design algorithms search over protein structures, the allowed flexibility is often reduced to a relatively small set of discrete side-chain and backbone conformations. While simplifications in scoring functions and protein flexibility are currently necessary to computationally search the vast protein sequence and conformational space, a rigid representation of a protein causes the search to become brittle and miss low-energy structures. Continuous rotamers more closely represent the allowed movement of a side chain within its torsional well and have been successfully incorporated into the protein design framework to design biomedically relevant protein systems. The use of continuous rotamers in protein design enables algorithms to search a larger conformational space than previously possible, but adds additional complexity to the design search. To design large, complex systems with continuous rotamers, new algorithms are needed to increase the efficiency of the search. We present two methods, PartCR and HOT, that greatly increase the speed and efficiency of protein design with continuous rotamers. These methods specifically target the large errors in energetic terms that are used to bound pairwise energies during the design search. By tightening the energy bounds, additional pruning of the conformation space can be achieved, and the number of conformations that must be enumerated to find the global minimum energy conformation is greatly reduced.


Assuntos
Biologia Computacional/métodos , Conformação Proteica , Proteínas/química , Proteínas/metabolismo , Algoritmos , Sequência de Aminoácidos , Modelos Moleculares , Engenharia de Proteínas
3.
Proteins ; 83(10): 1859-1877, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26235965

RESUMO

Despite significant successes in structure-based computational protein design in recent years, protein design algorithms must be improved to increase the biological accuracy of new designs. Protein design algorithms search through an exponential number of protein conformations, protein ensembles, and amino acid sequences in an attempt to find globally optimal structures with a desired biological function. To improve the biological accuracy of protein designs, it is necessary to increase both the amount of protein flexibility allowed during the search and the overall size of the design, while guaranteeing that the lowest-energy structures and sequences are found. DEE/A*-based algorithms are the most prevalent provable algorithms in the field of protein design and can provably enumerate a gap-free list of low-energy protein conformations, which is necessary for ensemble-based algorithms that predict protein binding. We present two classes of algorithmic improvements to the A* algorithm that greatly increase the efficiency of A*. First, we analyze the effect of ordering the expansion of mutable residue positions within the A* tree and present a dynamic residue ordering that reduces the number of A* nodes that must be visited during the search. Second, we propose new methods to improve the conformational bounds used to estimate the energies of partial conformations during the A* search. The residue ordering techniques and improved bounds can be combined for additional increases in A* efficiency. Our enhancements enable all A*-based methods to more fully search protein conformation space, which will ultimately improve the accuracy of complex biomedically relevant designs.


Assuntos
Biologia Computacional/métodos , Engenharia de Proteínas/métodos , Análise de Sequência de Proteína/métodos , Algoritmos , Sequência de Aminoácidos , Conformação Proteica , Software
4.
J Virol ; 88(21): 12669-82, 2014 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-25142607

RESUMO

UNLABELLED: Over the past 5 years, a new generation of highly potent and broadly neutralizing HIV-1 antibodies has been identified. These antibodies can protect against lentiviral infection in nonhuman primates (NHPs), suggesting that passive antibody transfer would prevent HIV-1 transmission in humans. To increase the protective efficacy of such monoclonal antibodies, we employed next-generation sequencing, computational bioinformatics, and structure-guided design to enhance the neutralization potency and breadth of VRC01, an antibody that targets the CD4 binding site of the HIV-1 envelope. One variant, VRC07-523, was 5- to 8-fold more potent than VRC01, neutralized 96% of viruses tested, and displayed minimal autoreactivity. To compare its protective efficacy to that of VRC01 in vivo, we performed a series of simian-human immunodeficiency virus (SHIV) challenge experiments in nonhuman primates and calculated the doses of VRC07-523 and VRC01 that provide 50% protection (EC50). VRC07-523 prevented infection in NHPs at a 5-fold lower concentration than VRC01. These results suggest that increased neutralization potency in vitro correlates with improved protection against infection in vivo, documenting the improved functional efficacy of VRC07-523 and its potential clinical relevance for protecting against HIV-1 infection in humans. IMPORTANCE: In the absence of an effective HIV-1 vaccine, alternative strategies are needed to block HIV-1 transmission. Direct administration of HIV-1-neutralizing antibodies may be able to prevent HIV-1 infections in humans. This approach could be especially useful in individuals at high risk for contracting HIV-1 and could be used together with antiretroviral drugs to prevent infection. To optimize the chance of success, such antibodies can be modified to improve their potency, breadth, and in vivo half-life. Here, knowledge of the structure of a potent neutralizing antibody, VRC01, that targets the CD4-binding site of the HIV-1 envelope protein was used to engineer a next-generation antibody with 5- to 8-fold increased potency in vitro. When administered to nonhuman primates, this antibody conferred protection at a 5-fold lower concentration than the original antibody. Our studies demonstrate an important correlation between in vitro assays used to evaluate the therapeutic potential of antibodies and their in vivo effectiveness.


Assuntos
Anticorpos Neutralizantes/imunologia , Anticorpos Anti-HIV/imunologia , HIV-1/imunologia , Imunização Passiva/métodos , Síndrome de Imunodeficiência Adquirida dos Símios/prevenção & controle , Animais , Anticorpos Neutralizantes/administração & dosagem , Anticorpos Neutralizantes/genética , Anticorpos Anti-HIV/administração & dosagem , Anticorpos Anti-HIV/genética , HIV-1/genética , Macaca mulatta , Masculino , Dados de Sequência Molecular , Análise de Sequência de DNA
5.
PLoS Comput Biol ; 8(1): e1002335, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22279426

RESUMO

UNLABELLED: Optimizing amino acid conformation and identity is a central problem in computational protein design. Protein design algorithms must allow realistic protein flexibility to occur during this optimization, or they may fail to find the best sequence with the lowest energy. Most design algorithms implement side-chain flexibility by allowing the side chains to move between a small set of discrete, low-energy states, which we call rigid rotamers. In this work we show that allowing continuous side-chain flexibility (which we call continuous rotamers) greatly improves protein flexibility modeling. We present a large-scale study that compares the sequences and best energy conformations in 69 protein-core redesigns using a rigid-rotamer model versus a continuous-rotamer model. We show that in nearly all of our redesigns the sequence found by the continuous-rotamer model is different and has a lower energy than the one found by the rigid-rotamer model. Moreover, the sequences found by the continuous-rotamer model are more similar to the native sequences. We then show that the seemingly easy solution of sampling more rigid rotamers within the continuous region is not a practical alternative to a continuous-rotamer model: at computationally feasible resolutions, using more rigid rotamers was never better than a continuous-rotamer model and almost always resulted in higher energies. Finally, we present a new protein design algorithm based on the dead-end elimination (DEE) algorithm, which we call iMinDEE, that makes the use of continuous rotamers feasible in larger systems. iMinDEE guarantees finding the optimal answer while pruning the search space with close to the same efficiency of DEE. AVAILABILITY: Software is available under the Lesser GNU Public License v3. Contact the authors for source code.


Assuntos
Engenharia de Proteínas/métodos , Proteínas/química , Algoritmos , Aminoácidos/química , Biologia Computacional/métodos , Simulação por Computador , Bases de Dados de Proteínas , Modelos Moleculares , Conformação Proteica , Software , Termodinâmica
6.
PLoS Comput Biol ; 8(4): e1002477, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22532795

RESUMO

The cystic fibrosis transmembrane conductance regulator (CFTR) is an epithelial chloride channel mutated in patients with cystic fibrosis (CF). The most prevalent CFTR mutation, ΔF508, blocks folding in the endoplasmic reticulum. Recent work has shown that some ΔF508-CFTR channel activity can be recovered by pharmaceutical modulators ("potentiators" and "correctors"), but ΔF508-CFTR can still be rapidly degraded via a lysosomal pathway involving the CFTR-associated ligand (CAL), which binds CFTR via a PDZ interaction domain. We present a study that goes from theory, to new structure-based computational design algorithms, to computational predictions, to biochemical testing and ultimately to epithelial-cell validation of novel, effective CAL PDZ inhibitors (called "stabilizers") that rescue ΔF508-CFTR activity. To design the "stabilizers", we extended our structural ensemble-based computational protein redesign algorithm K* to encompass protein-protein and protein-peptide interactions. The computational predictions achieved high accuracy: all of the top-predicted peptide inhibitors bound well to CAL. Furthermore, when compared to state-of-the-art CAL inhibitors, our design methodology achieved higher affinity and increased binding efficiency. The designed inhibitor with the highest affinity for CAL (kCAL01) binds six-fold more tightly than the previous best hexamer (iCAL35), and 170-fold more tightly than the CFTR C-terminus. We show that kCAL01 has physiological activity and can rescue chloride efflux in CF patient-derived airway epithelial cells. Since stabilizers address a different cellular CF defect from potentiators and correctors, our inhibitors provide an additional therapeutic pathway that can be used in conjunction with current methods.


Assuntos
Proteínas de Transporte/antagonistas & inibidores , Proteínas de Transporte/química , Regulador de Condutância Transmembrana em Fibrose Cística/antagonistas & inibidores , Regulador de Condutância Transmembrana em Fibrose Cística/ultraestrutura , Desenho de Fármacos , Proteínas de Membrana/antagonistas & inibidores , Proteínas de Membrana/química , Domínios PDZ , Peptídeos/química , Proteínas Adaptadoras de Transdução de Sinal , Sítios de Ligação , Simulação por Computador , Proteínas da Matriz do Complexo de Golgi , Proteínas de Membrana Transportadoras , Modelos Químicos , Modelos Moleculares , Ligação Proteica
8.
Methods Enzymol ; 523: 87-107, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23422427

RESUMO

UNLABELLED: We have developed a suite of protein redesign algorithms that improves realistic in silico modeling of proteins. These algorithms are based on three characteristics that make them unique: (1) improved flexibility of the protein backbone, protein side-chains, and ligand to accurately capture the conformational changes that are induced by mutations to the protein sequence; (2) modeling of proteins and ligands as ensembles of low-energy structures to better approximate binding affinity; and (3) a globally optimal protein design search, guaranteeing that the computational predictions are optimal with respect to the input model. Here, we illustrate the importance of these three characteristics. We then describe OSPREY, a protein redesign suite that implements our protein design algorithms. OSPREY has been used prospectively, with experimental validation, in several biomedically relevant settings. We show in detail how OSPREY has been used to predict resistance mutations and explain why improved flexibility, ensembles, and provability are essential for this application. AVAILABILITY: OSPREY is free and open source under a Lesser GPL license. The latest version is OSPREY 2.0. The program, user manual, and source code are available at www.cs.duke.edu/donaldlab/software.php. CONTACT: osprey@cs.duke.edu.


Assuntos
Algoritmos , Proteínas/química , Estrutura Secundária de Proteína , Análise de Sequência de Proteína , Software
9.
J Comput Biol ; 18(11): 1661-79, 2011 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-21970619

RESUMO

A major bottleneck in protein structure determination via nuclear magnetic resonance (NMR) is the lengthy and laborious process of assigning resonances and nuclear Overhauser effect (NOE) cross peaks. Recent studies have shown that accurate backbone folds can be determined using sparse NMR data, such as residual dipolar couplings (RDCs) or backbone chemical shifts. This opens a question of whether we can also determine the accurate protein side-chain conformations using sparse or unassigned NMR data. We attack this question by using unassigned nuclear Overhauser effect spectroscopy (NOESY) data, which records the through-space dipolar interactions between protons nearby in three-dimensional (3D) space. We propose a Bayesian approach with a Markov random field (MRF) model to integrate the likelihood function derived from observed experimental data, with prior information (i.e., empirical molecular mechanics energies) about the protein structures. We unify the side-chain structure prediction problem with the side-chain structure determination problem using unassigned NMR data, and apply the deterministic dead-end elimination (DEE) and A* search algorithms to provably find the global optimum solution that maximizes the posterior probability. We employ a Hausdorff-based measure to derive the likelihood of a rotamer or a pairwise rotamer interaction from unassigned NOESY data. In addition, we apply a systematic and rigorous approach to estimate the experimental noise in NMR data, which also determines the weighting factor of the data term in the scoring function derived from the Bayesian framework. We tested our approach on real NMR data of three proteins: the FF Domain 2 of human transcription elongation factor CA150 (FF2), the B1 domain of Protein G (GB1), and human ubiquitin. The promising results indicate that our algorithm can be applied in high-resolution protein structure determination. Since our approach does not require any NOE assignment, it can accelerate the NMR structure determination process.


Assuntos
Aminoácidos/química , Teorema de Bayes , Simulação por Computador , Modelos Moleculares , Algoritmos , Sequência de Aminoácidos , Proteínas de Bactérias/química , Humanos , Funções Verossimilhança , Espectroscopia de Ressonância Magnética , Cadeias de Markov , Conformação Proteica , Estrutura Terciária de Proteína , Razão Sinal-Ruído , Fatores de Elongação da Transcrição/química , Ubiquitina/química
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA