Your browser doesn't support javascript.
loading
Minimization-Aware Recursive K*: A Novel, Provable Algorithm that Accelerates Ensemble-Based Protein Design and Provably Approximates the Energy Landscape.
Jou, Jonathan D; Holt, Graham T; Lowegard, Anna U; Donald, Bruce R.
Afiliação
  • Jou JD; Department of Computer Science, Duke University, Durham, North Carolina.
  • Holt GT; Department of Computer Science, Duke University, Durham, North Carolina.
  • Lowegard AU; Computational Biology and Bioinformatics Program, Duke University, Durham, North Carolina.
  • Donald BR; Department of Computer Science, Duke University, Durham, North Carolina.
J Comput Biol ; 27(4): 550-564, 2020 04.
Article em En | MEDLINE | ID: mdl-31855059
ABSTRACT
Protein design algorithms that model continuous sidechain flexibility and conformational ensembles better approximate the in vitro and in vivo behavior of proteins. The previous state of the art, iMinDEE-A*-K*, computes provable ɛ-approximations to partition functions of protein states (e.g., bound vs. unbound) by computing provable, admissible pairwise-minimized energy lower bounds on protein conformations, and using the A* enumeration algorithm to return a gap-free list of lowest-energy conformations. iMinDEE-A*-K* runs in time sublinear in the number of conformations, but can be trapped in loosely-bounded, low-energy conformational wells containing many conformations with highly similar energies. That is, iMinDEE-A*-K* is unable to exploit the correlation between protein conformation and energy similar conformations often have similar energy. We introduce two new concepts that exploit this correlation Minimization-Aware Enumeration and Recursive K*. We combine these two insights into a novel algorithm, Minimization-Aware Recursive K* (MARK*), which tightens bounds not on single conformations, but instead on distinct regions of the conformation space. We compare the performance of iMinDEE-A*-K* versus MARK* by running the Branch and Bound over K* (BBK*) algorithm, which provably returns sequences in order of decreasing K* score, using either iMinDEE-A*-K* or MARK* to approximate partition functions. We show on 200 design problems that MARK* not only enumerates and minimizes vastly fewer conformations than the previous state of the art, but also runs up to 2 orders of magnitude faster. Finally, we show that MARK* not only efficiently approximates the partition function, but also provably approximates the energy landscape. To our knowledge, MARK* is the first algorithm to do so. We use MARK* to analyze the change in energy landscape of the bound and unbound states of an HIV-1 capsid protein C-terminal domain in complex with a camelid VHH, and measure the change in conformational entropy induced by binding. Thus, MARK* both accelerates existing designs and offers new capabilities not possible with previous algorithms.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Conformação Proteica / Software / Proteínas / Biologia Computacional Idioma: En Ano de publicação: 2020 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Conformação Proteica / Software / Proteínas / Biologia Computacional Idioma: En Ano de publicação: 2020 Tipo de documento: Article