Your browser doesn't support javascript.
loading
A combinatorially complete epistatic fitness landscape in an enzyme active site.
Johnston, Kadina E; Almhjell, Patrick J; Watkins-Dulaney, Ella J; Liu, Grace; Porter, Nicholas J; Yang, Jason; Arnold, Frances H.
Afiliação
  • Johnston KE; Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA 91125.
  • Almhjell PJ; Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125.
  • Watkins-Dulaney EJ; Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA 91125.
  • Liu G; Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA 91125.
  • Porter NJ; Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125.
  • Yang J; Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125.
  • Arnold FH; Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA 91125.
Proc Natl Acad Sci U S A ; 121(32): e2400439121, 2024 Aug 06.
Article em En | MEDLINE | ID: mdl-39074291
ABSTRACT
Protein engineering often targets binding pockets or active sites which are enriched in epistasis-nonadditive interactions between amino acid substitutions-and where the combined effects of multiple single substitutions are difficult to predict. Few existing sequence-fitness datasets capture epistasis at large scale, especially for enzyme catalysis, limiting the development and assessment of model-guided enzyme engineering approaches. We present here a combinatorially complete, 160,000-variant fitness landscape across four residues in the active site of an enzyme. Assaying the native reaction of a thermostable ß-subunit of tryptophan synthase (TrpB) in a nonnative environment yielded a landscape characterized by significant epistasis and many local optima. These effects prevent simulated directed evolution approaches from efficiently reaching the global optimum. There is nonetheless wide variability in the effectiveness of different directed evolution approaches, which together provide experimental benchmarks for computational and machine learning workflows. The most-fit TrpB variants contain a substitution that is nearly absent in natural TrpB sequences-a result that conservation-based predictions would not capture. Thus, although fitness prediction using evolutionary data can enrich in more-active variants, these approaches struggle to identify and differentiate among the most-active variants, even for this near-native function. Overall, this work presents a large-scale testing ground for model-guided enzyme engineering and suggests that efficient navigation of epistatic fitness landscapes can be improved by advances in both machine learning and physical modeling.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Triptofano Sintase / Domínio Catalítico / Epistasia Genética Idioma: En Revista: Proc Natl Acad Sci U S A Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Triptofano Sintase / Domínio Catalítico / Epistasia Genética Idioma: En Revista: Proc Natl Acad Sci U S A Ano de publicação: 2024 Tipo de documento: Article