Your browser doesn't support javascript.
loading
ECNet is an evolutionary context-integrated deep learning framework for protein engineering.
Luo, Yunan; Jiang, Guangde; Yu, Tianhao; Liu, Yang; Vo, Lam; Ding, Hantian; Su, Yufeng; Qian, Wesley Wei; Zhao, Huimin; Peng, Jian.
Afiliação
  • Luo Y; Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana-Champaign, IL, USA.
  • Jiang G; Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana-Champaign, IL, USA.
  • Yu T; Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana-Champaign, IL, USA.
  • Liu Y; Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana-Champaign, IL, USA.
  • Vo L; Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana-Champaign, IL, USA.
  • Ding H; Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana-Champaign, IL, USA.
  • Su Y; Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana-Champaign, IL, USA.
  • Qian WW; Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana-Champaign, IL, USA.
  • Zhao H; Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana-Champaign, IL, USA. zhao5@illinois.edu.
  • Peng J; Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana-Champaign, IL, USA. jianpeng@illinois.edu.
Nat Commun ; 12(1): 5743, 2021 09 30.
Article em En | MEDLINE | ID: mdl-34593817
ABSTRACT
Machine learning has been increasingly used for protein engineering. However, because the general sequence contexts they capture are not specific to the protein being engineered, the accuracy of existing machine learning algorithms is rather limited. Here, we report ECNet (evolutionary context-integrated neural network), a deep-learning algorithm that exploits evolutionary contexts to predict functional fitness for protein engineering. This algorithm integrates local evolutionary context from homologous sequences that explicitly model residue-residue epistasis for the protein of interest with the global evolutionary context that encodes rich semantic and structural features from the enormous protein sequence universe. As such, it enables accurate mapping from sequence to function and provides generalization from low-order mutants to higher-order mutants. We show that ECNet predicts the sequence-function relationship more accurately as compared to existing machine learning algorithms by using ~50 deep mutational scanning and random mutagenesis datasets. Moreover, we used ECNet to guide the engineering of TEM-1 ß-lactamase and identified variants with improved ampicillin resistance with high success rates.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Engenharia de Proteínas / Evolução Molecular / Aprendizado Profundo Tipo de estudo: Prognostic_studies Idioma: En Revista: Nat Commun Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Engenharia de Proteínas / Evolução Molecular / Aprendizado Profundo Tipo de estudo: Prognostic_studies Idioma: En Revista: Nat Commun Ano de publicação: 2021 Tipo de documento: Article