Your browser doesn't support javascript.
loading
Transfer learning to leverage larger datasets for improved prediction of protein stability changes.
Dieckhaus, Henry; Brocidiacono, Michael; Randolph, Nicholas Z; Kuhlman, Brian.
Affiliation
  • Dieckhaus H; Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, NC 27599.
  • Brocidiacono M; Division of Chemical Biology and Medicinal Chemistry, University of North Carolina Eshelman School of Pharmacy, Chapel Hill, NC 27599.
  • Randolph NZ; Division of Chemical Biology and Medicinal Chemistry, University of North Carolina Eshelman School of Pharmacy, Chapel Hill, NC 27599.
  • Kuhlman B; Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, NC 27599.
Proc Natl Acad Sci U S A ; 121(6): e2314853121, 2024 Feb 06.
Article in En | MEDLINE | ID: mdl-38285937
ABSTRACT
Amino acid mutations that lower a protein's thermodynamic stability are implicated in numerous diseases, and engineered proteins with enhanced stability can be important in research and medicine. Computational methods for predicting how mutations perturb protein stability are, therefore, of great interest. Despite recent advancements in protein design using deep learning, in silico prediction of stability changes has remained challenging, in part due to a lack of large, high-quality training datasets for model development. Here, we describe ThermoMPNN, a deep neural network trained to predict stability changes for protein point mutations given an initial structure. In doing so, we demonstrate the utility of a recently released megascale stability dataset for training a robust stability model. We also employ transfer learning to leverage a second, larger dataset by using learned features extracted from ProteinMPNN, a deep neural network trained to predict a protein's amino acid sequence given its three-dimensional structure. We show that our method achieves state-of-the-art performance on established benchmark datasets using a lightweight model architecture that allows for rapid, scalable predictions. Finally, we make ThermoMPNN readily available as a tool for stability prediction and design.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Proteins / Neural Networks, Computer Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Proc Natl Acad Sci U S A / Proc. Natl. Acad. Sci. U. S. A / Proceedings of the national academy of sciences of the United States of America Year: 2024 Document type: Article Country of publication: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Proteins / Neural Networks, Computer Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Proc Natl Acad Sci U S A / Proc. Natl. Acad. Sci. U. S. A / Proceedings of the national academy of sciences of the United States of America Year: 2024 Document type: Article Country of publication: United States