Your browser doesn't support javascript.
loading
DeepFold: enhancing protein structure prediction through optimized loss functions, improved template features, and re-optimized energy function.
Lee, Jae-Won; Won, Jong-Hyun; Jeon, Seonggwang; Choo, Yujin; Yeon, Yubin; Oh, Jin-Seon; Kim, Minsoo; Kim, SeonHwa; Joung, InSuk; Jang, Cheongjae; Lee, Sung Jong; Kim, Tae Hyun; Jin, Kyong Hwan; Song, Giltae; Kim, Eun-Sol; Yoo, Jejoong; Paek, Eunok; Noh, Yung-Kyun; Joo, Keehyoung.
Affiliation
  • Lee JW; Department of Computer Science, Hanyang University, Seoul 04763, Korea.
  • Won JH; Center for Advanced Computation, Korea Institute for Advanced Study, Seoul 02455, Korea.
  • Jeon S; Department of Computer Science, Hanyang University, Seoul 04763, Korea.
  • Choo Y; Center for Advanced Computation, Korea Institute for Advanced Study, Seoul 02455, Korea.
  • Yeon Y; Department of Computer Science, Hanyang University, Seoul 04763, Korea.
  • Oh JS; Center for Advanced Computation, Korea Institute for Advanced Study, Seoul 02455, Korea.
  • Kim M; Center for Advanced Computation, Korea Institute for Advanced Study, Seoul 02455, Korea.
  • Kim S; Department of Artificial intelligence, Hanyang University, Seoul 04763, Korea.
  • Joung I; Department of Computer Science, Hanyang University, Seoul 04763, Korea.
  • Jang C; Center for Advanced Computation, Korea Institute for Advanced Study, Seoul 02455, Korea.
  • Lee SJ; Center for Advanced Computation, Korea Institute for Advanced Study, Seoul 02455, Korea.
  • Kim TH; Department of Artificial intelligence, Hanyang University, Seoul 04763, Korea.
  • Jin KH; Department of Physics, Sungkyunkwan University, Suwon 16419, Korea.
  • Song G; School of Electrical Engineering, Korea University, Seoul 02841, Korea.
  • Kim ES; Standigm Inc., Seoul 06234, Korea.
  • Yoo J; Artificial Intelligence Institute, Hanyang University, Seoul 04763, Korea.
  • Paek E; Basic Science Research Institute, Changwon National University, Changwon 51140, Korea.
  • Noh YK; Department of Computer Science, Hanyang University, Seoul 04763, Korea.
  • Joo K; School of Electrical Engineering, Korea University, Seoul 02841, Korea.
Bioinformatics ; 39(12)2023 12 01.
Article in En | MEDLINE | ID: mdl-37995286
MOTIVATION: Predicting protein structures with high accuracy is a critical challenge for the broad community of life sciences and industry. Despite progress made by deep neural networks like AlphaFold2, there is a need for further improvements in the quality of detailed structures, such as side-chains, along with protein backbone structures. RESULTS: Building upon the successes of AlphaFold2, the modifications we made include changing the losses of side-chain torsion angles and frame aligned point error, adding loss functions for side chain confidence and secondary structure prediction, and replacing template feature generation with a new alignment method based on conditional random fields. We also performed re-optimization by conformational space annealing using a molecular mechanics energy function which integrates the potential energies obtained from distogram and side-chain prediction. In the CASP15 blind test for single protein and domain modeling (109 domains), DeepFold ranked fourth among 132 groups with improvements in the details of the structure in terms of backbone, side-chain, and Molprobity. In terms of protein backbone accuracy, DeepFold achieved a median GDT-TS score of 88.64 compared with 85.88 of AlphaFold2. For TBM-easy/hard targets, DeepFold ranked at the top based on Z-scores for GDT-TS. This shows its practical value to the structural biology community, which demands highly accurate structures. In addition, a thorough analysis of 55 domains from 39 targets with publicly available structures indicates that DeepFold shows superior side-chain accuracy and Molprobity scores among the top-performing groups. AVAILABILITY AND IMPLEMENTATION: DeepFold tools are open-source software available at https://github.com/newtonjoo/deepfold.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Software / Proteins Language: En Journal: Bioinformatics Journal subject: INFORMATICA MEDICA Year: 2023 Document type: Article Country of publication: Reino Unido

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Software / Proteins Language: En Journal: Bioinformatics Journal subject: INFORMATICA MEDICA Year: 2023 Document type: Article Country of publication: Reino Unido