Your browser doesn't support javascript.
loading
Comprehensive assessment of protein loop modeling programs on large-scale datasets: prediction accuracy and efficiency.
Wang, Tianyue; Wang, Langcheng; Zhang, Xujun; Shen, Chao; Zhang, Odin; Wang, Jike; Wu, Jialu; Jin, Ruofan; Zhou, Donghao; Chen, Shicheng; Liu, Liwei; Wang, Xiaorui; Hsieh, Chang-Yu; Chen, Guangyong; Pan, Peichen; Kang, Yu; Hou, Tingjun.
Affiliation
  • Wang T; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • Wang L; Department of Pathology, New York University Medical Center, 550 First Avenue, New York, NY 10016, USA.
  • Zhang X; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • Shen C; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • Zhang O; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • Wang J; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • Wu J; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • Jin R; College of Life Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • Zhou D; Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, Guangdong, China.
  • Chen S; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • Liu L; Advanced Computing and Storage Laboratory, Central Research Institute, 2012 Laboratories, Huawei Technologies Co., Ltd., Shenzhen 518129, Guangdong, China.
  • Wang X; State Key Laboratory of Quality Research in Chinese Medicines, Macau University of Science and Technology, Macao, China.
  • Hsieh CY; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • Chen G; Zhejiang Lab, Zhejiang University, Hangzhou 311121, Zhejiang, China.
  • Pan P; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • Kang Y; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • Hou T; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
Brief Bioinform ; 25(1)2023 11 22.
Article in En | MEDLINE | ID: mdl-38171930
ABSTRACT
Protein loops play a critical role in the dynamics of proteins and are essential for numerous biological functions, and various computational approaches to loop modeling have been proposed over the past decades. However, a comprehensive understanding of the strengths and weaknesses of each method is lacking. In this work, we constructed two high-quality datasets (i.e. the General dataset and the CASP dataset) and systematically evaluated the accuracy and efficiency of 13 commonly used loop modeling approaches from the perspective of loop lengths, protein classes and residue types. The results indicate that the knowledge-based method FREAD generally outperforms the other tested programs in most cases, but encountered challenges when predicting loops longer than 15 and 30 residues on the CASP and General datasets, respectively. The ab initio method Rosetta NGK demonstrated exceptional modeling accuracy for short loops with four to eight residues and achieved the highest success rate on the CASP dataset. The well-known AlphaFold2 and RoseTTAFold require more resources for better performance, but they exhibit promise for predicting loops longer than 16 and 30 residues in the CASP and General datasets. These observations can provide valuable insights for selecting suitable methods for specific loop modeling tasks and contribute to future advancements in the field.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Proteins Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Brief Bioinform / Brief. bioinform / Briefings in bioinformatics Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2023 Type: Article Affiliation country: China

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Proteins Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Brief Bioinform / Brief. bioinform / Briefings in bioinformatics Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2023 Type: Article Affiliation country: China