Your browser doesn't support javascript.
loading
Accurate and efficient protein sequence design through learning concise local environment of residues.
Huang, Bin; Fan, Tingwen; Wang, Kaiyue; Zhang, Haicang; Yu, Chungong; Nie, Shuyu; Qi, Yangshuo; Zheng, Wei-Mou; Han, Jian; Fan, Zheng; Sun, Shiwei; Ye, Sheng; Yang, Huaiyi; Bu, Dongbo.
Afiliação
  • Huang B; Key Lab of Intelligent Information Processing, SKLP, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.
  • Fan T; University of Chinese Academy of Sciences, Beijing 100110, China.
  • Wang K; Key Lab of Microbial Physiological & Metabolic Engineering, State Key Lab of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China.
  • Zhang H; Beijing Advanced Innovation Center for Big Data-based Precision Medicine, School of Engineering Medicine, Beihang University, Beijing 100083, China.
  • Yu C; Key Laboratory of Big Data-based Precision Medicine (Beihang University), Ministry of Industry and Information Technology of the People's Republic of China, Beijing 100083, China.
  • Nie S; Key Lab of Intelligent Information Processing, SKLP, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.
  • Qi Y; University of Chinese Academy of Sciences, Beijing 100110, China.
  • Zheng WM; Zhongke Big Data Academy, Zhengzhou, Henan 450046, China.
  • Han J; Key Lab of Intelligent Information Processing, SKLP, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.
  • Fan Z; University of Chinese Academy of Sciences, Beijing 100110, China.
  • Sun S; Zhongke Big Data Academy, Zhengzhou, Henan 450046, China.
  • Ye S; Key Lab of Microbial Physiological & Metabolic Engineering, State Key Lab of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China.
  • Yang H; School of Life Sciences, Hebei University, Baoding, Hebei 071002, China.
  • Bu D; Key Lab of Microbial Physiological & Metabolic Engineering, State Key Lab of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China.
Bioinformatics ; 39(3)2023 03 01.
Article em En | MEDLINE | ID: mdl-36916746
ABSTRACT
MOTIVATION Computational protein sequence design has been widely applied in rational protein engineering and increasing the design accuracy and efficiency is highly desired.

RESULTS:

Here, we present ProDESIGN-LE, an accurate and efficient approach to protein sequence design. ProDESIGN-LE adopts a concise but informative representation of the residue's local environment and trains a transformer to learn the correlation between local environment of residues and their amino acid types. For a target backbone structure, ProDESIGN-LE uses the transformer to assign an appropriate residue type for each position based on its local environment within this structure, eventually acquiring a designed sequence with all residues fitting well with their local environments. We applied ProDESIGN-LE to design sequences for 68 naturally occurring and 129 hallucinated proteins within 20 s per protein on average. The designed proteins have their predicted structures perfectly resembling the target structures with a state-of-the-art average TM-score exceeding 0.80. We further experimentally validated ProDESIGN-LE by designing five sequences for an enzyme, chloramphenicol O-acetyltransferase type III (CAT III), and recombinantly expressing the proteins in Escherichia coli. Of these proteins, three exhibited excellent solubility, and one yielded monomeric species with circular dichroism spectra consistent with the natural CAT III protein. AVAILABILITY AND IMPLEMENTATION The source code of ProDESIGN-LE is available at https//github.com/bigict/ProDESIGN-LE.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Proteínas Tipo de estudo: Prognostic_studies Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Proteínas Tipo de estudo: Prognostic_studies Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: China