Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 48
Filtrar
1.
Nature ; 561(7724): 485-491, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-30209393

RESUMO

The regular arrangements of ß-strands around a central axis in ß-barrels and of α-helices in coiled coils contrast with the irregular tertiary structures of most globular proteins, and have fascinated structural biologists since they were first discovered. Simple parametric models have been used to design a wide range of α-helical coiled-coil structures, but to date there has been no success with ß-barrels. Here we show that accurate de novo design of ß-barrels requires considerable symmetry-breaking to achieve continuous hydrogen-bond connectivity and eliminate backbone strain. We then build ensembles of ß-barrel backbone models with cavity shapes that match the fluorogenic compound DFHBI, and use a hierarchical grid-based search method to simultaneously optimize the rigid-body placement of DFHBI in these cavities and the identities of the surrounding amino acids to achieve high shape and chemical complementarity. The designs have high structural accuracy and bind and fluorescently activate DFHBI in vitro and in Escherichia coli, yeast and mammalian cells. This de novo design of small-molecule binding activity, using backbones custom-built to bind the ligand, should enable the design of increasingly sophisticated ligand-binding proteins, sensors and catalysts that are not limited by the backbone geometries available in known protein structures.


Assuntos
Compostos de Benzil/química , Fluorescência , Imidazolinas/química , Proteínas/química , Animais , Compostos de Benzil/análise , Células COS , Chlorocebus aethiops , Escherichia coli , Proteínas de Fluorescência Verde/genética , Proteínas de Fluorescência Verde/metabolismo , Ligação de Hidrogênio , Imidazolinas/análise , Ligantes , Ligação Proteica , Domínios Proteicos , Dobramento de Proteína , Estabilidade Proteica , Estrutura Secundária de Proteína , Reprodutibilidade dos Testes , Leveduras
2.
J Comput Chem ; 44(14): 1360-1368, 2023 05 30.
Artigo em Inglês | MEDLINE | ID: mdl-36847771

RESUMO

Cryo-electron microscopy (cryo-EM) is gaining large attention for high-resolution protein structure determination in solutions. However, a very high percentage of cryo-EM structures correspond to resolutions of 3-5 Å, making the structures difficult to be used in in silico drug design. In this study, we analyze how useful cryo-EM protein structures are for in silico drug design by evaluating ligand docking accuracy. From realistic cross-docking scenarios using medium resolution (3-5 Å) cryo-EM structures and a popular docking tool Autodock-Vina, only 20% of docking succeeded, when the success rate doubles in the same kind of cross-docking but using high-resolution (<2 Å) crystal structures instead. We decipher the reason for failures by decomposing the contribution from resolution-dependent and independent factors. The heterogeneity in the protein side-chain and backbone conformations is identified as the major resolution-dependent factor causing docking difficulty from our analysis, while intrinsic receptor flexibility mainly comprises the resolution-independent factor. We demonstrate the flexibility implementation in current ligand docking tools is able to rescue only a portion of failures (10%), and the limited performance was majorly due to potential structural errors than conformational changes. Our work suggests the strong necessity of more robust method developments on ligand docking and EM modeling techniques in order to fully utilize cryo-EM structures for in silico drug design.


Assuntos
Benchmarking , Proteínas , Microscopia Crioeletrônica/métodos , Ligantes , Proteínas/química , Desenho de Fármacos , Conformação Proteica
3.
Proc Natl Acad Sci U S A ; 117(3): 1496-1503, 2020 01 21.
Artigo em Inglês | MEDLINE | ID: mdl-31896580

RESUMO

The prediction of interresidue contacts and distances from coevolutionary data using deep learning has considerably advanced protein structure prediction. Here, we build on these advances by developing a deep residual network for predicting interresidue orientations, in addition to distances, and a Rosetta-constrained energy-minimization protocol for rapidly and accurately generating structure models guided by these restraints. In benchmark tests on 13th Community-Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction (CASP13)- and Continuous Automated Model Evaluation (CAMEO)-derived sets, the method outperforms all previously described structure-prediction methods. Although trained entirely on native proteins, the network consistently assigns higher probability to de novo-designed proteins, identifying the key fold-determining residues and providing an independent quantitative measure of the "ideality" of a protein structure. The method promises to be useful for a broad range of protein structure prediction and design problems.


Assuntos
Conformação Proteica , Análise de Sequência de Proteína/métodos , Software , Animais , Aprendizado Profundo , Humanos
4.
J Am Chem Soc ; 144(34): 15519-15528, 2022 08 31.
Artigo em Inglês | MEDLINE | ID: mdl-35972994

RESUMO

Although interest in stabilized α-helical peptides as next-generation therapeutics for modulating biomolecular interfaces is increasing, peptides have limited functionality and stability due to their small size. In comparison, α-helical ligands based on proteins can make steric clash with targets due to their large size. Here, we report the design of a monomeric pseudo-isolated α-helix (mPIH) system in which proteins behave as if they are peptides. The designed proteins contain α-helix ligands that do not require any covalent chemical modification, do not have frayed ends, and importantly can make sterically favorable interactions similar to isolated peptides. An optimal mPIH showed a more than 100-fold increase in target selectivity, which might be related to the advantages in conformational selection due to the absence of frayed ends. The α-helical ligand in the mPIH displayed high thermal stability well above human body temperature and showed reversible and rapid folding/unfolding transitions. Thus, mPIH can become a promising protein-based platform for developing stabilized α-helix pharmaceuticals.


Assuntos
Peptídeos , Proteínas , Sequência de Aminoácidos , Dicroísmo Circular , Humanos , Peptídeos/química , Conformação Proteica em alfa-Hélice , Dobramento de Proteína , Estrutura Secundária de Proteína
5.
Proteins ; 89(12): 1824-1833, 2021 12.
Artigo em Inglês | MEDLINE | ID: mdl-34324224

RESUMO

For CASP14, we developed deep learning-based methods for predicting homo-oligomeric and hetero-oligomeric contacts and used them for oligomer modeling. To build structure models, we developed an oligomer structure generation method that utilizes predicted interchain contacts to guide iterative restrained minimization from random backbone structures. We supplemented this gradient-based fold-and-dock method with template-based and ab initio docking approaches using deep learning-based subunit predictions on 29 assembly targets. These methods produced oligomer models with summed Z-scores 5.5 units higher than the next best group, with the fold-and-dock method having the best relative performance. Over the eight targets for which this method was used, the best of the five submitted models had average oligomer TM-score of 0.71 (average oligomer TM-score of the next best group: 0.64), and explicit modeling of inter-subunit interactions improved modeling of six out of 40 individual domains (ΔGDT-TS > 2.0).


Assuntos
Modelos Moleculares , Conformação Proteica , Proteínas , Software , Biologia Computacional , Bases de Dados de Proteínas , Aprendizado Profundo , Ligação Proteica , Subunidades Proteicas/química , Subunidades Proteicas/metabolismo , Proteínas/química , Proteínas/metabolismo , Análise de Sequência de Proteína
6.
Proteins ; 89(12): 1722-1733, 2021 12.
Artigo em Inglês | MEDLINE | ID: mdl-34331359

RESUMO

The trRosetta structure prediction method employs deep learning to generate predicted residue-residue distance and orientation distributions from which 3D models are built. We sought to improve the method by incorporating as inputs (in addition to sequence information) both language model embeddings and template information weighted by sequence similarity to the target. We also developed a refinement pipeline that recombines models generated by template-free and template utilizing versions of trRosetta guided by the DeepAccNet accuracy predictor. Both benchmark tests and CASP results show that the new pipeline is a considerable improvement over the original trRosetta, and it is faster and requires less computing resources, completing the entire modeling process in a median < 3 h in CASP14. Our human group improved results with this pipeline primarily by identifying additional homologous sequences for input into the network. We also used the DeepAccNet accuracy predictor to guide Rosetta high-resolution refinement for submissions in the regular and refinement categories; although performance was quite good on a CASP relative scale, the overall improvements were rather modest in part due to missing inter-domain or inter-chain contacts.


Assuntos
Biologia Computacional/métodos , Aprendizado Profundo , Estrutura Terciária de Proteína , Proteínas , Software , Humanos , Metagenoma/genética , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Análise de Sequência de Proteína
7.
PLoS Comput Biol ; 16(9): e1008103, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32956350

RESUMO

Highly coordinated water molecules are frequently an integral part of protein-protein and protein-ligand interfaces. We introduce an updated energy model that efficiently captures the energetic effects of these ordered water molecules on the surfaces of proteins. A two-stage method is developed in which polar groups arranged in geometries suitable for water placement are first identified, then a modified Monte Carlo simulation allows highly coordinated waters to be placed on the surface of a protein while simultaneously sampling amino acid side chain orientations. This "semi-explicit" water model is implemented in Rosetta and is suitable for both structure prediction and protein design. We show that our new approach and energy model yield significant improvements in native structure recovery of protein-protein and protein-ligand docking discrimination tests.


Assuntos
Sítios de Ligação/fisiologia , Simulação de Acoplamento Molecular , Ligação Proteica/fisiologia , Proteínas , Água , Algoritmos , Aminoácidos/química , Aminoácidos/metabolismo , Ligação de Hidrogênio , Ligantes , Método de Monte Carlo , Proteínas/química , Proteínas/metabolismo , Água/química , Água/metabolismo
8.
Proc Natl Acad Sci U S A ; 115(12): 3054-3059, 2018 03 20.
Artigo em Inglês | MEDLINE | ID: mdl-29507254

RESUMO

Proteins fold to their lowest free-energy structures, and hence the most straightforward way to increase the accuracy of a partially incorrect protein structure model is to search for the lowest-energy nearby structure. This direct approach has met with little success for two reasons: first, energy function inaccuracies can lead to false energy minima, resulting in model degradation rather than improvement; and second, even with an accurate energy function, the search problem is formidable because the energy only drops considerably in the immediate vicinity of the global minimum, and there are a very large number of degrees of freedom. Here we describe a large-scale energy optimization-based refinement method that incorporates advances in both search and energy function accuracy that can substantially improve the accuracy of low-resolution homology models. The method refined low-resolution homology models into correct folds for 50 of 84 diverse protein families and generated improved models in recent blind structure prediction experiments. Analyses of the basis for these improvements reveal contributions from both the improvements in conformational sampling techniques and the energy function.


Assuntos
Simulação por Computador , Modelos Químicos , Biologia Computacional/métodos , Modelos Moleculares , Simulação de Dinâmica Molecular , Conformação Proteica , Dobramento de Proteína , Termodinâmica
9.
Proteins ; 87(12): 1276-1282, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31325340

RESUMO

Because proteins generally fold to their lowest free energy states, energy-guided refinement in principle should be able to systematically improve the quality of protein structure models generated using homologous structure or co-evolution derived information. However, because of the high dimensionality of the search space, there are far more ways to degrade the quality of a near native model than to improve it, and hence, refinement methods are very sensitive to energy function errors. In the 13th Critial Assessment of techniques for protein Structure Prediction (CASP13), we sought to carry out a thorough search for low energy states in the neighborhood of a starting model using restraints to avoid straying too far. The approach was reasonably successful in improving both regions largely incorrect in the starting models as well as core regions that started out closer to the correct structure. Models with GDT-HA over 70 were obtained for five targets and for one of those, an accuracy of 0.5 å backbone root-mean-square deviation (RMSD) was achieved. An important current challenge is to improve performance in refining oligomers and larger proteins, for which the search problem remains extremely difficult.


Assuntos
Biologia Computacional/métodos , Conformação Proteica , Dobramento de Proteína , Proteínas/química , Algoritmos , Modelos Moleculares , Reprodutibilidade dos Testes , Termodinâmica
10.
Proteins ; 86 Suppl 1: 113-121, 2018 03.
Artigo em Inglês | MEDLINE | ID: mdl-28940798

RESUMO

We describe several notable aspects of our structure predictions using Rosetta in CASP12 in the free modeling (FM) and refinement (TR) categories. First, we had previously generated (and published) models for most large protein families lacking experimentally determined structures using Rosetta guided by co-evolution based contact predictions, and for several targets these models proved better starting points for comparative modeling than any known crystal structure-our model database thus starts to fulfill one of the goals of the original protein structure initiative. Second, while our "human" group simply submitted ROBETTA models for most targets, for six targets expert intervention improved predictions considerably; the largest improvement was for T0886 where we correctly parsed two discontinuous domains guided by predicted contact maps to accurately identify a structural homolog of the same fold. Third, Rosetta all atom refinement followed by MD simulations led to consistent but small improvements when starting models were close to the native structure, and larger but less consistent improvements when starting models were further away.


Assuntos
Biologia Computacional/métodos , Modelos Moleculares , Conformação Proteica , Dobramento de Proteína , Proteínas/química , Algoritmos , Cristalografia por Raios X , Humanos , Análise de Sequência de Proteína
11.
Proteins ; 86 Suppl 1: 283-291, 2018 03.
Artigo em Inglês | MEDLINE | ID: mdl-28913931

RESUMO

Many naturally occurring protein systems function primarily as symmetric assemblies. Prediction of the quaternary structure of these assemblies is an important biological problem. This article describes automated tools we have developed for predicting the structures of symmetric protein assemblies in the Robetta structure prediction server. We assess the performance of this pipeline on a set of targets from the recent CASP12/CAPRI blind quaternary structure prediction experiment. Our approach successfully predicted 5 of 7 symmetric assemblies in this challenge, and was assessed as the best participating server group, and 1 of only 2 groups (human or server) with 2 predictions judged as high quality by the assessors. We also assess the method on a broader set of 22 natively symmetric CASP12 targets, where we show that oligomeric modeling can improve the accuracy of monomeric structure determination, particularly in highly intertwined oligomers.


Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Modelos Moleculares , Conformação Proteica , Multimerização Proteica , Proteínas/química , Software , Humanos , Análise de Sequência de Proteína
12.
J Chem Inf Model ; 58(6): 1234-1243, 2018 06 25.
Artigo em Inglês | MEDLINE | ID: mdl-29786430

RESUMO

The second extracellular loops (ECL2s) of G-protein-coupled receptors (GPCRs) are often involved in GPCR functions, and their structures have important implications in drug discovery. However, structure prediction of ECL2 is difficult because of its long length and the structural diversity among different GPCRs. In this study, a new ECL2 conformational sampling method involving both template-based and ab initio sampling was developed. Inspired by the observation of similar ECL2 structures of closely related GPCRs, a template-based sampling method employing loop structure templates selected from the structure database was developed. A new metric for evaluating similarity of the target loop to templates was introduced for template selection. An ab initio loop sampling method was also developed to treat cases without highly similar templates. The ab initio method is based on the previously developed fragment assembly and loop closure method. A new sampling component that takes advantage of secondary structure prediction was added. In addition, a conserved disulfide bridge restraining ECL2 conformation was predicted and analytically incorporated into sampling, reducing the effective dimension of the conformational search space. The sampling method was combined with an existing energy function for comparison with previously reported loop structure prediction methods, and the benchmark test demonstrated outstanding performance.


Assuntos
Receptores Acoplados a Proteínas G/química , Animais , Bases de Dados de Proteínas , Dissulfetos/química , Humanos , Modelos Moleculares , Conformação Proteica , Estrutura Secundária de Proteína
13.
Proteins ; 84 Suppl 1: 314-22, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-26205421

RESUMO

We report new Rosetta-based approaches to tackling the major issues that confound protein structure refinement, and the testing of these approaches in the CASP11 experiment. Automated refinement protocols were developed that integrate a range of sampling methods using parallel computation and multiobjective optimization. In CASP11, we used a more aggressive large-scale structure rebuilding approach for poor starting models, and a less aggressive local rebuilding plus core refinement approach for starting models likely to be closer to the native structure. The more incorrectly modeled a structure was predicted to be, the more it was allowed to vary during refinement. The CASP11 experiment revealed strengths and weaknesses of the approaches: the high-resolution strategy incorporating local rebuilding with core refinement consistently improved starting structures, while the low-resolution strategy incorporating the reconstruction of large parts of the structures improved starting models in some cases but often considerably worsened them, largely because of model selection issues. Overall, the results suggest the high-resolution refinement protocol is a promising method orthogonal to other approaches, while the low-resolution refinement method clearly requires further development. Proteins 2016; 84(Suppl 1):314-322. © 2015 Wiley Periodicals, Inc.


Assuntos
Biologia Computacional/estatística & dados numéricos , Modelos Estatísticos , Simulação de Dinâmica Molecular , Proteínas/química , Software , Algoritmos , Motivos de Aminoácidos , Benchmarking , Biologia Computacional/métodos , Humanos , Internet , Conformação Proteica em alfa-Hélice , Conformação Proteica em Folha beta , Dobramento de Proteína , Domínios e Motivos de Interação entre Proteínas , Estrutura Terciária de Proteína , Homologia de Sequência de Aminoácidos , Termodinâmica
14.
Proteins ; 84 Suppl 1: 181-8, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-26857542

RESUMO

In CASP11 we generated protein structure models using simulated ambiguous and unambiguous nuclear Overhauser effect (NOE) restraints with a two stage protocol. Low resolution models were generated guided by the unambiguous restraints using continuous chain folding for alpha and alpha-beta proteins, and iterative annealing for all beta proteins to take advantage of the strand pairing information implicit in the restraints. The Rosetta fragment/model hybridization protocol was then used to recombine and regularize these models, and refine them in the Rosetta full atom energy function guided by both the unambiguous and the ambiguous restraints. Fifteen out of 19 targets were modeled with GDT-TS quality scores greater than 60 for Model 1, significantly improving upon the non-assisted predictions. Our results suggest that atomic level accuracy is achievable using sparse NOE data when there is at least one correctly assigned NOE for every residue. Proteins 2016; 84(Suppl 1):181-188. © 2016 Wiley Periodicals, Inc.


Assuntos
Biologia Computacional/estatística & dados numéricos , Modelos Moleculares , Modelos Estatísticos , Proteínas/química , Software , Algoritmos , Motivos de Aminoácidos , Biologia Computacional/métodos , Simulação por Computador , Bases de Dados de Proteínas , Cooperação Internacional , Internet , Ressonância Magnética Nuclear Biomolecular , Conformação Proteica em alfa-Hélice , Conformação Proteica em Folha beta , Dobramento de Proteína , Domínios e Motivos de Interação entre Proteínas
15.
Nucleic Acids Res ; 41(Web Server issue): W384-8, 2013 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-23737448

RESUMO

The quality of model structures generated by contemporary protein structure prediction methods strongly depends on the degree of similarity between the target and available template structures. Therefore, the importance of improving template-based model structures beyond the accuracy available from template information has been emphasized in the structure prediction community. The GalaxyRefine web server, freely available at http://galaxy.seoklab.org/refine, is based on a refinement method that has been successfully tested in CASP10. The method first rebuilds side chains and performs side-chain repacking and subsequent overall structure relaxation by molecular dynamics simulation. According to the CASP10 assessment, this method showed the best performance in improving the local structure quality. The method can improve both global and local structure quality on average, when used for refining the models generated by state-of-the-art protein structure prediction servers.


Assuntos
Conformação Proteica , Software , Internet , Simulação de Dinâmica Molecular
16.
Proteins ; 82(4): 620-32, 2014 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-24155158

RESUMO

We report the first assessment of blind predictions of water positions at protein-protein interfaces, performed as part of the critical assessment of predicted interactions (CAPRI) community-wide experiment. Groups submitting docking predictions for the complex of the DNase domain of colicin E2 and Im2 immunity protein (CAPRI Target 47), were invited to predict the positions of interfacial water molecules using the method of their choice. The predictions-20 groups submitted a total of 195 models-were assessed by measuring the recall fraction of water-mediated protein contacts. Of the 176 high- or medium-quality docking models-a very good docking performance per se-only 44% had a recall fraction above 0.3, and a mere 6% above 0.5. The actual water positions were in general predicted to an accuracy level no better than 1.5 Å, and even in good models about half of the contacts represented false positives. This notwithstanding, three hotspot interface water positions were quite well predicted, and so was one of the water positions that is believed to stabilize the loop that confers specificity in these complexes. Overall the best interface water predictions was achieved by groups that also produced high-quality docking models, indicating that accurate modelling of the protein portion is a determinant factor. The use of established molecular mechanics force fields, coupled to sampling and optimization procedures also seemed to confer an advantage. Insights gained from this analysis should help improve the prediction of protein-water interactions and their role in stabilizing protein complexes.


Assuntos
Colicinas/química , Mapeamento de Interação de Proteínas , Água/química , Algoritmos , Biologia Computacional , Modelos Moleculares , Simulação de Acoplamento Molecular , Conformação Proteica
17.
Bioinformatics ; 29(8): 1078-80, 2013 Apr 15.
Artigo em Inglês | MEDLINE | ID: mdl-23413437

RESUMO

SUMMARY: A large number of proteins function as homo-oligomers; therefore, predicting homo-oligomeric structure of proteins is of primary importance for understanding protein function at the molecular level. Here, we introduce a web server for prediction of protein homo-oligomer structure. The server takes a protein monomer structure as input and predicts its homo-oligomer structure from oligomer templates selected based on sequence and tertiary/quaternary structure similarity. Using protein model structures as input, the server shows clear improvement over the best methods of CASP9 in predicting oligomeric structures from amino acid sequences. AVAILABILITY: http://galaxy.seoklab.org/gemini. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Multimerização Proteica , Software , Internet , Proteínas/química , Análise de Sequência de Proteína , Homologia de Sequência de Aminoácidos , Homologia Estrutural de Proteína
18.
Nucleic Acids Res ; 40(Web Server issue): W294-7, 2012 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-22649060

RESUMO

Three-dimensional protein structures provide invaluable information for understanding and regulating biological functions of proteins. The GalaxyWEB server predicts protein structure from sequence by template-based modeling and refines loop or terminus regions by ab initio modeling. This web server is based on the method tested in CASP9 (9th Critical Assessment of techniques for protein Structure Prediction) as 'Seok-server', which was assessed to be among top performing template-based modeling servers. The method generates reliable core structures from multiple templates and re-builds unreliable loops or termini by using an optimization-based refinement method. In addition to structure prediction, a user can also submit a refinement only job by providing a starting model structure and locations of loops or termini to refine. The web server can be freely accessed at http://galaxy.seoklab.org/.


Assuntos
Conformação Proteica , Software , Internet , Análise de Sequência de Proteína , Interface Usuário-Computador
19.
J Chem Theory Comput ; 20(7): 2689-2695, 2024 Apr 09.
Artigo em Inglês | MEDLINE | ID: mdl-38547871

RESUMO

Mapping the ensemble of protein conformations that contribute to function and can be targeted by small molecule drugs remains an outstanding challenge. Here, we explore the use of variational autoencoders for reducing the challenge of dimensionality in the protein structure ensemble generation problem. We convert high-dimensional protein structural data into a continuous, low-dimensional representation, carry out a search in this space guided by a structure quality metric, and then use RoseTTAFold guided by the sampled structural information to generate 3D structures. We use this approach to generate ensembles for the cancer relevant protein K-Ras, train the VAE on a subset of the available K-Ras crystal structures and MD simulation snapshots, and assess the extent of sampling close to crystal structures withheld from training. We find that our latent space sampling procedure rapidly generates ensembles with high structural quality and is able to sample within 1 Å of held-out crystal structures, with a consistency higher than that of MD simulation or AlphaFold2 prediction. The sampled structures sufficiently recapitulate the cryptic pockets in the held-out K-Ras structures to allow for small molecule docking.


Assuntos
Proteínas , Proteínas/química , Conformação Proteica , Simulação por Computador
20.
J Chem Theory Comput ; 2024 Aug 07.
Artigo em Inglês | MEDLINE | ID: mdl-39109987

RESUMO

With the recent introduction of deep learning techniques into the prediction of biomolecular structures, structure prediction performance has significantly improved, and the potential for biomedical applications has increased considerably. The prediction of protein-ligand complex structures, applicable to the atomistic understanding of biomolecular functions and the effective design of drug molecules, has also improved with the introduction of deep learning. In this paper, it is demonstrated that docking performance can be greatly enhanced by training an energy function that encapsulates physical effects using deep learning within the framework of the traditional protein-ligand docking method. The advantage of this method, called GalaxyDock-DL, lies in its minimal overfitting to the training data compared to several existing deep learning-based protein-ligand docking methods. Unlike some recent deep learning methods, it does not use information about known binding pocket center positions. Instead, the results of this docking method show a systematic dependence on the physical properties of the target protein-ligand complexes such as atomic thermal fluctuations and binding affinity. GalaxyDock-DL utilizes the global optimization technique of the conventional protein-ligand docking method, GalaxyDock, and a neural network energy function trained to stabilize the native state compared to non-native states, just as physical free energy does. This physical principle-based approach suggests directions not only for future structure prediction involving structurally flexible biomolecular complexes but also for predicting binding affinity, thereby providing guidance for the effective design of biofunctional ligands.

SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa