Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
1.
Proteomics ; 23(17): e2200322, 2023 09.
Artículo en Inglés | MEDLINE | ID: mdl-36529945

RESUMEN

Proteins and nucleic acids are key components in many processes in living cells, and interactions between proteins and nucleic acids are often crucial pathway components. In many cases, large flexibility of proteins as they interact with nucleic acids is key to their function. To understand the mechanisms of these processes, it is necessary to consider the 3D atomic structures of such protein-nucleic acid complexes. When such structures are not yet experimentally determined, protein docking can be used to computationally generate useful structure models. However, such docking has long had the limitation that the consideration of flexibility is usually limited to small movements or to small structures. We previously developed a method of flexible protein docking which could model ordered proteins which undergo large-scale conformational changes, which we also showed was compatible with nucleic acids. Here, we elaborate on the ability of that pipeline, Flex-LZerD, to model specifically interactions between proteins and nucleic acids, and demonstrate that Flex-LZerD can model more interactions and types of conformational change than previously shown.


Asunto(s)
Ácidos Nucleicos , Conformación Proteica , Unión Proteica , Ácidos Nucleicos/metabolismo , Proteínas/metabolismo
2.
Proteomics ; 23(17): e2200323, 2023 09.
Artículo en Inglés | MEDLINE | ID: mdl-37365936

RESUMEN

Reliably scoring and ranking candidate models of protein complexes and assigning their oligomeric state from the structure of the crystal lattice represent outstanding challenges. A community-wide effort was launched to tackle these challenges. The latest resources on protein complexes and interfaces were exploited to derive a benchmark dataset consisting of 1677 homodimer protein crystal structures, including a balanced mix of physiological and non-physiological complexes. The non-physiological complexes in the benchmark were selected to bury a similar or larger interface area than their physiological counterparts, making it more difficult for scoring functions to differentiate between them. Next, 252 functions for scoring protein-protein interfaces previously developed by 13 groups were collected and evaluated for their ability to discriminate between physiological and non-physiological complexes. A simple consensus score generated using the best performing score of each of the 13 groups, and a cross-validated Random Forest (RF) classifier were created. Both approaches showed excellent performance, with an area under the Receiver Operating Characteristic (ROC) curve of 0.93 and 0.94, respectively, outperforming individual scores developed by different groups. Additionally, AlphaFold2 engines recalled the physiological dimers with significantly higher accuracy than the non-physiological set, lending support to the reliability of our benchmark dataset annotations. Optimizing the combined power of interface scoring functions and evaluating it on challenging benchmark datasets appears to be a promising strategy.


Asunto(s)
Proteínas , Reproducibilidad de los Resultados , Proteínas/metabolismo , Unión Proteica
3.
Proteins ; 91(12): 1658-1683, 2023 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-37905971

RESUMEN

We present the results for CAPRI Round 54, the 5th joint CASP-CAPRI protein assembly prediction challenge. The Round offered 37 targets, including 14 homodimers, 3 homo-trimers, 13 heterodimers including 3 antibody-antigen complexes, and 7 large assemblies. On average ~70 CASP and CAPRI predictor groups, including more than 20 automatics servers, submitted models for each target. A total of 21 941 models submitted by these groups and by 15 CAPRI scorer groups were evaluated using the CAPRI model quality measures and the DockQ score consolidating these measures. The prediction performance was quantified by a weighted score based on the number of models of acceptable quality or higher submitted by each group among their five best models. Results show substantial progress achieved across a significant fraction of the 60+ participating groups. High-quality models were produced for about 40% of the targets compared to 8% two years earlier. This remarkable improvement is due to the wide use of the AlphaFold2 and AlphaFold2-Multimer software and the confidence metrics they provide. Notably, expanded sampling of candidate solutions by manipulating these deep learning inference engines, enriching multiple sequence alignments, or integration of advanced modeling tools, enabled top performing groups to exceed the performance of a standard AlphaFold2-Multimer version used as a yard stick. This notwithstanding, performance remained poor for complexes with antibodies and nanobodies, where evolutionary relationships between the binding partners are lacking, and for complexes featuring conformational flexibility, clearly indicating that the prediction of protein complexes remains a challenging problem.


Asunto(s)
Algoritmos , Mapeo de Interacción de Proteínas , Mapeo de Interacción de Proteínas/métodos , Conformación Proteica , Unión Proteica , Simulación del Acoplamiento Molecular , Biología Computacional/métodos , Programas Informáticos
4.
Nucleic Acids Res ; 49(W1): W359-W365, 2021 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-33963854

RESUMEN

Protein complexes are involved in many important processes in living cells. To understand the mechanisms of these processes, it is necessary to solve the 3D structures of the protein complexes. When protein complex structures have not yet been determined by experiment, protein-protein docking tools can be used to computationally model the structures of these complexes. Here, we present a webserver which provides access to LZerD and Multi-LZerD protein docking tools. The protocol provided by the server have performed consistently among the top in the CAPRI blind evaluation. LZerD docks pairs of structures, while Multi-LZerD can dock three or more structures simultaneously. LZerD uses a soft protein surface representation with 3D Zernike descriptors and explores the binding pose space using geometric hashing. Multi-LZerD performs multi-chain docking by combining pairwise solutions by LZerD. Both methods output full-atom docked models of the input proteins. Users can also input distance constraints between interacting or non-interacting residues as well as residues that locate at the interface or far from the interface. The webserver is equipped with a user-friendly panel that visualizes the distribution and structures of binding poses of top scoring models. The LZerD webserver is available at https://lzerd.kiharalab.org.


Asunto(s)
Simulación del Acoplamiento Molecular/métodos , Complejos Multiproteicos/química , Programas Informáticos , Antígenos CD/química , Proteínas Bacterianas/química , Moléculas de Adhesión Celular/química , Enoil-ACP Reductasa (NADH)/química , Humanos , Internet
5.
Bioinformatics ; 36(7): 2113-2118, 2020 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-31746961

RESUMEN

MOTIVATION: Many important cellular processes involve physical interactions of proteins. Therefore, determining protein quaternary structures provide critical insights for understanding molecular mechanisms of functions of the complexes. To complement experimental methods, many computational methods have been developed to predict structures of protein complexes. One of the challenges in computational protein complex structure prediction is to identify near-native models from a large pool of generated models. RESULTS: We developed a convolutional deep neural network-based approach named DOcking decoy selection with Voxel-based deep neural nEtwork (DOVE) for evaluating protein docking models. To evaluate a protein docking model, DOVE scans the protein-protein interface of the model with a 3D voxel and considers atomic interaction types and their energetic contributions as input features applied to the neural network. The deep learning models were trained and validated on docking models available in the ZDock and DockGround databases. Among the different combinations of features tested, almost all outperformed existing scoring functions. AVAILABILITY AND IMPLEMENTATION: Codes available at http://github.com/kiharalab/DOVE, http://kiharalab.org/dove/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Redes Neurales de la Computación , Proteínas
6.
Proteins ; 88(8): 948-961, 2020 08.
Artículo en Inglés | MEDLINE | ID: mdl-31697428

RESUMEN

We report the performance of the protein docking prediction pipeline of our group and the results for Critical Assessment of Prediction of Interactions (CAPRI) rounds 38-46. The pipeline integrates programs developed in our group as well as other existing scoring functions. The core of the pipeline is the LZerD protein-protein docking algorithm. If templates of the target complex are not found in PDB, the first step of our docking prediction pipeline is to run LZerD for a query protein pair. Meanwhile, in the case of human group prediction, we survey the literature to find information that can guide the modeling, such as protein-protein interface information. In addition to any literature information and binding residue prediction, generated docking decoys were selected by a rank aggregation of statistical scoring functions. The top 10 decoys were relaxed by a short molecular dynamics simulation before submission to remove atom clashes and improve side-chain conformations. In these CAPRI rounds, our group, particularly the LZerD server, showed robust performance. On the other hand, there are failed cases where some other groups were successful. To understand weaknesses of our pipeline, we analyzed sources of errors for failed targets. Since we noted that structure refinement is a step that needs improvement, we newly performed a comparative study of several refinement approaches. Finally, we show several examples that illustrate successful and unsuccessful cases by our group.


Asunto(s)
Simulación del Acoplamiento Molecular , Péptidos/química , Proteínas/química , Programas Informáticos , Algoritmos , Secuencia de Aminoácidos , Sitios de Unión , Humanos , Ligandos , Péptidos/metabolismo , Unión Proteica , Conformación Proteica en Hélice alfa , Conformación Proteica en Lámina beta , Dominios y Motivos de Interacción de Proteínas , Mapeo de Interacción de Proteínas , Proteínas/metabolismo , Proyectos de Investigación , Homología Estructural de Proteína
7.
PLoS Comput Biol ; 15(4): e1006969, 2019 04.
Artículo en Inglés | MEDLINE | ID: mdl-30978181

RESUMEN

Proteins are involved in almost all functions in a living cell, and functions of proteins are realized by their tertiary structures. Obtaining a global perspective of the variety and distribution of protein structures lays a foundation for our understanding of the building principle of protein structures. In light of the rapid accumulation of low-resolution structure data from electron tomography and cryo-electron microscopy, here we map and classify three-dimensional (3D) surface shapes of proteins into a similarity space. Surface shapes of proteins were represented with 3D Zernike descriptors, mathematical moment-based invariants, which have previously been demonstrated effective for biomolecular structure similarity search. In addition to single chains of proteins, we have also analyzed the shape space occupied by protein complexes. From the mapping, we have obtained various new insights into the relationship between shapes, main-chain folds, and complex formation. The unique view obtained from shape mapping opens up new ways to understand design principles, functions, and evolution of proteins.


Asunto(s)
Elementos Estructurales de las Proteínas/fisiología , Proteínas/química , Proteínas/clasificación , Biología Computacional/métodos , Bases de Datos de Proteínas , Modelos Moleculares , Conformación Proteica , Programas Informáticos
8.
Proteins ; 87(12): 1200-1221, 2019 12.
Artículo en Inglés | MEDLINE | ID: mdl-31612567

RESUMEN

We present the results for CAPRI Round 46, the third joint CASP-CAPRI protein assembly prediction challenge. The Round comprised a total of 20 targets including 14 homo-oligomers and 6 heterocomplexes. Eight of the homo-oligomer targets and one heterodimer comprised proteins that could be readily modeled using templates from the Protein Data Bank, often available for the full assembly. The remaining 11 targets comprised 5 homodimers, 3 heterodimers, and two higher-order assemblies. These were more difficult to model, as their prediction mainly involved "ab-initio" docking of subunit models derived from distantly related templates. A total of ~30 CAPRI groups, including 9 automatic servers, submitted on average ~2000 models per target. About 17 groups participated in the CAPRI scoring rounds, offered for most targets, submitting ~170 models per target. The prediction performance, measured by the fraction of models of acceptable quality or higher submitted across all predictors groups, was very good to excellent for the nine easy targets. Poorer performance was achieved by predictors for the 11 difficult targets, with medium and high quality models submitted for only 3 of these targets. A similar performance "gap" was displayed by scorer groups, highlighting yet again the unmet challenge of modeling the conformational changes of the protein components that occur upon binding or that must be accounted for in template-based modeling. Our analysis also indicates that residues in binding interfaces were less well predicted in this set of targets than in previous Rounds, providing useful insights for directions of future improvements.


Asunto(s)
Biología Computacional , Conformación Proteica , Proteínas/ultraestructura , Programas Informáticos , Algoritmos , Sitios de Unión/genética , Bases de Datos de Proteínas , Modelos Moleculares , Unión Proteica/genética , Mapeo de Interacción de Proteínas , Proteínas/química , Proteínas/genética , Homología Estructural de Proteína
9.
PLoS Comput Biol ; 14(1): e1005937, 2018 01.
Artículo en Inglés | MEDLINE | ID: mdl-29329283

RESUMEN

Protein-protein interactions are the cornerstone of numerous biological processes. Although an increasing number of protein complex structures have been determined using experimental methods, relatively fewer studies have been performed to determine the assembly order of complexes. In addition to the insights into the molecular mechanisms of biological function provided by the structure of a complex, knowing the assembly order is important for understanding the process of complex formation. Assembly order is also practically useful for constructing subcomplexes as a step toward solving the entire complex experimentally, designing artificial protein complexes, and developing drugs that interrupt a critical step in the complex assembly. There are several experimental methods for determining the assembly order of complexes; however, these techniques are resource-intensive. Here, we present a computational method that predicts the assembly order of protein complexes by building the complex structure. The method, named Path-LzerD, uses a multimeric protein docking algorithm that assembles a protein complex structure from individual subunit structures and predicts assembly order by observing the simulated assembly process of the complex. Benchmarked on a dataset of complexes with experimental evidence of assembly order, Path-LZerD was successful in predicting the assembly pathway for the majority of the cases. Moreover, when compared with a simple approach that infers the assembly path from the buried surface area of subunits in the native complex, Path-LZerD has the strong advantage that it can be used for cases where the complex structure is not known. The path prediction accuracy decreased when starting from unbound monomers, particularly for larger complexes of five or more subunits, for which only a part of the assembly path was correctly identified. As the first method of its kind, Path-LZerD opens a new area of computational protein structure modeling and will be an indispensable approach for studying protein complexes.


Asunto(s)
Biología Computacional/métodos , Mapeo de Interacción de Proteínas/métodos , Proteínas/química , Algoritmos , Toxina del Cólera/química , Bases de Datos de Proteínas , Helicobacter pylori/metabolismo , Humanos , Modelos Estadísticos , Simulación del Acoplamiento Molecular , Unión Proteica , Dominios Proteicos , Programas Informáticos , Termodinámica
10.
PLoS Comput Biol ; 13(4): e1005485, 2017 04.
Artículo en Inglés | MEDLINE | ID: mdl-28394890

RESUMEN

Disordered protein-protein interactions (PPIs), those involving a folded protein and an intrinsically disordered protein (IDP), are prevalent in the cell, including important signaling and regulatory pathways. IDPs do not adopt a single dominant structure in isolation but often become ordered upon binding. To aid understanding of the molecular mechanisms of disordered PPIs, it is crucial to obtain the tertiary structure of the PPIs. However, experimental methods have difficulty in solving disordered PPIs and existing protein-protein and protein-peptide docking methods are not able to model them. Here we present a novel computational method, IDP-LZerD, which models the conformation of a disordered PPI by considering the biophysical binding mechanism of an IDP to a structured protein, whereby a local segment of the IDP initiates the interaction and subsequently the remaining IDP regions explore and coalesce around the initial binding site. On a dataset of 22 disordered PPIs with IDPs up to 69 amino acids, successful predictions were made for 21 bound and 18 unbound receptors. The successful modeling provides additional support for biophysical principles. Moreover, the new technique significantly expands the capability of protein structure modeling and provides crucial insights into the molecular mechanisms of disordered PPIs.


Asunto(s)
Biología Computacional/métodos , Proteínas Intrínsecamente Desordenadas/química , Proteínas Intrínsecamente Desordenadas/metabolismo , Modelos Moleculares , Análisis de Secuencia de Proteína/métodos , Secuencia de Aminoácidos , Animales , Sitios de Unión , Humanos , Ratones , Unión Proteica
11.
Methods ; 131: 22-32, 2017 12 01.
Artículo en Inglés | MEDLINE | ID: mdl-28802714

RESUMEN

A core concept behind modern drug discovery is finding a small molecule that modulates a function of a target protein. This concept has been successfully applied since the mid-1970s. However, the efficiency of drug discovery is decreasing because the druggable target space in the human proteome is limited. Recently, protein-protein interaction (PPI) has been identified asan emerging target space for drug discovery. PPI plays a pivotal role in biological pathways including diseases. Current human interactome research suggests that the number of PPIs is between 130,000 and 650,000, and only a small number of them have been targeted as drug targets. For traditional drug targets, in silico structure-based methods have been successful in many cases. However, their performance suffers on PPI interfaces because PPI interfaces are different in five major aspects: From a geometric standpoint, they have relatively large interface regions, flat geometry, and the interface surface shape tends to fluctuate upon binding. Also, their interactions are dominated by hydrophobic atoms, which is different from traditional binding-pocket-targeted drugs. Finally, PPI targets usually lack natural molecules that bind to the target PPI interface. Here, we first summarize characteristics of PPI interfaces and their known binders. Then, we will review existing in silico structure-based approaches for discovering small molecules that bind to PPI interfaces.


Asunto(s)
Diseño de Fármacos , Descubrimiento de Drogas/métodos , Simulación del Acoplamiento Molecular , Proteínas/metabolismo , Bibliotecas de Moléculas Pequeñas/química , Biología Computacional , Descubrimiento de Drogas/tendencias , Humanos , Terapia Molecular Dirigida/métodos , Unión Proteica/efectos de los fármacos , Mapeo de Interacción de Proteínas/métodos , Mapas de Interacción de Proteínas/efectos de los fármacos , Proteínas/química , Bibliotecas de Moléculas Pequeñas/farmacología , Relación Estructura-Actividad
12.
J Chem Inf Model ; 56(9): 1676-91, 2016 09 26.
Artículo en Inglés | MEDLINE | ID: mdl-27500657

RESUMEN

Virtual screening has become an indispensable procedure in drug discovery. Virtual screening methods can be classified into two categories: ligand-based and structure-based. While the former have advantages, including being quick to compute, in general they are relatively weak at discovering novel active compounds because they use known actives as references. On the other hand, structure-based methods have higher potential to find novel compounds because they directly predict the binding affinity of a ligand in a target binding pocket, albeit with substantially lower speed than ligand-based methods. Here we report a novel structure-based virtual screening method, PL-PatchSurfer2. In PL-PatchSurfer2, protein and ligand surfaces are represented by a set of overlapping local patches, each of which is represented by three-dimensional Zernike descriptors (3DZDs). By means of 3DZDs, the shapes and physicochemical complementarities of local surface regions of a pocket surface and a ligand molecule can be concisely and effectively computed. Compared with the previous version of the program, the performance of PL-PatchSurfer2 is substantially improved by the addition of two more features, atom-based hydrophobicity and hydrogen-bond acceptors and donors. Benchmark studies showed that PL-PatchSurfer2 performed better than or comparable to popular existing methods. Particularly, PL-PatchSurfer2 significantly outperformed existing methods when apo-form or template-based protein models were used for queries. The computational time of PL-PatchSurfer2 is about 20 times shorter than those of conventional structure-based methods. The PL-PatchSurfer2 program is available at http://www.kiharalab.org/plps2/ .


Asunto(s)
Evaluación Preclínica de Medicamentos/métodos , Sitios de Unión , Ligandos , Simulación del Acoplamiento Molecular , Conformación Proteica , Interfaz Usuario-Computador
13.
BMC Bioinformatics ; 16: 181, 2015 May 30.
Artículo en Inglés | MEDLINE | ID: mdl-26025554

RESUMEN

BACKGROUND: The Electron Microscopy DataBank (EMDB) is growing rapidly, accumulating biological structural data obtained mainly by electron microscopy and tomography, which are emerging techniques for determining large biomolecular complex and subcellular structures. Together with the Protein Data Bank (PDB), EMDB is becoming a fundamental resource of the tertiary structures of biological macromolecules. To take full advantage of this indispensable resource, the ability to search the database by structural similarity is essential. However, unlike high-resolution structures stored in PDB, methods for comparing low-resolution electron microscopy (EM) density maps in EMDB are not well established. RESULTS: We developed a computational method for efficiently searching low-resolution EM maps. The method uses a compact fingerprint representation of EM maps based on the 3D Zernike descriptor, which is derived from a mathematical series expansion for EM maps that are considered as 3D functions. The method is implemented in a web server named EM-SURFER, which allows users to search against the entire EMDB in real-time. EM-SURFER compares the global shapes of EM maps. Examples of search results from different types of query structures are discussed. CONCLUSIONS: We developed EM-SURFER, which retrieves structurally relevant matches for query EM maps from EMDB within seconds. The unique capability of EM-SURFER to detect 3D shape similarity of low-resolution EM maps should prove invaluable in structural biology.


Asunto(s)
Biología Computacional , Bases de Datos de Proteínas , Internet , Microscopía Electrónica/métodos , Proteínas/química , Programas Informáticos , Humanos , Imagenología Tridimensional , Modelos Moleculares
14.
J Mol Biol ; 436(6): 168486, 2024 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-38336197

RESUMEN

Membrane proteins play crucial roles in various cellular processes, and their interactions with other proteins in and on the membrane are essential for their proper functioning. While an increasing number of structures of more membrane proteins are being determined, the available structure data is still sparse. To gain insights into the mechanisms of membrane protein complexes, computational docking methods are necessary due to the challenge of experimental determination. Here, we introduce Mem-LZerD, a rigid-body membrane docking algorithm designed to take advantage of modern membrane modeling and protein docking techniques to facilitate the docking of membrane protein complexes. Mem-LZerD is based on the LZerD protein docking algorithm, which has been constantly among the top servers in many rounds of CAPRI protein docking assessment. By employing a combination of geometric hashing, newly constrained by the predicted membrane height and tilt angle, and model scoring accounting for the energy of membrane insertion, we demonstrate the capability of Mem-LZerD to model diverse membrane protein-protein complexes. Mem-LZerD successfully performed unbound docking on 13 of 21 (61.9%) transmembrane complexes in an established benchmark, more than shown by previous approaches. It was additionally tested on new datasets of 44 transmembrane complexes and 92 peripheral membrane protein complexes, of which it successfully modeled 35 (79.5%) and 15 (16.3%) complexes respectively. When non-blind orientations of peripheral targets were included, the number of successes increased to 54 (58.7%). We further demonstrate that Mem-LZerD produces complex models which are suitable for molecular dynamics simulation. Mem-LZerD is made available at https://lzerd.kiharalab.org.


Asunto(s)
Proteínas de la Membrana , Algoritmos , Proteínas de la Membrana/química , Simulación del Acoplamiento Molecular , Unión Proteica , Conformación Proteica , Programas Informáticos
15.
Methods Mol Biol ; 2690: 355-373, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37450159

RESUMEN

Interactions of proteins with other macromolecules have important structural and functional roles in the basic processes of living cells. To understand and elucidate the mechanisms of interactions, it is important to know the 3D structures of the complexes. Proteomes contain numerous protein-protein complexes, for which experimentally determined structures often do not exist. Computational techniques can be a practical alternative to obtain useful complex structure models. Here, we present a web server that provides access to the LZerD and Multi-LZerD protein docking tools, which can perform both pairwise and multi-chain docking. The web server is user-friendly, with options to visualize the distribution and structures of binding poses of top-scoring models. The LZerD web server is available at https://lzerd.kiharalab.org . This chapter dictates the algorithm and step-by-step procedure to model the monomeric structures with AttentiveDist, and also provides the detail of pairwise LZerD docking, and multi-LZerD. This also provided case studies for each of the three modules.


Asunto(s)
Biología Computacional , Programas Informáticos , Simulación del Acoplamiento Molecular , Biología Computacional/métodos , Algoritmos , Proteoma , Internet , Unión Proteica
16.
bioRxiv ; 2023 Nov 09.
Artículo en Inglés | MEDLINE | ID: mdl-37961264

RESUMEN

Membrane proteins play crucial roles in various cellular processes, and their interactions with other proteins in and on the membrane are essential for their proper functioning. While an increasing number of structures of more membrane proteins are being determined, the available structure data is still sparse. To gain insights into the mechanisms of membrane protein complexes, computational docking methods are necessary due to the challenge of experimental determination. Here, we introduce Mem-LZerD, a rigid-body membrane docking algorithm designed to take advantage of modern membrane modeling and protein docking techniques to facilitate the docking of membrane protein complexes. Mem-LZerD is based on the LZerD protein docking algorithm, which has been constantly among the top servers in many rounds of CAPRI protein docking assessment. By employing a combination of geometric hashing, newly constrained by the predicted membrane height and tilt angle, and model scoring accounting for the energy of membrane insertion, we demonstrate the capability of Mem-LZerD to model diverse membrane protein-protein complexes. Mem-LZerD successfully performed unbound docking on 13 of 21 (61.9%) transmembrane complexes in an established benchmark, more than shown by previous approaches. It was additionally tested on new datasets of 44 transmembrane complexes and 92 peripheral membrane protein complexes, of which it successfully modeled 35 (79.5%) and 15 (16.3%) complexes respectively. When non-blind orientations of peripheral targets were included, the number of successes increased to 54 (58.7%). We further demonstrate that Mem-LZerD produces complex models which are suitable for molecular dynamics simulation. Mem-LZerD is made available at https://lzerd.kiharalab.org.

17.
bioRxiv ; 2023 Dec 04.
Artículo en Inglés | MEDLINE | ID: mdl-38106114

RESUMEN

Protein-peptide interactions play a key role in biological processes. Understanding the interactions that occur within a receptor-peptide complex can help in discovering and altering their biological functions. Various computational methods for modeling the structures of receptor-peptide complexes have been developed. Recently, accurate structure prediction enabled by deep learning methods has significantly advanced the field of structural biology. AlphaFold (AF) is among the top-performing structure prediction methods and has highly accurate structure modeling performance on single-chain targets. Shortly after the release of AlphaFold, AlphaFold-Multimer (AFM) was developed in a similar fashion as AF for prediction of protein complex structures. AFM has achieved competitive performance in modeling protein-peptide interactions compared to previous computational methods; however, still further improvement is needed. Here, we present DistPepFold, which improves protein-peptide complex docking using an AFM-based architecture through a privileged knowledge distillation approach. DistPepFold leverages a teacher model that uses native interaction information during training and transfers its knowledge to a student model through a teacher-student distillation process. We evaluated DistPepFold's docking performance on two protein-peptide complex datasets and showed that DistPepFold outperforms AFM. Furthermore, we demonstrate that the student model was able to learn from the teacher model to make structural improvements based on AFM predictions.

18.
Biomolecules ; 13(4)2023 03 29.
Artículo en Inglés | MEDLINE | ID: mdl-37189363

RESUMEN

Lowe Syndrome (LS) is a condition due to mutations in the OCRL1 gene, characterized by congenital cataracts, intellectual disability, and kidney malfunction. Unfortunately, patients succumb to renal failure after adolescence. This study is centered in investigating the biochemical and phenotypic impact of patient's OCRL1 variants (OCRL1VAR). Specifically, we tested the hypothesis that some OCRL1VAR are stabilized in a non-functional conformation by focusing on missense mutations affecting the phosphatase domain, but not changing residues involved in binding/catalysis. The pathogenic and conformational characteristics of the selected variants were evaluated in silico and our results revealed some OCRL1VAR to be benign, while others are pathogenic. Then we proceeded to monitor the enzymatic activity and function in kidney cells of the different OCRL1VAR. Based on their enzymatic activity and presence/absence of phenotypes, the variants segregated into two categories that also correlated with the severity of the condition they induce. Overall, these two groups mapped to opposite sides of the phosphatase domain. In summary, our findings highlight that not every mutation affecting the catalytic domain impairs OCRL1's enzymatic activity. Importantly, data support the inactive-conformation hypothesis. Finally, our results contribute to establishing the molecular and structural basis for the observed heterogeneity in severity/symptomatology displayed by patients.


Asunto(s)
Síndrome Oculocerebrorrenal , Humanos , Síndrome Oculocerebrorrenal/genética , Monoéster Fosfórico Hidrolasas/genética , Monoéster Fosfórico Hidrolasas/química , Mutación , Mutación Missense , Fenotipo
19.
J Mol Biol ; 434(21): 167820, 2022 11 15.
Artículo en Inglés | MEDLINE | ID: mdl-36089054

RESUMEN

Proteins are key components in many processes in living cells, and physical interactions with other proteins and nucleic acids often form key parts of their functions. In many cases, large flexibility of proteins as they interact is key to their function. To understand the mechanisms of these processes, it is necessary to consider the 3D structures of such protein complexes. When such structures are not yet experimentally determined, protein docking has long been present to computationally generate useful structure models. However, protein docking has long had the limitation that the consideration of flexibility is usually limited to very small movements or very small structures. Methods have been developed which handle minor flexibility via normal mode or other structure sampling, but new methods are required to model ordered proteins which undergo large-scale conformational changes to elucidate their function at the molecular level. Here, we present Flex-LZerD, a framework for docking such complexes. Via partial assembly multidomain docking and an iterative normal mode analysis admitting curvilinear motions, we demonstrate the ability to model the assembly of a variety of protein-protein and protein-nucleic acid complexes.


Asunto(s)
Dominios y Motivos de Interacción de Proteínas , Proteínas , Simulación del Acoplamiento Molecular , Unión Proteica , Proteínas/química
20.
Front Mol Biosci ; 9: 969394, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36090027

RESUMEN

Numerous biological processes in a cell are carried out by protein complexes. To understand the molecular mechanisms of such processes, it is crucial to know the quaternary structures of the complexes. Although the structures of protein complexes have been determined by biophysical experiments at a rapid pace, there are still many important complex structures that are yet to be determined. To supplement experimental structure determination of complexes, many computational protein docking methods have been developed; however, most of these docking methods are designed only for docking with two chains. Here, we introduce a novel method, RL-MLZerD, which builds multiple protein complexes using reinforcement learning (RL). In RL-MLZerD a multi-chain assembly process is considered as a series of episodes of selecting and integrating pre-computed pairwise docking models in a RL framework. RL is effective in correctly selecting plausible pairwise models that fit well with other subunits in a complex. When tested on a benchmark dataset of protein complexes with three to five chains, RL-MLZerD showed better modeling performance than other existing multiple docking methods under different evaluation criteria, except against AlphaFold-Multimer in unbound docking. Also, it emerged that the docking order of multi-chain complexes can be naturally predicted by examining preferred paths of episodes in the RL computation.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA