Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 51
Filtrar
1.
PLoS Comput Biol ; 17(8): e1008844, 2021 08.
Artigo em Inglês | MEDLINE | ID: mdl-34370723

RESUMO

Many biological processes are mediated by protein-protein interactions (PPIs). Because protein domains are the building blocks of proteins, PPIs likely rely on domain-domain interactions (DDIs). Several attempts exist to infer DDIs from PPI networks but the produced datasets are heterogeneous and sometimes not accessible, while the PPI interactome data keeps growing. We describe a new computational approach called "PPIDM" (Protein-Protein Interactions Domain Miner) for inferring DDIs using multiple sources of PPIs. The approach is an extension of our previously described "CODAC" (Computational Discovery of Direct Associations using Common neighbors) method for inferring new edges in a tripartite graph. The PPIDM method has been applied to seven widely used PPI resources, using as "Gold-Standard" a set of DDIs extracted from 3D structural databases. Overall, PPIDM has produced a dataset of 84,552 non-redundant DDIs. Statistical significance (p-value) is calculated for each source of PPI and used to classify the PPIDM DDIs in Gold (9,175 DDIs), Silver (24,934 DDIs) and Bronze (50,443 DDIs) categories. Dataset comparison reveals that PPIDM has inferred from the 2017 releases of PPI sources about 46% of the DDIs present in the 2020 release of the 3did database, not counting the DDIs present in the Gold-Standard. The PPIDM dataset contains 10,229 DDIs that are consistent with more than 13,300 PPIs extracted from the IMEx database, and nearly 23,300 DDIs (27.5%) that are consistent with more than 214,000 human PPIs extracted from the STRING database. Examples of newly inferred DDIs covering more than 10 PPIs in the IMEx database are provided. Further exploitation of the PPIDM DDI reservoir includes the inventory of possible partners of a protein of interest and characterization of protein interactions at the domain level in combination with other methods. The result is publicly available at http://ppidm.loria.fr/.


Assuntos
Domínios e Motivos de Interação entre Proteínas , Mapeamento de Interação de Proteínas/estatística & dados numéricos , Mapas de Interação de Proteínas , Algoritmos , Biologia Computacional , Mineração de Dados/estatística & dados numéricos , Bases de Dados de Proteínas/estatística & dados numéricos , Humanos , Software
2.
Nat Methods ; 15(1): 67-72, 2018 01.
Artigo em Inglês | MEDLINE | ID: mdl-29155427

RESUMO

Protein structures are key to understanding biomolecular mechanisms and diseases, yet their interpretation is hampered by limited knowledge of their biologically relevant quaternary structure (QS). A critical challenge in inferring QS information from crystallographic data is distinguishing biological interfaces from fortuitous crystal-packing contacts. Here, we tackled this problem by developing strategies for aligning and comparing QS states across both homologs and data repositories. QS conservation across homologs proved remarkably strong at predicting biological relevance and is implemented in two methods, QSalign and anti-QSalign, for annotating homo-oligomers and monomers, respectively. QS conservation across repositories is implemented in QSbio (http://www.QSbio.org), which approaches the accuracy of manual curation and allowed us to predict >100,000 QS states across the Protein Data Bank. Based on this high-quality data set, we analyzed pairs of structurally conserved interfaces, and this analysis revealed a striking plasticity whereby evolutionary distant interfaces maintain similar interaction geometries through widely divergent chemical properties.


Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Multimerização Proteica , Estrutura Quaternária de Proteína , Proteínas/química , Humanos , Modelos Moleculares , Ligação Proteica
4.
Proteins ; 88(8): 1121-1128, 2020 08.
Artigo em Inglês | MEDLINE | ID: mdl-32506478

RESUMO

Protein docking algorithms aim to predict the 3D structure of a protein complex from the structures of its separated components. In the past, most docking algorithms focused on docking pairs of proteins to form dimeric complexes. However, attention is now turning towards the more difficult problem of using docking methods to predict the structures of multicomponent complexes. In both cases, however, the constituent proteins often change conformation upon complex formation, and this can cause many algorithms to fail to detect near-native binding orientations due to the high number of atomic steric clashes in the list of candidate solutions. An increasingly popular way to retain more near-native orientations is to define one or more restraints that serve to modulate or override the effect of steric clashes. Here, we present an updated version of our "EROS-DOCK" docking algorithm which has been extended to dock arbitrary dimeric and trimeric complexes, and to allow the user to define residue-residue or atom-atom interaction restraints. Our results show that using even just one residue-residue restraint in each interaction interface is sufficient to increase the number of cases with acceptable solutions within the top 10 from 51 to 121 out of 173 pairwise docking cases, and to successfully dock 8 out of 11 trimeric complexes.


Assuntos
Algoritmos , Simulação de Acoplamento Molecular , Peptídeos/química , Proteínas/química , Software , Sequência de Aminoácidos , Sítios de Ligação , Humanos , Ligantes , Peptídeos/metabolismo , Ligação Proteica , Conformação Proteica em alfa-Hélice , Conformação Proteica em Folha beta , Domínios e Motivos de Interação entre Proteínas , Mapeamento de Interação de Proteínas , Multimerização Proteica , Proteínas/metabolismo , Projetos de Pesquisa , Homologia Estrutural de Proteína , Termodinâmica
5.
Bioinformatics ; 35(23): 5003-5010, 2019 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-31125060

RESUMO

MOTIVATION: Protein-protein docking algorithms aim to predict the 3D structure of a binary complex using the structures of the individual proteins. This typically involves searching and scoring in a 6D space. Many docking algorithms use FFT techniques to exhaustively cover the search space and to accelerate the scoring calculation. However, FFT docking results often depend on the initial protein orientations with respect to the Fourier sampling grid. Furthermore, Fourier-transforming a physics-base force field can involve a serious loss of precision. RESULTS: Here, we present EROS-DOCK, an algorithm to rigidly dock two proteins using a series of exhaustive 3D rotational searches in which non-clashing orientations are scored using the ATTRACT coarse-grained force field model. The rotational space is represented as a quaternion 'π-ball', which is systematically sub-divided in a 'branch-and-bound' manner, allowing efficient pruning of rotations that will give steric clashes. The algorithm was tested on 173 Docking Benchmark complexes, and results were compared with those of ATTRACT and ZDOCK. According to the CAPRI quality criteria, EROS-DOCK typically gives more acceptable or medium quality solutions than ATTRACT and ZDOCK. AVAILABILITY AND IMPLEMENTATION: The EROS-DOCK program is available for download at http://erosdock.loria.fr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , Simulação de Acoplamento Molecular , Ligação Proteica , Conformação Proteica , Proteínas
6.
Proc Natl Acad Sci U S A ; 113(30): E4286-93, 2016 07 26.
Artigo em Inglês | MEDLINE | ID: mdl-27412858

RESUMO

Energy evaluation using fast Fourier transforms (FFTs) enables sampling billions of putative complex structures and hence revolutionized rigid protein-protein docking. However, in current methods, efficient acceleration is achieved only in either the translational or the rotational subspace. Developing an efficient and accurate docking method that expands FFT-based sampling to five rotational coordinates is an extensively studied but still unsolved problem. The algorithm presented here retains the accuracy of earlier methods but yields at least 10-fold speedup. The improvement is due to two innovations. First, the search space is treated as the product manifold [Formula: see text], where [Formula: see text] is the rotation group representing the space of the rotating ligand, and [Formula: see text] is the space spanned by the two Euler angles that define the orientation of the vector from the center of the fixed receptor toward the center of the ligand. This representation enables the use of efficient FFT methods developed for [Formula: see text] Second, we select the centers of highly populated clusters of docked structures, rather than the lowest energy conformations, as predictions of the complex, and hence there is no need for very high accuracy in energy evaluation. Therefore, it is sufficient to use a limited number of spherical basis functions in the Fourier space, which increases the efficiency of sampling while retaining the accuracy of docking results. A major advantage of the method is that, in contrast to classical approaches, increasing the number of correlation function terms is computationally inexpensive, which enables using complex energy functions for scoring.


Assuntos
Algoritmos , Análise de Fourier , Simulação de Acoplamento Molecular/métodos , Conformação Proteica , Proteínas/química , Espectroscopia de Ressonância Magnética/métodos , Ligação Proteica , Proteínas/metabolismo , Reprodutibilidade dos Testes , Rotação , Termodinâmica
7.
BMC Bioinformatics ; 19(Suppl 14): 413, 2018 Nov 20.
Artigo em Inglês | MEDLINE | ID: mdl-30453875

RESUMO

BACKGROUND: Families of related proteins and their different functions may be described systematically using common classifications and ontologies such as Pfam and GO (Gene Ontology), for example. However, many proteins consist of multiple domains, and each domain, or some combination of domains, can be responsible for a particular molecular function. Therefore, identifying which domains should be associated with a specific function is a non-trivial task. RESULTS: We describe a general approach for the computational discovery of associations between different sets of annotations by formalising the problem as a bipartite graph enrichment problem in the setting of a tripartite graph. We call this approach "CODAC" (for COmputational Discovery of Direct Associations using Common Neighbours). As one application of this approach, we describe "GODomainMiner" for associating GO terms with protein domains. We used GODomainMiner to predict GO-domain associations between each of the 3 GO ontology namespaces (MF, BP, and CC) and the Pfam, CATH, and SCOP domain classifications. Overall, GODomainMiner yields average enrichments of 15-, 41- and 25-fold GO-domain associations compared to the existing GO annotations in these 3 domain classifications, respectively. CONCLUSIONS: These associations could potentially be used to annotate many of the protein chains in the Protein Databank and protein sequences in UniProt whose domain composition is known but which currently lack GO annotation.


Assuntos
Biologia Computacional/métodos , Ontologia Genética , Proteínas/química , Algoritmos , Sequência de Aminoácidos , Área Sob a Curva , Bases de Dados de Proteínas , Anotação de Sequência Molecular , Domínios Proteicos
8.
BMC Bioinformatics ; 18(1): 107, 2017 Feb 13.
Artigo em Inglês | MEDLINE | ID: mdl-28193156

RESUMO

BACKGROUND: Many entries in the protein data bank (PDB) are annotated to show their component protein domains according to the Pfam classification, as well as their biological function through the enzyme commission (EC) numbering scheme. However, despite the fact that the biological activity of many proteins often arises from specific domain-domain and domain-ligand interactions, current on-line resources rarely provide a direct mapping from structure to function at the domain level. Since the PDB now contains many tens of thousands of protein chains, and since protein sequence databases can dwarf such numbers by orders of magnitude, there is a pressing need to develop automatic structure-function annotation tools which can operate at the domain level. RESULTS: This article presents ECDomainMiner, a novel content-based filtering approach to automatically infer associations between EC numbers and Pfam domains. ECDomainMiner finds a total of 20,728 non-redundant EC-Pfam associations with a F-measure of 0.95 with respect to a "Gold Standard" test set extracted from InterPro. Compared to the 1515 manually curated EC-Pfam associations in InterPro, ECDomainMiner infers a 13-fold increase in the number of EC-Pfam associations. CONCLUSION: These EC-Pfam associations could be used to annotate some 58,722 protein chains in the PDB which currently lack any EC annotation. The ECDomainMiner database is publicly available at http://ecdm.loria.fr/ .


Assuntos
Biologia Computacional/métodos , Mineração de Dados/métodos , Bases de Dados de Proteínas , Enzimas , Proteínas , Enzimas/química , Enzimas/genética , Enzimas/metabolismo , Proteínas/química , Proteínas/genética , Proteínas/metabolismo
9.
Proteins ; 85(3): 463-469, 2017 03.
Artigo em Inglês | MEDLINE | ID: mdl-27701764

RESUMO

Many of the modeling targets in the blind CASP-11/CAPRI-30 experiment were protein homo-dimers and homo-tetramers. Here, we perform a retrospective docking-based analysis of the perfectly symmetrical CAPRI Round 30 targets whose crystal structures have been published. Starting from the CASP "stage-2" fold prediction models, we show that using our recently developed "SAM" polar Fourier symmetry docking algorithm combined with NAMD energy minimization often gives acceptable or better 3D models of the target complexes. We also use SAM to analyze the overall quality of all CASP structural models for the selected targets from a docking-based perspective. We demonstrate that docking only CASP "center" structures for the selected targets provides a fruitful and economical docking strategy. Furthermore, our results show that many of the CASP models are dockable in the sense that they can lead to acceptable or better models of symmetrical complexes. Even though SAM is very fast, using docking and NAMD energy minimization to pull out acceptable docking models from a large ensemble of docked CASP models is computationally expensive. Nonetheless, thanks to our SAM docking algorithm, we expect that applying our docking protocol on a modern computer cluster will give us the ability to routinely model 3D structures of symmetrical protein complexes from CASP-quality models. Proteins 2017; 85:463-469. © 2016 Wiley Periodicals, Inc.


Assuntos
Algoritmos , Biologia Computacional/métodos , Simulação de Acoplamento Molecular/métodos , Proteínas/química , Software , Motivos de Aminoácidos , Benchmarking , Sítios de Ligação , Cristalografia por Raios X , Ligação Proteica , Conformação Proteica , Mapeamento de Interação de Proteínas , Multimerização Proteica , Projetos de Pesquisa , Homologia Estrutural de Proteína , Termodinâmica
10.
Bioinformatics ; 32(17): 2650-8, 2016 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-27187202

RESUMO

MOTIVATION: Calculating multiple protein structure alignments (MSAs) is important for understanding functional and evolutionary relationships between protein families, and for modeling protein structures by homology. While incorporating backbone flexibility promises to circumvent many of the limitations of rigid MSA algorithms, very few flexible MSA algorithms exist today. This article describes several novel improvements to the Kpax algorithm which allow high quality flexible MSAs to be calculated. This article also introduces a new Gaussian-based MSA quality measure called 'M-score', which circumvents the pitfalls of RMSD-based quality measures. RESULTS: As well as calculating flexible MSAs, the new version of Kpax can also score MSAs from other aligners and from previously aligned reference datasets. Results are presented for a large-scale evaluation of the Homstrad, SABmark and SISY benchmark sets using Kpax and Matt as examples of state-of-the-art flexible aligners and 3DCOMB as an example of a state-of-the-art rigid aligner. These results demonstrate the utility of the M-score as a measure of MSA quality and show that high quality MSAs may be achieved when structural flexibility is properly taken into account. AVAILABILITY AND IMPLEMENTATION: Kpax 5.0 may be downloaded for academic use at http://kpax.loria.fr/ CONTACT: dave.ritchie@inria.fr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Proteínas , Alinhamento de Sequência , Algoritmos , Modelos Estatísticos , Distribuição Normal , Software
11.
Bioinformatics ; 32(17): i693-i701, 2016 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-27587691

RESUMO

MOTIVATION: Docking prediction algorithms aim to find the native conformation of a complex of proteins from knowledge of their unbound structures. They rely on a combination of sampling and scoring methods, adapted to different scales. Polynomial Expansion of Protein Structures and Interactions for Docking (PEPSI-Dock) improves the accuracy of the first stage of the docking pipeline, which will sharpen up the final predictions. Indeed, PEPSI-Dock benefits from the precision of a very detailed data-driven model of the binding free energy used with a global and exhaustive rigid-body search space. As well as being accurate, our computations are among the fastest by virtue of the sparse representation of the pre-computed potentials and FFT-accelerated sampling techniques. Overall, this is the first demonstration of a FFT-accelerated docking method coupled with an arbitrary-shaped distance-dependent interaction potential. RESULTS: First, we present a novel learning process to compute data-driven distant-dependent pairwise potentials, adapted from our previous method used for rescoring of putative protein-protein binding poses. The potential coefficients are learned by combining machine-learning techniques with physically interpretable descriptors. Then, we describe the integration of the deduced potentials into a FFT-accelerated spherical sampling provided by the Hex library. Overall, on a training set of 163 heterodimers, PEPSI-Dock achieves a success rate of 91% mid-quality predictions in the top-10 solutions. On a subset of the protein docking benchmark v5, it achieves 44.4% mid-quality predictions in the top-10 solutions when starting from bound structures and 20.5% when starting from unbound structures. The method runs in 5-15 min on a modern laptop and can easily be extended to other types of interactions. AVAILABILITY AND IMPLEMENTATION: https://team.inria.fr/nano-d/software/PEPSI-Dock CONTACT: sergei.grudinin@inria.fr.


Assuntos
Algoritmos , Modelos Teóricos , Conformação Molecular , Ligação Proteica , Aprendizado de Máquina , Simulação de Acoplamento Molecular , Conformação Proteica , Proteínas
12.
Proteins ; 84 Suppl 1: 323-48, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-27122118

RESUMO

We present the results for CAPRI Round 30, the first joint CASP-CAPRI experiment, which brought together experts from the protein structure prediction and protein-protein docking communities. The Round comprised 25 targets from amongst those submitted for the CASP11 prediction experiment of 2014. The targets included mostly homodimers, a few homotetramers, and two heterodimers, and comprised protein chains that could readily be modeled using templates from the Protein Data Bank. On average 24 CAPRI groups and 7 CASP groups submitted docking predictions for each target, and 12 CAPRI groups per target participated in the CAPRI scoring experiment. In total more than 9500 models were assessed against the 3D structures of the corresponding target complexes. Results show that the prediction of homodimer assemblies by homology modeling techniques and docking calculations is quite successful for targets featuring large enough subunit interfaces to represent stable associations. Targets with ambiguous or inaccurate oligomeric state assignments, often featuring crystal contact-sized interfaces, represented a confounding factor. For those, a much poorer prediction performance was achieved, while nonetheless often providing helpful clues on the correct oligomeric state of the protein. The prediction performance was very poor for genuine tetrameric targets, where the inaccuracy of the homology-built subunit models and the smaller pair-wise interfaces severely limited the ability to derive the correct assembly mode. Our analysis also shows that docking procedures tend to perform better than standard homology modeling techniques and that highly accurate models of the protein components are not always required to identify their association modes with acceptable accuracy. Proteins 2016; 84(Suppl 1):323-348. © 2016 Wiley Periodicals, Inc.


Assuntos
Biologia Computacional/estatística & dados numéricos , Modelos Estatísticos , Simulação de Acoplamento Molecular , Simulação de Dinâmica Molecular , Proteínas/química , Software , Algoritmos , Motivos de Aminoácidos , Bactérias/química , Sítios de Ligação , Biologia Computacional/métodos , Humanos , Cooperação Internacional , Internet , Ligação Proteica , Conformação Proteica em alfa-Hélice , Conformação Proteica em Folha beta , Dobramento de Proteína , Domínios e Motivos de Interação entre Proteínas , Multimerização Proteica , Estrutura Terciária de Proteína , Homologia de Sequência de Aminoácidos , Termodinâmica
13.
Nucleic Acids Res ; 42(Database issue): D389-95, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24271397

RESUMO

Comparing, classifying and modelling protein structural interactions can enrich our understanding of many biomolecular processes. This contribution describes Kbdock (http://kbdock.loria.fr/), a database system that combines the Pfam domain classification with coordinate data from the PDB to analyse and model 3D domain-domain interactions (DDIs). Kbdock can be queried using Pfam domain identifiers, protein sequences or 3D protein structures. For a given query domain or pair of domains, Kbdock retrieves and displays a non-redundant list of homologous DDIs or domain-peptide interactions in a common coordinate frame. Kbdock may also be used to search for and visualize interactions involving different, but structurally similar, Pfam families. Thus, structural DDI templates may be proposed even when there is little or no sequence similarity to the query domains.


Assuntos
Bases de Dados de Proteínas , Domínios e Motivos de Interação entre Proteínas , Sítios de Ligação , Internet , Modelos Moleculares , Simulação de Acoplamento Molecular , Mapeamento de Interação de Proteínas , Mapas de Interação de Proteínas , Proteínas/classificação , Alinhamento de Sequência , Análise de Sequência de Proteína
14.
J Chem Inf Model ; 55(9): 1804-23, 2015 Sep 28.
Artigo em Inglês | MEDLINE | ID: mdl-26251970

RESUMO

The in silico prediction of unwanted side effects (SEs) caused by the promiscuous behavior of drugs and their targets is highly relevant to the pharmaceutical industry. Considerable effort is now being put into computational and experimental screening of several suspected off-target proteins in the hope that SEs might be identified early, before the cost associated with developing a drug candidate rises steeply. Following this need, we present a new method called GESSE to predict potential SEs of drugs from their physicochemical properties (three-dimensional shape plus chemistry) and to target protein data extracted from predicted drug-target relationships. The GESSE approach uses a canonical correlation analysis of the full drug-target and drug-SE matrices, and it then calculates a probability that each drug in the resulting drug-target matrix will have a given SE using a Bayesian discriminant analysis (DA) technique. The performance of GESSE is quantified using retrospective (external database) analysis and literature examples by means of area under the ROC curve analysis, "top hit rates", misclassification rates, and a χ(2) independence test. Overall, the robust and very promising retrospective statistics obtained and the many SE predictions that have experimental corroboration demonstrate that GESSE can successfully predict potential drug-SE profiles of candidate drug compounds from their predicted drug-target relationships.


Assuntos
Sistemas de Liberação de Medicamentos , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Curva ROC , Estudos Retrospectivos
15.
Proteins ; 82(1): 34-44, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-23775700

RESUMO

In spite of the abundance of oligomeric proteins within a cell, the structural characterization of protein-protein interactions is still a challenging task. In particular, many of these interactions involve heteromeric complexes, which are relatively difficult to determine experimentally. Hence there is growing interest in using computational techniques to model such complexes. However, assembling large heteromeric complexes computationally is a highly combinatorial problem. Nonetheless the problem can be simplified greatly by considering interactions between protein trimers. After dimers and monomers, triangular trimers (i.e. trimers with pair-wise contacts between all three pairs of proteins) are the most frequently observed quaternary structural motifs according to the three-dimensional (3D) complex database. This article presents DockTrina, a novel protein docking method for modeling the 3D structures of nonsymmetrical triangular trimers. The method takes as input pair-wise contact predictions from a rigid body docking program. It then scans and scores all possible combinations of pairs of monomers using a very fast root mean square deviation test. Finally, it ranks the predictions using a scoring function which combines triples of pair-wise contact terms and a geometric clash penalty term. The overall approach takes less than 2 min per complex on a modern desktop computer. The method is tested and validated using a benchmark set of 220 bound and seven unbound protein trimer structures. DockTrina will be made available at http://nano-d.inrialpes.fr/software/docktrina.


Assuntos
Algoritmos , Biologia Computacional/métodos , Modelos Moleculares , Simulação de Acoplamento Molecular , Complexos Multiproteicos/química , Multimerização Proteica/genética , Software
16.
Proteins ; 82(4): 620-32, 2014 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-24155158

RESUMO

We report the first assessment of blind predictions of water positions at protein-protein interfaces, performed as part of the critical assessment of predicted interactions (CAPRI) community-wide experiment. Groups submitting docking predictions for the complex of the DNase domain of colicin E2 and Im2 immunity protein (CAPRI Target 47), were invited to predict the positions of interfacial water molecules using the method of their choice. The predictions-20 groups submitted a total of 195 models-were assessed by measuring the recall fraction of water-mediated protein contacts. Of the 176 high- or medium-quality docking models-a very good docking performance per se-only 44% had a recall fraction above 0.3, and a mere 6% above 0.5. The actual water positions were in general predicted to an accuracy level no better than 1.5 Å, and even in good models about half of the contacts represented false positives. This notwithstanding, three hotspot interface water positions were quite well predicted, and so was one of the water positions that is believed to stabilize the loop that confers specificity in these complexes. Overall the best interface water predictions was achieved by groups that also produced high-quality docking models, indicating that accurate modelling of the protein portion is a determinant factor. The use of established molecular mechanics force fields, coupled to sampling and optimization procedures also seemed to confer an advantage. Insights gained from this analysis should help improve the prediction of protein-water interactions and their role in stabilizing protein complexes.


Assuntos
Colicinas/química , Mapeamento de Interação de Proteínas , Água/química , Algoritmos , Biologia Computacional , Modelos Moleculares , Simulação de Acoplamento Molecular , Conformação Proteica
17.
J Chem Inf Model ; 54(3): 720-34, 2014 Mar 24.
Artigo em Inglês | MEDLINE | ID: mdl-24494653

RESUMO

Polypharmacology is now recognized as an increasingly important aspect of drug design. We previously introduced the Gaussian ensemble screening (GES) approach to predict relationships between drug classes rapidly without requiring thousands of bootstrap comparisons as in current promiscuity prediction approaches. Here we present the GES "computational polypharmacology fingerprint" (CPF), the first target fingerprint to encode drug promiscuity information. The similarity between the 3D shapes and chemical properties of ligands is calculated using PARAFIT and our HPCC programs to give a consensus shape-plus-chemistry ligand similarity score, and ligand promiscuity for a given set of targets is quantified using the GES fingerprints. To demonstrate our approach, we calculated the CPFs for a set of ligands from DrugBank that are related to some 800 targets. The performance of the approach was measured by comparing our CPF with an in-house "experimental polypharmacology fingerprint" (EPF) built using publicly available experimental data for the targets that comprise the fingerprint. Overall, the GES CPF gives very low fall-out while still giving high precision. We present examples of polypharmacology relationships predicted by our approach that have been experimentally validated. This demonstrates that our CPF approach can successfully describe drug-target relationships and can serve as a novel drug repurposing method for proposing new targets for preclinical compounds and clinical drug candidates.


Assuntos
Desenho de Fármacos , Reposicionamento de Medicamentos/métodos , Preparações Farmacêuticas/química , Bases de Dados de Produtos Farmacêuticos , Ligantes , Modelos Moleculares , Distribuição Normal , Polifarmacologia
18.
J Struct Biol ; 184(2): 348-54, 2013 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-24060989

RESUMO

Fitting high resolution protein structures into low resolution cryo-electron microscopy (cryo-EM) density maps is an important technique for modeling the atomic structures of very large macromolecular assemblies. This article presents "gEMfitter", a highly parallel fast Fourier transform (FFT) EM density fitting program which can exploit the special hardware properties of modern graphics processor units (GPUs) to accelerate both the translational and rotational parts of the correlation search. In particular, by using the GPU's special texture memory hardware to rotate 3D voxel grids, the cost of rotating large 3D density maps is almost completely eliminated. Compared to performing 3D correlations on one core of a contemporary central processor unit (CPU), running gEMfitter on a modern GPU gives up to 26-fold speed-up. Furthermore, using our parallel processing framework, this speed-up increases linearly with the number of CPUs or GPUs used. Thus, it is now possible to use routinely more robust but more expensive 3D correlation techniques. When tested on low resolution experimental cryo-EM data for the GroEL-GroES complex, we demonstrate the satisfactory fitting results that may be achieved by using a locally normalised cross-correlation with a Laplacian pre-filter, while still being up to three orders of magnitude faster than the well-known COLORES program.


Assuntos
Imageamento Tridimensional , Software , Algoritmos , Chaperonina 60/química , Chaperonina 60/ultraestrutura , Microscopia Crioeletrônica , Análise de Fourier , Modelos Moleculares , Estrutura Quaternária de Proteína , Recombinases Rec A/química , Recombinases Rec A/ultraestrutura
19.
Proteins ; 81(12): 2150-8, 2013 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-24123156

RESUMO

Protein docking algorithms aim to calculate the three-dimensional (3D) structure of a protein complex starting from its unbound components. Although ab initio docking algorithms are improving, there is a growing need to use homology modeling techniques to exploit the rapidly increasing volumes of structural information that now exist. However, most current homology modeling approaches involve finding a pair of complete single-chain structures in a homologous protein complex to use as a 3D template, despite the fact that protein complexes are often formed from one or more domain-domain interactions (DDIs). To model 3D protein complexes by domain-domain homology, we have developed a case-based reasoning approach called KBDOCK which systematically identifies and reuses domain family binding sites from our database of nonredundant DDIs. When tested on 54 protein complexes from the Protein Docking Benchmark, our approach provides a near-perfect way to model single-domain protein complexes when full-homology templates are available, and it extends our ability to model more difficult cases when only partial or incomplete templates exist. These promising early results highlight the need for a new and diverse docking benchmark set, specifically designed to assess homology docking approaches.


Assuntos
Biologia Computacional/métodos , Mapeamento de Interação de Proteínas/métodos , Proteínas/química , Algoritmos , Sítios de Ligação , Bases de Dados de Proteínas , Modelos Moleculares , Simulação de Acoplamento Molecular , Linguagens de Programação , Conformação Proteica , Software
20.
BMC Struct Biol ; 13: 25, 2013 Oct 21.
Artigo em Inglês | MEDLINE | ID: mdl-24144335

RESUMO

BACKGROUND: Picking images of particles in cryo-electron micrographs is an important step in solving the 3D structures of large macromolecular assemblies. However, in order to achieve sub-nanometre resolution it is often necessary to capture and process many thousands or even several millions of 2D particle images. Thus, a computational bottleneck in reaching high resolution is the accurate and automatic picking of particles from raw cryo-electron micrographs. RESULTS: We have developed "gEMpicker", a highly parallel correlation-based particle picking tool. To our knowledge, gEMpicker is the first particle picking program to use multiple graphics processor units (GPUs) to accelerate the calculation. When tested on the publicly available keyhole limpet hemocyanin dataset, we find that gEMpicker gives similar results to the FindEM program. However, compared to calculating correlations on one core of a contemporary central processor unit (CPU), running gEMpicker on a modern GPU gives a speed-up of about 27 ×. To achieve even higher processing speeds, the basic correlation calculations are accelerated considerably by using a hierarchy of parallel programming techniques to distribute the calculation over multiple GPUs and CPU cores attached to multiple nodes of a computer cluster. By using a theoretically optimal reduction algorithm to collect and combine the cluster calculation results, the speed of the overall calculation scales almost linearly with the number of cluster nodes available. CONCLUSIONS: The very high picking throughput that is now possible using GPU-powered workstations or computer clusters will help experimentalists to achieve higher resolution 3D reconstructions more rapidly than before.


Assuntos
Microscopia Crioeletrônica/métodos , Hemocianinas/química , Processamento de Imagem Assistida por Computador/métodos , Software , Algoritmos , Imageamento Tridimensional , Reconhecimento Automatizado de Padrão
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa