Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
Más filtros











Base de datos
Intervalo de año de publicación
1.
J Chem Inf Model ; 62(7): 1644-1653, 2022 04 11.
Artículo en Inglés | MEDLINE | ID: mdl-35318851

RESUMEN

The Torsion Library is a collection of torsion motifs associated with angle distributions, derived from crystallographic databases. It is used in strain assessment, conformer generation, and geometry optimization. A hierarchical structure of expert curated SMARTS defines the chemical environments of rotatable bonds and associates these with preferred angles. SMARTS can be very complex and full of implications, which make them difficult to maintain manually. Recent developments in automatically comparing SMARTS patterns can be applied to the Torsion Library to ensure its correctness. We specifically discuss the implementation and the limits of such a procedure in the context of torsion motifs and show several examples of how the Torsion Library benefits from this. All automated changes are validated manually and then shown to have an effect on the angle distributions by correcting matching behavior. The corrected Torsion Library itself is available including both PDB as well as CSD histograms in the Supporting Information and can be used to evaluate rotatable bonds at https://torsions.zbh.uni-hamburg.de.


Asunto(s)
Conformación Molecular , Bases de Datos Factuales , Biblioteca de Genes
2.
J Chem Inf Model ; 60(12): 6502-6522, 2020 12 28.
Artículo en Inglés | MEDLINE | ID: mdl-33258376

RESUMEN

Scoring and numerical optimization of protein-ligand poses is an integral part of docking tools. Although many scoring functions exist, many of them are not continuously differentiable and they are rarely explicitly analyzed with respect to their numerical optimization behavior. Here, we present a consistent scheme for pose scoring and gradient-based pose optimization. It consists of a novel variant of the BFGS algorithm enabling step-length control, named LSL-BFGS (limited step length BFGS), and the empirical JAMDA scoring function designed for pose prediction and good numerical optimizability. The JAMDA scoring function shows a high pose prediction performance in the CASF-2016 docking power benchmark, top-ranking a pose with an RMSD of ≤2 Å in about 89% of the cases. The combination of JAMDA scoring with the LSL-BFGS algorithm shows a significantly higher optimization locality (i.e., no excessive movement of poses) than with the classical BFGS algorithm while retaining the characteristically low number of scoring function evaluations. The JAMDA scoring and optimization scheme is freely available for noncommercial use and academic research.


Asunto(s)
Algoritmos , Proteínas , Benchmarking , Ligandos , Simulación del Acoplamiento Molecular , Unión Proteica , Proteínas/metabolismo
3.
Nucleic Acids Res ; 48(W1): W48-W53, 2020 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-32297936

RESUMEN

Due to the increasing amount of publicly available protein structures searching, enriching and investigating these data still poses a challenging task. The ProteinsPlus web service (https://proteins.plus) offers a broad range of tools addressing these challenges. The web interface to the tool collection focusing on protein-ligand interactions has been geared towards easy and intuitive access to a large variety of functionality for life scientists. Since our last publication, the ProteinsPlus web service has been extended by additional services as well as it has undergone substantial infrastructural improvements. A keyword search functionality was added on the start page of ProteinsPlus enabling users to work on structures without knowing their PDB code. The tool collection has been augmented by three tools: StructureProfiler validates ligands and active sites using selection criteria of well-established protein-ligand benchmark data sets, WarPP places water molecules in the ligand binding sites of a protein, and METALizer calculates, predicts and scores coordination geometries of metal ions based on surrounding complex atoms. Additionally, all tools provided by ProteinsPlus are available through a REST service enabling the automated integration in structure processing and modeling pipelines.


Asunto(s)
Proteínas/química , Programas Informáticos , Sitios de Unión , Ligandos , Metales/química , Modelos Moleculares , Proteínas/metabolismo , Agua/química
4.
J Chem Inf Model ; 59(2): 731-742, 2019 02 25.
Artículo en Inglés | MEDLINE | ID: mdl-30747530

RESUMEN

Computer-aided drug design methods such as docking, pharmacophore searching, 3D database searching, and the creation of 3D-QSAR models need conformational ensembles to handle the flexibility of small molecules. Here, we present Conformator, an accurate and effective knowledge-based algorithm for generating conformer ensembles. With 99.9% of all test molecules processed, Conformator stands out by its robustness with respect to input formats, molecular geometries, and the handling of macrocycles. With an extended set of rules for sampling torsion angles, a novel algorithm for macrocycle conformer generation, and a new clustering algorithm for the assembly of conformer ensembles, Conformator reaches a median minimum root-mean-square deviation (measured between protein-bound ligand conformations and ensembles of a maximum of 250 conformers) of 0.47 Å with no significant difference to the highest-ranked commercial algorithm OMEGA and significantly higher accuracy than seven free algorithms, including the RDKit DG algorithm. Conformator is freely available for noncommercial use and academic research.


Asunto(s)
Diseño de Fármacos , Conformación Molecular , Algoritmos , Análisis por Conglomerados , Compuestos Macrocíclicos/química , Modelos Moleculares , Relación Estructura-Actividad Cuantitativa , Factores de Tiempo
5.
Bioinformatics ; 35(5): 874-876, 2019 03 01.
Artículo en Inglés | MEDLINE | ID: mdl-30124779

RESUMEN

MOTIVATION: Three-dimensional protein structures are important starting points for elucidating protein function and applications like drug design. Computational methods in this area rely on high quality validation datasets which are usually manually assembled. Due to the increase in published structures as well as the increasing demand for specially tailored validation datasets, automatic procedures should be adopted. RESULTS: StructureProfiler is a new tool for automatic, objective and customizable profiling of X-ray protein structures based on the most frequently applied selection criteria currently in use to assemble benchmark datasets. As examples, four dataset configurations (Astex, Iridium, Platinum, combined), all results of the combined tests and the list of all PDB Ids passing the combined criteria set are attached in the Supplementary Material. AVAILABILITY AND IMPLEMENTATION: StructureProfiler is available as part of the ProteinsPlus web service http://proteins.plus and as standalone tool in the NAOMI ChemBio Suite. Dataset updates together with the tool can be found on http://www.zbh.uni-hamburg.de/structureprofiler. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Programas Informáticos , Biología Computacional , Diseño de Fármacos , Proteínas
6.
J Chem Inf Model ; 57(10): 2437-2447, 2017 10 23.
Artículo en Inglés | MEDLINE | ID: mdl-28981269

RESUMEN

Macromolecular structures resolved by X-ray crystallography are essential for life science research. While some methods exist to automatically quantify the quality of the electron density fit, none of them is without flaws. Especially the question of how well individual parts like atoms, small fragments, or molecules are supported by electron density is difficult to quantify. While taking experimental uncertainties correctly into account, they do not offer an answer on how reliable an individual atom position is. A rapid quantification of this atomic position reliability would be highly valuable in structure-based molecular design. To overcome this limitation, we introduce the electron density score EDIA for individual atoms and molecular fragments. EDIA assesses rapidly, automatically, and intuitively the fit of individual as well as multiple atoms (EDIAm) into electron density accompanied by an integrated error analysis. The computation is based on the standard 2fo - fc electron density map in combination with the model of the molecular structure. For evaluating partial structures, EDIAm shows significant advantages compared to the real-space R correlation coefficient (RSCC) and the real-space difference density Z score (RSZD) from the molecular modeler's point of view. Thus, EDIA abolishes the time-consuming step of visually inspecting the electron density during structure selection and curation. It supports daily modeling tasks of medicinal and computational chemists and enables a fully automated assembly of large-scale, high-quality structure data sets. Furthermore, EDIA scores can be applied for model validation and method development in computer-aided molecular design. In contrast to measuring the deviation from the structure model by root-mean-squared deviation, EDIA scores allow comparison to the underlying experimental data taking its uncertainty into account.


Asunto(s)
Cristalografía por Rayos X/métodos , Electrones , Modelos Moleculares , Ligandos , Fragmentos de Péptidos/química
7.
J Biotechnol ; 261: 207-214, 2017 Nov 10.
Artículo en Inglés | MEDLINE | ID: mdl-28610996

RESUMEN

Nowadays, computational approaches are an integral part of life science research. Problems related to interpretation of experimental results, data analysis, or visualization tasks highly benefit from the achievements of the digital era. Simulation methods facilitate predictions of physicochemical properties and can assist in understanding macromolecular phenomena. Here, we will give an overview of the methods developed in our group that aim at supporting researchers from all life science areas. Based on state-of-the-art approaches from structural bioinformatics and cheminformatics, we provide software covering a wide range of research questions. Our all-in-one web service platform ProteinsPlus (http://proteins.plus) offers solutions for pocket and druggability prediction, hydrogen placement, structure quality assessment, ensemble generation, protein-protein interaction classification, and 2D-interaction visualization. Additionally, we provide a software package that contains tools targeting cheminformatics problems like file format conversion, molecule data set processing, SMARTS editing, fragment space enumeration, and ligand-based virtual screening. Furthermore, it also includes structural bioinformatics solutions for inverse screening, binding site alignment, and searching interaction patterns across structure libraries. The software package is available at http://software.zbh.uni-hamburg.de.


Asunto(s)
Biología Computacional , Internet , Programas Informáticos , Bases de Datos de Proteínas
8.
J Med Chem ; 60(10): 4245-4257, 2017 05 25.
Artículo en Inglés | MEDLINE | ID: mdl-28497966

RESUMEN

Protein-ligand interactions are the fundamental basis for molecular design in pharmaceutical research, biocatalysis, and agrochemical development. Especially hydrogen bonds are known to have special geometric requirements and therefore deserve a detailed analysis. In modeling approaches a more general description of hydrogen bond geometries, using distance and directionality, is applied. A first study of their geometries was performed based on 15 protein structures in 1982. Currently there are about 95 000 protein-ligand structures available in the PDB, providing a solid foundation for a new large-scale statistical analysis. Here, we report a comprehensive investigation of geometric and functional properties of hydrogen bonds. Out of 22 defined functional groups, eight are fully in accordance with theoretical predictions while 14 show variations from expected values. On the basis of these results, we derived interaction geometries to improve current computational models. It is expected that these observations will be useful in designing new chemical structures for biological applications.


Asunto(s)
Proteínas/metabolismo , Animales , Bases de Datos de Proteínas , Descubrimiento de Drogas , Humanos , Enlace de Hidrógeno , Ligandos , Simulación del Acoplamiento Molecular , Proteínas/química
9.
Proteins ; 85(8): 1550-1566, 2017 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-28486771

RESUMEN

Reliable computational prediction of protein side chain conformations and the energetic impact of amino acid mutations are the key aspects for the optimization of biotechnologically relevant enzymatic reactions using structure-based design. By improving the protein stability, higher yields can be achieved. In addition, tuning the substrate selectivity of an enzymatic reaction by directed mutagenesis can lead to higher turnover rates. This work presents a novel approach to predict the conformation of a side chain mutation along with the energetic effect on the protein structure. The HYDE scoring concept applied here describes the molecular interactions primarily by evaluating the effect of dehydration and hydrogen bonding on molecular structures in aqueous solution. Here, we evaluate its capability of side-chain conformation prediction in classic remutation experiments. Furthermore, we present a new data set for evaluating "cross-mutations," a new experiment that resembles real-world application scenarios more closely. This data set consists of protein pairs with up to five point mutations. Thus, structural changes are attributed to point mutations only. In the cross-mutation experiment, the original protein structure is mutated with the aim to predict the structure of the side chain as in the paired mutated structure. The comparison of side chain conformation prediction ("remutation") showed that the performance of HYDEprotein is qualitatively comparable to state-of-the art methods. The ability of HYDEprotein to predict the energetic effect of a mutation is evaluated in the third experiment. Herein, the effect on protein stability is predicted correctly in 70% of the evaluated cases. Proteins 2017; 85:1550-1566. © 2017 Wiley Periodicals, Inc.


Asunto(s)
Aminoácidos/química , Mutación Puntual , Agua/química , beta-Glucosidasa/química , Sustitución de Aminoácidos , Aminoácidos/genética , Desecación , Humanos , Enlace de Hidrógeno , Conformación Proteica en Hélice alfa , Conformación Proteica en Lámina beta , Estabilidad Proteica , Programas Informáticos , Soluciones , Relación Estructura-Actividad , Termodinámica , beta-Glucosidasa/genética
10.
Nucleic Acids Res ; 45(W1): W337-W343, 2017 07 03.
Artículo en Inglés | MEDLINE | ID: mdl-28472372

RESUMEN

With currently more than 126 000 publicly available structures and an increasing growth rate, the Protein Data Bank constitutes a rich data source for structure-driven research in fields like drug discovery, crop science and biotechnology in general. Typical workflows in these areas involve manifold computational tools for the analysis and prediction of molecular functions. Here, we present the ProteinsPlus web server that offers a unified easy-to-use interface to a broad range of tools for the early phase of structure-based molecular modeling. This includes solutions for commonly required pre-processing tasks like structure quality assessment (EDIA), hydrogen placement (Protoss) and the search for alternative conformations (SIENA). Beyond that, it also addresses frequent problems as the generation of 2D-interaction diagrams (PoseView), protein-protein interface classification (HyPPI) as well as automatic pocket detection and druggablity assessment (DoGSiteScorer). The unified ProteinsPlus interface covering all featured approaches provides various facilities for intuitive input and result visualization, case-specific parameterization and download options for further processing. Moreover, its generalized workflow allows the user a quick familiarization with the different tools. ProteinsPlus also stores the calculated results temporarily for future request and thus facilitates convenient result communication and re-access. The server is freely available at http://proteins.plus.


Asunto(s)
Conformación Proteica , Programas Informáticos , Sitios de Unión , Hidrógeno/química , Internet , Ligandos , Modelos Moleculares , Mapeo de Interacción de Proteínas , Proteínas/química
11.
J Chem Inf Model ; 57(3): 529-539, 2017 03 27.
Artículo en Inglés | MEDLINE | ID: mdl-28206754

RESUMEN

We developed a cheminformatics pipeline for the fully automated selection and extraction of high-quality protein-bound ligand conformations from X-ray structural data. The pipeline evaluates the validity and accuracy of the 3D structures of small molecules according to multiple criteria, including their fit to the electron density and their physicochemical and structural properties. Using this approach, we compiled two high-quality datasets from the Protein Data Bank (PDB): a comprehensive dataset and a diversified subset of 4626 and 2912 structures, respectively. The datasets were applied to benchmarking seven freely available conformer ensemble generators: Balloon (two different algorithms), the RDKit standard conformer ensemble generator, the Experimental-Torsion basic Knowledge Distance Geometry (ETKDG) algorithm, Confab, Frog2 and Multiconf-DOCK. Substantial differences in the performance of the individual algorithms were observed, with RDKit and ETKDG generally achieving a favorable balance of accuracy, ensemble size and runtime. The Platinum datasets are available for download from http://www.zbh.uni-hamburg.de/platinum_dataset .


Asunto(s)
Diseño de Fármacos , Informática/métodos , Benchmarking , Ligandos , Modelos Moleculares , Conformación Molecular , Platino (Metal)/química , Platino (Metal)/metabolismo , Proteínas/metabolismo , Factores de Tiempo
12.
J Chem Inf Model ; 56(1): 1-5, 2016 Jan 25.
Artículo en Inglés | MEDLINE | ID: mdl-26679290

RESUMEN

The Torsion Library contains hundreds of rules for small molecule conformations which have been derived from the Cambridge Structural Database (CSD) and are curated by molecular design experts. The torsion rules are encoded as SMARTS patterns and categorize rotatable bonds via a traffic light coloring scheme. We have systematically revised all torsion rules to better identify highly strained conformations and minimize the number of false alerts for CSD small molecule X-ray structures. For this new release, we added or substantially modified 78 torsion patterns and reviewed all angles and tolerance intervals. The overall number of red alerts for a filtered CSD data set with 130 000 structures was reduced by a factor of 4 compared to the predecessor. This is of clear advantage in 3D virtual screening where hits should only be removed by a conformational filter if they are in energetically inaccessible conformations.


Asunto(s)
Biología Computacional/métodos , Conformación Molecular , Bibliotecas de Moléculas Pequeñas/química , Bases de Datos Farmacéuticas , Diseño de Fármacos , Modelos Moleculares
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA