RESUMEN
Several geometric-based methods have been developed for the last two to three decades to detect and identify cavities (i.e., putative binding sites) on proteins, as needed to study protein-ligand interactions and protein docking. This paper introduces a new protein cavity method, called CavVis, which combines voxelization (i.e., a grid of voxels) and an analytic formulation of Gaussian surfaces that approximates the solvent-excluded surface. This method builds upon visibility of points on protein surface to find its cavities. Specifically, the visibility criterion combines three concepts we borrow from computer graphics, the field-of-view of each surface point, voxel ray casting, and back-face culling.
Asunto(s)
Algoritmos , Modelos Moleculares , Proteínas/química , Proteínas/metabolismo , Ligandos , Distribución Normal , Conformación Proteica , Propiedades de SuperficieRESUMEN
BACKGROUND: Protein cavities play a key role in biomolecular recognition and function, particularly in protein-ligand interactions, as usual in drug discovery and design. Grid-based cavity detection methods aim at finding cavities as aggregates of grid nodes outside the molecule, under the condition that such cavities are bracketed by nodes on the molecule surface along a set of directions (not necessarily aligned with coordinate axes). Therefore, these methods are sensitive to scanning directions, a problem that we call cavity ground-and-walls ambiguity, i.e., they depend on the position and orientation of the protein in the discretized domain. Also, it is hard to distinguish grid nodes belonging to protein cavities amongst all those outside the protein, a problem that we call cavity ceiling ambiguity. RESULTS: We solve those two ambiguity problems using two implicit isosurfaces of the protein, the protein surface itself (called inner isosurface) that excludes all its interior nodes from any cavity, and the outer isosurface that excludes most of its exterior nodes from any cavity. Summing up, the cavities are formed from nodes located between these two isosurfaces. It is worth noting that these two surfaces do not need to be evaluated (i.e., sampled), triangulated, and rendered on the screen to find the cavities in between; their defining analytic functions are enough to determine which grid nodes are in the empty space between them. CONCLUSION: This article introduces a novel geometric algorithm to detect cavities on the protein surface that takes advantage of the real analytic functions describing two Gaussian surfaces of a given protein.
Asunto(s)
Algoritmos , Proteínas/química , Ligandos , Distribución Normal , Proteínas/metabolismo , Propiedades de SuperficieRESUMEN
BACKGROUND: Plasmid DNA molecules are closed circular molecules that are widely used in life sciences, particularly in gene therapy research. Monte Carlo methods have been used for several years to simulate the conformational behavior of DNA molecules. In each iteration these simulation methods randomly generate a new trial conformation, which is either accepted or rejected according to a criterion based on energy calculations and stochastic rules. These simulation trials are generated using a method based on crankshaft motion that, apart from some slight improvements, has remained the same for many years. RESULTS: In this paper, we present a new algorithm for the deformation of plasmid DNA molecules for Monte Carlo simulations. The move underlying our algorithm preserves the size and connectivity of straight-line segments of the plasmid DNA skeleton. We also present the results of three experiments comparing our deformation move with the standard and biased crankshaft moves in terms of acceptance ratio of the trials, energy and temperature evolution, and average displacement of the molecule. Our algorithm can also be used as a generic geometric algorithm for the deformation of regular polygons or polylines that preserves the connections and lengths of their segments. CONCLUSION: Compared with both crankshaft moves, our move generates simulation trials with higher acceptance ratios and smoother deformations, making it suitable for real-time visualization of plasmid DNA coiling. For that purpose, we have adopted a DNA assembly algorithm that uses nucleotides as building blocks.
Asunto(s)
Algoritmos , ADN/química , Modelos Moleculares , Método de Montecarlo , Conformación de Ácido Nucleico , Plásmidos/genética , ADN/metabolismo , MovimientoRESUMEN
Extensive research has been applied to discover new techniques and methods to model protein-ligand interactions. In particular, considerable efforts focused on identifying candidate binding sites, which quite often are active sites that correspond to protein pockets or cavities. Thus, these cavities play an important role in molecular docking. However, there is no established benchmark to assess the accuracy of new cavity detection methods. In practice, each new technique is evaluated using a small set of proteins with known binding sites as ground-truth. However, studies supported by large datasets of known cavities and/or binding sites and statistical classification (i.e., false positives, false negatives, true positives, and true negatives) would yield much stronger and reliable assessments. To this end, we propose CavBench, a generic and extensible benchmark to compare different cavity detection methods relative to diverse ground truth datasets (e.g., PDBsum) using statistical classification methods.
Asunto(s)
Modelos Moleculares , Proteínas/química , Programas Informáticos , Algoritmos , Conformación Proteica , Reproducibilidad de los Resultados , Sensibilidad y EspecificidadRESUMEN
DNA encodes the genetic information of most living beings, except viruses that use RNA. Unlike other types of molecules, DNA is not usually described by its atomic structure being instead usually described by its base-pair sequence, i.e., the textual sequence of its subsidiary molecules known as nucleotides ( adenine (A), cytosine (C), guanine (G), and thymine (T)). The three-dimensional assembling of DNA molecules based on its base-pair sequence has been, for decades, a topic of interest for many research groups all over the world. In this paper, we survey the major methods found in the literature to assemble and visualize DNA molecules from their base-pair sequences. We divided these methods into three categories: predictive methods, adaptive methods, and thermodynamic methods . Predictive methods aim to predict a conformation of the DNA from its base pair sequence, while the goal of adaptive methods is to assemble DNA base-pairs sequences along previously known conformations, as needed in scenarios such as DNA Monte Carlo simulations. Unlike these two geometric methods, thermodynamic methods are energy-based and aim to predict secondary structural motifs of DNA in cases where hydrogen bonds between base pairs might be broken because of temperature changes. We also present the major software tools that implements predictive, adaptive, and thermodynamic methods.