Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
Bioinformatics ; 30(4): 593-5, 2014 Feb 15.
Artículo en Inglés | MEDLINE | ID: mdl-24336804

RESUMEN

SUMMARY: Modern scientific investigation is generating increasingly larger datasets, yet analyzing these data with current tools is challenging. DIVE is a software framework intended to facilitate big data analysis and reduce the time to scientific insight. Here, we present features of the framework and demonstrate DIVE's application to the Dynameomics project, looking specifically at two proteins. AVAILABILITY AND IMPLEMENTATION: Binaries and documentation are available at http://www.dynameomics.org/DIVE/DIVESetup.exe.


Asunto(s)
Biología Computacional/métodos , Gráficos por Computador , Documentación/métodos , Proteínas Mutantes/metabolismo , Programas Informáticos , Simulación por Computador , Humanos , Proteínas Mutantes/genética , Mutación/genética , Superóxido Dismutasa/genética , Superóxido Dismutasa/metabolismo , Superóxido Dismutasa-1 , Proteína p53 Supresora de Tumor/genética , Proteína p53 Supresora de Tumor/metabolismo
2.
Biophys J ; 101(8): 2053-60, 2011 Oct 19.
Artículo en Inglés | MEDLINE | ID: mdl-22004760

RESUMEN

The folding pathway of the small α/ß protein GB1 has been extensively studied during the past two decades using both theoretical and experimental approaches. These studies provided a consensus view that the protein folds in a two-state manner. Here, we reassessed the folding of GB1, both by experiments and simulations, and detected the presence of an on-pathway intermediate. This intermediate has eluded earlier experimental characterization and is distinct from the collapsed state previously identified using ultrarapid mixing. Failure to identify the presence of an intermediate affects some of the conclusions that have been drawn for GB1, a popular model for protein folding studies.


Asunto(s)
Proteínas Bacterianas/química , Pliegue de Proteína , Concentración de Iones de Hidrógeno , Cinética , Simulación de Dinámica Molecular , Conformación Proteica , Termodinámica
3.
BMC Bioinformatics ; 12: 334, 2011 Aug 10.
Artículo en Inglés | MEDLINE | ID: mdl-21831299

RESUMEN

BACKGROUND: Molecular dynamics (MD) simulations offer the ability to observe the dynamics and interactions of both whole macromolecules and individual atoms as a function of time. Taken in context with experimental data, atomic interactions from simulation provide insight into the mechanics of protein folding, dynamics, and function. The calculation of atomic interactions or contacts from an MD trajectory is computationally demanding and the work required grows exponentially with the size of the simulation system. We describe the implementation of a spatial indexing algorithm in our multi-terabyte MD simulation database that significantly reduces the run-time required for discovery of contacts. The approach is applied to the Dynameomics project data. Spatial indexing, also known as spatial hashing, is a method that divides the simulation space into regular sized bins and attributes an index to each bin. Since, the calculation of contacts is widely employed in the simulation field, we also use this as the basis for testing compression of data tables. We investigate the effects of compression of the trajectory coordinate tables with different options of data and index compression within MS SQL SERVER 2008. RESULTS: Our implementation of spatial indexing speeds up the calculation of contacts over a 1 nanosecond (ns) simulation window by between 14% and 90% (i.e., 1.2 and 10.3 times faster). For a 'full' simulation trajectory (51 ns) spatial indexing reduces the calculation run-time between 31 and 81% (between 1.4 and 5.3 times faster). Compression resulted in reduced table sizes but resulted in no significant difference in the total execution time for neighbour discovery. The greatest compression (~36%) was achieved using page level compression on both the data and indexes. CONCLUSIONS: The spatial indexing scheme significantly decreases the time taken to calculate atomic contacts and could be applied to other multidimensional neighbor discovery problems. The speed up enables on-the-fly calculation and visualization of contacts and rapid cross simulation analysis for knowledge discovery. Using page compression for the atomic coordinate tables and indexes saves ~36% of disk space without any significant decrease in calculation time and should be considered for other non-transactional databases in MS SQL SERVER 2008.


Asunto(s)
Bases de Datos de Proteínas , Simulación de Dinámica Molecular , Proteínas/química , Algoritmos , Compresión de Datos , Saccharomyces cerevisiae/química , Proteínas de Saccharomyces cerevisiae/química
4.
Biophys J ; 98(11): 2671-81, 2010 Jun 02.
Artículo en Inglés | MEDLINE | ID: mdl-20513412

RESUMEN

The goal of the Dynameomics project is to perform, store, and analyze molecular dynamics simulations of representative proteins, of all known globular folds, in their native state and along their unfolding pathways. To analyze unfolding simulations, the location of the protein along the unfolding reaction coordinate (RXN) must be determined. Properties such as the fraction of native contacts and radius of gyration are often used; however, there is an issue regarding degeneracy with these properties, as native and nonnative species can overlap. Here, we used 15 physical properties of the protein to construct a multidimensional-embedded, one-dimensional RXN coordinate that faithfully captures the complex nature of unfolding. The unfolding RXN coordinates for 188 proteins (1534 simulations and 22.9 mus in explicit water) were calculated. Native, transition, intermediate, and denatured states were readily identified with the use of this RXN coordinate. A global native ensemble based on the native-state properties of the 188 proteins was created. This ensemble was shown to be effective for calculating RXN coordinates for folds outside the initial 188 targets. These RXN coordinates enable, high-throughput assignment of conformational states, which represents an important step in comparing protein properties across fold space as well as characterizing the unfolding of individual proteins.


Asunto(s)
Bases de Datos de Proteínas , Simulación de Dinámica Molecular , Pliegue de Proteína , Proteínas/química , Análisis de Componente Principal , Conformación Proteica , Desnaturalización Proteica , Agua/química
5.
Protein Eng Des Sel ; 32(7): 331-345, 2019 12 31.
Artículo en Inglés | MEDLINE | ID: mdl-31868211

RESUMEN

The Dynameomics project contains native state and unfolding simulations of 807 protein domains, where each domain is representative of a different metafold; these metafolds encompass ~97% of protein fold space. There is a long-standing question in structural biology as to whether proteins in the same fold family share the same folding/unfolding characteristics. Using molecular dynamics simulations from the Dynameomics project, we conducted a detailed study of protein unfolding/folding pathways for 5 protein domains from the immunoglobulin (Ig)-like ß-sandwich metafold (the highest ranked metafold in our database). The domains have sequence similarities ranging from 4 to 15% and are all from different SCOP superfamilies, yet they share the same overall Ig-like topology. Despite having very different amino acid sequences, the dominant unfolding pathway is very similar for the 5 proteins, and the secondary structures that are peripheral to the aligned, shared core domain add variability to the unfolding pathway. Aligned residues in the core domain display consensus structure in the transition state primarily through conservation of hydrophobic positions. Commonalities in the obligate folding nucleus indicate that insights into the major events in the folding/unfolding of other domains from this metafold may be obtainable from unfolding simulations of a few representative proteins.


Asunto(s)
Inmunoglobulinas/química , Desplegamiento Proteico , Simulación de Dinámica Molecular , Conformación Proteica en Lámina beta
6.
Sci Rep ; 9(1): 11873, 2019 08 15.
Artículo en Inglés | MEDLINE | ID: mdl-31417097

RESUMEN

Diffusional motion within the crowded environment of the cell is known to be crucial to cellular function as it drives the interactions of proteins. However, the relationships between protein diffusion, shape and interaction, and the evolutionary selection mechanisms that arise as a consequence, have not been investigated. Here, we study the dynamics of triaxial ellipsoids of equivalent steric volume to proteins at different aspect ratios and volume fractions using a combination of Brownian molecular dynamics and geometric packing. In general, proteins are found to have a shape, approximately Golden in aspect ratio, that give rise to the highest critical volume fraction resisting gelation, corresponding to the fastest long-time self-diffusion in the cell. The ellipsoidal shape also directs random collisions between proteins away from sites that would promote aggregation and loss of function to more rapidly evolving nonsticky regions on the surface, and further provides a greater tolerance to mutation.


Asunto(s)
Evolución Biológica , Modelos Moleculares , Modelos Teóricos , Proteínas/química , Algoritmos , Fenómenos Fisiológicos Celulares , Bases de Datos de Proteínas , Difusión , Transporte de Proteínas , Proteínas/genética , Proteínas/metabolismo , Relación Estructura-Actividad
7.
Protein Eng Des Sel ; 21(6): 379-86, 2008 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-18411222

RESUMEN

The Dynameomics project is our effort to characterize the native-state dynamics and folding/unfolding pathways of representatives of all known protein folds by way of molecular dynamics simulations, as described by Beck et al. (in Protein Eng. Des. Select., the first paper in this series). The data produced by these simulations are highly multidimensional in structure and multi-terabytes in size. Both of these features present significant challenges for storage, retrieval and analysis. For optimal data modeling and flexibility, we needed a platform that supported both multidimensional indices and hierarchical relationships between related types of data and that could be integrated within our data warehouse, as described in the accompanying paper directly preceding this one. For these reasons, we have chosen On-line Analytical Processing (OLAP), a multi-dimensional analysis optimized database, as an analytical platform for these data. OLAP is a mature technology in the financial sector, but it has not been used extensively for scientific analysis. Our project is further more unusual for its focus on the multidimensional and analytical capabilities of OLAP rather than its aggregation capacities. The dimensional data model and hierarchies are very flexible. The query language is concise for complex analysis and rapid data retrieval. OLAP shows great promise for the dynamic protein analysis for bioengineering and biomedical applications. In addition, OLAP may have similar potential for other scientific and engineering applications involving large and complex datasets.


Asunto(s)
Bases de Datos de Proteínas , Proteínas/química , Lenguajes de Programación
8.
Protein Eng Des Sel ; 21(6): 369-77, 2008 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-18411223

RESUMEN

Dynameomics is a project to investigate and catalog the native-state dynamics and thermal unfolding pathways of representatives of all protein folds using solvated molecular dynamics simulations, as described in the preceding paper. Here we introduce the design of the molecular dynamics data warehouse, a scalable, reliable repository that houses simulation data that vastly simplifies management and access. In the succeeding paper, we describe the development of a complementary multidimensional database. A single protein unfolding or native-state simulation can take weeks to months to complete, and produces gigabytes of coordinate and analysis data. Mining information from over 3000 completed simulations is complicated and time-consuming. Even the simplest queries involve writing intricate programs that must be built from low-level file system access primitives and include significant logic to correctly locate and parse data of interest. As a result, programs to answer questions that require data from hundreds of simulations are very difficult to write. Thus, organization and access to simulation data have been major obstacles to the discovery of new knowledge in the Dynameomics project. This repository is used internally and is the foundation of the Dynameomics portal site http://www.dynameomics.org. By organizing simulation data into a scalable, manageable and accessible form, we can begin to address substantial questions that move us closer to solving biomedical and bioengineering problems.


Asunto(s)
Simulación por Computador , Bases de Datos de Proteínas , Proteínas/química , Modelos Moleculares , Lenguajes de Programación
9.
Protein Eng Des Sel ; 21(6): 353-68, 2008 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-18411224

RESUMEN

The goal of Dynameomics is to perform atomistic molecular dynamics (MD) simulations of representative proteins from all known folds in explicit water in their native state and along their thermal unfolding pathways. Here we present 188-fold representatives and their native state simulations and analyses. These 188 targets represent 67% of all the structures in the Protein Data Bank. The behavior of several specific targets is highlighted to illustrate general properties in the full dataset and to demonstrate the role of MD in understanding protein function and stability. As an example of what can be learned from mining the Dynameomics database, we identified a protein fold with heightened localized dynamics. In one member of this fold family, the motion affects the exposure of its phosphorylation site and acts as an entropy sink to offset another portion of the protein that is relatively immobile in order to present a consistent interface for protein docking. In another member of this family, a polymorphism in the highly mobile region leads to a host of disease phenotypes. We have constructed a web site to provide access to a novel hybrid relational/multidimensional database (described in the succeeding two papers) to view and interrogate simulations of the top 30 targets: http://www.dynameomics.org. The Dynameomics database, currently the largest collection of protein simulations and protein structures in the world, should also be useful for determining the rules governing protein folding and kinetic stability, which should aid in deciphering genomic information and for protein engineering and design.


Asunto(s)
Proteínas/química , Fosforilación , Desnaturalización Proteica , Pliegue de Proteína , Agua/química
10.
J Mol Graph Model ; 24(5): 396-403, 2006 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-16290077

RESUMEN

Mechanical resistance of a protein under external force is known to depend on the amino acid sequence, unfolding rate constant, topology and the direction of force applied. To assess the affect of force direction on mechanical resistance, molecular dynamics (MD) simulations of the partial unfolding of titin I27 have been carried out by applying a ramp of force between the N-terminus and the alpha-carbon of each amino acid, respectively. The results arbitrarily place the amino acids in a hierarchy in terms of the time at which an unfolding intermediate is formed. The onset of unfolding is indeed affected by force direction; directions that give maximum leverage (for the A strand to detach) unfold to the intermediate quicker than directions that give least leverage. Moreover, the change in the time taken to reach the intermediate, hence the change in mechanical resistance, can be attributed to beta-strand topology. The simulations indicate that experimentally multi-directional forced unfolding could be used to reveal and study strand topology, and suggests that direction of applied force, topology and mechanical resistance are all closely related.


Asunto(s)
Simulación por Computador , Proteínas Musculares/metabolismo , Desnaturalización Proteica , Proteínas Quinasas/metabolismo , Conectina , Enlace de Hidrógeno , Matemática , Microscopía de Fuerza Atómica , Modelos Moleculares , Proteínas Musculares/química , Proteínas Quinasas/química , Estructura Secundaria de Proteína , Análisis Espectral , Factores de Tiempo
11.
Structure ; 18(4): 423-35, 2010 Mar 14.
Artículo en Inglés | MEDLINE | ID: mdl-20399180

RESUMEN

The dynamic behavior of proteins is important for an understanding of their function and folding. We have performed molecular dynamics simulations of the native state and unfolding pathways of over 2000 protein/peptide systems (approximately 11,000 independent simulations) representing the majority of folds in globular proteins. These data are stored and organized using an innovative database approach, which can be mined to obtain both general and specific information about the dynamics and folding/unfolding of proteins, relevant subsets thereof, and individual proteins. Here we describe the project in general terms and the type of information contained in the database. Then we provide examples of mining the database for information relevant to protein folding, structure building, the effect of single-nucleotide polymorphisms, and drug design. The native state simulation data and corresponding analyses for the 100 most populated metafolds, together with related resources, are publicly accessible through http://www.dynameomics.org.


Asunto(s)
Proteínas/química , Algoritmos , Animales , Biología Computacional/métodos , Bases de Datos de Proteínas , Humanos , Modelos Moleculares , Conformación Molecular , Polimorfismo de Nucleótido Simple , Desnaturalización Proteica , Pliegue de Proteína , Proteómica/métodos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA