Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 35
Filtrar
1.
J Biomol NMR ; 73(1-2): 5-9, 2019 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-30580387

RESUMO

The growth of the biological nuclear magnetic resonance (NMR) field and the development of new experimental technology have mandated the revision and enlargement of the NMR-STAR ontology used to represent experiments, spectral and derived data, and supporting metadata. We present here a brief description of the NMR-STAR ontology and software tools for manipulating NMR-STAR data files, editing the files, extracting selected data, and creating data visualizations. Detailed information on these is accessible from the links provided.


Assuntos
Ontologias Biológicas , Ressonância Magnética Nuclear Biomolecular , Armazenamento e Recuperação da Informação , Software , Vocabulário Controlado
2.
Biophys J ; 112(8): 1529-1534, 2017 Apr 25.
Artigo em Inglês | MEDLINE | ID: mdl-28445744

RESUMO

Advances in computation have been enabling many recent advances in biomolecular applications of NMR. Due to the wide diversity of applications of NMR, the number and variety of software packages for processing and analyzing NMR data is quite large, with labs relying on dozens, if not hundreds of software packages. Discovery, acquisition, installation, and maintenance of all these packages is a burdensome task. Because the majority of software packages originate in academic labs, persistence of the software is compromised when developers graduate, funding ceases, or investigators turn to other projects. To simplify access to and use of biomolecular NMR software, foster persistence, and enhance reproducibility of computational workflows, we have developed NMRbox, a shared resource for NMR software and computation. NMRbox employs virtualization to provide a comprehensive software environment preconfigured with hundreds of software packages, available as a downloadable virtual machine or as a Platform-as-a-Service supported by a dedicated compute cloud. Ongoing development includes a metadata harvester to regularize, annotate, and preserve workflows and facilitate and enhance data depositions to BioMagResBank, and tools for Bayesian inference to enhance the robustness and extensibility of computational analyses. In addition to facilitating use and preservation of the rich and dynamic software environment for biomolecular NMR, NMRbox fosters the development and deployment of a new class of metasoftware packages. NMRbox is freely available to not-for-profit users.


Assuntos
Ressonância Magnética Nuclear Biomolecular , Software , Acesso à Informação , Teorema de Bayes , Computação em Nuvem , Internet , Metadados
3.
J Biomol NMR ; 64(1): 17-25, 2016 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-26724815

RESUMO

Data validation plays an important role in ensuring the reliability and reproducibility of studies. NMR investigations of the functional properties, dynamics, chemical kinetics, and structures of proteins depend critically on the correctness of chemical shift assignments. We present a novel probabilistic method named ARECA for validating chemical shift assignments that relies on the nuclear Overhauser effect data . ARECA has been evaluated through its application to 26 case studies and has been shown to be complementary to, and usually more reliable than, approaches based on chemical shift databases. ARECA is available online at http://areca.nmrfam.wisc.edu/.


Assuntos
Ressonância Magnética Nuclear Biomolecular/métodos , Proteínas/química , Reprodutibilidade dos Testes
4.
J Biomol NMR ; 63(1): 77-83, 2015 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-26195077

RESUMO

MolProbity is a powerful software program for validating structures of proteins and nucleic acids. Although MolProbity includes scripts for batch analysis of structures, because these scripts analyze structures one at a time, they are not well suited for the validation of a large dataset of structures. We have created a version of MolProbity (MolProbity-HTC) that circumvents these limitations and takes advantage of a high-throughput computing cluster by using the HTCondor software. MolProbity-HTC enables the longitudinal analysis of large sets of structures, such as those deposited in the PDB or generated through theoretical computation-tasks that would have been extremely time-consuming using previous versions of MolProbity. We have used MolProbity-HTC to validate the entire PDB, and have developed a new visual chart for the BioMagResBank website that enables users to easily ascertain the quality of each model in an NMR ensemble and to compare the quality of those models to the rest of the PDB.


Assuntos
Ácidos Nucleicos/química , Proteínas/química , Software , Estatística como Assunto , Espectroscopia de Ressonância Magnética , Reprodutibilidade dos Testes
5.
Biochim Biophys Acta ; 1834(2): 487-98, 2013 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-23232152

RESUMO

Intrinsically disordered proteins (IDPs) and proteins with long disordered regions are highly abundant in various proteomes. Despite their lack of well-defined ordered structure, these proteins and regions are frequently involved in crucial biological processes. Although in recent years these proteins have attracted the attention of many researchers, IDPs represent a significant challenge for structural characterization since these proteins can impact many of the processes in the structure determination pipeline. Here we investigate the effects of IDPs on the structure determination process and the utility of disorder prediction in selecting and improving proteins for structural characterization. Examination of the extent of intrinsic disorder in existing crystal structures found that relatively few protein crystal structures contain extensive regions of intrinsic disorder. Although intrinsic disorder is not the only cause of crystallization failures and many structured proteins cannot be crystallized, filtering out highly disordered proteins from structure-determination target lists is still likely to be cost effective. Therefore it is desirable to avoid highly disordered proteins from structure-determination target lists and we show that disorder prediction can be applied effectively to enrich structure determination pipelines with proteins more likely to yield crystal structures. For structural investigation of specific proteins, disorder prediction can be used to improve targets for structure determination. Finally, a framework for considering intrinsic disorder in the structure determination pipeline is proposed.


Assuntos
Bases de Dados de Proteínas , Proteínas/química , Proteínas/genética , Proteômica , Cristalografia por Raios X , Estrutura Terciária de Proteína
6.
Nucleic Acids Res ; 40(Database issue): D519-24, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22139937

RESUMO

For many macromolecular NMR ensembles from the Protein Data Bank (PDB) the experiment-based restraint lists are available, while other experimental data, mainly chemical shift values, are often available from the BioMagResBank. The accuracy and precision of the coordinates in these macromolecular NMR ensembles can be improved by recalculation using the available experimental data and present-day software. Such efforts, however, generally fail on half of all NMR ensembles due to the syntactic and semantic heterogeneity of the underlying data and the wide variety of formats used for their deposition. We have combined the remediated restraint information from our NMR Restraints Grid (NRG) database with available chemical shifts from the BioMagResBank and the Common Interface for NMR structure Generation (CING) structure validation reports into the weekly updated NRG-CING database (http://nmr.cmbi.ru.nl/NRG-CING). Eleven programs have been included in the NRG-CING production pipeline to arrive at validation reports that list for each entry the potential inconsistencies between the coordinates and the available experimental NMR data. The longitudinal validation of these data in a publicly available relational database yields a set of indicators that can be used to judge the quality of every macromolecular structure solved with NMR. The remediated NMR experimental data sets and validation reports are freely available online.


Assuntos
Bases de Dados de Proteínas , Ressonância Magnética Nuclear Biomolecular , Conformação Proteica , Reprodutibilidade dos Testes , Integração de Sistemas
7.
J Biomol NMR ; 53(4): 311-20, 2012 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-22689068

RESUMO

Biomolecular NMR chemical shift data are key information for the functional analysis of biomolecules and the development of new techniques for NMR studies utilizing chemical shift statistical information. Structural genomics projects are major contributors to the accumulation of protein chemical shift information. The management of the large quantities of NMR data generated by each project in a local database and the transfer of the data to the public databases are still formidable tasks because of the complicated nature of NMR data. Here we report an automated and efficient system developed for the deposition and annotation of a large number of data sets including (1)H, (13)C and (15)N resonance assignments used for the structure determination of proteins. We have demonstrated the feasibility of our system by applying it to over 600 entries from the internal database generated by the RIKEN Structural Genomics/Proteomics Initiative (RSGI) to the public database, BioMagResBank (BMRB). We have assessed the quality of the deposited chemical shifts by comparing them with those predicted from the PDB coordinate entry for the corresponding protein. The same comparison for other matched BMRB/PDB entries deposited from 2001-2011 has been carried out and the results suggest that the RSGI entries greatly improved the quality of the BMRB database. Since the entries include chemical shifts acquired under strikingly similar experimental conditions, these NMR data can be expected to be a promising resource to improve current technologies as well as to develop new NMR methods for protein studies.


Assuntos
Bases de Dados de Proteínas , Ressonância Magnética Nuclear Biomolecular/métodos , Software , Genômica/métodos , Proteínas/química , Proteômica/métodos , Controle de Qualidade
8.
Magn Reson (Gott) ; 2(2): 765-775, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-37905229

RESUMO

Hydrogen bonding between an amide group and the p-π cloud of an aromatic ring was first identified in a protein in the 1980s. Subsequent surveys of high-resolution X-ray crystal structures found multiple instances, but their preponderance was determined to be infrequent. Hydrogen atoms participating in a hydrogen bond to the p-π cloud of an aromatic ring are expected to experience an upfield chemical shift arising from a shielding ring current shift. We surveyed the Biological Magnetic Resonance Data Bank for amide hydrogens exhibiting unusual shifts as well as corroborating nuclear Overhauser effects between the amide protons and ring protons. We found evidence that Trp residues are more likely to be involved in p-π hydrogen bonds than other aromatic amino acids, whereas His residues are more likely to be involved in in-plane hydrogen bonds, with a ring nitrogen acting as the hydrogen acceptor. The p-π hydrogen bonds may be more abundant than previously believed. The inclusion in NMR structure refinement protocols of shift effects in amide protons from aromatic sidechains, or explicit hydrogen bond restraints between amides and aromatic rings, could improve the local accuracy of sidechain orientations in solution NMR protein structures, but their impact on global accuracy is likely be limited.

9.
J Biomol NMR ; 48(2): 85-92, 2010 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-20680401

RESUMO

We present a suite of software for the complete and easy deposition of NMR data to the PDB and BMRB. This suite uses the CCPN framework and introduces a freely downloadable, graphical desktop application called CcpNmr Entry Completion Interface (ECI) for the secure editing of experimental information and associated datasets through the lifetime of an NMR project. CCPN projects can be created within the CcpNmr Analysis software or by importing existing NMR data files using the CcpNmr FormatConverter. After further data entry and checking with the ECI, the project can then be rapidly deposited to the PDBe using AutoDep, or exported as a complete deposition NMR-STAR file. In full CCPN projects created with ECI, it is straightforward to select chemical shift lists, restraint data sets, structural ensembles and all relevant associated experimental collection details, which all are or will become mandatory when depositing to the PDB. Instructions and download information for the ECI are available from the PDBe web site at http://www.ebi.ac.uk/pdbe/nmr/deposition/eci.html .


Assuntos
Sistemas de Gerenciamento de Base de Dados , Bases de Dados de Proteínas , Ressonância Magnética Nuclear Biomolecular , Proteínas/química , Proteínas/classificação , Interface Usuário-Computador
10.
Nucleic Acids Res ; 36(Database issue): D402-8, 2008 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17984079

RESUMO

The BioMagResBank (BMRB: www.bmrb.wisc.edu) is a repository for experimental and derived data gathered from nuclear magnetic resonance (NMR) spectroscopic studies of biological molecules. BMRB is a partner in the Worldwide Protein Data Bank (wwPDB). The BMRB archive consists of four main data depositories: (i) quantitative NMR spectral parameters for proteins, peptides, nucleic acids, carbohydrates and ligands or cofactors (assigned chemical shifts, coupling constants and peak lists) and derived data (relaxation parameters, residual dipolar couplings, hydrogen exchange rates, pK(a) values, etc.), (ii) databases for NMR restraints processed from original author depositions available from the Protein Data Bank, (iii) time-domain (raw) spectral data from NMR experiments used to assign spectral resonances and determine the structures of biological macromolecules and (iv) a database of one- and two-dimensional (1)H and (13)C one- and two-dimensional NMR spectra for over 250 metabolites. The BMRB website provides free access to all of these data. BMRB has tools for querying the archive and retrieving information and an ftp site (ftp.bmrb.wisc.edu) where data in the archive can be downloaded in bulk. Two BMRB mirror sites exist: one at the PDBj, Protein Research Institute, Osaka University, Osaka, Japan (bmrb.protein.osaka-u.ac.jp) and the other at CERM, University of Florence, Florence, Italy (bmrb.postgenomicnmr.net/). The site at Osaka also accepts and processes data depositions.


Assuntos
Bases de Dados Factuais , Ressonância Magnética Nuclear Biomolecular , Carboidratos/química , Internet , Ligantes , Ácidos Nucleicos/química , Peptídeos/química , Proteínas/química , Interface Usuário-Computador
11.
Nucleic Acids Res ; 36(Database issue): D426-33, 2008 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18073189

RESUMO

The Worldwide Protein Data Bank (wwPDB; wwpdb.org) is the international collaboration that manages the deposition, processing and distribution of the PDB archive. The online PDB archive at ftp://ftp.wwpdb.org is the repository for the coordinates and related information for more than 47 000 structures, including proteins, nucleic acids and large macromolecular complexes that have been determined using X-ray crystallography, NMR and electron microscopy techniques. The members of the wwPDB-RCSB PDB (USA), MSD-EBI (Europe), PDBj (Japan) and BMRB (USA)-have remediated this archive to address inconsistencies that have been introduced over the years. The scope and methods used in this project are presented.


Assuntos
Bases de Dados de Proteínas , Substâncias Macromoleculares/química , Arquivos , Cristalografia por Raios X , Bases de Dados de Proteínas/normas , Dicionários Químicos como Assunto , Internet , Microscopia Eletrônica , Ressonância Magnética Nuclear Biomolecular , Ácidos Nucleicos/química , Proteínas/química , Reprodutibilidade dos Testes , Terminologia como Assunto
12.
Methods Mol Biol ; 2112: 187-218, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32006287

RESUMO

The Biological Magnetic Resonance Data Bank (BioMagResBank or BMRB), founded in 1988, serves as the archive for data generated by nuclear magnetic resonance (NMR) spectroscopy of biological systems. NMR spectroscopy is unique among biophysical approaches in its ability to provide a broad range of atomic and higher-level information relevant to the structural, dynamic, and chemical properties of biological macromolecules, as well as report on metabolite and natural product concentrations in complex mixtures and their chemical structures. BMRB became a core member of the Worldwide Protein Data Bank (wwPDB) in 2007, and the BMRB archive is now a core archive of the wwPDB. Currently, about 10% of the structures deposited into the PDB archive are based on NMR spectroscopy. BMRB stores experimental and derived data from biomolecular NMR studies. Newer BMRB biopolymer depositions are divided about evenly between those associated with structure determinations (atomic coordinates and supporting information archived in the PDB) and those reporting experimental information on molecular dynamics, conformational transitions, ligand binding, assigned chemical shifts, or other results from NMR spectroscopy. BMRB also provides resources for NMR studies of metabolites and other small molecules that are often macromolecular ligands and/or nonstandard residues. This chapter is directed to the structural biology community rather than the metabolomics and natural products community. Our goal is to describe various BMRB services offered to structural biology researchers and how they can be accessed and utilized. These services can be classified into four main groups: (1) data deposition, (2) data retrieval, (3) data analysis, and (4) services for NMR spectroscopists and software developers. The chapter also describes the NMR-STAR data format used by BMRB and the tools provided to facilitate its use. For programmers, BMRB offers an application programming interface (API) and libraries in the Python and R languages that enable users to develop their own BMRB-based tools for data analysis, visualization, and manipulation of NMR-STAR formatted files. BMRB also provides users with direct access tools through the NMRbox platform.


Assuntos
Substâncias Macromoleculares/química , Biologia Molecular/métodos , Conformação Proteica , Proteínas/química , Bases de Dados de Proteínas , Ligantes , Ressonância Magnética Nuclear Biomolecular/métodos , Software
13.
J Biomol NMR ; 45(4): 389-96, 2009 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-19809795

RESUMO

Several pilot experiments have indicated that improvements in older NMR structures can be expected by applying modern software and new protocols (Nabuurs et al. in Proteins 55:483-186, 2004; Nederveen et al. in Proteins 59:662-672, 2005; Saccenti and Rosato in J Biomol NMR 40:251-261, 2008). A recent large scale X-ray study also has shown that modern software can significantly improve the quality of X-ray structures that were deposited more than a few years ago (Joosten et al. in J. Appl Crystallogr 42:376-384, 2009; Sanderson in Nature 459:1038-1039, 2009). Recalculation of three-dimensional coordinates requires that the original experimental data are available and complete, and are semantically and syntactically correct, or are at least correct enough to be reconstructed. For multiple reasons, including a lack of standards, the heterogeneity of the experimental data and the many NMR experiment types, it has not been practical to parse a large proportion of the originally deposited NMR experimental data files related to protein NMR structures. This has made impractical the automatic recalculation, and thus improvement, of the three dimensional coordinates of these structures. We here describe a large-scale international collaborative effort to make all deposited experimental NMR data semantically and syntactically homogeneous, and thus useful for further research. A total of 4,014 out of 5,266 entries were 'cleaned' in this process. For 1,387 entries, human intervention was needed. Continuous efforts in automating the parsing of both old, and newly deposited files is steadily decreasing this fraction. The cleaned data files are available from the NMR restraints grid at http://restraintsgrid.bmrb.wisc.edu .


Assuntos
Bases de Dados de Proteínas/normas , Ressonância Magnética Nuclear Biomolecular/métodos , Ácidos Nucleicos , Proteínas , Processamento Eletrônico de Dados , Humanos , Padrões de Referência , Software
14.
Curr Opin Biotechnol ; 43: 56-61, 2017 02.
Artigo em Inglês | MEDLINE | ID: mdl-27643760

RESUMO

The metabolome, the collection of small molecules associated with an organism, is a growing subject of inquiry, with the data utilized for data-intensive systems biology, disease diagnostics, biomarker discovery, and the broader characterization of small molecules in mixtures. Owing to their close proximity to the functional endpoints that govern an organism's phenotype, metabolites are highly informative about functional states. The field of metabolomics identifies and quantifies endogenous and exogenous metabolites in biological samples. Information acquired from nuclear magnetic spectroscopy (NMR), mass spectrometry (MS), and the published literature, as processed by statistical approaches, are driving increasingly wider applications of metabolomics. This review focuses on the role of databases and software tools in advancing the rigor, robustness, reproducibility, and validation of metabolomics studies.


Assuntos
Bases de Dados Factuais , Espectroscopia de Ressonância Magnética/métodos , Metaboloma , Metabolômica/métodos , Software , Biologia de Sistemas/métodos , Animais , Humanos , Imageamento por Ressonância Magnética
15.
Structure ; 25(12): 1916-1927, 2017 12 05.
Artigo em Inglês | MEDLINE | ID: mdl-29174494

RESUMO

The Worldwide PDB recently launched a deposition, biocuration, and validation tool: OneDep. At various stages of OneDep data processing, validation reports for three-dimensional structures of biological macromolecules are produced. These reports are based on recommendations of expert task forces representing crystallography, nuclear magnetic resonance, and cryoelectron microscopy communities. The reports provide useful metrics with which depositors can evaluate the quality of the experimental data, the structural model, and the fit between them. The validation module is also available as a stand-alone web server and as a programmatically accessible web service. A growing number of journals require the official wwPDB validation reports (produced at biocuration) to accompany manuscripts describing macromolecular structures. Upon public release of the structure, the validation report becomes part of the public PDB archive. Geometric quality scores for proteins in the PDB archive have improved over the past decade.


Assuntos
Bases de Dados de Proteínas/normas , Estudos de Validação como Assunto , Análise de Sequência de Proteína/métodos , Análise de Sequência de Proteína/normas
16.
Structure ; 25(3): 536-545, 2017 03 07.
Artigo em Inglês | MEDLINE | ID: mdl-28190782

RESUMO

OneDep, a unified system for deposition, biocuration, and validation of experimentally determined structures of biological macromolecules to the PDB archive, has been developed as a global collaboration by the worldwide PDB (wwPDB) partners. This new system was designed to ensure that the wwPDB could meet the evolving archiving requirements of the scientific community over the coming decades. OneDep unifies deposition, biocuration, and validation pipelines across all wwPDB, EMDB, and BMRB deposition sites with improved focus on data quality and completeness in these archives, while supporting growth in the number of depositions and increases in their average size and complexity. In this paper, we describe the design, functional operation, and supporting infrastructure of the OneDep system, and provide initial performance assessments.


Assuntos
Proteínas/química , Curadoria de Dados , Bases de Dados de Proteínas , Internet , Modelos Moleculares , Ressonância Magnética Nuclear Biomolecular , Conformação Proteica , Interface Usuário-Computador
17.
J Biomed Semantics ; 7(1): 16, 2016 05 05.
Artigo em Inglês | MEDLINE | ID: mdl-27927232

RESUMO

BACKGROUND: The nuclear magnetic resonance (NMR) spectroscopic data for biological macromolecules archived at the BioMagResBank (BMRB) provide a rich resource of biophysical information at atomic resolution. The NMR data archived in NMR-STAR ASCII format have been implemented in a relational database. However, it is still fairly difficult for users to retrieve data from the NMR-STAR files or the relational database in association with data from other biological databases. FINDINGS: To enhance the interoperability of the BMRB database, we present a full conversion of BMRB entries to two standard structured data formats, XML and RDF, as common open representations of the NMR-STAR data. Moreover, a SPARQL endpoint has been deployed. The described case study demonstrates that a simple query of the SPARQL endpoints of the BMRB, UniProt, and Online Mendelian Inheritance in Man (OMIM), can be used in NMR and structure-based analysis of proteins combined with information of single nucleotide polymorphisms (SNPs) and their phenotypes. CONCLUSIONS: We have developed BMRB/XML and BMRB/RDF and demonstrate their use in performing a federated SPARQL query linking the BMRB to other databases through standard semantic web technologies. This will facilitate data exchange across diverse information resources.


Assuntos
Ontologias Biológicas , Internet , Ressonância Magnética Nuclear Biomolecular , Proteínas/química , Semântica , Bases de Dados de Proteínas , Proteínas/metabolismo
18.
Proteins ; 59(3): 444-53, 2005 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-15789434

RESUMO

The Center for Eukaryotic Structural Genomics (CESG), as part of the Protein Structure Initiative (PSI), has established a high-throughput structure determination pipeline focused on eukaryotic proteins. NMR spectroscopy is an integral part of this pipeline, both as a method for structure determinations and as a means for screening proteins for stable structure. Because computational approaches have estimated that many eukaryotic proteins are highly disordered, about 1 year into the project, CESG began to use an algorithm (the Predictor of Naturally Disordered Regions, PONDR to avoid proteins that were likely to be disordered. We report a retrospective analysis of the effect of this filtering on the yield of viable structure determination candidates. In addition, we have used our current database of results on 70 protein targets from Arabidopsis thaliana and 1 from Caenorhabditis elegans, which were labeled uniformly with nitrogen-15 and screened for disorder by NMR spectroscopy, to compare the original algorithm with 13 other approaches for predicting disorder from sequence. Our study indicates that the efficiency of structural proteomics of eukaryotes can be improved significantly by removing targets predicted to be disordered by an algorithm chosen to provide optimal performance.


Assuntos
Proteínas/química , Proteoma , Algoritmos , Clonagem Molecular , Bases de Dados de Proteínas , Genômica , Espectroscopia de Ressonância Magnética , Proteínas/genética
19.
Proteins ; 59(4): 687-96, 2005 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-15815974

RESUMO

To address data management and data exchange problems in the nuclear magnetic resonance (NMR) community, the Collaborative Computing Project for the NMR community (CCPN) created a "Data Model" that describes all the different types of information needed in an NMR structural study, from molecular structure and NMR parameters to coordinates. This paper describes the development of a set of software applications that use the Data Model and its associated libraries, thus validating the approach. These applications are freely available and provide a pipeline for high-throughput analysis of NMR data. Three programs work directly with the Data Model: CcpNmr Analysis, an entirely new analysis and interactive display program, the CcpNmr FormatConverter, which allows transfer of data from programs commonly used in NMR to and from the Data Model, and the CLOUDS software for automated structure calculation and assignment (Carnegie Mellon University), which was rewritten to interact directly with the Data Model. The ARIA 2.0 software for structure calculation (Institut Pasteur) and the QUEEN program for validation of restraints (University of Nijmegen) were extended to provide conversion of their data to the Data Model. During these developments the Data Model has been thoroughly tested and used, demonstrating that applications can successfully exchange data via the Data Model. The software architecture developed by CCPN is now ready for new developments, such as integration with additional software applications and extensions of the Data Model into other areas of research.


Assuntos
Bases de Dados de Proteínas , Espectroscopia de Ressonância Magnética/métodos , Software , Gráficos por Computador , Espectroscopia de Ressonância Magnética/instrumentação , Modelos Teóricos
20.
Proteins ; 59(4): 662-72, 2005 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-15822098

RESUMO

State-of-the-art methods based on CNS and CYANA were used to recalculate the nuclear magnetic resonance (NMR) solution structures of 500+ proteins for which coordinates and NMR restraints are available from the Protein Data Bank. Curated restraints were obtained from the BioMagResBank FRED database. Although the original NMR structures were determined by various methods, they all were recalculated by CNS and CYANA and refined subsequently by restrained molecular dynamics (CNS) in a hydrated environment. We present an extensive analysis of the results, in terms of various quality indicators generated by PROCHECK and WHAT_CHECK. On average, the quality indicators for packing and Ramachandran appearance moved one standard deviation closer to the mean of the reference database. The structural quality of the recalculated structures is discussed in relation to various parameters, including number of restraints per residue, NOE completeness and positional root mean square deviation (RMSD). Correlations between pairs of these quality indicators were generally low; for example, there is a weak correlation between the number of restraints per residue and the Ramachandran appearance according to WHAT_CHECK (r = 0.31). The set of recalculated coordinates constitutes a unified database of protein structures in which potential user- and software-dependent biases have been kept as small as possible. The database can be used by the structural biology community for further development of calculation protocols, validation tools, structure-based statistical approaches and modeling. The RECOORD database of recalculated structures is publicly available from http://www.ebi.ac.uk/msd/recoord.


Assuntos
Bases de Dados de Proteínas , Proteínas/química , Conformação Proteica , Reprodutibilidade dos Testes , Estresse Mecânico
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA