RESUMEN
Binding MOAD (Mother of All Databases) is a database of 9836 protein-ligand crystal structures. All biologically relevant ligands are annotated, and experimental binding-affinity data is reported when available. Binding MOAD has almost doubled in size since it was originally introduced in 2004, demonstrating steady growth with each annual update. Several technologies, such as natural language processing, help drive this constant expansion. Along with increasing data, Binding MOAD has improved usability. The website now showcases a faster, more featured viewer to examine the protein-ligand structures. Ligands have additional chemical data, allowing for cheminformatics mining. Lastly, logins are no longer necessary, and Binding MOAD is freely available to all at http://www.BindingMOAD.org.
Asunto(s)
Bases de Datos de Proteínas , Ligandos , Conformación Proteica , Sitios de Unión , Gráficos por Computador , Cristalografía por Rayos X , Internet , Proteínas/metabolismo , Interfaz Usuario-ComputadorRESUMEN
BACKGROUND: Molecular interaction Information is a key resource in modern biomedical research. Publicly available data have previously been provided in a broad array of diverse formats, making access to this very difficult. The publication and wide implementation of the Human Proteome Organisation Proteomics Standards Initiative Molecular Interactions (HUPO PSI-MI) format in 2004 was a major step towards the establishment of a single, unified format by which molecular interactions should be presented, but focused purely on protein-protein interactions. RESULTS: The HUPO-PSI has further developed the PSI-MI XML schema to enable the description of interactions between a wider range of molecular types, for example nucleic acids, chemical entities, and molecular complexes. Extensive details about each supported molecular interaction can now be captured, including the biological role of each molecule within that interaction, detailed description of interacting domains, and the kinetic parameters of the interaction. The format is supported by data management and analysis tools and has been adopted by major interaction data providers. Additionally, a simpler, tab-delimited format MITAB2.5 has been developed for the benefit of users who require only minimal information in an easy to access configuration. CONCLUSION: The PSI-MI XML2.5 and MITAB2.5 formats have been jointly developed by interaction data producers and providers from both the academic and commercial sector, and are already widely implemented and well supported by an active development community. PSI-MI XML2.5 enables the description of highly detailed molecular interaction data and facilitates data exchange between databases and users without loss of information. MITAB2.5 is a simpler format appropriate for fast Perl parsing or loading into Microsoft Excel.
Asunto(s)
Bases de Datos de Proteínas/normas , Procesamiento de Lenguaje Natural , Mapeo de Interacción de Proteínas/métodos , Proteómica/métodos , Biología Computacional , Gráficos por Computador , Sistemas de Administración de Bases de Datos , Proteómica/normas , Interfaz Usuario-ComputadorRESUMEN
The ever-increasing generation of, and corresponding interest in, molecular interaction data has lead to the establishment of a number of high-quality molecular interaction databases which manually curate interaction data extracted from the literature. In order to effectively share the curation load, and ensure that data is stored in and accessible from multiple sources, these databases have united to form the IMEx consortium. All of the IMEx databases also accept direct deposition of interaction data from authors prior to publication, thus both assisting the scientist in preparing the dataset for publication and ensuring that its subsequent representation in the public domain databases is fully accurate. This article walks the potential submitter through the various routes by which data may be deposited with the databases and describes the tools which have been developed to assist in this process.
Asunto(s)
Bases de Datos Genéticas/normas , Mapeo de Interacción de Proteínas/métodos , Proteómica/normas , Interfaz Usuario-Computador , Acceso a la Información , Internet , Edición/normas , Programas Informáticos , Vocabulario ControladoRESUMEN
The theme of the third annual Spring workshop of the HUPO-PSI was "proteomics and beyond" and its underlying goal was to reach beyond the boundaries of the proteomics community to interact with groups working on the similar issues of developing interchange standards and minimal reporting requirements. Significant developments in many of the HUPO-PSI XML interchange formats, minimal reporting requirements and accompanying controlled vocabularies were reported, with many of these now feeding into the broader efforts of the Functional Genomics Experiment (FuGE) data model and Functional Genomics Ontology (FuGO) ontologies.
Asunto(s)
Proteómica/normas , Animales , Bases de Datos de Proteínas/normas , Humanos , Mapeo de Interacción de Proteínas , Publicaciones/normas , Proyectos de Investigación/normas , Terminología como AsuntoRESUMEN
We have recently announced the largest database of protein-ligand complexes, Binding MOAD (Mother of All Databases). After the August 2004 update, Binding MOAD contains 6816 complexes. There are 2220 protein families and 3316 unique ligands. After searching 6000+ crystallography papers, we have obtained binding data for 1793 (27%) of the complexes. We have also created a non-redundant set of complexes with only one complex from each protein family; in that set, 630 (28%) of the unique complexes have binding data. Here, we present information about the data provided at the Binding MOAD website. We also present the results of mining Binding MOAD to map the degree of solvent exposure for binding sites. We have determined that most cavities and ligands (70-85%) are well buried in the complexes. This fits with the common paradigm that a large degree of contact between the ligand and protein is significant in molecular recognition. GoCAV and the GoCAV viewer are the tools we created for this study. To share our data and make our online dataset more useful to other research groups, we have integrated the viewer into the Binding MOAD website (www.BindingMOAD.org).