Search | VHL Regional Portal

1.

Restraint validation of biomolecular structures determined by NMR in the Protein Data Bank.

Baskaran, Kumaran; Ploskon, Eliza; Tejero, Roberto; Yokochi, Masashi; Harrus, Deborah; Liang, Yuhe; Peisach, Ezra; Persikova, Irina; Ramelot, Theresa A; Sekharan, Monica; Tolchard, James; Westbrook, John D; Bardiaux, Benjamin; Schwieters, Charles D; Patwardhan, Ardan; Velankar, Sameer; Burley, Stephen K; Kurisu, Genji; Hoch, Jeffrey C; Montelione, Gaetano T; Vuister, Geerten W; Young, Jasmine Y.

Structure ; 32(6): 824-837.e1, 2024 Jun 06.

Article in English | MEDLINE | ID: mdl-38490206

ABSTRACT

Biomolecular structure analysis from experimental NMR studies generally relies on restraints derived from a combination of experimental and knowledge-based data. A challenge for the structural biology community has been a lack of standards for representing these restraints, preventing the establishment of uniform methods of model-vs-data structure validation against restraints and limiting interoperability between restraint-based structure modeling programs. The NEF and NMR-STAR formats provide a standardized approach for representing commonly used NMR restraints. Using these restraint formats, a standardized validation system for assessing structural models of biopolymers against restraints has been developed and implemented in the wwPDB OneDep data deposition-validation-biocuration system. The resulting wwPDB restraint violation report provides a model vs. data assessment of biomolecule structures determined using distance and dihedral restraints, with extensions to other restraint types currently being implemented. These tools are useful for assessing NMR models, as well as for assessing biomolecular structure predictions based on distance restraints.

Subject(s)

Databases, Protein , Models, Molecular , Nuclear Magnetic Resonance, Biomolecular , Protein Conformation , Proteins , Nuclear Magnetic Resonance, Biomolecular/methods , Proteins/chemistry , Software

2.

Restraint Validation of Biomolecular Structures Determined by NMR in the Protein Data Bank.

Baskaran, Kumaran; Ploskon, Eliza; Tejero, Roberto; Yokochi, Masashi; Harrus, Deborah; Liang, Yuhe; Peisach, Ezra; Persikova, Irina; Ramelot, Theresa A; Sekharan, Monica; Tolchard, James; Westbrook, John D; Bardiaux, Benjamin; Schwieters, Charles D; Patwardhan, Ardan; Velankar, Sameer; Burley, Stephen K; Kurisu, Genji; Hoch, Jeffrey C; Montelione, Gaetano T; Vuister, Geerten W; Young, Jasmine Y.

bioRxiv ; 2024 Jan 22.

Article in English | MEDLINE | ID: mdl-38328042

ABSTRACT

Biomolecular structure analysis from experimental NMR studies generally relies on restraints derived from a combination of experimental and knowledge-based data. A challenge for the structural biology community has been a lack of standards for representing these restraints, preventing the establishment of uniform methods of model-vs-data structure validation against restraints and limiting interoperability between restraint-based structure modeling programs. The NMR exchange (NEF) and NMR-STAR formats provide a standardized approach for representing commonly used NMR restraints. Using these restraint formats, a standardized validation system for assessing structural models of biopolymers against restraints has been developed and implemented in the wwPDB OneDep data deposition-validation-biocuration system. The resulting wwPDB Restraint Violation Report provides a model vs. data assessment of biomolecule structures determined using distance and dihedral restraints, with extensions to other restraint types currently being implemented. These tools are useful for assessing NMR models, as well as for assessing biomolecular structure predictions based on distance restraints.

3.

Assessing and Maximizing the Quality of 3DEM Structure Data at the Worldwide Protein Data Bank.

Flatt, Justin W; Hudson, Brian P; Persikova, Irina; Liang, Yuhe; Shao, Chenghua; Peisach, Ezra; Young, Jasmine Y; Burley, Stephen K.

Microsc Microanal ; 29(Supplement_1): 948, 2023 Jul 22.

Article in English | MEDLINE | ID: mdl-37613801

4.

RCSB Protein Data Bank (RCSB.org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning.

Burley, Stephen K; Bhikadiya, Charmi; Bi, Chunxiao; Bittrich, Sebastian; Chao, Henry; Chen, Li; Craig, Paul A; Crichlow, Gregg V; Dalenberg, Kenneth; Duarte, Jose M; Dutta, Shuchismita; Fayazi, Maryam; Feng, Zukang; Flatt, Justin W; Ganesan, Sai; Ghosh, Sutapa; Goodsell, David S; Green, Rachel Kramer; Guranovic, Vladimir; Henry, Jeremy; Hudson, Brian P; Khokhriakov, Igor; Lawson, Catherine L; Liang, Yuhe; Lowe, Robert; Peisach, Ezra; Persikova, Irina; Piehl, Dennis W; Rose, Yana; Sali, Andrej; Segura, Joan; Sekharan, Monica; Shao, Chenghua; Vallat, Brinda; Voigt, Maria; Webb, Ben; Westbrook, John D; Whetstone, Shamara; Young, Jasmine Y; Zalevsky, Arthur; Zardecki, Christine.

Nucleic Acids Res ; 51(D1): D488-D508, 2023 01 06.

Article in English | MEDLINE | ID: mdl-36420884

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), founding member of the Worldwide Protein Data Bank (wwPDB), is the US data center for the open-access PDB archive. As wwPDB-designated Archive Keeper, RCSB PDB is also responsible for PDB data security. Annually, RCSB PDB serves >10 000 depositors of three-dimensional (3D) biostructures working on all permanently inhabited continents. RCSB PDB delivers data from its research-focused RCSB.org web portal to many millions of PDB data consumers based in virtually every United Nations-recognized country, territory, etc. This Database Issue contribution describes upgrades to the research-focused RCSB.org web portal that created a one-stop-shop for open access to â¼200 000 experimentally-determined PDB structures of biological macromolecules alongside >1 000 000 incorporated Computed Structure Models (CSMs) predicted using artificial intelligence/machine learning methods. RCSB.org is a 'living data resource.' Every PDB structure and CSM is integrated weekly with related functional annotations from external biodata resources, providing up-to-date information for the entire corpus of 3D biostructure data freely available from RCSB.org with no usage limitations. Within RCSB.org, PDB structures and the CSMs are clearly identified as to their provenance and reliability. Both are fully searchable, and can be analyzed and visualized using the full complement of RCSB.org web portal capabilities.

Subject(s)

Artificial Intelligence , Databases, Protein , Proteins , Machine Learning , Protein Conformation , Proteins/chemistry , Reproducibility of Results

5.

RCSB Protein Data bank: Tools for visualizing and understanding biological macromolecules in 3D.

Burley, Stephen K; Bhikadiya, Charmi; Bi, Chunxiao; Bittrich, Sebastian; Chao, Henry; Chen, Li; Craig, Paul A; Crichlow, Gregg V; Dalenberg, Kenneth; Duarte, Jose M; Dutta, Shuchismita; Fayazi, Maryam; Feng, Zukang; Flatt, Justin W; Ganesan, Sai J; Ghosh, Sutapa; Goodsell, David S; Green, Rachel Kramer; Guranovic, Vladimir; Henry, Jeremy; Hudson, Brian P; Khokhriakov, Igor; Lawson, Catherine L; Liang, Yuhe; Lowe, Robert; Peisach, Ezra; Persikova, Irina; Piehl, Dennis W; Rose, Yana; Sali, Andrej; Segura, Joan; Sekharan, Monica; Shao, Chenghua; Vallat, Brinda; Voigt, Maria; Webb, Benjamin; Westbrook, John D; Whetstone, Shamara; Young, Jasmine Y; Zalevsky, Arthur; Zardecki, Christine.

Protein Sci ; 31(12): e4482, 2022 12.

Article in English | MEDLINE | ID: mdl-36281733

ABSTRACT

Now in its 52nd year of continuous operations, the Protein Data Bank (PDB) is the premiere open-access global archive housing three-dimensional (3D) biomolecular structure data. It is jointly managed by the Worldwide Protein Data Bank (wwPDB) partnership. The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) is funded by the National Science Foundation, National Institutes of Health, and US Department of Energy and serves as the US data center for the wwPDB. RCSB PDB is also responsible for the security of PDB data in its role as wwPDB-designated Archive Keeper. Every year, RCSB PDB serves tens of thousands of depositors of 3D macromolecular structure data (coming from macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction). The RCSB PDB research-focused web portal (RCSB.org) makes PDB data available at no charge and without usage restrictions to many millions of PDB data consumers around the world. The RCSB PDB training, outreach, and education web portal (PDB101.RCSB.org) serves nearly 700 K educators, students, and members of the public worldwide. This invited Tools Issue contribution describes how RCSB PDB (i) is organized; (ii) works with wwPDB partners to process new depositions; (iii) serves as the wwPDB-designated Archive Keeper; (iv) enables exploration and 3D visualization of PDB data via RCSB.org; and (v) supports training, outreach, and education via PDB101.RCSB.org. New tools and features at RCSB.org are presented using examples drawn from high-resolution structural studies of proteins relevant to treatment of human cancers by targeting immune checkpoints.

Subject(s)

Computational Biology , Proteins , Humans , Protein Conformation , Databases, Protein , Proteins/chemistry , Computational Biology/methods , Macromolecular Substances/chemistry

6.

RCSB Protein Data Bank: Celebrating 50 years of the PDB with new tools for understanding and visualizing biological macromolecules in 3D.

Burley, Stephen K; Bhikadiya, Charmi; Bi, Chunxiao; Bittrich, Sebastian; Chen, Li; Crichlow, Gregg V; Duarte, Jose M; Dutta, Shuchismita; Fayazi, Maryam; Feng, Zukang; Flatt, Justin W; Ganesan, Sai J; Goodsell, David S; Ghosh, Sutapa; Kramer Green, Rachel; Guranovic, Vladimir; Henry, Jeremy; Hudson, Brian P; Lawson, Catherine L; Liang, Yuhe; Lowe, Robert; Peisach, Ezra; Persikova, Irina; Piehl, Dennis W; Rose, Yana; Sali, Andrej; Segura, Joan; Sekharan, Monica; Shao, Chenghua; Vallat, Brinda; Voigt, Maria; Westbrook, John D; Whetstone, Shamara; Young, Jasmine Y; Zardecki, Christine.

Protein Sci ; 31(1): 187-208, 2022 01.

Article in English | MEDLINE | ID: mdl-34676613

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), funded by the US National Science Foundation, National Institutes of Health, and Department of Energy, has served structural biologists and Protein Data Bank (PDB) data consumers worldwide since 1999. RCSB PDB, a founding member of the Worldwide Protein Data Bank (wwPDB) partnership, is the US data center for the global PDB archive housing biomolecular structure data. RCSB PDB is also responsible for the security of PDB data, as the wwPDB-designated Archive Keeper. Annually, RCSB PDB serves tens of thousands of three-dimensional (3D) macromolecular structure data depositors (using macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction) from all inhabited continents. RCSB PDB makes PDB data available from its research-focused RCSB.org web portal at no charge and without usage restrictions to millions of PDB data consumers working in every nation and territory worldwide. In addition, RCSB PDB operates an outreach and education PDB101.RCSB.org web portal that was used by more than 800,000 educators, students, and members of the public during calendar year 2020. This invited Tools Issue contribution describes (i) how the archive is growing and evolving as new experimental methods generate ever larger and more complex biomolecular structures; (ii) the importance of data standards and data remediation in effective management of the archive and facile integration with more than 50 external data resources; and (iii) new tools and features for 3D structure analysis and visualization made available during the past year via the RCSB.org web portal.

Subject(s)

Computational Biology/history , Databases, Protein/history , User-Computer Interface , Anniversaries and Special Events , History, 20th Century , History, 21st Century

7.

RCSB Protein Data Bank: powerful new tools for exploring 3D structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences.

Burley, Stephen K; Bhikadiya, Charmi; Bi, Chunxiao; Bittrich, Sebastian; Chen, Li; Crichlow, Gregg V; Christie, Cole H; Dalenberg, Kenneth; Di Costanzo, Luigi; Duarte, Jose M; Dutta, Shuchismita; Feng, Zukang; Ganesan, Sai; Goodsell, David S; Ghosh, Sutapa; Green, Rachel Kramer; Guranovic, Vladimir; Guzenko, Dmytro; Hudson, Brian P; Lawson, Catherine L; Liang, Yuhe; Lowe, Robert; Namkoong, Harry; Peisach, Ezra; Persikova, Irina; Randle, Chris; Rose, Alexander; Rose, Yana; Sali, Andrej; Segura, Joan; Sekharan, Monica; Shao, Chenghua; Tao, Yi-Ping; Voigt, Maria; Westbrook, John D; Young, Jasmine Y; Zardecki, Christine; Zhuravleva, Marina.

Nucleic Acids Res ; 49(D1): D437-D451, 2021 01 08.

Article in English | MEDLINE | ID: mdl-33211854

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), the US data center for the global PDB archive and a founding member of the Worldwide Protein Data Bank partnership, serves tens of thousands of data depositors in the Americas and Oceania and makes 3D macromolecular structure data available at no charge and without restrictions to millions of RCSB.org users around the world, including >660 000 educators, students and members of the curious public using PDB101.RCSB.org. PDB data depositors include structural biologists using macromolecular crystallography, nuclear magnetic resonance spectroscopy, 3D electron microscopy and micro-electron diffraction. PDB data consumers accessing our web portals include researchers, educators and students studying fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences. During the past 2 years, the research-focused RCSB PDB web portal (RCSB.org) has undergone a complete redesign, enabling improved searching with full Boolean operator logic and more facile access to PDB data integrated with >40 external biodata resources. New features and resources are described in detail using examples that showcase recently released structures of SARS-CoV-2 proteins and host cell proteins relevant to understanding and addressing the COVID-19 global pandemic.

Subject(s)

Computational Biology/methods , Databases, Protein , Macromolecular Substances/chemistry , Protein Conformation , Proteins/chemistry , Bioengineering/methods , Biomedical Research/methods , Biotechnology/methods , COVID-19/epidemiology , COVID-19/prevention & control , COVID-19/virology , Humans , Macromolecular Substances/metabolism , Pandemics , Proteins/genetics , Proteins/metabolism , SARS-CoV-2/genetics , SARS-CoV-2/metabolism , SARS-CoV-2/physiology , Software , Viral Proteins/chemistry , Viral Proteins/genetics , Viral Proteins/metabolism

8.

RCSB Protein Data Bank: Enabling biomedical research and drug discovery.

Goodsell, David S; Zardecki, Christine; Di Costanzo, Luigi; Duarte, Jose M; Hudson, Brian P; Persikova, Irina; Segura, Joan; Shao, Chenghua; Voigt, Maria; Westbrook, John D; Young, Jasmine Y; Burley, Stephen K.

Protein Sci ; 29(1): 52-65, 2020 01.

Article in English | MEDLINE | ID: mdl-31531901

ABSTRACT

Analyses of publicly available structural data reveal interesting insights into the impact of the three-dimensional (3D) structures of protein targets important for discovery of new drugs (e.g., G-protein-coupled receptors, voltage-gated ion channels, ligand-gated ion channels, transporters, and E3 ubiquitin ligases). The Protein Data Bank (PDB) archive currently holds > 155,000 atomic-level 3D structures of biomolecules experimentally determined using crystallography, nuclear magnetic resonance spectroscopy, and electron microscopy. The PDB was established in 1971 as the first open-access, digital-data resource in biology, and is now managed by the Worldwide PDB partnership (wwPDB; wwPDB.org). US PDB operations are the responsibility of the Research Collaboratory for Structural Bioinformatics PDB (RCSB PDB). The RCSB PDB serves millions of RCSB.org users worldwide by delivering PDB data integrated with â¼40 external biodata resources, providing rich structural views of fundamental biology, biomedicine, and energy sciences. Recently published work showed that the PDB archival holdings facilitated discovery of â¼90% of the 210 new drugs approved by the US Food and Drug Administration 2010-2016. We review user-driven development of RCSB PDB services, examine growth of the PDB archive in terms of size and complexity, and present examples and opportunities for structure-guided drug discovery for challenging targets (e.g., integral membrane proteins).

Subject(s)

Computational Biology/methods , Databases, Protein , Proteins/chemistry , Crystallography , Drug Discovery , Magnetic Resonance Spectroscopy , Microscopy, Electron , Protein Conformation , User-Computer Interface

9.

Announcing mandatory submission of PDBx/mmCIF format files for crystallographic depositions to the Protein Data Bank (PDB).

Adams, Paul D; Afonine, Pavel V; Baskaran, Kumaran; Berman, Helen M; Berrisford, John; Bricogne, Gerard; Brown, David G; Burley, Stephen K; Chen, Minyu; Feng, Zukang; Flensburg, Claus; Gutmanas, Aleksandras; Hoch, Jeffrey C; Ikegawa, Yasuyo; Kengaku, Yumiko; Krissinel, Eugene; Kurisu, Genji; Liang, Yuhe; Liebschner, Dorothee; Mak, Lora; Markley, John L; Moriarty, Nigel W; Murshudov, Garib N; Noble, Martin; Peisach, Ezra; Persikova, Irina; Poon, Billy K; Sobolev, Oleg V; Ulrich, Eldon L; Velankar, Sameer; Vonrhein, Clemens; Westbrook, John; Wojdyr, Marcin; Yokochi, Masashi; Young, Jasmine Y.

Acta Crystallogr D Struct Biol ; 75(Pt 4): 451-454, 2019 Apr 01.

Article in English | MEDLINE | ID: mdl-30988261

Subject(s)

Crystallography, X-Ray/methods , Database Management Systems/standards , Databases, Protein , Protein Conformation , Proteins/chemistry , Software , Humans

10.

Worldwide Protein Data Bank biocuration supporting open access to high-quality 3D structural biology data.

Young, Jasmine Y; Westbrook, John D; Feng, Zukang; Peisach, Ezra; Persikova, Irina; Sala, Raul; Sen, Sanchayita; Berrisford, John M; Swaminathan, G Jawahar; Oldfield, Thomas J; Gutmanas, Aleksandras; Igarashi, Reiko; Armstrong, David R; Baskaran, Kumaran; Chen, Li; Chen, Minyu; Clark, Alice R; Di Costanzo, Luigi; Dimitropoulos, Dimitris; Gao, Guanghua; Ghosh, Sutapa; Gore, Swanand; Guranovic, Vladimir; Hendrickx, Pieter M S; Hudson, Brian P; Ikegawa, Yasuyo; Kengaku, Yumiko; Lawson, Catherine L; Liang, Yuhe; Mak, Lora; Mukhopadhyay, Abhik; Narayanan, Buvaneswari; Nishiyama, Kayoko; Patwardhan, Ardan; Sahni, Gaurav; Sanz-García, Eduardo; Sato, Junko; Sekharan, Monica R; Shao, Chenghua; Smart, Oliver S; Tan, Lihua; van Ginkel, Glen; Yang, Huanwang; Zhuravleva, Marina A; Markley, John L; Nakamura, Haruki; Kurisu, Genji; Kleywegt, Gerard J; Velankar, Sameer; Berman, Helen M.

Database (Oxford) ; 20182018 01 01.

Article in English | MEDLINE | ID: mdl-29688351

ABSTRACT

Database URL: https://www.wwpdb.org/.

Subject(s)

Data Curation , Databases, Protein , Protein Conformation , Vocabulary, Controlled

11.

OneDep: Unified wwPDB System for Deposition, Biocuration, and Validation of Macromolecular Structures in the PDB Archive.

Young, Jasmine Y; Westbrook, John D; Feng, Zukang; Sala, Raul; Peisach, Ezra; Oldfield, Thomas J; Sen, Sanchayita; Gutmanas, Aleksandras; Armstrong, David R; Berrisford, John M; Chen, Li; Chen, Minyu; Di Costanzo, Luigi; Dimitropoulos, Dimitris; Gao, Guanghua; Ghosh, Sutapa; Gore, Swanand; Guranovic, Vladimir; Hendrickx, Pieter M S; Hudson, Brian P; Igarashi, Reiko; Ikegawa, Yasuyo; Kobayashi, Naohiro; Lawson, Catherine L; Liang, Yuhe; Mading, Steve; Mak, Lora; Mir, M Saqib; Mukhopadhyay, Abhik; Patwardhan, Ardan; Persikova, Irina; Rinaldi, Luana; Sanz-Garcia, Eduardo; Sekharan, Monica R; Shao, Chenghua; Swaminathan, G Jawahar; Tan, Lihua; Ulrich, Eldon L; van Ginkel, Glen; Yamashita, Reiko; Yang, Huanwang; Zhuravleva, Marina A; Quesada, Martha; Kleywegt, Gerard J; Berman, Helen M; Markley, John L; Nakamura, Haruki; Velankar, Sameer; Burley, Stephen K.

Structure ; 25(3): 536-545, 2017 03 07.

Article in English | MEDLINE | ID: mdl-28190782

ABSTRACT

OneDep, a unified system for deposition, biocuration, and validation of experimentally determined structures of biological macromolecules to the PDB archive, has been developed as a global collaboration by the worldwide PDB (wwPDB) partners. This new system was designed to ensure that the wwPDB could meet the evolving archiving requirements of the scientific community over the coming decades. OneDep unifies deposition, biocuration, and validation pipelines across all wwPDB, EMDB, and BMRB deposition sites with improved focus on data quality and completeness in these archives, while supporting growth in the number of depositions and increases in their average size and complexity. In this paper, we describe the design, functional operation, and supporting infrastructure of the OneDep system, and provide initial performance assessments.

Subject(s)

Proteins/chemistry , Data Curation , Databases, Protein , Internet , Models, Molecular , Nuclear Magnetic Resonance, Biomolecular , Protein Conformation , User-Computer Interface

12.

Small molecule annotation for the Protein Data Bank.

Sen, Sanchayita; Young, Jasmine; Berrisford, John M; Chen, Minyu; Conroy, Matthew J; Dutta, Shuchismita; Di Costanzo, Luigi; Gao, Guanghua; Ghosh, Sutapa; Hudson, Brian P; Igarashi, Reiko; Kengaku, Yumiko; Liang, Yuhe; Peisach, Ezra; Persikova, Irina; Mukhopadhyay, Abhik; Narayanan, Buvaneswari Coimbatore; Sahni, Gaurav; Sato, Junko; Sekharan, Monica; Shao, Chenghua; Tan, Lihua; Zhuravleva, Marina A.

Database (Oxford) ; 2014: bau116, 2014.

Article in English | MEDLINE | ID: mdl-25425036

ABSTRACT

The Protein Data Bank (PDB) is the single global repository for three-dimensional structures of biological macromolecules and their complexes, and its more than 100,000 structures contain more than 20,000 distinct ligands or small molecules bound to proteins and nucleic acids. Information about these small molecules and their interactions with proteins and nucleic acids is crucial for our understanding of biochemical processes and vital for structure-based drug design. Small molecules present in a deposited structure may be attached to a polymer or may occur as a separate, non-covalently linked ligand. During curation of a newly deposited structure by wwPDB annotation staff, each molecule is cross-referenced to the PDB Chemical Component Dictionary (CCD). If the molecule is new to the PDB, a dictionary description is created for it. The information about all small molecule components found in the PDB is distributed via the ftp archive as an external reference file. Small molecule annotation in the PDB also includes information about ligand-binding sites and about covalent and other linkages between ligands and macromolecules. During the remediation of the peptide-like antibiotics and inhibitors present in the PDB archive in 2011, it became clear that additional annotation was required for consistent representation of these molecules, which are quite often composed of several sequential subcomponents including modified amino acids and other chemical groups. The connectivity information of the modified amino acids is necessary for correct representation of these biologically interesting molecules. The combined information is made available via a new resource called the Biologically Interesting molecules Reference Dictionary, which is complementary to the CCD and is now routinely used for annotation of peptide-like antibiotics and inhibitors.

Subject(s)

Databases, Chemical , Databases, Protein , Small Molecule Libraries/chemistry , Anti-Bacterial Agents/chemistry , Anti-Bacterial Agents/pharmacology , Binding Sites , Data Mining , Glucose/chemistry , Glycopeptides/chemistry , Glycopeptides/pharmacology , Ligands , Models, Molecular , Reproducibility of Results , Small Molecule Libraries/pharmacology

13.

Improving the representation of peptide-like inhibitor and antibiotic molecules in the Protein Data Bank.

Dutta, Shuchismita; Dimitropoulos, Dimitris; Feng, Zukang; Persikova, Irina; Sen, Sanchayita; Shao, Chenghua; Westbrook, John; Young, Jasmine; Zhuravleva, Marina A; Kleywegt, Gerard J; Berman, Helen M.

Biopolymers ; 101(6): 659-68, 2014 Jun.

Article in English | MEDLINE | ID: mdl-24173824

ABSTRACT

With the accumulation of a large number and variety of molecules in the Protein Data Bank (PDB) comes the need on occasion to review and improve their representation. The Worldwide PDB (wwPDB) partners have periodically updated various aspects of structural data representation to improve the integrity and consistency of the archive. The remediation effort described here was focused on improving the representation of peptide-like inhibitor and antibiotic molecules so that they can be easily identified and analyzed. Peptide-like inhibitors or antibiotics were identified in over 1000 PDB entries, systematically reviewed and represented either as peptides with polymer sequence or as single components. For the majority of the single-component molecules, their peptide-like composition was captured in a new representation, called the subcomponent sequence. A novel concept called "group" was developed for representing complex peptide-like antibiotics and inhibitors that are composed of multiple polymer and nonpolymer components. In addition, a reference dictionary was developed with detailed information about these peptide-like molecules to aid in their annotation, identification and analysis. Based on the experience gained in this remediation, guidelines, procedures, and tools were developed to annotate new depositions containing peptide-like inhibitors and antibiotics accurately and consistently.

Subject(s)

Anti-Bacterial Agents/pharmacology , Databases, Protein , Peptides/pharmacology , Anti-Bacterial Agents/chemistry , Enzyme Inhibitors/chemistry , Enzyme Inhibitors/pharmacology , Gramicidin/chemistry , Gramicidin/pharmacology , Pancreatic Elastase/antagonists & inhibitors , Peptides/chemistry , Thiostrepton/chemistry , Thiostrepton/pharmacology , Vancomycin/chemistry , Vancomycin/pharmacology

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL