Search | VHL Regional Portal

1.

RCSB Protein Data Bank (RCSB.org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning.

Burley, Stephen K; Bhikadiya, Charmi; Bi, Chunxiao; Bittrich, Sebastian; Chao, Henry; Chen, Li; Craig, Paul A; Crichlow, Gregg V; Dalenberg, Kenneth; Duarte, Jose M; Dutta, Shuchismita; Fayazi, Maryam; Feng, Zukang; Flatt, Justin W; Ganesan, Sai; Ghosh, Sutapa; Goodsell, David S; Green, Rachel Kramer; Guranovic, Vladimir; Henry, Jeremy; Hudson, Brian P; Khokhriakov, Igor; Lawson, Catherine L; Liang, Yuhe; Lowe, Robert; Peisach, Ezra; Persikova, Irina; Piehl, Dennis W; Rose, Yana; Sali, Andrej; Segura, Joan; Sekharan, Monica; Shao, Chenghua; Vallat, Brinda; Voigt, Maria; Webb, Ben; Westbrook, John D; Whetstone, Shamara; Young, Jasmine Y; Zalevsky, Arthur; Zardecki, Christine.

Nucleic Acids Res ; 51(D1): D488-D508, 2023 01 06.

Article in English | MEDLINE | ID: mdl-36420884

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), founding member of the Worldwide Protein Data Bank (wwPDB), is the US data center for the open-access PDB archive. As wwPDB-designated Archive Keeper, RCSB PDB is also responsible for PDB data security. Annually, RCSB PDB serves >10 000 depositors of three-dimensional (3D) biostructures working on all permanently inhabited continents. RCSB PDB delivers data from its research-focused RCSB.org web portal to many millions of PDB data consumers based in virtually every United Nations-recognized country, territory, etc. This Database Issue contribution describes upgrades to the research-focused RCSB.org web portal that created a one-stop-shop for open access to â¼200 000 experimentally-determined PDB structures of biological macromolecules alongside >1 000 000 incorporated Computed Structure Models (CSMs) predicted using artificial intelligence/machine learning methods. RCSB.org is a 'living data resource.' Every PDB structure and CSM is integrated weekly with related functional annotations from external biodata resources, providing up-to-date information for the entire corpus of 3D biostructure data freely available from RCSB.org with no usage limitations. Within RCSB.org, PDB structures and the CSMs are clearly identified as to their provenance and reliability. Both are fully searchable, and can be analyzed and visualized using the full complement of RCSB.org web portal capabilities.

Subject(s)

Artificial Intelligence , Databases, Protein , Proteins , Machine Learning , Protein Conformation , Proteins/chemistry , Reproducibility of Results

2.

Protein Data Bank: A Comprehensive Review of 3D Structure Holdings and Worldwide Utilization by Researchers, Educators, and Students.

Burley, Stephen K; Berman, Helen M; Duarte, Jose M; Feng, Zukang; Flatt, Justin W; Hudson, Brian P; Lowe, Robert; Peisach, Ezra; Piehl, Dennis W; Rose, Yana; Sali, Andrej; Sekharan, Monica; Shao, Chenghua; Vallat, Brinda; Voigt, Maria; Westbrook, John D; Young, Jasmine Y; Zardecki, Christine.

Biomolecules ; 12(10)2022 10 04.

Article in English | MEDLINE | ID: mdl-36291635

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), funded by the United States National Science Foundation, National Institutes of Health, and Department of Energy, supports structural biologists and Protein Data Bank (PDB) data users around the world. The RCSB PDB, a founding member of the Worldwide Protein Data Bank (wwPDB) partnership, serves as the US data center for the global PDB archive housing experimentally-determined three-dimensional (3D) structure data for biological macromolecules. As the wwPDB-designated Archive Keeper, RCSB PDB is also responsible for the security of PDB data and weekly update of the archive. RCSB PDB serves tens of thousands of data depositors (using macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction) annually working on all permanently inhabited continents. RCSB PDB makes PDB data available from its research-focused web portal at no charge and without usage restrictions to many millions of PDB data consumers around the globe. It also provides educators, students, and the general public with an introduction to the PDB and related training materials through its outreach and education-focused web portal. This review article describes growth of the PDB, examines evolution of experimental methods for structure determination viewed through the lens of the PDB archive, and provides a detailed accounting of PDB archival holdings and their utilization by researchers, educators, and students worldwide.

Subject(s)

Computational Biology , Proteins , Humans , Protein Conformation , Databases, Protein , Computational Biology/methods , Proteins/chemistry , Students

3.

RCSB Protein Data bank: Tools for visualizing and understanding biological macromolecules in 3D.

Burley, Stephen K; Bhikadiya, Charmi; Bi, Chunxiao; Bittrich, Sebastian; Chao, Henry; Chen, Li; Craig, Paul A; Crichlow, Gregg V; Dalenberg, Kenneth; Duarte, Jose M; Dutta, Shuchismita; Fayazi, Maryam; Feng, Zukang; Flatt, Justin W; Ganesan, Sai J; Ghosh, Sutapa; Goodsell, David S; Green, Rachel Kramer; Guranovic, Vladimir; Henry, Jeremy; Hudson, Brian P; Khokhriakov, Igor; Lawson, Catherine L; Liang, Yuhe; Lowe, Robert; Peisach, Ezra; Persikova, Irina; Piehl, Dennis W; Rose, Yana; Sali, Andrej; Segura, Joan; Sekharan, Monica; Shao, Chenghua; Vallat, Brinda; Voigt, Maria; Webb, Benjamin; Westbrook, John D; Whetstone, Shamara; Young, Jasmine Y; Zalevsky, Arthur; Zardecki, Christine.

Protein Sci ; 31(12): e4482, 2022 12.

Article in English | MEDLINE | ID: mdl-36281733

ABSTRACT

Now in its 52nd year of continuous operations, the Protein Data Bank (PDB) is the premiere open-access global archive housing three-dimensional (3D) biomolecular structure data. It is jointly managed by the Worldwide Protein Data Bank (wwPDB) partnership. The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) is funded by the National Science Foundation, National Institutes of Health, and US Department of Energy and serves as the US data center for the wwPDB. RCSB PDB is also responsible for the security of PDB data in its role as wwPDB-designated Archive Keeper. Every year, RCSB PDB serves tens of thousands of depositors of 3D macromolecular structure data (coming from macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction). The RCSB PDB research-focused web portal (RCSB.org) makes PDB data available at no charge and without usage restrictions to many millions of PDB data consumers around the world. The RCSB PDB training, outreach, and education web portal (PDB101.RCSB.org) serves nearly 700 K educators, students, and members of the public worldwide. This invited Tools Issue contribution describes how RCSB PDB (i) is organized; (ii) works with wwPDB partners to process new depositions; (iii) serves as the wwPDB-designated Archive Keeper; (iv) enables exploration and 3D visualization of PDB data via RCSB.org; and (v) supports training, outreach, and education via PDB101.RCSB.org. New tools and features at RCSB.org are presented using examples drawn from high-resolution structural studies of proteins relevant to treatment of human cancers by targeting immune checkpoints.

Subject(s)

Computational Biology , Proteins , Humans , Protein Conformation , Databases, Protein , Proteins/chemistry , Computational Biology/methods , Macromolecular Substances/chemistry

4.

PDBx/mmCIF Ecosystem: Foundational Semantic Tools for Structural Biology.

Westbrook, John D; Young, Jasmine Y; Shao, Chenghua; Feng, Zukang; Guranovic, Vladimir; Lawson, Catherine L; Vallat, Brinda; Adams, Paul D; Berrisford, John M; Bricogne, Gerard; Diederichs, Kay; Joosten, Robbie P; Keller, Peter; Moriarty, Nigel W; Sobolev, Oleg V; Velankar, Sameer; Vonrhein, Clemens; Waterman, David G; Kurisu, Genji; Berman, Helen M; Burley, Stephen K; Peisach, Ezra.

J Mol Biol ; 434(11): 167599, 2022 06 15.

Article in English | MEDLINE | ID: mdl-35460671

ABSTRACT

PDBx/mmCIF, Protein Data Bank Exchange (PDBx) macromolecular Crystallographic Information Framework (mmCIF), has become the data standard for structural biology. With its early roots in the domain of small-molecule crystallography, PDBx/mmCIF provides an extensible data representation that is used for deposition, archiving, remediation, and public dissemination of experimentally determined three-dimensional (3D) structures of biological macromolecules by the Worldwide Protein Data Bank (wwPDB, wwpdb.org). Extensions of PDBx/mmCIF are similarly used for computed structure models by ModelArchive (modelarchive.org), integrative/hybrid structures by PDB-Dev (pdb-dev.wwpdb.org), small angle scattering data by Small Angle Scattering Biological Data Bank SASBDB (sasbdb.org), and for models computed generated with the AlphaFold 2.0 deep learning software suite (alphafold.ebi.ac.uk). Community-driven development of PDBx/mmCIF spans three decades, involving contributions from researchers, software and methods developers in structural sciences, data repository providers, scientific publishers, and professional societies. Having a semantically rich and extensible data framework for representing a wide range of structural biology experimental and computational results, combined with expertly curated 3D biostructure data sets in public repositories, accelerates the pace of scientific discovery. Herein, we describe the architecture of the PDBx/mmCIF data standard, tools used to maintain representations of the data standard, governance, and processes by which data content standards are extended, plus community tools/software libraries available for processing and checking the integrity of PDBx/mmCIF data. Use cases exemplify how the members of the Worldwide Protein Data Bank have used PDBx/mmCIF as the foundation for its pipeline for delivering Findable, Accessible, Interoperable, and Reusable (FAIR) data to many millions of users worldwide.

Subject(s)

Computational Biology , Crystallography , Databases, Protein , Software , Macromolecular Substances/chemistry , Molecular Biology , Protein Conformation , Semantics

5.

Simplified quality assessment for small-molecule ligands in the Protein Data Bank.

Shao, Chenghua; Westbrook, John D; Lu, Changpeng; Bhikadiya, Charmi; Peisach, Ezra; Young, Jasmine Y; Duarte, Jose M; Lowe, Robert; Wang, Sijian; Rose, Yana; Feng, Zukang; Burley, Stephen K.

Structure ; 30(2): 252-262.e4, 2022 02 03.

Article in English | MEDLINE | ID: mdl-35026162

ABSTRACT

More than 70% of the experimentally determined macromolecular structures in the Protein Data Bank (PDB) contain small-molecule ligands. Quality indicators of â¼643,000 ligands present in â¼106,000 PDB X-ray crystal structures have been analyzed. Ligand quality varies greatly with regard to goodness of fit between ligand structure and experimental data, deviations in bond lengths and angles from known chemical structures, and inappropriate interatomic clashes between the ligand and its surroundings. Based on principal component analysis, correlated quality indicators of ligand structure have been aggregated into two largely orthogonal composite indicators measuring goodness of fit to experimental data and deviation from ideal chemical structure. Ranking of the composite quality indicators across the PDB archive enabled construction of uniformly distributed composite ranking score. This score is implemented at RCSB.org to compare chemically identical ligands in distinct PDB structures with easy-to-interpret two-dimensional ligand quality plots, allowing PDB users to quickly assess ligand structure quality and select the best exemplars.

Subject(s)

Proteins/chemistry , Proteins/metabolism , Small Molecule Libraries/pharmacology , Databases, Protein , Ligands , Models, Molecular , Protein Conformation

6.

RCSB Protein Data Bank: Celebrating 50 years of the PDB with new tools for understanding and visualizing biological macromolecules in 3D.

Burley, Stephen K; Bhikadiya, Charmi; Bi, Chunxiao; Bittrich, Sebastian; Chen, Li; Crichlow, Gregg V; Duarte, Jose M; Dutta, Shuchismita; Fayazi, Maryam; Feng, Zukang; Flatt, Justin W; Ganesan, Sai J; Goodsell, David S; Ghosh, Sutapa; Kramer Green, Rachel; Guranovic, Vladimir; Henry, Jeremy; Hudson, Brian P; Lawson, Catherine L; Liang, Yuhe; Lowe, Robert; Peisach, Ezra; Persikova, Irina; Piehl, Dennis W; Rose, Yana; Sali, Andrej; Segura, Joan; Sekharan, Monica; Shao, Chenghua; Vallat, Brinda; Voigt, Maria; Westbrook, John D; Whetstone, Shamara; Young, Jasmine Y; Zardecki, Christine.

Protein Sci ; 31(1): 187-208, 2022 01.

Article in English | MEDLINE | ID: mdl-34676613

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), funded by the US National Science Foundation, National Institutes of Health, and Department of Energy, has served structural biologists and Protein Data Bank (PDB) data consumers worldwide since 1999. RCSB PDB, a founding member of the Worldwide Protein Data Bank (wwPDB) partnership, is the US data center for the global PDB archive housing biomolecular structure data. RCSB PDB is also responsible for the security of PDB data, as the wwPDB-designated Archive Keeper. Annually, RCSB PDB serves tens of thousands of three-dimensional (3D) macromolecular structure data depositors (using macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction) from all inhabited continents. RCSB PDB makes PDB data available from its research-focused RCSB.org web portal at no charge and without usage restrictions to millions of PDB data consumers working in every nation and territory worldwide. In addition, RCSB PDB operates an outreach and education PDB101.RCSB.org web portal that was used by more than 800,000 educators, students, and members of the public during calendar year 2020. This invited Tools Issue contribution describes (i) how the archive is growing and evolving as new experimental methods generate ever larger and more complex biomolecular structures; (ii) the importance of data standards and data remediation in effective management of the archive and facile integration with more than 50 external data resources; and (iii) new tools and features for 3D structure analysis and visualization made available during the past year via the RCSB.org web portal.

Subject(s)

Computational Biology/history , Databases, Protein/history , User-Computer Interface , Anniversaries and Special Events , History, 20th Century , History, 21st Century

7.

Modernized uniform representation of carbohydrate molecules in the Protein Data Bank.

Shao, Chenghua; Feng, Zukang; Westbrook, John D; Peisach, Ezra; Berrisford, John; Ikegawa, Yasuyo; Kurisu, Genji; Velankar, Sameer; Burley, Stephen K; Young, Jasmine Y.

Glycobiology ; 31(9): 1204-1218, 2021 09 20.

Article in English | MEDLINE | ID: mdl-33978738

ABSTRACT

Since 1971, the Protein Data Bank (PDB) has served as the single global archive for experimentally determined 3D structures of biological macromolecules made freely available to the global community according to the FAIR principles of Findability-Accessibility-Interoperability-Reusability. During the first 50 years of continuous PDB operations, standards for data representation have evolved to better represent rich and complex biological phenomena. Carbohydrate molecules present in more than 14,000 PDB structures have recently been reviewed and remediated to conform to a new standardized format. This machine-readable data representation for carbohydrates occurring in the PDB structures and the corresponding reference data improves the findability, accessibility, interoperability and reusability of structural information pertaining to these molecules. The PDB Exchange MacroMolecular Crystallographic Information File data dictionary now supports (i) standardized atom nomenclature that conforms to International Union of Pure and Applied Chemistry-International Union of Biochemistry and Molecular Biology (IUPAC-IUBMB) recommendations for carbohydrates, (ii) uniform representation of branched entities for oligosaccharides, (iii) commonly used linear descriptors of carbohydrates developed by the glycoscience community and (iv) annotation of glycosylation sites in proteins. For the first time, carbohydrates in PDB structures are consistently represented as collections of standardized monosaccharides, which precisely describe oligosaccharide structures and enable improved carbohydrate visualization, structure validation, robust quantitative and qualitative analyses, search for dendritic structures and classification. The uniform representation of carbohydrate molecules in the PDB described herein will facilitate broader usage of the resource by the glycoscience community and researchers studying glycoproteins.

Subject(s)

Carbohydrates , Proteins , Carbohydrates/chemistry , Databases, Protein , Proteins/chemistry

8.

Enhanced validation of small-molecule ligands and carbohydrates in the Protein Data Bank.

Feng, Zukang; Westbrook, John D; Sala, Raul; Smart, Oliver S; Bricogne, Gérard; Matsubara, Masaaki; Yamada, Issaku; Tsuchiya, Shinichiro; Aoki-Kinoshita, Kiyoko F; Hoch, Jeffrey C; Kurisu, Genji; Velankar, Sameer; Burley, Stephen K; Young, Jasmine Y.

Structure ; 29(4): 393-400.e1, 2021 04 01.

Article in English | MEDLINE | ID: mdl-33657417

ABSTRACT

The Worldwide Protein Data Bank (wwPDB) has provided validation reports based on recommendations from community Validation Task Forces for structures in the PDB since 2013. To further enhance validation of small molecules as recommended from the 2016 Ligand Validation Workshop, wwPDB, Global Phasing Ltd., and the Noguchi Institute, recently formed a public/private partnership to incorporate some of their software tools into the wwPDB validation package. Augmented wwPDB validation report features include: two-dimensional (2D) diagrams of small-molecule ligands and carbohydrates, highlighting geometric validation outcomes; 2D topological diagrams of oligosaccharides present in branched entities generated using 2D Symbol Nomenclature for Glycan representation; and views of 3D electron density maps for ligands and carbohydrates, illustrating the goodness-of-fit between the atomic structure and experimental data (X-ray crystallographic structures only). These improvements will impact confidence in ligand conformation and ligand-macromolecular interactions that will aid in understanding biochemical function and contribute to small-molecule drug discovery.

Subject(s)

Carbohydrates/chemistry , Databases, Protein/standards , Molecular Docking Simulation/methods , Proteomics/methods , Small Molecule Libraries/chemistry , Cheminformatics/methods , Databases, Chemical/standards , Humans , Ligands , Protein Binding , Proteome/chemistry , Proteome/metabolism

9.

RCSB Protein Data Bank: powerful new tools for exploring 3D structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences.

Burley, Stephen K; Bhikadiya, Charmi; Bi, Chunxiao; Bittrich, Sebastian; Chen, Li; Crichlow, Gregg V; Christie, Cole H; Dalenberg, Kenneth; Di Costanzo, Luigi; Duarte, Jose M; Dutta, Shuchismita; Feng, Zukang; Ganesan, Sai; Goodsell, David S; Ghosh, Sutapa; Green, Rachel Kramer; Guranovic, Vladimir; Guzenko, Dmytro; Hudson, Brian P; Lawson, Catherine L; Liang, Yuhe; Lowe, Robert; Namkoong, Harry; Peisach, Ezra; Persikova, Irina; Randle, Chris; Rose, Alexander; Rose, Yana; Sali, Andrej; Segura, Joan; Sekharan, Monica; Shao, Chenghua; Tao, Yi-Ping; Voigt, Maria; Westbrook, John D; Young, Jasmine Y; Zardecki, Christine; Zhuravleva, Marina.

Nucleic Acids Res ; 49(D1): D437-D451, 2021 01 08.

Article in English | MEDLINE | ID: mdl-33211854

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), the US data center for the global PDB archive and a founding member of the Worldwide Protein Data Bank partnership, serves tens of thousands of data depositors in the Americas and Oceania and makes 3D macromolecular structure data available at no charge and without restrictions to millions of RCSB.org users around the world, including >660 000 educators, students and members of the curious public using PDB101.RCSB.org. PDB data depositors include structural biologists using macromolecular crystallography, nuclear magnetic resonance spectroscopy, 3D electron microscopy and micro-electron diffraction. PDB data consumers accessing our web portals include researchers, educators and students studying fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences. During the past 2 years, the research-focused RCSB PDB web portal (RCSB.org) has undergone a complete redesign, enabling improved searching with full Boolean operator logic and more facile access to PDB data integrated with >40 external biodata resources. New features and resources are described in detail using examples that showcase recently released structures of SARS-CoV-2 proteins and host cell proteins relevant to understanding and addressing the COVID-19 global pandemic.

Subject(s)

Computational Biology/methods , Databases, Protein , Macromolecular Substances/chemistry , Protein Conformation , Proteins/chemistry , Bioengineering/methods , Biomedical Research/methods , Biotechnology/methods , COVID-19/epidemiology , COVID-19/prevention & control , COVID-19/virology , Humans , Macromolecular Substances/metabolism , Pandemics , Proteins/genetics , Proteins/metabolism , SARS-CoV-2/genetics , SARS-CoV-2/metabolism , SARS-CoV-2/physiology , Software , Viral Proteins/chemistry , Viral Proteins/genetics , Viral Proteins/metabolism

10.

Announcing mandatory submission of PDBx/mmCIF format files for crystallographic depositions to the Protein Data Bank (PDB).

Adams, Paul D; Afonine, Pavel V; Baskaran, Kumaran; Berman, Helen M; Berrisford, John; Bricogne, Gerard; Brown, David G; Burley, Stephen K; Chen, Minyu; Feng, Zukang; Flensburg, Claus; Gutmanas, Aleksandras; Hoch, Jeffrey C; Ikegawa, Yasuyo; Kengaku, Yumiko; Krissinel, Eugene; Kurisu, Genji; Liang, Yuhe; Liebschner, Dorothee; Mak, Lora; Markley, John L; Moriarty, Nigel W; Murshudov, Garib N; Noble, Martin; Peisach, Ezra; Persikova, Irina; Poon, Billy K; Sobolev, Oleg V; Ulrich, Eldon L; Velankar, Sameer; Vonrhein, Clemens; Westbrook, John; Wojdyr, Marcin; Yokochi, Masashi; Young, Jasmine Y.

Acta Crystallogr D Struct Biol ; 75(Pt 4): 451-454, 2019 Apr 01.

Article in English | MEDLINE | ID: mdl-30988261

Subject(s)

Crystallography, X-Ray/methods , Database Management Systems/standards , Databases, Protein , Protein Conformation , Proteins/chemistry , Software , Humans

11.

RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy.

Burley, Stephen K; Berman, Helen M; Bhikadiya, Charmi; Bi, Chunxiao; Chen, Li; Di Costanzo, Luigi; Christie, Cole; Dalenberg, Ken; Duarte, Jose M; Dutta, Shuchismita; Feng, Zukang; Ghosh, Sutapa; Goodsell, David S; Green, Rachel K; Guranovic, Vladimir; Guzenko, Dmytro; Hudson, Brian P; Kalro, Tara; Liang, Yuhe; Lowe, Robert; Namkoong, Harry; Peisach, Ezra; Periskova, Irina; Prlic, Andreas; Randle, Chris; Rose, Alexander; Rose, Peter; Sala, Raul; Sekharan, Monica; Shao, Chenghua; Tan, Lihua; Tao, Yi-Ping; Valasatava, Yana; Voigt, Maria; Westbrook, John; Woo, Jesse; Yang, Huanwang; Young, Jasmine; Zhuravleva, Marina; Zardecki, Christine.

Nucleic Acids Res ; 47(D1): D464-D474, 2019 01 08.

Article in English | MEDLINE | ID: mdl-30357411

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB, rcsb.org), the US data center for the global PDB archive, serves thousands of Data Depositors in the Americas and Oceania and makes 3D macromolecular structure data available at no charge and without usage restrictions to more than 1 million rcsb.org Users worldwide and 600 000 pdb101.rcsb.org education-focused Users around the globe. PDB Data Depositors include structural biologists using macromolecular crystallography, nuclear magnetic resonance spectroscopy and 3D electron microscopy. PDB Data Consumers include researchers, educators and students studying Fundamental Biology, Biomedicine, Biotechnology and Energy. Recent reorganization of RCSB PDB activities into four integrated, interdependent services is described in detail, together with tools and resources added over the past 2 years to RCSB PDB web portals in support of a 'Structural View of Biology.'

Subject(s)

Databases, Protein , Protein Conformation , Biomedical Research/education , Biotechnology/education , Data Curation , Software

12.

Worldwide Protein Data Bank biocuration supporting open access to high-quality 3D structural biology data.

Young, Jasmine Y; Westbrook, John D; Feng, Zukang; Peisach, Ezra; Persikova, Irina; Sala, Raul; Sen, Sanchayita; Berrisford, John M; Swaminathan, G Jawahar; Oldfield, Thomas J; Gutmanas, Aleksandras; Igarashi, Reiko; Armstrong, David R; Baskaran, Kumaran; Chen, Li; Chen, Minyu; Clark, Alice R; Di Costanzo, Luigi; Dimitropoulos, Dimitris; Gao, Guanghua; Ghosh, Sutapa; Gore, Swanand; Guranovic, Vladimir; Hendrickx, Pieter M S; Hudson, Brian P; Ikegawa, Yasuyo; Kengaku, Yumiko; Lawson, Catherine L; Liang, Yuhe; Mak, Lora; Mukhopadhyay, Abhik; Narayanan, Buvaneswari; Nishiyama, Kayoko; Patwardhan, Ardan; Sahni, Gaurav; Sanz-García, Eduardo; Sato, Junko; Sekharan, Monica R; Shao, Chenghua; Smart, Oliver S; Tan, Lihua; van Ginkel, Glen; Yang, Huanwang; Zhuravleva, Marina A; Markley, John L; Nakamura, Haruki; Kurisu, Genji; Kleywegt, Gerard J; Velankar, Sameer; Berman, Helen M.

Database (Oxford) ; 20182018 01 01.

Article in English | MEDLINE | ID: mdl-29688351

ABSTRACT

Database URL: https://www.wwpdb.org/.

Subject(s)

Data Curation , Databases, Protein , Protein Conformation , Vocabulary, Controlled

13.

RCSB Protein Data Bank: Sustaining a living digital data resource that enables breakthroughs in scientific research and biomedical education.

Burley, Stephen K; Berman, Helen M; Christie, Cole; Duarte, Jose M; Feng, Zukang; Westbrook, John; Young, Jasmine; Zardecki, Christine.

Protein Sci ; 27(1): 316-330, 2018 01.

Article in English | MEDLINE | ID: mdl-29067736

ABSTRACT

The Protein Data Bank (PDB) is one of two archival resources for experimental data central to biomedical research and education worldwide (the other key Primary Data Archive in biology being the International Nucleotide Sequence Database Collaboration). The PDB currently houses >134,000 atomic level biomolecular structures determined by crystallography, NMR spectroscopy, and 3D electron microscopy. It was established in 1971 as the first open-access, digital-data resource in biology, and is managed by the Worldwide Protein Data Bank partnership (wwPDB; wwpdb.org). US PDB operations are conducted by the RCSB Protein Data Bank (RCSB PDB; RCSB.org; Rutgers University and UC San Diego) and funded by NSF, NIH, and DoE. The RCSB PDB serves as the global Archive Keeper for the wwPDB. During calendar 2016, >591 million structure data files were downloaded from the PDB by Data Consumers working in every sovereign nation recognized by the United Nations. During this same period, the RCSB PDB processed >5300 new atomic level biomolecular structures plus experimental data and metadata coming into the archive from Data Depositors working in the Americas and Oceania. In addition, RCSB PDB served >1 million RCSB.org users worldwide with PDB data integrated with â¼40 external data resources providing rich structural views of fundamental biology, biomedicine, and energy sciences, and >600,000 PDB101.rcsb.org educational website users around the globe. RCSB PDB resources are described in detail together with metrics documenting the impact of access to PDB data on basic and applied research, clinical medicine, education, and the economy.

Subject(s)

Biomedical Research/education , Biomedical Research/methods , Databases, Protein , Biomedical Research/trends , Humans

14.

Validation of Structures in the Protein Data Bank.

Gore, Swanand; Sanz García, Eduardo; Hendrickx, Pieter M S; Gutmanas, Aleksandras; Westbrook, John D; Yang, Huanwang; Feng, Zukang; Baskaran, Kumaran; Berrisford, John M; Hudson, Brian P; Ikegawa, Yasuyo; Kobayashi, Naohiro; Lawson, Catherine L; Mading, Steve; Mak, Lora; Mukhopadhyay, Abhik; Oldfield, Thomas J; Patwardhan, Ardan; Peisach, Ezra; Sahni, Gaurav; Sekharan, Monica R; Sen, Sanchayita; Shao, Chenghua; Smart, Oliver S; Ulrich, Eldon L; Yamashita, Reiko; Quesada, Martha; Young, Jasmine Y; Nakamura, Haruki; Markley, John L; Berman, Helen M; Burley, Stephen K; Velankar, Sameer; Kleywegt, Gerard J.

Structure ; 25(12): 1916-1927, 2017 12 05.

Article in English | MEDLINE | ID: mdl-29174494

ABSTRACT

The Worldwide PDB recently launched a deposition, biocuration, and validation tool: OneDep. At various stages of OneDep data processing, validation reports for three-dimensional structures of biological macromolecules are produced. These reports are based on recommendations of expert task forces representing crystallography, nuclear magnetic resonance, and cryoelectron microscopy communities. The reports provide useful metrics with which depositors can evaluate the quality of the experimental data, the structural model, and the fit between them. The validation module is also available as a stand-alone web server and as a programmatically accessible web service. A growing number of journals require the official wwPDB validation reports (produced at biocuration) to accompany manuscripts describing macromolecular structures. Upon public release of the structure, the validation report becomes part of the public PDB archive. Geometric quality scores for proteins in the PDB archive have improved over the past decade.

Subject(s)

Databases, Protein/standards , Validation Studies as Topic , Sequence Analysis, Protein/methods , Sequence Analysis, Protein/standards

15.

OneDep: Unified wwPDB System for Deposition, Biocuration, and Validation of Macromolecular Structures in the PDB Archive.

Young, Jasmine Y; Westbrook, John D; Feng, Zukang; Sala, Raul; Peisach, Ezra; Oldfield, Thomas J; Sen, Sanchayita; Gutmanas, Aleksandras; Armstrong, David R; Berrisford, John M; Chen, Li; Chen, Minyu; Di Costanzo, Luigi; Dimitropoulos, Dimitris; Gao, Guanghua; Ghosh, Sutapa; Gore, Swanand; Guranovic, Vladimir; Hendrickx, Pieter M S; Hudson, Brian P; Igarashi, Reiko; Ikegawa, Yasuyo; Kobayashi, Naohiro; Lawson, Catherine L; Liang, Yuhe; Mading, Steve; Mak, Lora; Mir, M Saqib; Mukhopadhyay, Abhik; Patwardhan, Ardan; Persikova, Irina; Rinaldi, Luana; Sanz-Garcia, Eduardo; Sekharan, Monica R; Shao, Chenghua; Swaminathan, G Jawahar; Tan, Lihua; Ulrich, Eldon L; van Ginkel, Glen; Yamashita, Reiko; Yang, Huanwang; Zhuravleva, Marina A; Quesada, Martha; Kleywegt, Gerard J; Berman, Helen M; Markley, John L; Nakamura, Haruki; Velankar, Sameer; Burley, Stephen K.

Structure ; 25(3): 536-545, 2017 03 07.

Article in English | MEDLINE | ID: mdl-28190782

ABSTRACT

OneDep, a unified system for deposition, biocuration, and validation of experimentally determined structures of biological macromolecules to the PDB archive, has been developed as a global collaboration by the worldwide PDB (wwPDB) partners. This new system was designed to ensure that the wwPDB could meet the evolving archiving requirements of the scientific community over the coming decades. OneDep unifies deposition, biocuration, and validation pipelines across all wwPDB, EMDB, and BMRB deposition sites with improved focus on data quality and completeness in these archives, while supporting growth in the number of depositions and increases in their average size and complexity. In this paper, we describe the design, functional operation, and supporting infrastructure of the OneDep system, and provide initial performance assessments.

Subject(s)

Proteins/chemistry , Data Curation , Databases, Protein , Internet , Models, Molecular , Nuclear Magnetic Resonance, Biomolecular , Protein Conformation , User-Computer Interface

16.

The RCSB protein data bank: integrative view of protein, gene and 3D structural information.

Rose, Peter W; Prlic, Andreas; Altunkaya, Ali; Bi, Chunxiao; Bradley, Anthony R; Christie, Cole H; Costanzo, Luigi Di; Duarte, Jose M; Dutta, Shuchismita; Feng, Zukang; Green, Rachel Kramer; Goodsell, David S; Hudson, Brian; Kalro, Tara; Lowe, Robert; Peisach, Ezra; Randle, Christopher; Rose, Alexander S; Shao, Chenghua; Tao, Yi-Ping; Valasatava, Yana; Voigt, Maria; Westbrook, John D; Woo, Jesse; Yang, Huangwang; Young, Jasmine Y; Zardecki, Christine; Berman, Helen M; Burley, Stephen K.

Nucleic Acids Res ; 45(D1): D271-D281, 2017 01 04.

Article in English | MEDLINE | ID: mdl-27794042

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB, http://rcsb.org), the US data center for the global PDB archive, makes PDB data freely available to all users, from structural biologists to computational biologists and beyond. New tools and resources have been added to the RCSB PDB web portal in support of a 'Structural View of Biology.' Recent developments have improved the User experience, including the high-speed NGL Viewer that provides 3D molecular visualization in any web browser, improved support for data file download and enhanced organization of website pages for query, reporting and individual structure exploration. Structure validation information is now visible for all archival entries. PDB data have been integrated with external biological resources, including chromosomal position within the human genome; protein modifications; and metabolic pathways. PDB-101 educational materials have been reorganized into a searchable website and expanded to include new features such as the Geis Digital Archive.

Subject(s)

Computational Biology/methods , Databases, Genetic , Proteins/chemistry , Proteins/genetics , Datasets as Topic , Metabolic Networks and Pathways , Models, Molecular , Protein Conformation , Proteins/metabolism , Software , Structure-Activity Relationship , User-Computer Interface , Web Browser

17.

Outcome of the First wwPDB/CCDC/D3R Ligand Validation Workshop.

Adams, Paul D; Aertgeerts, Kathleen; Bauer, Cary; Bell, Jeffrey A; Berman, Helen M; Bhat, Talapady N; Blaney, Jeff M; Bolton, Evan; Bricogne, Gerard; Brown, David; Burley, Stephen K; Case, David A; Clark, Kirk L; Darden, Tom; Emsley, Paul; Feher, Victoria A; Feng, Zukang; Groom, Colin R; Harris, Seth F; Hendle, Jorg; Holder, Thomas; Joachimiak, Andrzej; Kleywegt, Gerard J; Krojer, Tobias; Marcotrigiano, Joseph; Mark, Alan E; Markley, John L; Miller, Matthew; Minor, Wladek; Montelione, Gaetano T; Murshudov, Garib; Nakagawa, Atsushi; Nakamura, Haruki; Nicholls, Anthony; Nicklaus, Marc; Nolte, Robert T; Padyana, Anil K; Peishoff, Catherine E; Pieniazek, Susan; Read, Randy J; Shao, Chenghua; Sheriff, Steven; Smart, Oliver; Soisson, Stephen; Spurlino, John; Stouch, Terry; Svobodova, Radka; Tempel, Wolfram; Terwilliger, Thomas C; Tronrud, Dale.

Structure ; 24(4): 502-508, 2016 Apr 05.

Article in English | MEDLINE | ID: mdl-27050687

ABSTRACT

Crystallographic studies of ligands bound to biological macromolecules (proteins and nucleic acids) represent an important source of information concerning drug-target interactions, providing atomic level insights into the physical chemistry of complex formation between macromolecules and ligands. Of the more than 115,000 entries extant in the Protein Data Bank (PDB) archive, â¼75% include at least one non-polymeric ligand. Ligand geometrical and stereochemical quality, the suitability of ligand models for in silico drug discovery and design, and the goodness-of-fit of ligand models to electron-density maps vary widely across the archive. We describe the proceedings and conclusions from the first Worldwide PDB/Cambridge Crystallographic Data Center/Drug Design Data Resource (wwPDB/CCDC/D3R) Ligand Validation Workshop held at the Research Collaboratory for Structural Bioinformatics at Rutgers University on July 30-31, 2015. Experts in protein crystallography from academe and industry came together with non-profit and for-profit software providers for crystallography and with experts in computational chemistry and data archiving to discuss and make recommendations on best practices, as framed by a series of questions central to structural studies of macromolecule-ligand complexes. What data concerning bound ligands should be archived in the PDB? How should the ligands be best represented? How should structural models of macromolecule-ligand complexes be validated? What supplementary information should accompany publications of structural studies of biological macromolecules? Consensus recommendations on best practices developed in response to each of these questions are provided, together with some details regarding implementation. Important issues addressed but not resolved at the workshop are also enumerated.

Subject(s)

Databases, Protein , Proteins/chemistry , Crystallography, X-Ray , Data Curation , Guidelines as Topic , Ligands , Models, Molecular , Protein Conformation

18.

The chemical component dictionary: complete descriptions of constituent molecules in experimentally determined 3D macromolecules in the Protein Data Bank.

Westbrook, John D; Shao, Chenghua; Feng, Zukang; Zhuravleva, Marina; Velankar, Sameer; Young, Jasmine.

Bioinformatics ; 31(8): 1274-8, 2015 Apr 15.

Article in English | MEDLINE | ID: mdl-25540181

ABSTRACT

UNLABELLED: The Chemical Component Dictionary (CCD) is a chemical reference data resource that describes all residue and small molecule components found in Protein Data Bank (PDB) entries. The CCD contains detailed chemical descriptions for standard and modified amino acids/nucleotides, small molecule ligands and solvent molecules. Each chemical definition includes descriptions of chemical properties such as stereochemical assignments, chemical descriptors, systematic chemical names and idealized coordinates. The content, preparation, validation and distribution of this CCD chemical reference dataset are described. AVAILABILITY AND IMPLEMENTATION: The CCD is updated regularly in conjunction with the scheduled weekly release of new PDB structure data. The CCD and amino acid variant reference datasets are hosted in the public PDB ftp repository at ftp://ftp.wwpdb.org/pub/pdb/data/monomers/components.cif.gz, ftp://ftp.wwpdb.org/pub/pdb/data/monomers/aa-variants-v1.cif.gz, and its mirror sites, and can be accessed from http://wwpdb.org. CONTACT: jwest@rcsb.rutgers.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Subject(s)

Databases, Chemical , Databases, Protein , Dictionaries, Chemical as Topic , Macromolecular Substances/chemistry , Molecular Sequence Annotation , Internet , Ligands , User-Computer Interface

19.

Improving the representation of peptide-like inhibitor and antibiotic molecules in the Protein Data Bank.

Dutta, Shuchismita; Dimitropoulos, Dimitris; Feng, Zukang; Persikova, Irina; Sen, Sanchayita; Shao, Chenghua; Westbrook, John; Young, Jasmine; Zhuravleva, Marina A; Kleywegt, Gerard J; Berman, Helen M.

Biopolymers ; 101(6): 659-68, 2014 Jun.

Article in English | MEDLINE | ID: mdl-24173824

ABSTRACT

With the accumulation of a large number and variety of molecules in the Protein Data Bank (PDB) comes the need on occasion to review and improve their representation. The Worldwide PDB (wwPDB) partners have periodically updated various aspects of structural data representation to improve the integrity and consistency of the archive. The remediation effort described here was focused on improving the representation of peptide-like inhibitor and antibiotic molecules so that they can be easily identified and analyzed. Peptide-like inhibitors or antibiotics were identified in over 1000 PDB entries, systematically reviewed and represented either as peptides with polymer sequence or as single components. For the majority of the single-component molecules, their peptide-like composition was captured in a new representation, called the subcomponent sequence. A novel concept called "group" was developed for representing complex peptide-like antibiotics and inhibitors that are composed of multiple polymer and nonpolymer components. In addition, a reference dictionary was developed with detailed information about these peptide-like molecules to aid in their annotation, identification and analysis. Based on the experience gained in this remediation, guidelines, procedures, and tools were developed to annotate new depositions containing peptide-like inhibitors and antibiotics accurately and consistently.

Subject(s)

Anti-Bacterial Agents/pharmacology , Databases, Protein , Peptides/pharmacology , Anti-Bacterial Agents/chemistry , Enzyme Inhibitors/chemistry , Enzyme Inhibitors/pharmacology , Gramicidin/chemistry , Gramicidin/pharmacology , Pancreatic Elastase/antagonists & inhibitors , Peptides/chemistry , Thiostrepton/chemistry , Thiostrepton/pharmacology , Vancomycin/chemistry , Vancomycin/pharmacology

20.

Chemical annotation of small and peptide-like molecules at the Protein Data Bank.

Young, Jasmine Y; Feng, Zukang; Dimitropoulos, Dimitris; Sala, Raul; Westbrook, John; Zhuravleva, Marina; Shao, Chenghua; Quesada, Martha; Peisach, Ezra; Berman, Helen M.

Database (Oxford) ; 2013: bat079, 2013.

Article in English | MEDLINE | ID: mdl-24291661

ABSTRACT

Over the past decade, the number of polymers and their complexes with small molecules in the Protein Data Bank archive (PDB) has continued to increase significantly. To support scientific advancements and ensure the best quality and completeness of the data files over the next 10 years and beyond, the Worldwide PDB partnership that manages the PDB archive is developing a new deposition and annotation system. This system focuses on efficient data capture across all supported experimental methods. The new deposition and annotation system is composed of four major modules that together support all of the processing requirements for a PDB entry. In this article, we describe one such module called the Chemical Component Annotation Tool. This tool uses information from both the Chemical Component Dictionary and Biologically Interesting molecule Reference Dictionary to aid in annotation. Benchmark studies have shown that the Chemical Component Annotation Tool provides significant improvements in processing efficiency and data quality. Database URL: http://wwpdb.org.

Subject(s)

Databases, Protein , Molecular Sequence Annotation , Peptides/chemistry , Computational Biology , Dictionaries as Topic , Reference Standards , Reproducibility of Results , Terminology as Topic

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL