Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 10 de 10
Filter
Add more filters










Publication year range
1.
IUCrJ ; 11(Pt 3): 279-286, 2024 May 01.
Article in English | MEDLINE | ID: mdl-38597878

ABSTRACT

The Protein Data Bank (PDB) was established as the first open-access digital data resource in biology and medicine in 1971 with seven X-ray crystal structures of proteins. Today, the PDB houses >210 000 experimentally determined, atomic level, 3D structures of proteins and nucleic acids as well as their complexes with one another and small molecules (e.g. approved drugs, enzyme cofactors). These data provide insights into fundamental biology, biomedicine, bioenergy and biotechnology. They proved particularly important for understanding the SARS-CoV-2 global pandemic. The US-funded Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) and other members of the Worldwide Protein Data Bank (wwPDB) partnership jointly manage the PDB archive and support >60 000 `data depositors' (structural biologists) around the world. wwPDB ensures the quality and integrity of the data in the ever-expanding PDB archive and supports global open access without limitations on data usage. The RCSB PDB research-focused web portal at https://www.rcsb.org/ (RCSB.org) supports millions of users worldwide, representing a broad range of expertise and interests. In addition to retrieving 3D structure data, PDB `data consumers' access comparative data and external annotations, such as information about disease-causing point mutations and genetic variations. RCSB.org also provides access to >1 000 000 computed structure models (CSMs) generated using artificial intelligence/machine-learning methods. To avoid doubt, the provenance and reliability of experimentally determined PDB structures and CSMs are identified. Related training materials are available to support users in their RCSB.org explorations.


Subject(s)
COVID-19 , Databases, Protein , Protein Conformation , SARS-CoV-2 , COVID-19/epidemiology , Humans , Computational Biology/methods , Proteins/chemistry
2.
J Mol Biol ; 435(14): 167994, 2023 07 15.
Article in English | MEDLINE | ID: mdl-36738985

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) provides open access to experimentally-determined three-dimensional (3D) structures of biomolecules. The RCSB PDB RCSB.org research-focused web portal is used annually by many millions of users around the world. They access biostructure information, run complex queries utilizing various search services (e.g., full-text, structural and chemical attribute, chemical, sequence, and structure similarity searches), and visualize macromolecules in 3D, all at no charge and with no limitations on data usage. Notwithstanding more than 24,000-fold growth of the PDB over the past five decades, experimentally-determined structures are only available for a small subset of the millions of proteins of known sequence. Recently developed machine learning software tools can predict 3D structures of proteins at accuracies comparable to lower-resolution experimental methods. The RCSB PDB now provides access to ∼1,000,000 Computed Structure Models (CSMs) of proteins coming from AlphaFold DB and the ModelArchive alongside ∼200,000 experimentally-determined PDB structures. Both CSMs and PDB structures are available on RCSB.org and via well-established RCSB PDB Data, Search, and 1D-Coordinates application programming interfaces (APIs). Simultaneous delivery of PDB data and CSMs provides users with access to complementary structural information across the human proteome and those of model organisms and selected pathogens. API enhancements are backwards-compatible and programmatic users can "opt in" to access CSMs with minimal effort. Herein, we describe modifications to RCSB PDB cyberinfrastructure required to support sixfold scaling of 3D biostructure data delivery and lay the groundwork for scaling to accommodate hundreds of millions of CSMs.


Subject(s)
Computational Biology , Databases, Protein , Humans , Computational Biology/methods , Protein Conformation , Proteome , Software
3.
J Mol Biol ; 435(14): 168021, 2023 07 15.
Article in English | MEDLINE | ID: mdl-36828268

ABSTRACT

ModelCIF (github.com/ihmwg/ModelCIF) is a data information framework developed for and by computational structural biologists to enable delivery of Findable, Accessible, Interoperable, and Reusable (FAIR) data to users worldwide. ModelCIF describes the specific set of attributes and metadata associated with macromolecular structures modeled by solely computational methods and provides an extensible data representation for deposition, archiving, and public dissemination of predicted three-dimensional (3D) models of macromolecules. It is an extension of the Protein Data Bank Exchange / macromolecular Crystallographic Information Framework (PDBx/mmCIF), which is the global data standard for representing experimentally-determined 3D structures of macromolecules and associated metadata. The PDBx/mmCIF framework and its extensions (e.g., ModelCIF) are managed by the Worldwide Protein Data Bank partnership (wwPDB, wwpdb.org) in collaboration with relevant community stakeholders such as the wwPDB ModelCIF Working Group (wwpdb.org/task/modelcif). This semantically rich and extensible data framework for representing computed structure models (CSMs) accelerates the pace of scientific discovery. Herein, we describe the architecture, contents, and governance of ModelCIF, and tools and processes for maintaining and extending the data standard. Community tools and software libraries that support ModelCIF are also described.


Subject(s)
Databases, Protein , Macromolecular Substances/chemistry , Protein Conformation , Software
4.
Nucleic Acids Res ; 51(D1): D488-D508, 2023 01 06.
Article in English | MEDLINE | ID: mdl-36420884

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), founding member of the Worldwide Protein Data Bank (wwPDB), is the US data center for the open-access PDB archive. As wwPDB-designated Archive Keeper, RCSB PDB is also responsible for PDB data security. Annually, RCSB PDB serves >10 000 depositors of three-dimensional (3D) biostructures working on all permanently inhabited continents. RCSB PDB delivers data from its research-focused RCSB.org web portal to many millions of PDB data consumers based in virtually every United Nations-recognized country, territory, etc. This Database Issue contribution describes upgrades to the research-focused RCSB.org web portal that created a one-stop-shop for open access to ∼200 000 experimentally-determined PDB structures of biological macromolecules alongside >1 000 000 incorporated Computed Structure Models (CSMs) predicted using artificial intelligence/machine learning methods. RCSB.org is a 'living data resource.' Every PDB structure and CSM is integrated weekly with related functional annotations from external biodata resources, providing up-to-date information for the entire corpus of 3D biostructure data freely available from RCSB.org with no usage limitations. Within RCSB.org, PDB structures and the CSMs are clearly identified as to their provenance and reliability. Both are fully searchable, and can be analyzed and visualized using the full complement of RCSB.org web portal capabilities.


Subject(s)
Artificial Intelligence , Databases, Protein , Proteins , Machine Learning , Protein Conformation , Proteins/chemistry , Reproducibility of Results
5.
Biomolecules ; 12(10)2022 10 04.
Article in English | MEDLINE | ID: mdl-36291635

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), funded by the United States National Science Foundation, National Institutes of Health, and Department of Energy, supports structural biologists and Protein Data Bank (PDB) data users around the world. The RCSB PDB, a founding member of the Worldwide Protein Data Bank (wwPDB) partnership, serves as the US data center for the global PDB archive housing experimentally-determined three-dimensional (3D) structure data for biological macromolecules. As the wwPDB-designated Archive Keeper, RCSB PDB is also responsible for the security of PDB data and weekly update of the archive. RCSB PDB serves tens of thousands of data depositors (using macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction) annually working on all permanently inhabited continents. RCSB PDB makes PDB data available from its research-focused web portal at no charge and without usage restrictions to many millions of PDB data consumers around the globe. It also provides educators, students, and the general public with an introduction to the PDB and related training materials through its outreach and education-focused web portal. This review article describes growth of the PDB, examines evolution of experimental methods for structure determination viewed through the lens of the PDB archive, and provides a detailed accounting of PDB archival holdings and their utilization by researchers, educators, and students worldwide.


Subject(s)
Computational Biology , Proteins , Humans , Protein Conformation , Databases, Protein , Computational Biology/methods , Proteins/chemistry , Students
6.
Protein Sci ; 31(12): e4482, 2022 12.
Article in English | MEDLINE | ID: mdl-36281733

ABSTRACT

Now in its 52nd year of continuous operations, the Protein Data Bank (PDB) is the premiere open-access global archive housing three-dimensional (3D) biomolecular structure data. It is jointly managed by the Worldwide Protein Data Bank (wwPDB) partnership. The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) is funded by the National Science Foundation, National Institutes of Health, and US Department of Energy and serves as the US data center for the wwPDB. RCSB PDB is also responsible for the security of PDB data in its role as wwPDB-designated Archive Keeper. Every year, RCSB PDB serves tens of thousands of depositors of 3D macromolecular structure data (coming from macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction). The RCSB PDB research-focused web portal (RCSB.org) makes PDB data available at no charge and without usage restrictions to many millions of PDB data consumers around the world. The RCSB PDB training, outreach, and education web portal (PDB101.RCSB.org) serves nearly 700 K educators, students, and members of the public worldwide. This invited Tools Issue contribution describes how RCSB PDB (i) is organized; (ii) works with wwPDB partners to process new depositions; (iii) serves as the wwPDB-designated Archive Keeper; (iv) enables exploration and 3D visualization of PDB data via RCSB.org; and (v) supports training, outreach, and education via PDB101.RCSB.org. New tools and features at RCSB.org are presented using examples drawn from high-resolution structural studies of proteins relevant to treatment of human cancers by targeting immune checkpoints.


Subject(s)
Computational Biology , Proteins , Humans , Protein Conformation , Databases, Protein , Proteins/chemistry , Computational Biology/methods , Macromolecular Substances/chemistry
7.
Protein Sci ; 31(1): 187-208, 2022 01.
Article in English | MEDLINE | ID: mdl-34676613

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), funded by the US National Science Foundation, National Institutes of Health, and Department of Energy, has served structural biologists and Protein Data Bank (PDB) data consumers worldwide since 1999. RCSB PDB, a founding member of the Worldwide Protein Data Bank (wwPDB) partnership, is the US data center for the global PDB archive housing biomolecular structure data. RCSB PDB is also responsible for the security of PDB data, as the wwPDB-designated Archive Keeper. Annually, RCSB PDB serves tens of thousands of three-dimensional (3D) macromolecular structure data depositors (using macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction) from all inhabited continents. RCSB PDB makes PDB data available from its research-focused RCSB.org web portal at no charge and without usage restrictions to millions of PDB data consumers working in every nation and territory worldwide. In addition, RCSB PDB operates an outreach and education PDB101.RCSB.org web portal that was used by more than 800,000 educators, students, and members of the public during calendar year 2020. This invited Tools Issue contribution describes (i) how the archive is growing and evolving as new experimental methods generate ever larger and more complex biomolecular structures; (ii) the importance of data standards and data remediation in effective management of the archive and facile integration with more than 50 external data resources; and (iii) new tools and features for 3D structure analysis and visualization made available during the past year via the RCSB.org web portal.


Subject(s)
Computational Biology/history , Databases, Protein/history , User-Computer Interface , Anniversaries and Special Events , History, 20th Century , History, 21st Century
8.
Methods ; 138-139: 47-53, 2018 04 01.
Article in English | MEDLINE | ID: mdl-29366688

ABSTRACT

Cholesterol (Chol) is vital for cell function as it is essential to a myriad of biochemical and biophysical processes. The atomistic details of Chol's interactions with phospholipids and proteins is therefore of fundamental interest, and NMR offers unique opportunities to interrogate these properties at high resolution. Towards this end, here we describe approaches for examining the structure and dynamics of Chol in lipid bilayers using high levels of 13C enrichment in combination with magic-angle spinning (MAS) methods. We quantify the incorporation levels and demonstrate high sensitivity and resolution in 2D 13C-13C and 1H-13C spectra, enabling de novo assignments and site-resolved order parameter measurements obtained in a fraction of the time required for experiments with natural abundance sterols. We envision many potential future applications of these methods to study sterol interactions with drugs, lipids and proteins.


Subject(s)
Cholesterol/chemistry , Lipid Bilayers/chemistry , Magnetic Resonance Spectroscopy/methods , Carbon Isotopes , Cholesterol/analysis , Molecular Structure , Sensitivity and Specificity , Yeasts/metabolism
9.
ACS Omega ; 2(2): 712-720, 2017 Feb 28.
Article in English | MEDLINE | ID: mdl-28261692

ABSTRACT

Light-chain (AL)-associated amyloidosis is a systemic disorder involving the formation and deposition of immunoglobulin AL fibrils in various bodily organs. One severe instance of AL disease is exhibited by the patient-derived variable domain (VL) of the light chain AL-09, a 108 amino acid residue protein containing seven mutations relative to the corresponding germline protein, κI O18/O8 VL. Previous work has demonstrated that the thermodynamic stability of native AL-09 VL is greatly lowered by two of these mutations, Y87H and N34I, whereas a third mutation, K42Q, further increases the kinetics of fibril formation. However, detailed knowledge regarding the residues that are responsible for stabilizing the misfolded fibril structure is lacking. In this study, using solid-state NMR spectroscopy, we show that the majority of the AL-09 VL sequence is immobilized in the fibrils and that the N- and C-terminal portions of the sequence are particularly well-structured. Thus, AL-09 VL forms an extensively ordered and ß-strand-rich fibril structure. Furthermore, we demonstrate that the predominant ß-sheet secondary structure and rigidity observed for in vitro prepared AL-09 VL fibrils are qualitatively similar to those observed for AL fibrils extracted from postmortem human spleen tissue, suggesting that this conformation may be representative of a common feature of AL fibrils.

10.
Biomol NMR Assign ; 11(1): 45-50, 2017 04.
Article in English | MEDLINE | ID: mdl-27771830

ABSTRACT

Light chain (AL) amyloidosis is a systemic disease characterized by the formation of immunoglobulin light-chain fibrils in critical organs of the body. The light-chain protein AL-09 presents one severe case of cardiac AL amyloidosis, which contains seven mutations in the variable domain (VL) relative to its germline counterpart, κI O18/O8 VL. Three of these mutations are non-conservative-Y87H, N34I, and K42Q-and previous work has shown that they are responsible for significantly reducing the protein's thermodynamic stability, allowing fibril formation to occur with fast kinetics and across a wide-range of pH conditions. Currently, however, there is extremely limited structural information available which explicitly describes the residues that are involved in supporting the misfolded fibril structure. Here, we assign the site-specific 15N and 13C chemical shifts of the rigid residues of AL-09 VL fibrils by solid-state NMR, reporting on the regions of the protein involved in the fibril as well as the extent of secondary structure.


Subject(s)
Amyloid/chemistry , Immunoglobulin Light Chains/chemistry , Nuclear Magnetic Resonance, Biomolecular , Amino Acid Sequence , Models, Molecular , Protein Conformation, beta-Strand , Protein Domains
SELECTION OF CITATIONS
SEARCH DETAIL
...