Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 58
Filter
1.
J Mol Biol ; : 168546, 2024 Mar 18.
Article in English | MEDLINE | ID: mdl-38508301

ABSTRACT

IHMCIF (github.com/ihmwg/IHMCIF) is a data information framework that supports archiving and disseminating macromolecular structures determined by integrative or hybrid modeling (IHM), and making them Findable, Accessible, Interoperable, and Reusable (FAIR). IHMCIF is an extension of the Protein Data Bank Exchange/macromolecular Crystallographic Information Framework (PDBx/mmCIF) that serves as the framework for the Protein Data Bank (PDB) to archive experimentally determined atomic structures of biological macromolecules and their complexes with one another and small molecule ligands (e.g., enzyme cofactors and drugs). IHMCIF serves as the foundational data standard for the PDB-Dev prototype system, developed for archiving and disseminating integrative structures. It utilizes a flexible data representation to describe integrative structures that span multiple spatiotemporal scales and structural states with definitions for restraints from a variety of experimental methods contributing to integrative structural biology. The IHMCIF extension was created with the benefit of considerable community input and recommendations gathered by the Worldwide Protein Data Bank (wwPDB) Task Force for Integrative or Hybrid Methods (wwpdb.org/task/hybrid). Herein, we describe the development of IHMCIF to support evolving methodologies and ongoing advancements in integrative structural biology. Ultimately, IHMCIF will facilitate the unification of PDB-Dev data and tools with the PDB archive so that integrative structures can be archived and disseminated through PDB.

2.
Res Sq ; 2024 Jan 25.
Article in English | MEDLINE | ID: mdl-38343795

ABSTRACT

The EMDataResource Ligand Model Challenge aimed to assess the reliability and reproducibility of modeling ligands bound to protein and protein/nucleic-acid complexes in cryogenic electron microscopy (cryo-EM) maps determined at near-atomic (1.9-2.5 Å) resolution. Three published maps were selected as targets: E. coli beta-galactosidase with inhibitor, SARS-CoV-2 RNA-dependent RNA polymerase with covalently bound nucleotide analog, and SARS-CoV-2 ion channel ORF3a with bound lipid. Sixty-one models were submitted from 17 independent research groups, each with supporting workflow details. We found that (1) the quality of submitted ligand models and surrounding atoms varied, as judged by visual inspection and quantification of local map quality, model-to-map fit, geometry, energetics, and contact scores, and (2) a composite rather than a single score was needed to assess macromolecule+ligand model quality. These observations lead us to recommend best practices for assessing cryo-EM structures of liganded macromolecules reported at near-atomic resolution.

3.
IUCrJ ; 11(Pt 2): 140-151, 2024 Mar 01.
Article in English | MEDLINE | ID: mdl-38358351

ABSTRACT

In January 2020, a workshop was held at EMBL-EBI (Hinxton, UK) to discuss data requirements for the deposition and validation of cryoEM structures, with a focus on single-particle analysis. The meeting was attended by 47 experts in data processing, model building and refinement, validation, and archiving of such structures. This report describes the workshop's motivation and history, the topics discussed, and the resulting consensus recommendations. Some challenges for future methods-development efforts in this area are also highlighted, as is the implementation to date of some of the recommendations.


Subject(s)
Data Curation , Cryoelectron Microscopy/methods
4.
ArXiv ; 2024 Feb 02.
Article in English | MEDLINE | ID: mdl-38076521

ABSTRACT

In January 2020, a workshop was held at EMBL-EBI (Hinxton, UK) to discuss data requirements for deposition and validation of cryoEM structures, with a focus on single-particle analysis. The meeting was attended by 47 experts in data processing, model building and refinement, validation, and archiving of such structures. This report describes the workshop's motivation and history, the topics discussed, and consensus recommendations resulting from the workshop. Some challenges for future methods-development efforts in this area are also highlighted, as is the implementation to date of some of the recommendations.

5.
Nucleic Acids Res ; 52(D1): D245-D254, 2024 Jan 05.
Article in English | MEDLINE | ID: mdl-37953312

ABSTRACT

The Nucleic Acid Knowledgebase (nakb.org) is a new data resource, updated weekly, for experimentally determined 3D structures containing DNA and/or RNA nucleic acid polymers and their biological assemblies. NAKB indexes nucleic acid-containing structures derived from all major structure determination methods (X-ray, NMR and EM), including all held by the Protein Data Bank (PDB). As the planned successor to the Nucleic Acid Database (NDB), NAKB's design preserves all functionality of the NDB and provides novel nucleic acid-centric content, including structural and functional annotations, as well as annotations from and links to external resources. A variety of custom interactive tools have been developed to enable rapid exploration and drill-down of NAKB's content.


Subject(s)
Nucleic Acid Conformation , Nucleic Acids , DNA/chemistry , Knowledge Bases , Nucleic Acids/genetics , RNA/chemistry
6.
Nucleic Acids Res ; 51(D1): D488-D508, 2023 01 06.
Article in English | MEDLINE | ID: mdl-36420884

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), founding member of the Worldwide Protein Data Bank (wwPDB), is the US data center for the open-access PDB archive. As wwPDB-designated Archive Keeper, RCSB PDB is also responsible for PDB data security. Annually, RCSB PDB serves >10 000 depositors of three-dimensional (3D) biostructures working on all permanently inhabited continents. RCSB PDB delivers data from its research-focused RCSB.org web portal to many millions of PDB data consumers based in virtually every United Nations-recognized country, territory, etc. This Database Issue contribution describes upgrades to the research-focused RCSB.org web portal that created a one-stop-shop for open access to ∼200 000 experimentally-determined PDB structures of biological macromolecules alongside >1 000 000 incorporated Computed Structure Models (CSMs) predicted using artificial intelligence/machine learning methods. RCSB.org is a 'living data resource.' Every PDB structure and CSM is integrated weekly with related functional annotations from external biodata resources, providing up-to-date information for the entire corpus of 3D biostructure data freely available from RCSB.org with no usage limitations. Within RCSB.org, PDB structures and the CSMs are clearly identified as to their provenance and reliability. Both are fully searchable, and can be analyzed and visualized using the full complement of RCSB.org web portal capabilities.


Subject(s)
Artificial Intelligence , Databases, Protein , Proteins , Machine Learning , Protein Conformation , Proteins/chemistry , Reproducibility of Results
7.
Biophys Rev ; 14(6): 1281-1301, 2022 Dec.
Article in English | MEDLINE | ID: mdl-36474933

ABSTRACT

As a discipline, structural biology has been transformed by the three-dimensional electron microscopy (3DEM) "Resolution Revolution" made possible by convergence of robust cryo-preservation of vitrified biological materials, sample handling systems, and measurement stages operating a liquid nitrogen temperature, improvements in electron optics that preserve phase information at the atomic level, direct electron detectors (DEDs), high-speed computing with graphics processing units, and rapid advances in data acquisition and processing software. 3DEM structure information (atomic coordinates and related metadata) are archived in the open-access Protein Data Bank (PDB), which currently holds more than 11,000 3DEM structures of proteins and nucleic acids, and their complexes with one another and small-molecule ligands (~ 6% of the archive). Underlying experimental data (3DEM density maps and related metadata) are stored in the Electron Microscopy Data Bank (EMDB), which currently holds more than 21,000 3DEM density maps. After describing the history of the PDB and the Worldwide Protein Data Bank (wwPDB) partnership, which jointly manages both the PDB and EMDB archives, this review examines the origins of the resolution revolution and analyzes its impact on structural biology viewed through the lens of PDB holdings. Six areas of focus exemplifying the impact of 3DEM across the biosciences are discussed in detail (icosahedral viruses, ribosomes, integral membrane proteins, SARS-CoV-2 spike proteins, cryogenic electron tomography, and integrative structure determination combining 3DEM with complementary biophysical measurement techniques), followed by a review of 3DEM structure validation by the wwPDB that underscores the importance of community engagement.

8.
Protein Sci ; 31(12): e4482, 2022 12.
Article in English | MEDLINE | ID: mdl-36281733

ABSTRACT

Now in its 52nd year of continuous operations, the Protein Data Bank (PDB) is the premiere open-access global archive housing three-dimensional (3D) biomolecular structure data. It is jointly managed by the Worldwide Protein Data Bank (wwPDB) partnership. The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) is funded by the National Science Foundation, National Institutes of Health, and US Department of Energy and serves as the US data center for the wwPDB. RCSB PDB is also responsible for the security of PDB data in its role as wwPDB-designated Archive Keeper. Every year, RCSB PDB serves tens of thousands of depositors of 3D macromolecular structure data (coming from macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction). The RCSB PDB research-focused web portal (RCSB.org) makes PDB data available at no charge and without usage restrictions to many millions of PDB data consumers around the world. The RCSB PDB training, outreach, and education web portal (PDB101.RCSB.org) serves nearly 700 K educators, students, and members of the public worldwide. This invited Tools Issue contribution describes how RCSB PDB (i) is organized; (ii) works with wwPDB partners to process new depositions; (iii) serves as the wwPDB-designated Archive Keeper; (iv) enables exploration and 3D visualization of PDB data via RCSB.org; and (v) supports training, outreach, and education via PDB101.RCSB.org. New tools and features at RCSB.org are presented using examples drawn from high-resolution structural studies of proteins relevant to treatment of human cancers by targeting immune checkpoints.


Subject(s)
Computational Biology , Proteins , Humans , Protein Conformation , Databases, Protein , Proteins/chemistry , Computational Biology/methods , Macromolecular Substances/chemistry
9.
Life (Basel) ; 12(4)2022 Apr 06.
Article in English | MEDLINE | ID: mdl-35455031

ABSTRACT

In this review, we describe the creation of the Nucleic Acid Database (NDB) at Rutgers University and how it became a testbed for the current infrastructure of the RCSB Protein Data Bank. We describe some of the special features of the NDB and how it has been used to enable research. Plans for the next phase as the Nucleic Acid Knowledgebase (NAKB) are summarized.

10.
J Mol Biol ; 434(11): 167599, 2022 06 15.
Article in English | MEDLINE | ID: mdl-35460671

ABSTRACT

PDBx/mmCIF, Protein Data Bank Exchange (PDBx) macromolecular Crystallographic Information Framework (mmCIF), has become the data standard for structural biology. With its early roots in the domain of small-molecule crystallography, PDBx/mmCIF provides an extensible data representation that is used for deposition, archiving, remediation, and public dissemination of experimentally determined three-dimensional (3D) structures of biological macromolecules by the Worldwide Protein Data Bank (wwPDB, wwpdb.org). Extensions of PDBx/mmCIF are similarly used for computed structure models by ModelArchive (modelarchive.org), integrative/hybrid structures by PDB-Dev (pdb-dev.wwpdb.org), small angle scattering data by Small Angle Scattering Biological Data Bank SASBDB (sasbdb.org), and for models computed generated with the AlphaFold 2.0 deep learning software suite (alphafold.ebi.ac.uk). Community-driven development of PDBx/mmCIF spans three decades, involving contributions from researchers, software and methods developers in structural sciences, data repository providers, scientific publishers, and professional societies. Having a semantically rich and extensible data framework for representing a wide range of structural biology experimental and computational results, combined with expertly curated 3D biostructure data sets in public repositories, accelerates the pace of scientific discovery. Herein, we describe the architecture of the PDBx/mmCIF data standard, tools used to maintain representations of the data standard, governance, and processes by which data content standards are extended, plus community tools/software libraries available for processing and checking the integrity of PDBx/mmCIF data. Use cases exemplify how the members of the Worldwide Protein Data Bank have used PDBx/mmCIF as the foundation for its pipeline for delivering Findable, Accessible, Interoperable, and Reusable (FAIR) data to many millions of users worldwide.


Subject(s)
Computational Biology , Crystallography , Databases, Protein , Software , Macromolecular Substances/chemistry , Molecular Biology , Protein Conformation , Semantics
11.
Protein Sci ; 31(1): 187-208, 2022 01.
Article in English | MEDLINE | ID: mdl-34676613

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), funded by the US National Science Foundation, National Institutes of Health, and Department of Energy, has served structural biologists and Protein Data Bank (PDB) data consumers worldwide since 1999. RCSB PDB, a founding member of the Worldwide Protein Data Bank (wwPDB) partnership, is the US data center for the global PDB archive housing biomolecular structure data. RCSB PDB is also responsible for the security of PDB data, as the wwPDB-designated Archive Keeper. Annually, RCSB PDB serves tens of thousands of three-dimensional (3D) macromolecular structure data depositors (using macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction) from all inhabited continents. RCSB PDB makes PDB data available from its research-focused RCSB.org web portal at no charge and without usage restrictions to millions of PDB data consumers working in every nation and territory worldwide. In addition, RCSB PDB operates an outreach and education PDB101.RCSB.org web portal that was used by more than 800,000 educators, students, and members of the public during calendar year 2020. This invited Tools Issue contribution describes (i) how the archive is growing and evolving as new experimental methods generate ever larger and more complex biomolecular structures; (ii) the importance of data standards and data remediation in effective management of the archive and facile integration with more than 50 external data resources; and (iii) new tools and features for 3D structure analysis and visualization made available during the past year via the RCSB.org web portal.


Subject(s)
Computational Biology/history , Databases, Protein/history , User-Computer Interface , Anniversaries and Special Events , History, 20th Century , History, 21st Century
12.
Acta Crystallogr D Struct Biol ; 77(Pt 12): 1486-1496, 2021 Dec 01.
Article in English | MEDLINE | ID: mdl-34866606

ABSTRACT

Structures of many complex biological assemblies are increasingly determined using integrative approaches, in which data from multiple experimental methods are combined. A standalone system, called PDB-Dev, has been developed for archiving integrative structures and making them publicly available. Here, the data standards and software tools that support PDB-Dev are described along with the new and updated components of the PDB-Dev data-collection, processing and archiving infrastructure. Following the FAIR (Findable, Accessible, Interoperable and Reusable) principles, PDB-Dev ensures that the results of integrative structure determinations are freely accessible to everyone.


Subject(s)
Databases, Protein , Information Storage and Retrieval/methods , Protein Conformation , Proteins/chemistry
13.
Acta Crystallogr F Struct Biol Commun ; 77(Pt 7): 215-225, 2021 Jul 01.
Article in English | MEDLINE | ID: mdl-34196612

ABSTRACT

The crystal structures of domain-swapped tryptophan repressor (TrpR) variant Val58Ile before and after soaking with the physiological ligand L-tryptophan (L-Trp) indicate that L-Trp occupies the same location in the domain-swapped form as in native dimeric TrpR and makes equivalent residue contacts. This result is unexpected because the ligand binding-site residues arise from three separate polypeptide chains in the domain-swapped form. This work represents the first published structure of a domain-swapped form of TrpR with L-Trp bound. The presented structures also show that the protein amino-terminus, whether or not it bears a disordered extension of about 20 residues, is accessible in the large solvent channels of the domain-swapped crystal form, as in the structures reported previously in this form for TrpR without N-terminal extensions. These findings inspire the exploration of L-Trp analogs and N-terminal modifications as labels to orient guest proteins that cannot otherwise be crystallized in the solvent channels of crystalline domain-swapped TrpR hosts for potential diffraction analysis.


Subject(s)
Bacterial Proteins/chemistry , Isoleucine/chemistry , Repressor Proteins/chemistry , Tryptophan/chemistry , Valine/chemistry , X-Ray Diffraction/methods , Amino Acid Sequence , Bacterial Proteins/genetics , Crystallography, X-Ray/methods , Escherichia coli/genetics , Isoleucine/genetics , Protein Domains/genetics , Protein Structure, Secondary , Repressor Proteins/genetics , Tryptophan/genetics , Valine/genetics
14.
Acta Crystallogr D Struct Biol ; 77(Pt 4): 471-485, 2021 Apr 01.
Article in English | MEDLINE | ID: mdl-33825708

ABSTRACT

Soaking small molecules into the solvent channels of protein crystals is the most common method of obtaining crystalline complexes with ligands such as substrates or inhibitors. The solvent channels of some protein crystals are large enough to allow the incorporation of macromolecules, but soaking of protein guests into protein crystals has not been reported. Such protein host crystals (here given the name hostals) incorporating guest proteins may be useful for a wide range of applications in biotechnology, for example as cargo systems or for diffraction studies analogous to the crystal sponge method. The present study takes advantage of crystals of the Escherichia coli tryptophan repressor protein (ds-TrpR) that are extensively domain-swapped and suitable for incorporating guest proteins by diffusion, as they are robust and have large solvent channels. Confocal fluorescence microscopy is used to follow the migration of cytochrome c and fluorophore-labeled calmodulin into the solvent channels of ds-TrpR crystals. The guest proteins become uniformly distributed in the crystal within weeks and enriched within the solvent channels. X-ray diffraction studies on host crystals with high concentrations of incorporated guests demonstrate that diffraction limits of ∼2.5 Šcan still be achieved. Weak electron density is observed in the solvent channels, but the guest-protein structures could not be determined by conventional crystallographic methods. Additional approaches that increase the ordering of guests in the host crystal are discussed that may support protein structure determination using the hostal system in the future. This host system may also be useful for biotechnological applications where crystallographic order of the guest is not required.


Subject(s)
Ligands , Models, Molecular , Proteins/chemistry , Solvents/chemistry , Crystallography, X-Ray/methods , Molecular Structure , X-Ray Diffraction
15.
J Biol Chem ; 296: 100560, 2021.
Article in English | MEDLINE | ID: mdl-33744287

ABSTRACT

Cryogenic electron microscopy (cryo-EM) methods began to be used in the mid-1970s to study thin and periodic arrays of proteins. Following a half-century of development in cryo-specimen preparation, instrumentation, data collection, data processing, and modeling software, cryo-EM has become a routine method for solving structures from large biological assemblies to small biomolecules at near to true atomic resolution. This review explores the critical roles played by the Protein Data Bank (PDB) and Electron Microscopy Data Bank (EMDB) in partnership with the community to develop the necessary infrastructure to archive cryo-EM maps and associated models. Public access to cryo-EM structure data has in turn facilitated better understanding of structure-function relationships and advancement of image processing and modeling tool development. The partnership between the global cryo-EM community and PDB and EMDB leadership has synergistically shaped the standards for metadata, one-stop deposition of maps and models, and validation metrics to assess the quality of cryo-EM structures. The advent of cryo-electron tomography (cryo-ET) for in situ molecular cell structures at a broad resolution range and their correlations with other imaging data introduce new data archival challenges in terms of data size and complexity in the years to come.


Subject(s)
Cryoelectron Microscopy/methods , Databases, Protein , Proteins/chemistry , Crystallography, X-Ray , Protein Conformation , Proteins/ultrastructure
16.
Nat Methods ; 18(2): 156-164, 2021 02.
Article in English | MEDLINE | ID: mdl-33542514

ABSTRACT

This paper describes outcomes of the 2019 Cryo-EM Model Challenge. The goals were to (1) assess the quality of models that can be produced from cryogenic electron microscopy (cryo-EM) maps using current modeling software, (2) evaluate reproducibility of modeling results from different software developers and users and (3) compare performance of current metrics used for model evaluation, particularly Fit-to-Map metrics, with focus on near-atomic resolution. Our findings demonstrate the relatively high accuracy and reproducibility of cryo-EM models derived by 13 participating teams from four benchmark maps, including three forming a resolution series (1.8 to 3.1 Å). The results permit specific recommendations to be made about validating near-atomic cryo-EM structures both in the context of individual experiments and structure data archives such as the Protein Data Bank. We recommend the adoption of multiple scoring parameters to provide full and objective annotation and assessment of the model, reflective of the observed cryo-EM map density.


Subject(s)
Cryoelectron Microscopy/methods , Models, Molecular , Crystallography, X-Ray , Protein Conformation , Proteins/chemistry
17.
Nucleic Acids Res ; 49(D1): D437-D451, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33211854

ABSTRACT

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), the US data center for the global PDB archive and a founding member of the Worldwide Protein Data Bank partnership, serves tens of thousands of data depositors in the Americas and Oceania and makes 3D macromolecular structure data available at no charge and without restrictions to millions of RCSB.org users around the world, including >660 000 educators, students and members of the curious public using PDB101.RCSB.org. PDB data depositors include structural biologists using macromolecular crystallography, nuclear magnetic resonance spectroscopy, 3D electron microscopy and micro-electron diffraction. PDB data consumers accessing our web portals include researchers, educators and students studying fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences. During the past 2 years, the research-focused RCSB PDB web portal (RCSB.org) has undergone a complete redesign, enabling improved searching with full Boolean operator logic and more facile access to PDB data integrated with >40 external biodata resources. New features and resources are described in detail using examples that showcase recently released structures of SARS-CoV-2 proteins and host cell proteins relevant to understanding and addressing the COVID-19 global pandemic.


Subject(s)
Computational Biology/methods , Databases, Protein , Macromolecular Substances/chemistry , Protein Conformation , Proteins/chemistry , Bioengineering/methods , Biomedical Research/methods , Biotechnology/methods , COVID-19/epidemiology , COVID-19/prevention & control , COVID-19/virology , Humans , Macromolecular Substances/metabolism , Pandemics , Proteins/genetics , Proteins/metabolism , SARS-CoV-2/genetics , SARS-CoV-2/metabolism , SARS-CoV-2/physiology , Software , Viral Proteins/chemistry , Viral Proteins/genetics , Viral Proteins/metabolism
18.
Biochemistry ; 59(48): 4523-4532, 2020 12 08.
Article in English | MEDLINE | ID: mdl-33205945

ABSTRACT

We demonstrate here that the α subunit C-terminal domain of Escherichia coli RNA polymerase (αCTD) recognizes the upstream promoter (UP) DNA element via its characteristic minor groove shape and electrostatic potential. In two compositionally distinct crystallized assemblies, a pair of αCTD subunits bind in tandem to the UP element consensus A-tract that is 6 bp in length (A6-tract), each with their arginine 265 guanidinium group inserted into the minor groove. The A6-tract minor groove is significantly narrowed in these crystal structures, as well as in computationally predicted structures of free and bound DNA duplexes derived by Monte Carlo and molecular dynamics simulations, respectively. The negative electrostatic potential of free A6-tract DNA is substantially enhanced compared to that of generic DNA. Shortening the A-tract by 1 bp is shown to "knock out" binding of the second αCTD through widening of the minor groove. Furthermore, in computationally derived structures with arginine 265 mutated to alanine in either αCTD, either with or without the "knockout" DNA mutation, contact with the DNA is perturbed, highlighting the importance of arginine 265 in achieving αCTD-DNA binding. These results demonstrate that the importance of the DNA shape in sequence-dependent recognition of DNA by RNA polymerase is comparable to that of certain transcription factors.


Subject(s)
DNA, Bacterial/chemistry , DNA, Bacterial/metabolism , DNA-Directed RNA Polymerases/chemistry , DNA-Directed RNA Polymerases/metabolism , Escherichia coli Proteins/chemistry , Escherichia coli Proteins/metabolism , Binding Sites , Crystallography, X-Ray , Cyclic AMP Receptor Protein/chemistry , Cyclic AMP Receptor Protein/genetics , Cyclic AMP Receptor Protein/metabolism , DNA, Bacterial/genetics , DNA-Directed RNA Polymerases/genetics , Escherichia coli/genetics , Escherichia coli/metabolism , Escherichia coli Proteins/genetics , Gene Knockout Techniques , Genes, Bacterial , Models, Molecular , Mutation , Nucleic Acid Conformation , Promoter Regions, Genetic , Protein Domains , Static Electricity
19.
IUCrJ ; 7(Pt 4): 630-638, 2020 Jul 01.
Article in English | MEDLINE | ID: mdl-32695409

ABSTRACT

The Protein Data Bank (PDB) has grown from a small data resource for crystallographers to a worldwide resource serving structural biology. The history of the growth of the PDB and the role that the community has played in developing standards and policies are described. This article also illustrates how other biophysics communities are collaborating with the worldwide PDB to create a network of interoperating data resources. This network will expand the capabilities of structural biology and enable the determination and archiving of increasingly complex structures.

20.
Struct Dyn ; 7(1): 014701, 2020 Jan.
Article in English | MEDLINE | ID: mdl-32002441

ABSTRACT

Electron cryo-microscopy (cryo-EM) is increasingly being used to determine 3D structures of a broad spectrum of biological specimens from molecules to cells. Anticipating this progress in the early 2000s, an international collaboration of scientists with expertise in both cryo-EM and structure data archiving was established (EMDataResource, previously known as EMDataBank). The major goals of the collaboration have been twofold: to develop the necessary infrastructure for archiving cryo-EM-derived density maps and models, and to promote development of cryo-EM structure validation standards. We describe how cryo-EM data archiving and validation have been developed and jointly coordinated for the Electron Microscopy Data Bank and Protein Data Bank archives over the past two decades, as well as the impact of evolving technology on data standards. Just as for X-ray crystallography and nuclear magnetic resonance, engaging the scientific community via workshops and challenging activities has played a central role in developing recommendations and requirements for the cryo-EM structure data archives.

SELECTION OF CITATIONS
SEARCH DETAIL
...