Your browser doesn't support javascript.
loading
Cryo2StructData: A Large Labeled Cryo-EM Density Map Dataset for AI-based Modeling of Protein Structures.
Giri, Nabin; Wang, Liguo; Cheng, Jianlin.
Afiliação
  • Giri N; Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, 65211, USA.
  • Wang L; Roy Blunt NextGen Precision Health, University of Missouri, Columbia, MO, 65211, USA.
  • Cheng J; Laboratory for BioMolecular Structure (LBMS), Brookhaven National Laboratory, Upton, NY, 11973, USA.
Sci Data ; 11(1): 458, 2024 May 06.
Article em En | MEDLINE | ID: mdl-38710720
ABSTRACT
The advent of single-particle cryo-electron microscopy (cryo-EM) has brought forth a new era of structural biology, enabling the routine determination of large biological molecules and their complexes at atomic resolution. The high-resolution structures of biological macromolecules and their complexes significantly expedite biomedical research and drug discovery. However, automatically and accurately building atomic models from high-resolution cryo-EM density maps is still time-consuming and challenging when template-based models are unavailable. Artificial intelligence (AI) methods such as deep learning trained on limited amount of labeled cryo-EM density maps generate inaccurate atomic models. To address this issue, we created a dataset called Cryo2StructData consisting of 7,600 preprocessed cryo-EM density maps whose voxels are labelled according to their corresponding known atomic structures for training and testing AI methods to build atomic models from cryo-EM density maps. Cryo2StructData is larger than existing, publicly available datasets for training AI methods to build atomic protein structures from cryo-EM density maps. We trained and tested deep learning models on Cryo2StructData to validate its quality showing that it is ready for being used to train and test AI methods for building atomic models.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Inteligência Artificial / Proteínas / Microscopia Crioeletrônica Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Inteligência Artificial / Proteínas / Microscopia Crioeletrônica Idioma: En Ano de publicação: 2024 Tipo de documento: Article