Your browser doesn't support javascript.
loading
CEM500K, a large-scale heterogeneous unlabeled cellular electron microscopy image dataset for deep learning.
Conrad, Ryan; Narayan, Kedar.
Afiliação
  • Conrad R; Center for Molecular Microscopy, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, United States.
  • Narayan K; Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, United States.
Elife ; 102021 04 08.
Article em En | MEDLINE | ID: mdl-33830015
ABSTRACT
Automated segmentation of cellular electron microscopy (EM) datasets remains a challenge. Supervised deep learning (DL) methods that rely on region-of-interest (ROI) annotations yield models that fail to generalize to unrelated datasets. Newer unsupervised DL algorithms require relevant pre-training images, however, pre-training on currently available EM datasets is computationally expensive and shows little value for unseen biological contexts, as these datasets are large and homogeneous. To address this issue, we present CEM500K, a nimble 25 GB dataset of 0.5 × 106 unique 2D cellular EM images curated from nearly 600 three-dimensional (3D) and 10,000 two-dimensional (2D) images from >100 unrelated imaging projects. We show that models pre-trained on CEM500K learn features that are biologically relevant and resilient to meaningful image augmentations. Critically, we evaluate transfer learning from these pre-trained models on six publicly available and one newly derived benchmark segmentation task and report state-of-the-art results on each. We release the CEM500K dataset, pre-trained models and curation pipeline for model building and further expansion by the EM community. Data and code are available at https//www.ebi.ac.uk/pdbe/emdb/empiar/entry/10592/ and https//git.io/JLLTz.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Processamento de Imagem Assistida por Computador / Microscopia Eletrônica / Aprendizado Profundo Idioma: En Revista: Elife Ano de publicação: 2021 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Processamento de Imagem Assistida por Computador / Microscopia Eletrônica / Aprendizado Profundo Idioma: En Revista: Elife Ano de publicação: 2021 Tipo de documento: Article País de afiliação: Estados Unidos