Búsqueda | BVS Bolivia

Unbiased single-cell morphology with self-supervised vision transformers.

Doron, Michael; Moutakanni, Théo; Chen, Zitong S; Moshkov, Nikita; Caron, Mathilde; Touvron, Hugo; Bojanowski, Piotr; Pernice, Wolfgang M; Caicedo, Juan C.

bioRxiv ; 2023 Jun 18.

Artículo en Inglés | MEDLINE | ID: mdl-37398158

RESUMEN

Accurately quantifying cellular morphology at scale could substantially empower existing single-cell approaches. However, measuring cell morphology remains an active field of research, which has inspired multiple computer vision algorithms over the years. Here, we show that DINO, a vision-transformer based, self-supervised algorithm, has a remarkable ability for learning rich representations of cellular morphology without manual annotations or any other type of supervision. We evaluate DINO on a wide variety of tasks across three publicly available imaging datasets of diverse specifications and biological focus. We find that DINO encodes meaningful features of cellular morphology at multiple scales, from subcellular and single-cell resolution, to multi-cellular and aggregated experimental groups. Importantly, DINO successfully uncovers a hierarchy of biological and technical factors of variation in imaging datasets. The results show that DINO can support the study of unknown biological variation, including single-cell heterogeneity and relationships between samples, making it an excellent tool for image-based biological discovery.

ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training.

Touvron, Hugo; Bojanowski, Piotr; Caron, Mathilde; Cord, Matthieu; El-Nouby, Alaaeldin; Grave, Edouard; Izacard, Gautier; Joulin, Armand; Synnaeve, Gabriel; Verbeek, Jakob; Jegou, Herve.

IEEE Trans Pattern Anal Mach Intell ; 45(4): 5314-5321, 2023 Apr.

Artículo en Inglés | MEDLINE | ID: mdl-36094972

RESUMEN

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet. We also train ResMLP models in a self-supervised setup, to further remove priors from employing a labelled dataset. Finally, by adapting our model to machine translation we achieve surprisingly good results. We share pre-trained models and our code based on the Timm library.

RESUMEN

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA