Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 45
Filtrar
1.
Sci Rep ; 14(1): 6858, 2024 03 21.
Artículo en Inglés | MEDLINE | ID: mdl-38514690

RESUMEN

The ability to understand and manipulate numbers and quantities emerges during childhood, but the mechanism through which humans acquire and develop this ability is still poorly understood. We explore this question through a model, assuming that the learner is able to pick up and place small objects from, and to, locations of its choosing, and will spontaneously engage in such undirected manipulation. We further assume that the learner's visual system will monitor the changing arrangements of objects in the scene and will learn to predict the effects of each action by comparing perception with a supervisory signal from the motor system. We model perception using standard deep networks for feature extraction and classification. Our main finding is that, from learning the task of action prediction, an unexpected image representation emerges exhibiting regularities that foreshadow the perception and representation of numbers and quantity. These include distinct categories for zero and the first few natural numbers, a strict ordering of the numbers, and a one-dimensional signal that correlates with numerical quantity. As a result, our model acquires the ability to estimate numerosity, i.e. the number of objects in the scene, as well as subitization, i.e. the ability to recognize at a glance the exact number of objects in small scenes. Remarkably, subitization and numerosity estimation extrapolate to scenes containing many objects, far beyond the three objects used during training. We conclude that important aspects of a facility with numbers and quantities may be learned with supervision from a simple pre-training task. Our observations suggest that cross-modal learning is a powerful learning mechanism that may be harnessed in artificial intelligence.


Asunto(s)
Inteligencia Artificial , Cognición , Humanos , Encéfalo , Aprendizaje , Percepción Visual
2.
Elife ; 122024 Feb 29.
Artículo en Inglés | MEDLINE | ID: mdl-38420996

RESUMEN

An animal entering a new environment typically faces three challenges: explore the space for resources, memorize their locations, and navigate towards those targets as needed. Here we propose a neural algorithm that can solve all these problems and operates reliably in diverse and complex environments. At its core, the mechanism makes use of a behavioral module common to all motile animals, namely the ability to follow an odor to its source. We show how the brain can learn to generate internal "virtual odors" that guide the animal to any location of interest. This endotaxis algorithm can be implemented with a simple 3-layer neural circuit using only biologically realistic structures and learning rules. Several neural components of this scheme are found in brains from insects to humans. Nature may have evolved a general mechanism for search and navigation on the ancient backbone of chemotaxis.


Asunto(s)
Algoritmos , Objetivos , Animales , Humanos , Aprendizaje , Odorantes
3.
bioRxiv ; 2024 Mar 07.
Artículo en Inglés | MEDLINE | ID: mdl-38045277

RESUMEN

Cells are a fundamental unit of biological organization, and identifying them in imaging data - cell segmentation - is a critical task for various cellular imaging experiments. While deep learning methods have led to substantial progress on this problem, most models in use are specialist models that work well for specific domains. Methods that have learned the general notion of "what is a cell" and can identify them across different domains of cellular imaging data have proven elusive. In this work, we present CellSAM, a foundation model for cell segmentation that generalizes across diverse cellular imaging data. CellSAM builds on top of the Segment Anything Model (SAM) by developing a prompt engineering approach for mask generation. We train an object detector, CellFinder, to automatically detect cells and prompt SAM to generate segmentations. We show that this approach allows a single model to achieve human-level performance for segmenting images of mammalian cells (in tissues and cell culture), yeast, and bacteria collected across various imaging modalities. We show that CellSAM has strong zero-shot performance and can be improved with a few examples via few-shot learning. We also show that CellSAM can unify bioimaging analysis workflows such as spatial transcriptomics and cell tracking. A deployed version of CellSAM is available at https://cellsam.deepcell.org/.

4.
bioRxiv ; 2023 Dec 07.
Artículo en Inglés | MEDLINE | ID: mdl-38045312

RESUMEN

Artificial activation of anatomically localized, genetically defined hypothalamic neuron populations is known to trigger distinct innate behaviors, suggesting a hypothalamic nucleus-centered organization of behavior control. To assess whether the encoding of behavior is similarly anatomically confined, we performed simultaneous neuron recordings across twenty hypothalamic regions in freely moving animals. Here we show that distinct but anatomically distributed neuron ensembles encode the social and fear behavior classes, primarily through mixed selectivity. While behavior class-encoding ensembles were spatially distributed, individual ensembles exhibited strong localization bias. Encoding models identified that behavior actions, but not motion-related variables, explained a large fraction of hypothalamic neuron activity variance. These results identify unexpected complexity in the hypothalamic encoding of instincts and provide a foundation for understanding the role of distributed neural representations in the expression of behaviors driven by hardwired circuits.

5.
Artículo en Inglés | MEDLINE | ID: mdl-36628357

RESUMEN

We propose a method for learning the posture and structure of agents from unlabelled behavioral videos. Starting from the observation that behaving agents are generally the main sources of movement in behavioral videos, our method, Behavioral Keypoint Discovery (B-KinD), uses an encoder-decoder architecture with a geometric bottleneck to reconstruct the spatiotemporal difference between video frames. By focusing only on regions of movement, our approach works directly on input videos without requiring manual annotations. Experiments on a variety of agent types (mouse, fly, human, jellyfish, and trees) demonstrate the generality of our approach and reveal that our discovered keypoints represent semantically meaningful body parts, which achieve state-of-the-art performance on keypoint regression among self-supervised methods. Additionally, B-KinD achieve comparable performance to supervised keypoints on downstream tasks, such as behavior classification, suggesting that our method can dramatically reduce model training costs vis-a-vis supervised methods.

6.
Elife ; 102021 11 30.
Artículo en Inglés | MEDLINE | ID: mdl-34846301

RESUMEN

The study of naturalistic social behavior requires quantification of animals' interactions. This is generally done through manual annotation-a highly time-consuming and tedious process. Recent advances in computer vision enable tracking the pose (posture) of freely behaving animals. However, automatically and accurately classifying complex social behaviors remains technically challenging. We introduce the Mouse Action Recognition System (MARS), an automated pipeline for pose estimation and behavior quantification in pairs of freely interacting mice. We compare MARS's annotations to human annotations and find that MARS's pose estimation and behavior classification achieve human-level performance. We also release the pose and annotation datasets used to train MARS to serve as community benchmarks and resources. Finally, we introduce the Behavior Ensemble and Neural Trajectory Observatory (BENTO), a graphical user interface for analysis of multimodal neuroscience datasets. Together, MARS and BENTO provide an end-to-end pipeline for behavior data extraction and analysis in a package that is user-friendly and easily modifiable.


Asunto(s)
Algoritmos , Reconocimiento de Normas Patrones Automatizadas/métodos , Conducta Social , Programas Informáticos , Animales , Conducta Animal , Ratones
7.
Elife ; 102021 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-34196271

RESUMEN

Animals learn certain complex tasks remarkably fast, sometimes after a single experience. What behavioral algorithms support this efficiency? Many contemporary studies based on two-alternative-forced-choice (2AFC) tasks observe only slow or incomplete learning. As an alternative, we study the unconstrained behavior of mice in a complex labyrinth and measure the dynamics of learning and the behaviors that enable it. A mouse in the labyrinth makes ~2000 navigation decisions per hour. The animal explores the maze, quickly discovers the location of a reward, and executes correct 10-bit choices after only 10 reward experiences - a learning rate 1000-fold higher than in 2AFC experiments. Many mice improve discontinuously from one minute to the next, suggesting moments of sudden insight about the structure of the labyrinth. The underlying search algorithm does not require a global memory of places visited and is largely explained by purely local turning rules.


Asunto(s)
Conducta Exploratoria/fisiología , Aprendizaje por Laberinto/fisiología , Algoritmos , Animales , Femenino , Masculino , Ratones , Ratones Endogámicos C57BL , Grabación en Video
8.
IEEE Trans Med Robot Bionics ; 3(1): 2-10, 2021 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-33644703

RESUMEN

Effectiveness of computer vision techniques has been demonstrated through a number of applications, both within and outside healthcare. The operating room environment specifically is a setting with rich data sources compatible with computational approaches and high potential for direct patient benefit. The aim of this review is to summarize major topics in computer vision for surgical domains. The major capabilities of computer vision are described as an aid to surgical teams to improve performance and contribute to enhanced patient safety. Literature was identified through leading experts in the fields of surgery, computational analysis and modeling in medicine, and computer vision in healthcare. The literature supports the application of computer vision principles to surgery. Potential applications within surgery include operating room vigilance, endoscopic vigilance, and individual and team-wide behavioral analysis. To advance the field, we recommend collecting and publishing carefully annotated datasets. Doing so will enable the surgery community to collectively define well-specified common objectives for automated systems, spur academic research, mobilize industry, and provide benchmarks with which we can track progress. Leveraging computer vision approaches through interdisciplinary collaboration and advanced approaches to data acquisition, modeling, interpretation, and integration promises a powerful impact on patient safety, public health, and financial costs.

9.
Artículo en Inglés | MEDLINE | ID: mdl-36544482

RESUMEN

Specialized domain knowledge is often necessary to accurately annotate training sets for in-depth analysis, but can be burdensome and time-consuming to acquire from domain experts. This issue arises prominently in automated behavior analysis, in which agent movements or actions of interest are detected from video tracking data. To reduce annotation effort, we present TREBA: a method to learn annotation-sample efficient trajectory embedding for behavior analysis, based on multi-task self-supervised learning. The tasks in our method can be efficiently engineered by domain experts through a process we call "task programming", which uses programs to explicitly encode structured knowledge from domain experts. Total domain expert effort can be reduced by exchanging data annotation time for the construction of a small number of programmed tasks. We evaluate this trade-off using data from behavioral neuroscience, in which specialized domain knowledge is used to identify behaviors. We present experimental results in three datasets across two domains: mice and fruit flies. Using embeddings from TREBA, we reduce annotation burden by up to a factor of 10 without compromising accuracy compared to state-of-the-art features. Our results thus suggest that task programming and self-supervision can be an effective way to reduce annotation effort for domain experts.

10.
Adv Neural Inf Process Syst ; 2021(DB1): 1-15, 2021 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-38706835

RESUMEN

Multi-agent behavior modeling aims to understand the interactions that occur between agents. We present a multi-agent dataset from behavioral neuroscience, the Caltech Mouse Social Interactions (CalMS21) Dataset. Our dataset consists of trajectory data of social interactions, recorded from videos of freely behaving mice in a standard resident-intruder assay. To help accelerate behavioral studies, the CalMS21 dataset provides benchmarks to evaluate the performance of automated behavior classification methods in three settings: (1) for training on large behavioral datasets all annotated by a single annotator, (2) for style transfer to learn inter-annotator differences in behavior definitions, and (3) for learning of new behaviors of interest given limited training data. The dataset consists of 6 million frames of unlabeled tracked poses of interacting mice, as well as over 1 million frames with tracked poses and corresponding frame-level behavior annotations. The challenge of our dataset is to be able to classify behaviors accurately using both labeled and unlabeled tracking data, as well as being able to generalize to new settings.

11.
Neuron ; 104(1): 11-24, 2019 10 09.
Artículo en Inglés | MEDLINE | ID: mdl-31600508

RESUMEN

The brain is worthy of study because it is in charge of behavior. A flurry of recent technical advances in measuring and quantifying naturalistic behaviors provide an important opportunity for advancing brain science. However, the problem of understanding unrestrained behavior in the context of neural recordings and manipulations remains unsolved, and developing approaches to addressing this challenge is critical. Here we discuss considerations in computational neuroethology-the science of quantifying naturalistic behaviors for understanding the brain-and propose strategies to evaluate progress. We point to open questions that require resolution and call upon the broader systems neuroscience community to further develop and leverage measures of naturalistic, unrestrained behavior, which will enable us to more effectively probe the richness and complexity of the brain.


Asunto(s)
Conducta Animal/fisiología , Encéfalo/fisiología , Aprendizaje Automático , Animales , Fenómenos Electrofisiológicos , Etología , Neurociencias
12.
Front Neurosci ; 11: 468, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-28894413

RESUMEN

Perceptual decisions requiring the comparison of spatially distributed stimuli that are fixated sequentially might be influenced by fluctuations in visual attention. We used two psychophysical tasks with human subjects to investigate the extent to which visual attention influences simple perceptual choices, and to test the extent to which the attentional Drift Diffusion Model (aDDM) provides a good computational description of how attention affects the underlying decision processes. We find evidence for sizable attentional choice biases and that the aDDM provides a reasonable quantitative description of the relationship between fluctuations in visual attention, choices and reaction times. We also find that exogenous manipulations of attention induce choice biases consistent with the predictions of the model.

13.
Bioinspir Biomim ; 11(4): 046008, 2016 07 18.
Artículo en Inglés | MEDLINE | ID: mdl-27427952

RESUMEN

Brains and sensory systems evolved to guide motion. Central to this task is controlling the approach to stationary obstacles and detecting moving organisms. Looming has been proposed as the main monocular visual cue for detecting the approach of other animals and avoiding collisions with stationary obstacles. Elegant neural mechanisms for looming detection have been found in the brain of insects and vertebrates. However, looming has not been analyzed in the context of collisions between two moving animals. We propose an alternative strategy, generalized regressive motion (GRM), which is consistent with recently observed behavior in fruit flies. Geometric analysis proves that GRM is a reliable cue to collision among conspecifics, whereas agent-based modeling suggests that GRM is a better cue than looming as a means to detect approach, prevent collisions and maintain mobility.


Asunto(s)
Prevención de Accidentes , Señales (Psicología) , Vuelo Animal/fisiología , Percepción de Movimiento/fisiología , Visión Monocular/fisiología , Animales , Encéfalo , Drosophila/fisiología
14.
Sensors (Basel) ; 16(4)2016 Apr 06.
Artículo en Inglés | MEDLINE | ID: mdl-27058543

RESUMEN

Novel image sensors transduce the stream of photons directly into asynchronous electrical pulses, rather than forming an image. Classical approaches to vision start from a good quality image and therefore it is tempting to consider image reconstruction as a first step to image analysis. We propose that, instead, one should focus on the task at hand (e.g., detection, tracking or control) and design algorithms that compute the relevant variables (class, position, velocity) directly from the stream of photons. We discuss three examples of such computer vision algorithms and test them on simulated data from photon-counting sensors. Such algorithms work just-in-time, i.e., they complete classification, search and tracking with high accuracy as soon as the information is sufficient, which is typically before there are enough photons to form a high-quality image. We argue that this is particularly useful when the photons are few or expensive, e.g., in astronomy, biological imaging, surveillance and night vision.

15.
J Vis ; 15(16): 9, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26675879

RESUMEN

Searching for objects among clutter is a key ability of the visual system. Speed and accuracy are the crucial performance criteria. How can the brain trade off these competing quantities for optimal performance in different tasks? Can a network of spiking neurons carry out such computations, and what is its architecture? We propose a new model that takes input from V1-type orientation-selective spiking neurons and detects a target in the shortest time that is compatible with a given acceptable error rate. Subject to the assumption that the output of the primary visual cortex comprises Poisson neurons with known properties, our model is an ideal observer. The model has only five free parameters: the signal-to-noise ratio in a hypercolumn, the costs of false-alarm and false-reject errors versus the cost of time, and two parameters accounting for nonperceptual delays. Our model postulates two gain-control mechanisms--one local to hypercolumns and one global to the visual field--to handle variable scene complexity. Error rate and response time predictions match psychophysics data as we vary stimulus discriminability, scene complexity, and the uncertainty associated with each of these quantities. A five-layer spiking network closely approximates the optimal model, suggesting that known cortical mechanisms are sufficient for implementing visual search efficiently.


Asunto(s)
Neuronas/fisiología , Corteza Visual/fisiología , Percepción Visual/fisiología , Humanos , Modelos Neurológicos , Orientación/fisiología , Psicofísica , Tiempo de Reacción , Relación Señal-Ruido , Campos Visuales/fisiología
16.
Proc Natl Acad Sci U S A ; 112(38): E5351-60, 2015 Sep 22.
Artículo en Inglés | MEDLINE | ID: mdl-26354123

RESUMEN

A lack of automated, quantitative, and accurate assessment of social behaviors in mammalian animal models has limited progress toward understanding mechanisms underlying social interactions and their disorders such as autism. Here we present a new integrated hardware and software system that combines video tracking, depth sensing, and machine learning for automatic detection and quantification of social behaviors involving close and dynamic interactions between two mice of different coat colors in their home cage. We designed a hardware setup that integrates traditional video cameras with a depth camera, developed computer vision tools to extract the body "pose" of individual animals in a social context, and used a supervised learning algorithm to classify several well-described social behaviors. We validated the robustness of the automated classifiers in various experimental settings and used them to examine how genetic background, such as that of Black and Tan Brachyury (BTBR) mice (a previously reported autism model), influences social behavior. Our integrated approach allows for rapid, automated measurement of social behaviors across diverse experimental designs and also affords the ability to develop new, objective behavioral metrics.


Asunto(s)
Conducta Animal , Procesamiento de Imagen Asistido por Computador , Aprendizaje Automático , Conducta Social , Grabación en Video , Algoritmos , Animales , Computadores , Femenino , Masculino , Ratones , Ratones Endogámicos BALB C , Ratones Endogámicos C57BL , Modelos Animales , Variaciones Dependientes del Observador , Reconocimiento de Normas Patrones Automatizadas , Reproducibilidad de los Resultados , Programas Informáticos
17.
Curr Biol ; 25(11): 1401-15, 2015 Jun 01.
Artículo en Inglés | MEDLINE | ID: mdl-25981791

RESUMEN

The neural circuit mechanisms underlying emotion states remain poorly understood. Drosophila offers powerful genetic approaches for dissecting neural circuit function, but whether flies exhibit emotion-like behaviors has not been clear. We recently proposed that model organisms may express internal states displaying "emotion primitives," which are general characteristics common to different emotions, rather than specific anthropomorphic emotions such as "fear" or "anxiety." These emotion primitives include scalability, persistence, valence, and generalization to multiple contexts. Here, we have applied this approach to determine whether flies' defensive responses to moving overhead translational stimuli ("shadows") are purely reflexive or may express underlying emotion states. We describe a new behavioral assay in which flies confined in an enclosed arena are repeatedly exposed to an overhead translational stimulus. Repetitive stimuli promoted graded (scalable) and persistent increases in locomotor velocity and hopping, and occasional freezing. The stimulus also dispersed feeding flies from a food resource, suggesting both negative valence and context generalization. Strikingly, there was a significant delay before the flies returned to the food following stimulus-induced dispersal, suggestive of a slowly decaying internal defensive state. The length of this delay was increased when more stimuli were delivered for initial dispersal. These responses can be mathematically modeled by assuming an internal state that behaves as a leaky integrator of stimulus exposure. Our results suggest that flies' responses to repetitive visual threat stimuli express an internal state exhibiting canonical emotion primitives, possibly analogous to fear in mammals. The mechanistic basis of this state can now be investigated in a genetically tractable insect species.


Asunto(s)
Nivel de Alerta , Conducta Animal , Drosophila/fisiología , Animales , Emociones , Locomoción , Masculino , Percepción Visual
18.
Clin Neurophysiol ; 126(8): 1548-56, 2015 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-25434753

RESUMEN

OBJECTIVES: To measure the inter-expert and intra-expert agreement in sleep spindle scoring, and to quantify how many experts are needed to build a reliable dataset of sleep spindle scorings. METHODS: The EEG dataset was comprised of 400 randomly selected 115s segments of stage 2 sleep from 110 sleeping subjects in the general population (57±8, range: 42-72 years). To assess expert agreement, a total of 24 Registered Polysomnographic Technologists (RPSGTs) scored spindles in a subset of the EEG dataset at a single electrode location (C3-M2). Intra-expert and inter-expert agreements were calculated as F1-scores, Cohen's kappa (κ), and intra-class correlation coefficient (ICC). RESULTS: We found an average intra-expert F1-score agreement of 72±7% (κ: 0.66±0.07). The average inter-expert agreement was 61±6% (κ: 0.52±0.07). Amplitude and frequency of discrete spindles were calculated with higher reliability than the estimation of spindle duration. Reliability of sleep spindle scoring can be improved by using qualitative confidence scores, rather than a dichotomous yes/no scoring system. CONCLUSIONS: We estimate that 2-3 experts are needed to build a spindle scoring dataset with 'substantial' reliability (κ: 0.61-0.8), and 4 or more experts are needed to build a dataset with 'almost perfect' reliability (κ: 0.81-1). SIGNIFICANCE: Spindle scoring is a critical part of sleep staging, and spindles are believed to play an important role in development, aging, and diseases of the nervous system.


Asunto(s)
Nivel de Alerta/fisiología , Electroencefalografía/métodos , Variaciones Dependientes del Observador , Fases del Sueño/fisiología , Adulto , Anciano , Femenino , Humanos , Masculino , Persona de Mediana Edad , Polisomnografía , Reproducibilidad de los Resultados
19.
Neuron ; 84(1): 18-31, 2014 Oct 01.
Artículo en Inglés | MEDLINE | ID: mdl-25277452

RESUMEN

The new field of "Computational Ethology" is made possible by advances in technology, mathematics, and engineering that allow scientists to automate the measurement and the analysis of animal behavior. We explore the opportunities and long-term directions of research in this area.


Asunto(s)
Conducta Animal/fisiología , Biología Computacional/métodos , Biología Computacional/tendencias , Etología/métodos , Etología/tendencias , Animales , Inteligencia Artificial/tendencias , Red Nerviosa/fisiología
20.
PLoS One ; 9(8): e105626, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25162609

RESUMEN

How animals use sensory information to weigh the risks vs. benefits of behavioral decisions remains poorly understood. Inter-male aggression is triggered when animals perceive both the presence of an appetitive resource, such as food or females, and of competing conspecific males. How such signals are detected and integrated to control the decision to fight is not clear. For instance, it is unclear whether food increases aggression directly, or as a secondary consequence of increased social interactions caused by attraction to food. Here we use the vinegar fly, Drosophila melanogaster, to investigate the manner by which food influences aggression. We show that food promotes aggression in flies, and that it does so independently of any effect on frequency of contact between males, increase in locomotor activity or general enhancement of social interactions. Importantly, the level of aggression depends on the absolute amount of food, rather than on its surface area or concentration. When food resources exceed a certain level, aggression is diminished, suggestive of reduced competition. Finally, we show that detection of sugar via Gr5a+ gustatory receptor neurons (GRNs) is necessary for food-promoted aggression. These data demonstrate that food exerts a specific effect to promote aggression in male flies, and that this effect is mediated, at least in part, by sweet-sensing GRNs.


Asunto(s)
Agresión/fisiología , Proteínas de Drosophila/metabolismo , Drosophila melanogaster/genética , Receptores de Superficie Celular/metabolismo , Células Receptoras Sensoriales/metabolismo , Gusto/fisiología , Animales , Proteínas de Drosophila/genética , Drosophila melanogaster/metabolismo , Conducta Alimentaria/fisiología , Alimentos , Expresión Génica , Masculino , Receptores de Superficie Celular/genética , Sacarosa/química , Sacarosa/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...