Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 48
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Biol Cybern ; 117(4-5): 299-329, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37306782

RESUMO

Advanced computer vision mechanisms have been inspired by neuroscientific findings. However, with the focus on improving benchmark achievements, technical solutions have been shaped by application and engineering constraints. This includes the training of neural networks which led to the development of feature detectors optimally suited to the application domain. However, the limitations of such approaches motivate the need to identify computational principles, or motifs, in biological vision that can enable further foundational advances in machine vision. We propose to utilize structural and functional principles of neural systems that have been largely overlooked. They potentially provide new inspirations for computer vision mechanisms and models. Recurrent feedforward, lateral, and feedback interactions characterize general principles underlying processing in mammals. We derive a formal specification of core computational motifs that utilize these principles. These are combined to define model mechanisms for visual shape and motion processing. We demonstrate how such a framework can be adopted to run on neuromorphic brain-inspired hardware platforms and can be extended to automatically adapt to environment statistics. We argue that the identified principles and their formalization inspires sophisticated computational mechanisms with improved explanatory scope. These and other elaborated, biologically inspired models can be employed to design computer vision solutions for different tasks and they can be used to advance neural network architectures of learning.


Assuntos
Computadores , Redes Neurais de Computação , Animais , Visão Ocular , Encéfalo , Aprendizagem , Mamíferos
2.
Sensors (Basel) ; 23(9)2023 May 02.
Artigo em Inglês | MEDLINE | ID: mdl-37177655

RESUMO

Conventional processing of sensory input often relies on uniform sampling leading to redundant information and unnecessary resource consumption throughout the entire processing pipeline. Neuromorphic computing challenges these conventions by mimicking biology and employing distributed event-based hardware. Based on the task of lateral auditory sound source localization (SSL), we propose a generic approach to map biologically inspired neural networks to neuromorphic hardware. First, we model the neural mechanisms of SSL based on the interaural level difference (ILD). Afterward, we identify generic computational motifs within the model and transform them into spike-based components. A hardware-specific step then implements them on neuromorphic hardware. We exemplify our approach by mapping the neural SSL model onto two platforms, namely the IBM TrueNorth Neurosynaptic System and SpiNNaker. Both implementations have been tested on synthetic and real-world data in terms of neural tunings and readout characteristics. For synthetic stimuli, both implementations provide a perfect readout (100% accuracy). Preliminary real-world experiments yield accuracies of 78% (TrueNorth) and 13% (SpiNNaker), RMSEs of 41∘ and 39∘, and MAEs of 18∘ and 29∘, respectively. Overall, the proposed mapping approach allows for the successful implementation of the same SSL model on two different neuromorphic architectures paving the way toward more hardware-independent neural SSL.


Assuntos
Algoritmos , Redes Neurais de Computação , Computadores , Encéfalo , Percepção Auditiva
3.
PLoS Comput Biol ; 16(7): e1008020, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32678847

RESUMO

Adaptation to statistics of sensory inputs is an essential ability of neural systems and extends their effective operational range. Having a broad operational range facilitates to react to sensory inputs of different granularities, thus is a crucial factor for survival. The computation of auditory cues for spatial localization of sound sources, particularly the interaural level difference (ILD), has long been considered as a static process. Novel findings suggest that this process of ipsi- and contra-lateral signal integration is highly adaptive and depends strongly on recent stimulus statistics. Here, adaptation aids the encoding of auditory perceptual space of various granularities. To investigate the mechanism of auditory adaptation in binaural signal integration in detail, we developed a neural model architecture for simulating functions of lateral superior olive (LSO) and medial nucleus of the trapezoid body (MNTB) composed of single compartment conductance-based neurons. Neurons in the MNTB serve as an intermediate relay population. Their signal is integrated by the LSO population on a circuit level to represent excitatory and inhibitory interactions of input signals. The circuit incorporates an adaptation mechanism operating at the synaptic level based on local inhibitory feedback signals. The model's predictive power is demonstrated in various simulations replicating physiological data. Incorporating the innovative adaptation mechanism facilitates a shift in neural responses towards the most effective stimulus range based on recent stimulus history. The model demonstrates that a single LSO neuron quickly adapts to these stimulus statistics and, thus, can encode an extended range of ILDs in the ipsilateral hemisphere. Most significantly, we provide a unique measurement of the adaptation efficacy of LSO neurons. Prerequisite of normal function is an accurate interaction of inhibitory and excitatory signals, a precise encoding of time and a well-tuned local feedback circuit. We suggest that the mechanisms of temporal competitive-cooperative interaction and the local feedback mechanism jointly sensitize the circuit to enable a response shift towards contra-lateral and ipsi-lateral stimuli, respectively.


Assuntos
Biologia Computacional , Neurônios/fisiologia , Núcleo Olivar/fisiologia , Sinapses/fisiologia , Corpo Trapezoide/fisiologia , Estimulação Acústica , Potenciais de Ação , Algoritmos , Animais , Vias Auditivas/fisiologia , Limiar Auditivo , Simulação por Computador , Sinais (Psicologia) , Gerbillinae , Humanos , Modelos Neurológicos , Distribuição Normal , Receptores de GABA/fisiologia , Reprodutibilidade dos Testes , Som , Localização de Som , Complexo Olivar Superior/fisiologia
4.
Magn Reson Med ; 75(2): 789-800, 2016 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-25761576

RESUMO

PURPOSE: To investigate the combination of Golden Angle Radial Sparse SENSE reconstruction with image-based self-gating (SG) for deriving high-quality TPM data from radial golden angle (GA) k-space data. METHODS: In 10 healthy volunteers, a self-gated radial GA TPM sequence (TPMSG ) was compared with a prospectively triggered radial TPM acquisition with conventional respiratory (RNAV) compensation (TPMref ). Image quality and velocities were compared for different regularization strengths λ in the CS reconstruction. RESULTS: Acquisitions and retrospective self-gating was successful in all cases. Contrast in TPMSG was superior to TPMref , because the blood saturation bands could be applied with full thickness without interference with the RNAV. Velocities from both acquisitions visually showed the same motion patterns and were quantitatively highly similar (correlation 0.81-0.97 and RMSE 0.08-0.21 cm/s). Strong temporal regularization ( λ∈0.3,0.4) led to reduced velocity peaks in TPMSG . For λ=0.2, image sharpness as well as velocity peaks of TPMSG were comparable to TPMRef . CONCLUSION: The combination of Golden Angle Radial Sparse SENSE with image-based self-gating allows measurement of velocities of the myocardium with superior black-blood contrast and full coverage of the cardiac cycle.


Assuntos
Técnicas de Imagem de Sincronização Cardíaca/métodos , Cardiomiopatias/fisiopatologia , Interpretação de Imagem Assistida por Computador/métodos , Imagem Cinética por Ressonância Magnética/métodos , Técnicas de Imagem de Sincronização Respiratória/métodos , Adulto , Idoso , Feminino , Voluntários Saudáveis , Humanos , Aumento da Imagem/métodos , Masculino
5.
PLoS Comput Biol ; 11(10): e1004489, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26496502

RESUMO

The processing of a visual stimulus can be subdivided into a number of stages. Upon stimulus presentation there is an early phase of feedforward processing where the visual information is propagated from lower to higher visual areas for the extraction of basic and complex stimulus features. This is followed by a later phase where horizontal connections within areas and feedback connections from higher areas back to lower areas come into play. In this later phase, image elements that are behaviorally relevant are grouped by Gestalt grouping rules and are labeled in the cortex with enhanced neuronal activity (object-based attention in psychology). Recent neurophysiological studies revealed that reward-based learning influences these recurrent grouping processes, but it is not well understood how rewards train recurrent circuits for perceptual organization. This paper examines the mechanisms for reward-based learning of new grouping rules. We derive a learning rule that can explain how rewards influence the information flow through feedforward, horizontal and feedback connections. We illustrate the efficiency with two tasks that have been used to study the neuronal correlates of perceptual organization in early visual cortex. The first task is called contour-integration and demands the integration of collinear contour elements into an elongated curve. We show how reward-based learning causes an enhancement of the representation of the to-be-grouped elements at early levels of a recurrent neural network, just as is observed in the visual cortex of monkeys. The second task is curve-tracing where the aim is to determine the endpoint of an elongated curve composed of connected image elements. If trained with the new learning rule, neural networks learn to propagate enhanced activity over the curve, in accordance with neurophysiological data. We close the paper with a number of model predictions that can be tested in future neurophysiological and computational studies.


Assuntos
Retroalimentação Fisiológica/fisiologia , Modelos Neurológicos , Rede Nervosa/fisiologia , Reforço Psicológico , Córtex Visual/fisiologia , Percepção Visual/fisiologia , Animais , Simulação por Computador , Macaca , Memória/fisiologia
6.
Magn Reson Med ; 73(1): 292-8, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24478142

RESUMO

PURPOSE: To compare the applicability of different self-gating (SG) strategies for respiratory SG in cardiac MRI in combination with iteratively reconstructed (k-t SPARSE SENSE) cine data with low and high temporal resolution. METHODS: Eleven SG variants were compared in five volunteers by assessment of the resulting image sharpness compared with nongated reconstructions. Promising SG techniques were applied for high temporal resolution reconstructions of the heart function. RESULTS: SG was successful in all volunteers with image-based SG and the ∑||p|| technique. These approaches were also superior to gating from the respiratory bellows signal on average. Combination with k-t SPARSE SENSE enabled high temporally resolved visualization of the heart motion with free breathing. CONCLUSION: Respiratory SG can be applied for improving image sharpness. Combining SG with iterative reconstruction allows generation of high temporal resolution cine data, which reveal more details of cardiac motion.


Assuntos
Coração/anatomia & histologia , Aumento da Imagem/métodos , Interpretação de Imagem Assistida por Computador/métodos , Imagem Cinética por Ressonância Magnética/métodos , Reconhecimento Automatizado de Padrão/métodos , Técnicas de Imagem de Sincronização Respiratória/métodos , Adulto , Algoritmos , Feminino , Humanos , Masculino , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Processamento de Sinais Assistido por Computador
7.
Neural Comput ; 26(12): 2735-89, 2014 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-25248083

RESUMO

Evidence suggests that the brain uses an operational set of canonical computations like normalization, input filtering, and response gain enhancement via reentrant feedback. Here, we propose a three-stage columnar architecture of cascaded model neurons to describe a core circuit combining signal pathways of feedforward and feedback processing and the inhibitory pooling of neurons to normalize the activity. We present an analytical investigation of such a circuit by first reducing its detail through the lumping of initial feedforward response filtering and reentrant modulating signal amplification. The resulting excitatory-inhibitory pair of neurons is analyzed in a 2D phase-space. The inhibitory pool activation is treated as a separate mechanism exhibiting different effects. We analyze subtractive as well as divisive (shunting) interaction to implement center-surround mechanisms that include normalization effects in the characteristics of real neurons. Different variants of a core model architecture are derived and analyzed--in particular, individual excitatory neurons (without pool inhibition), the interaction with an inhibitory subtractive or divisive (i.e., shunting) pool, and the dynamics of recurrent self-excitation combined with divisive inhibition. The stability and existence properties of these model instances are characterized, which serve as guidelines to adjust these properties through proper model parameterization. The significance of the derived results is demonstrated by theoretical predictions of response behaviors in the case of multiple interacting hypercolumns in a single and in multiple feature dimensions. In numerical simulations, we confirm these predictions and provide some explanations for different neural computational properties. Among those, we consider orientation contrast-dependent response behavior, different forms of attentional modulation, contrast element grouping, and the dynamic adaptation of the silent surround in extraclassical receptive field configurations, using only slight variations of the same core reference model.


Assuntos
Simulação por Computador , Retroalimentação , Modelos Neurológicos , Rede Nervosa/fisiologia , Neurônios/fisiologia , Algoritmos , Humanos , Vias Neurais/fisiologia , Dinâmica não Linear , Estimulação Luminosa , Percepção Visual
8.
Neural Comput ; 25(9): 2421-49, 2013 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-23663150

RESUMO

Visual navigation requires the estimation of self-motion as well as the segmentation of objects from the background. We suggest a definition of local velocity gradients to compute types of self-motion, segment objects, and compute local properties of optical flow fields, such as divergence, curl, and shear. Such velocity gradients are computed as velocity differences measured locally tangent and normal to the direction of flow. Then these differences are rotated according to the local direction of flow to achieve independence of that direction. We propose a bio-inspired model for the computation of these velocity gradients for video sequences. Simulation results show that local gradients encode ordinal surface depth, assuming self-motion in a rigid scene or object motions in a nonrigid scene. For translational self-motion velocity, gradients can be used to distinguish between static and moving objects. The information about ordinal surface depth and self-motion can help steering control for visual navigation.


Assuntos
Simulação por Computador , Modelos Neurológicos , Percepção de Movimento/fisiologia , Fluxo Óptico/fisiologia , Animais , Humanos , Propriedades de Superfície
9.
NMR Biomed ; 24(1): 17-24, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-20672389

RESUMO

The MRI-based evaluation of the quantity and regional distribution of adipose tissue is one objective measure in the investigation of obesity. The aim of this article was to report a comprehensive and automatic analytical method for the determination of the volumes of subcutaneous fat tissue (SFT) and visceral fat tissue (VFT) in either the whole human body or selected slices or regions of interest. Using an MRI protocol in an examination position that was convenient for volunteers and patients with severe diseases, 22 healthy subjects were examined. The software platform was able to merge MRI scans of several body regions acquired in separate acquisitions. Through a cascade of image processing steps, SFT and VFT volumes were calculated. Whole-body SFT and VFT distributions, as well as fat distributions of defined body slices, were analysed in detail. Complete three-dimensional datasets were analysed in a reproducible manner with as few operator-dependent interventions as possible. In order to determine the SFT volume, the ARTIS (Adapted Rendering for Tissue Intensity Segmentation) algorithm was introduced. The advantage of the ARTIS algorithm was the delineation of SFT volumes in regions in which standard region grow techniques fail. Using the ARTIS algorithm, an automatic SFT volume detection was feasible. MRI data analysis was able to determine SFT and VFT volume percentages using new analytical strategies. With the techniques described, it was possible to detect changes in SFT and VFT percentages of the whole body and selected regions. The techniques presented in this study are likely to be of use in obesity-related investigations, as well as in the examination of longitudinal changes in weight during various medical conditions.


Assuntos
Tecido Adiposo/anatomia & histologia , Distribuição da Gordura Corporal/métodos , Imageamento por Ressonância Magnética/métodos , Idoso , Difusão , Feminino , Saúde , Humanos , Interpretação de Imagem Assistida por Computador , Gordura Intra-Abdominal/anatomia & histologia , Masculino , Reprodutibilidade dos Testes , Gordura Subcutânea/anatomia & histologia
10.
Neural Comput ; 23(11): 2868-914, 2011 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-21851277

RESUMO

Motion transparency occurs when multiple coherent motions are perceived in one spatial location. Imagine, for instance, looking out of the window of a bus on a bright day, where the world outside the window is passing by and movements of passengers inside the bus are reflected in the window. The overlay of both motions at the window leads to motion transparency, which is challenging to process. Noisy and ambiguous motion signals can be reduced using a competition mechanism for all encoded motions in one spatial location. Such a competition, however, leads to the suppression of multiple peak responses that encode different motions, as only the strongest response tends to survive. As a solution, we suggest a local center-surround competition for population-encoded motion directions and speeds. Similar motions are supported, and dissimilar ones are separated, by representing them as multiple activations, which occurs in the case of motion transparency. Psychophysical findings, such as motion attraction and repulsion for motion transparency displays, can be explained by this local competition. Besides this local competition mechanism, we show that feedback signals improve the processing of motion transparency. A discrimination task for transparent versus opaque motion is simulated, where motion transparency is generated by superimposing large field motion patterns of either varying size or varying coherence of motion. The model's perceptual thresholds with and without feedback are calculated. We demonstrate that initially weak peak responses can be enhanced and stabilized through modulatory feedback signals from higher stages of processing.


Assuntos
Algoritmos , Encéfalo/fisiologia , Modelos Neurológicos , Modelos Teóricos , Percepção de Movimento/fisiologia , Animais , Humanos
11.
Front Neurorobot ; 14: 29, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32499692

RESUMO

While interacting with the world our senses and nervous system are constantly challenged to identify the origin and coherence of sensory input signals of various intensities. This problem becomes apparent when stimuli from different modalities need to be combined, e.g., to find out whether an auditory stimulus and a visual stimulus belong to the same object. To cope with this problem, humans and most other animal species are equipped with complex neural circuits to enable fast and reliable combination of signals from various sensory organs. This multisensory integration starts in the brain stem to facilitate unconscious reflexes and continues on ascending pathways to cortical areas for further processing. To investigate the underlying mechanisms in detail, we developed a canonical neural network model for multisensory integration that resembles neurophysiological findings. For example, the model comprises multisensory integration neurons that receive excitatory and inhibitory inputs from unimodal auditory and visual neurons, respectively, as well as feedback from cortex. Such feedback projections facilitate multisensory response enhancement and lead to the commonly observed inverse effectiveness of neural activity in multisensory neurons. Two versions of the model are implemented, a rate-based neural network model for qualitative analysis and a variant that employs spiking neurons for deployment on a neuromorphic processing. This dual approach allows to create an evaluation environment with the ability to test model performances with real world inputs. As a platform for deployment we chose IBM's neurosynaptic chip TrueNorth. Behavioral studies in humans indicate that temporal and spatial offsets as well as reliability of stimuli are critical parameters for integrating signals from different modalities. The model reproduces such behavior in experiments with different sets of stimuli. In particular, model performance for stimuli with varying spatial offset is tested. In addition, we demonstrate that due to the emergent properties of network dynamics model performance is close to optimal Bayesian inference for integration of multimodal sensory signals. Furthermore, the implementation of the model on a neuromorphic processing chip enables a complete neuromorphic processing cascade from sensory perception to multisensory integration and the evaluation of model performance for real world inputs.

12.
J Vis ; 9(9): 2.1-14, 2009 Aug 10.
Artigo em Inglês | MEDLINE | ID: mdl-19761335

RESUMO

A patient (HJA) with bilateral occipital lobe damage to ventral cortical areas V2, V3 and V4 was tested on a texture segmentation task involving texture bar detection in an array of oriented lines. Performance detecting a target shape was assessed as the orientations of the background lines had increasing orientation noise. Control participants found the task easier when the background lines had the same orientation or only slightly shifted in orientation. HJA was poor with all backgrounds but particularly so when the background lines had the same or almost the same orientations. The results suggest that V1 alone is not sufficient to perform easy texture segmentation, even when the background of the display is a homogeneous texture. Ventral extra-striate cortical areas are needed in order to detect texture boundaries. We suggest that extra-striate visual areas enhance the borders between the target and background, while also playing a role in reducing the signal from homogeneous texture backgrounds.


Assuntos
Infarto da Artéria Cerebral Posterior/fisiopatologia , Reconhecimento Visual de Modelos/fisiologia , Lobo Temporal/fisiologia , Córtex Visual/fisiologia , Vias Visuais/fisiologia , Idoso , Idoso de 80 Anos ou mais , Agnosia/patologia , Agnosia/fisiopatologia , Retroalimentação Fisiológica , Feminino , Humanos , Infarto da Artéria Cerebral Posterior/patologia , Masculino , Orientação/fisiologia , Estimulação Luminosa/métodos , Prosopagnosia/patologia , Prosopagnosia/fisiopatologia , Lobo Temporal/patologia , Córtex Visual/patologia , Vias Visuais/citologia
13.
Artigo em Inglês | MEDLINE | ID: mdl-30814934

RESUMO

Adaptation is a mechanism by which cortical neurons adjust their responses according to recently viewed stimuli. Visual information is processed in a circuit formed by feedforward (FF) and feedback (FB) synaptic connections of neurons in different cortical layers. Here, the functional role of FF-FB streams and their synaptic dynamics in adaptation to natural stimuli is assessed in psychophysics and neural model. We propose a cortical model which predicts psychophysically observed motion adaptation aftereffects (MAE) after exposure to geometrically distorted natural image sequences. The model comprises direction selective neurons in V1 and MT connected by recurrent FF and FB dynamic synapses. Psychophysically plausible model MAEs were obtained from synaptic changes within neurons tuned to salient direction signals of the broadband natural input. It is conceived that, motion disambiguation by FF-FB interactions is critical to encode this salient information. Moreover, only FF-FB dynamic synapses operating at distinct rates predicted psychophysical MAEs at different adaptation time-scales which could not be accounted for by single rate dynamic synapses in either of the streams. Recurrent FF-FB pathways thereby play a role during adaptation in a natural environment, specifically in inducing multilevel cortical plasticity to salient information and in mediating adaptation at different time-scales.


Assuntos
Adaptação Fisiológica/fisiologia , Córtex Cerebral/citologia , Modelos Neurológicos , Rede Nervosa/fisiologia , Neurônios/fisiologia , Animais , Humanos , Psicofísica , Sinapses/fisiologia
14.
J Vis ; 8(8): 8.1-25, 2008 Jun 18.
Artigo em Inglês | MEDLINE | ID: mdl-18831631

RESUMO

Physiological and psychophysical studies have demonstrated the importance of colinearity in visual processing. Motivated by these empirical findings we present a novel computational model of recurrent long-range processing in the primary visual cortex. Unlike other models we restrict the long-range interaction to cells of parallel orientation with colinear aligned receptive fields. We also employ a recurrent interaction using modulatory feedback, in accordance with empirical findings. Self-normalizing shunting equations guarantee the saturation of activities after a few recurrent cycles. The primary computational goal of the model is to evaluate local, often noisy orientation measurements within a more global context and to selectively enhance coherent activity by excitatory, modulating feedback. All model simulations were done with the same set of parameters. We show that the model qualitatively reproduces empirical data of response facilitation and suppression for a single bar element depending on the local surround outside the classical receptive field (M. K. Kapadia, M. Ito, C. D. Gilbert, & G. Westheimer, 1995). Next we evaluate the model performance for the processing of artificial and natural images. We quantitatively evaluate the model using two measures of contour saliency and orientation significance. We show that both measures monotonically increase during the recurrent interaction and saturate after a small number of recurrent cycles. The model clarifies how basic tasks of early vision can be accomplished within a single, biologically plausible architecture.


Assuntos
Percepção de Forma/fisiologia , Modelos Neurológicos , Modelos Psicológicos , Córtex Visual/fisiologia , Simulação por Computador , Retroalimentação , Humanos , Neurônios/fisiologia , Estimulação Luminosa/métodos , Córtex Visual/citologia , Vias Visuais/fisiologia
15.
Front Psychol ; 9: 1455, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30210382

RESUMO

Human articulated motion can be readily recognized robustly even from impoverished so-called point-light displays. Such sequence information is processed by separate visual processing channels recruiting different stages at low and intermediate levels of the cortical visual processing hierarchy. The different contributions that motion and form information make to form articulated, or biological, motion perception are still under investigation. Here we investigate experimentally whether and how specific spatio-temporal features, such as extrema in the motion energy or maximum limb expansion, indicated by the lateral and longitudinal extension, constrain the formation of the representations of articulated body motion. In order to isolate the relevant stimulus properties we suggest a novel masking technique, which allows to selectively impair the ankle information of the body configuration while keeping the motion of the point-light locations intact. Our results provide evidence that maxima in feature channel representations, e.g., the lateral or longitudinal extension, define elemental features to specify key poses of biological motion patterns. These findings provide support for models which aim at automatically building visual representations for the cortical processing of articulated motion by identifying temporally localized events in a continuous input stream.

16.
Biosystems ; 89(1-3): 208-15, 2007.
Artigo em Inglês | MEDLINE | ID: mdl-17280774

RESUMO

We utilize a model of motion perception to link a physiological study of feature attention in cortical motion processing to a psychophysical experiment of motion perception. We explain effects of feature attention by modulatory excitation of neural activity patterns in a framework of biased competition. Our model allows us to qualitatively replicate physiological data concerning attentional modulation and to generate model behavior in a decision experiment that is consistent with psychophysical observations. Furthermore, our investigation makes predictions for future psychophysical experiments.


Assuntos
Modelos Neurológicos , Movimento (Física) , Neurônios/fisiologia , Percepção Visual
17.
IEEE Trans Pattern Anal Mach Intell ; 29(2): 246-60, 2007 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-17170478

RESUMO

We have previously developed a neurodynamical model of motion segregation in cortical visual area V1 and MT of the dorsal stream. The model explains how motion ambiguities caused by the motion aperture problem can be solved for coherently moving objects of arbitrary size by means of cortical mechanisms. The major bottleneck in the development of a reliable biologically inspired technical system with real-time motion analysis capabilities based on this neural model is the amount of memory necessary for the representation of neural activation in velocity space. We propose a sparse coding framework for neural motion activity patterns and suggest a means by which initial activities are detected efficiently. We realize neural mechanisms such as shunting inhibition and feedback modulation in the sparse framework to implement an efficient algorithmic version of our neural model of cortical motion segregation. We demonstrate that the algorithm behaves similarly to the original neural model and is able to extract image motion from real world image sequences. Our investigation transfers a neuroscience model of cortical motion computation to achieve technologically demanding constraints such as real-time performance and hardware implementation. In addition, the proposed biologically inspired algorithm provides a tool for modeling investigations to achieve acceptable simulation time.


Assuntos
Inteligência Artificial , Biomimética/métodos , Encéfalo/fisiologia , Interpretação de Imagem Assistida por Computador/métodos , Percepção de Movimento/fisiologia , Movimento (Física) , Reconhecimento Automatizado de Padrão/métodos , Algoritmos , Humanos , Aumento da Imagem/métodos , Rede Nervosa/fisiologia
18.
Front Neurorobot ; 11: 13, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28381998

RESUMO

Intelligent agents, such as robots, have to serve a multitude of autonomous functions. Examples are, e.g., collision avoidance, navigation and route planning, active sensing of its environment, or the interaction and non-verbal communication with people in the extended reach space. Here, we focus on the recognition of the action of a human agent based on a biologically inspired visual architecture of analyzing articulated movements. The proposed processing architecture builds upon coarsely segregated streams of sensory processing along different pathways which separately process form and motion information (Layher et al., 2014). Action recognition is performed in an event-based scheme by identifying representations of characteristic pose configurations (key poses) in an image sequence. In line with perceptual studies, key poses are selected unsupervised utilizing a feature-driven criterion which combines extrema in the motion energy with the horizontal and the vertical extendedness of a body shape. Per class representations of key pose frames are learned using a deep convolutional neural network consisting of 15 convolutional layers. The network is trained using the energy-efficient deep neuromorphic networks (Eedn) framework (Esser et al., 2016), which realizes the mapping of the trained synaptic weights onto the IBM Neurosynaptic System platform (Merolla et al., 2014). After the mapping, the trained network achieves real-time capabilities for processing input streams and classify input images at about 1,000 frames per second while the computational stages only consume about 70 mW of energy (without spike transduction). Particularly regarding mobile robotic systems, a low energy profile might be crucial in a variety of application scenarios. Cross-validation results are reported for two different datasets and compared to state-of-the-art action recognition approaches. The results demonstrate, that (I) the presented approach is on par with other key pose based methods described in the literature, which select key pose frames by optimizing classification accuracy, (II) compared to the training on the full set of frames, representations trained on key pose frames result in a higher confidence in class assignments, and (III) key pose representations show promising generalization capabilities in a cross-dataset evaluation.

19.
Front Psychol ; 8: 1303, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28798717

RESUMO

The importance of emotions experienced by learners during their interaction with multimedia learning systems, such as serious games, underscores the need to identify sources of information that allow the recognition of learners' emotional experience without interrupting the learning process. Bodily expression is gaining in attention as one of these sources of information. However, to date, the question of how bodily expression can convey different emotions has largely been addressed in research relying on acted emotion displays. Following a more contextualized approach, the present study aims to identify features of bodily expression (i.e., posture and activity of the upper body and the head) that relate to genuine emotional experience during interaction with a serious game. In a multimethod approach, 70 undergraduates played a serious game relating to financial education while their bodily expression was captured using an off-the-shelf depth-image sensor (Microsoft Kinect). In addition, self-reports of experienced enjoyment, boredom, and frustration were collected repeatedly during gameplay, to address the dynamic changes in emotions occurring in educational tasks. Results showed that, firstly, the intensities of all emotions indeed changed significantly over the course of the game. Secondly, by using generalized estimating equations, distinct features of bodily expression could be identified as significant indicators for each emotion under investigation. A participant keeping their head more turned to the right was positively related to frustration being experienced, whereas keeping their head more turned to the left was positively related to enjoyment. Furthermore, having their upper body positioned more closely to the gaming screen was also positively related to frustration. Finally, increased activity of a participant's head emerged as a significant indicator of boredom being experienced. These results confirm the value of bodily expression as an indicator of emotional experience in multimedia learning systems. Furthermore, the findings may guide developers of emotion recognition procedures by focusing on the identified features of bodily expression.

20.
Vision Res ; 46(17): 2659-74, 2006 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-16603218

RESUMO

Recent evidence suggests that object surfaces and their properties are represented at early stages in the visual system of primates. Most likely invariant surface properties are extracted to endow primates with robust object recognition capabilities. In real-world scenes, luminance gradients are often superimposed on surfaces. We argue that gradients should also be represented in the visual system, since they encode highly variable information, such as shading, focal blur, and penumbral blur. We present a neuronal architecture which was designed and optimized for segregating and representing luminance gradients in real-world images. Our architecture in addition provides a novel theory for Mach bands, whereby corresponding psychophysical data are predicted consistently.


Assuntos
Sensibilidades de Contraste/fisiologia , Modelos Neurológicos , Modelos Psicológicos , Reconhecimento Visual de Modelos/fisiologia , Humanos , Iluminação , Psicofísica , Células Ganglionares da Retina/fisiologia , Limiar Sensorial/fisiologia , Propriedades de Superfície
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA