RESUMEN
The recent flowering of Bayesian approaches invites the re-examination of classic issues in behavior, even in areas as venerable as Pavlovian conditioning. A statistical account can offer a new, principled interpretation of behavior, and previous experiments and theories can inform many unexplored aspects of the Bayesian enterprise. Here we consider one such issue: the finding that surprising events provoke animals to learn faster. We suggest that, in a statistical account of conditioning, surprise signals change and therefore uncertainty and the need for new learning. We discuss inference in a world that changes and show how experimental results involving surprise can be interpreted from this perspective, and also how, thus understood, these phenomena help constrain statistical theories of animal and human learning.
Asunto(s)
Teorema de Bayes , Conducta Animal , Condicionamiento Clásico , Cambio Social , Animales , Aprendizaje por Asociación , Extinción Psicológica , Humanos , Inhibición Psicológica , Solución de Problemas , Esquema de Refuerzo , Medio Social , Procesos Estocásticos , IncertidumbreRESUMEN
Colorectal cancer (CRC) is the third cause of cancer death worldwide. Currently, the standard approach to reduce CRC-related mortality is to perform regular screening in search for polyps and colonoscopy is the screening tool of choice. The main limitations of this screening procedure are polyp miss rate and the inability to perform visual assessment of polyp malignancy. These drawbacks can be reduced by designing decision support systems (DSS) aiming to help clinicians in the different stages of the procedure by providing endoluminal scene segmentation. Thus, in this paper, we introduce an extended benchmark of colonoscopy image segmentation, with the hope of establishing a new strong benchmark for colonoscopy image analysis research. The proposed dataset consists of 4 relevant classes to inspect the endoluminal scene, targeting different clinical needs. Together with the dataset and taking advantage of advances in semantic segmentation literature, we provide new baselines by training standard fully convolutional networks (FCNs). We perform a comparative study to show that FCNs significantly outperform, without any further postprocessing, prior results in endoluminal scene segmentation, especially with respect to polyp segmentation and localization.
Asunto(s)
Colonoscopía , Neoplasias Colorrectales/diagnóstico por imagen , Procesamiento de Imagen Asistido por Computador , Algoritmos , Benchmarking , Técnicas de Apoyo para la Decisión , Humanos , Redes Neurales de la ComputaciónRESUMEN
In this paper, we present a fully automatic brain tumor segmentation method based on Deep Neural Networks (DNNs). The proposed networks are tailored to glioblastomas (both low and high grade) pictured in MR images. By their very nature, these tumors can appear anywhere in the brain and have almost any kind of shape, size, and contrast. These reasons motivate our exploration of a machine learning solution that exploits a flexible, high capacity DNN while being extremely efficient. Here, we give a description of different model choices that we've found to be necessary for obtaining competitive performance. We explore in particular different architectures based on Convolutional Neural Networks (CNN), i.e. DNNs specifically adapted to image data. We present a novel CNN architecture which differs from those traditionally used in computer vision. Our CNN exploits both local features as well as more global contextual features simultaneously. Also, different from most traditional uses of CNNs, our networks use a final layer that is a convolutional implementation of a fully connected layer which allows a 40 fold speed up. We also describe a 2-phase training procedure that allows us to tackle difficulties related to the imbalance of tumor labels. Finally, we explore a cascade architecture in which the output of a basic CNN is treated as an additional source of information for a subsequent CNN. Results reported on the 2013 BRATS test data-set reveal that our architecture improves over the currently published state-of-the-art while being over 30 times faster.
Asunto(s)
Neoplasias Encefálicas/diagnóstico por imagen , Glioblastoma/diagnóstico por imagen , Aprendizaje Automático , Imagen por Resonancia Magnética/métodos , Redes Neurales de la Computación , Humanos , Procesamiento de Imagen Asistido por Computador/métodosRESUMEN
The ICML 2013 Workshop on Challenges in Representation Learning(1) focused on three challenges: the black box learning challenge, the facial expression recognition challenge, and the multimodal learning challenge. We describe the datasets created for these challenges and summarize the results of the competitions. We provide suggestions for organizers of future challenges and some comments on what kind of knowledge can be gained from machine learning competitions.
Asunto(s)
Algoritmos , Inteligencia Artificial , Identificación Biométrica/métodos , HumanosRESUMEN
The spike-and-slab restricted Boltzmann machine (ssRBM) is defined to have both a real-valued "slab" variable and a binary "spike" variable associated with each unit in the hidden layer. The model uses its slab variables to model the conditional covariance of the observation-thought to be important in capturing the statistical properties of natural images. In this paper, we present the canonical ssRBM framework together with some extensions. These extensions highlight the flexibility of the spike-and-slab RBM as a platform for exploring more sophisticated probabilistic models of high dimensional data in general and natural image data in particular. Here, we introduce the subspace-ssRBM focused on the task of learning invariant features. We highlight the behaviour of the ssRBM and its extensions through experiments with the MNIST digit recognition task and the CIFAR-10 object classification task.
RESUMEN
The success of machine learning algorithms generally depends on data representation, and we hypothesize that this is because different representations can entangle and hide more or less the different explanatory factors of variation behind the data. Although specific domain knowledge can be used to help design representations, learning with generic priors can also be used, and the quest for AI is motivating the design of more powerful representation-learning algorithms implementing such priors. This paper reviews recent work in the area of unsupervised feature learning and deep learning, covering advances in probabilistic models, autoencoders, manifold learning, and deep networks. This motivates longer term unanswered questions about the appropriate objectives for learning good representations, for computing representations (i.e., inference), and the geometrical connections between representation learning, density estimation, and manifold learning.
Asunto(s)
Inteligencia Artificial/tendencias , Algoritmos , Humanos , Redes Neurales de la ComputaciónRESUMEN
We describe the use of two spike-and-slab models for modeling real-valued data, with an emphasis on their applications to object recognition. The first model, which we call spike-and-slab sparse coding (S3C), is a preexisting model for which we introduce a faster approximate inference algorithm. We introduce a deep variant of S3C, which we call the partially directed deep Boltzmann machine (PD-DBM) and extend our S3C inference algorithm for use on this model. We describe learning procedures for each. We demonstrate that our inference procedure for S3C enables scaling the model to unprecedented large problem sizes, and demonstrate that using S3C as a feature extractor results in very good object recognition performance, particularly when the number of labeled examples is low. We show that the PD-DBM generates better samples than its shallow counterpart, and that unlike DBMs or DBNs, the PD-DBM may be trained successfully without greedy layerwise training.
RESUMEN
We describe a computational model of epileptiform activity mimicking the activity exhibited by an animal model of epilepsy in vitro. The computational model permits generation of synthetic data to assist in the evaluation of new algorithms for epilepsy treatment via adaptive neurostimulation. The model implements both single-compartment pyramidal neurons and fast-spiking interneurons, arranged in a one-dimensional network using both excitatory and inhibitory synapses. The model tracks changes in extracellular ion concentrations, which determine the reversal potentials of membrane currents. Changes in simulated ion concentration provide positive feedback which drives the system towards the epileptiform state. One mechanism of positive feedback explored by this model is the conversion of pyramidal cells from regular spiking to intrinsic bursting as extracellular potassium concentration increases. One of the main contributions of this work is the development of a slow depression mechanism that enforces seizure termination. The network spontaneously leaves the seizure-like state as the slow depression variable decreases. This is one of the first detailed computational models of epileptiform activity, which exhibits realistic transitions between inter-seizure and seizure states, and back, with state durations similar to the in vitro model. We validate the computational model by comparing its state durations to those of the biological model. We also show that electrical stimulation of the computational model achieves seizure suppression comparable to that observed in the in vitro model.
Asunto(s)
Simulación por Computador , Modelos Animales de Enfermedad , Corteza Entorrinal/fisiopatología , Epilepsia/patología , Hipocampo/fisiopatología , 4-Aminopiridina/farmacología , Potenciales de Acción/fisiología , Animales , Corteza Entorrinal/patología , Hipocampo/patología , Técnicas In Vitro , Inhibición Psicológica , Masculino , Modelos Neurológicos , Neuronas/efectos de los fármacos , Neuronas/metabolismo , Neuronas/fisiología , Potasio/metabolismo , Bloqueadores de los Canales de Potasio/farmacología , Ratas , Sinapsis/fisiologíaRESUMEN
Although the responses of dopamine neurons in the primate midbrain are well characterized as carrying a temporal difference (TD) error signal for reward prediction, existing theories do not offer a credible account of how the brain keeps track of past sensory events that may be relevant to predicting future reward. Empirically, these shortcomings of previous theories are particularly evident in their account of experiments in which animals were exposed to variation in the timing of events. The original theories mispredicted the results of such experiments due to their use of a representational device called a tapped delay line. Here we propose that a richer understanding of history representation and a better account of these experiments can be given by considering TD algorithms for a formal setting that incorporates two features not originally considered in theories of the dopaminergic response: partial observability (a distinction between the animal's sensory experience and the true underlying state of the world) and semi-Markov dynamics (an explicit account of variation in the intervals between events). The new theory situates the dopaminergic system in a richer functional and anatomical context, since it assumes (in accord with recent computational theories of cortex) that problems of partial observability and stimulus history are solved in sensory cortex using statistical modeling and inference and that the TD system predicts reward using the results of this inference rather than raw sensory data. It also accounts for a range of experimental data, including the experiments involving programmed temporal variability and other previously unmodeled dopaminergic response phenomena, which we suggest are related to subjective noise in animals' interval timing. Finally, it offers new experimental predictions and a rich theoretical framework for designing future experiments.