Búsqueda | Portal de Búsqueda de la BVS España

1.

Frequency Propagation: Multimechanism Learning in Nonlinear Physical Networks.

Anisetti, Vidyesh Rao; Kandala, Ananth; Scellier, Benjamin; Schwarz, J M.

Neural Comput ; 36(4): 596-620, 2024 Mar 21.

Artículo en Inglés | MEDLINE | ID: mdl-38457749

RESUMEN

We introduce frequency propagation, a learning algorithm for nonlinear physical networks. In a resistive electrical circuit with variable resistors, an activation current is applied at a set of input nodes at one frequency and an error current is applied at a set of output nodes at another frequency. The voltage response of the circuit to these boundary currents is the superposition of an activation signal and an error signal whose coefficients can be read in different frequencies of the frequency domain. Each conductance is updated proportionally to the product of the two coefficients. The learning rule is local and proved to perform gradient descent on a loss function. We argue that frequency propagation is an instance of a multimechanism learning strategy for physical networks, be it resistive, elastic, or flow networks. Multimechanism learning strategies incorporate at least two physical quantities, potentially governed by independent physical mechanisms, to act as activation and error signals in the training process. Locally available information about these two signals is then used to update the trainable parameters to perform gradient descent. We demonstrate how earlier work implementing learning via chemical signaling in flow networks (Anisetti, Scellier, et al., 2023) also falls under the rubric of multimechanism learning.

2.

Equivalence of Equilibrium Propagation and Recurrent Backpropagation.

Scellier, Benjamin; Bengio, Yoshua.

Neural Comput ; 31(2): 312-329, 2019 02.

Artículo en Inglés | MEDLINE | ID: mdl-30576611

RESUMEN

Recurrent backpropagation and equilibrium propagation are supervised learning algorithms for fixed-point recurrent neural networks, which differ in their second phase. In the first phase, both algorithms converge to a fixed point that corresponds to the configuration where the prediction is made. In the second phase, equilibrium propagation relaxes to another nearby fixed point corresponding to smaller prediction error, whereas recurrent backpropagation uses a side network to compute error derivatives iteratively. In this work, we establish a close connection between these two algorithms. We show that at every moment in the second phase, the temporal derivatives of the neural activities in equilibrium propagation are equal to the error derivatives computed iteratively by recurrent backpropagation in the side network. This work shows that it is not required to have a side network for the computation of error derivatives and supports the hypothesis that in biological neural networks, temporal derivatives of neural activities may code for error signals.

3.

Contrastive learning through non-equilibrium memory.

Falk, Martin; Strupp, Adam; Scellier, Benjamin; Murugan, Arvind.

ArXiv ; 2023 Dec 29.

Artículo en Inglés | MEDLINE | ID: mdl-38235062

RESUMEN

Learning algorithms based on backpropagation have enabled transformative technological advances but alternatives based on local energy-based rules offer benefits in terms of biological plausibility and decentralized training. A broad class of such local learning rules involve \textit{contrasting} a clamped configuration with the free, spontaneous behavior of the system. However, comparisons of clamped and free configurations require explicit memory or switching between Hebbian and anti-Hebbian modes. Here, we show how a simple form of implicit non-equilibrium memory in the update dynamics of each ``synapse'' of a network naturally allows for contrastive learning. During training, free and clamped behaviors are shown in sequence over time using a sawtooth-like temporal protocol that breaks the symmetry between those two behaviors when combined with non-equilibrium update dynamics at each synapse. We show that the needed dynamics is implicit in integral feedback control, broadening the range of physical and biological systems naturally capable of contrastive learning. Finally, we show that non-equilibrium dissipation improves learning quality and determine the Landauer energy cost of contrastive learning through physical dynamics.

4.

Scaling Equilibrium Propagation to Deep ConvNets by Drastically Reducing Its Gradient Estimator Bias.

Laborieux, Axel; Ernoult, Maxence; Scellier, Benjamin; Bengio, Yoshua; Grollier, Julie; Querlioz, Damien.

Front Neurosci ; 15: 633674, 2021.

Artículo en Inglés | MEDLINE | ID: mdl-33679315

RESUMEN

Equilibrium Propagation is a biologically-inspired algorithm that trains convergent recurrent neural networks with a local learning rule. This approach constitutes a major lead to allow learning-capable neuromophic systems and comes with strong theoretical guarantees. Equilibrium propagation operates in two phases, during which the network is let to evolve freely and then "nudged" toward a target; the weights of the network are then updated based solely on the states of the neurons that they connect. The weight updates of Equilibrium Propagation have been shown mathematically to approach those provided by Backpropagation Through Time (BPTT), the mainstream approach to train recurrent neural networks, when nudging is performed with infinitely small strength. In practice, however, the standard implementation of Equilibrium Propagation does not scale to visual tasks harder than MNIST. In this work, we show that a bias in the gradient estimate of equilibrium propagation, inherent in the use of finite nudging, is responsible for this phenomenon and that canceling it allows training deep convolutional neural networks. We show that this bias can be greatly reduced by using symmetric nudging (a positive nudging and a negative one). We also generalize Equilibrium Propagation to the case of cross-entropy loss (by opposition to squared error). As a result of these advances, we are able to achieve a test error of 11.7% on CIFAR-10, which approaches the one achieved by BPTT and provides a major improvement with respect to the standard Equilibrium Propagation that gives 86% test error. We also apply these techniques to train an architecture with unidirectional forward and backward connections, yielding a 13.2% test error. These results highlight equilibrium propagation as a compelling biologically-plausible approach to compute error gradients in deep neuromorphic systems.

5.

A deep learning framework for neuroscience.

Richards, Blake A; Lillicrap, Timothy P; Beaudoin, Philippe; Bengio, Yoshua; Bogacz, Rafal; Christensen, Amelia; Clopath, Claudia; Costa, Rui Ponte; de Berker, Archy; Ganguli, Surya; Gillon, Colleen J; Hafner, Danijar; Kepecs, Adam; Kriegeskorte, Nikolaus; Latham, Peter; Lindsay, Grace W; Miller, Kenneth D; Naud, Richard; Pack, Christopher C; Poirazi, Panayiota; Roelfsema, Pieter; Sacramento, João; Saxe, Andrew; Scellier, Benjamin; Schapiro, Anna C; Senn, Walter; Wayne, Greg; Yamins, Daniel; Zenke, Friedemann; Zylberberg, Joel; Therien, Denis; Kording, Konrad P.

Nat Neurosci ; 22(11): 1761-1770, 2019 11.

Artículo en Inglés | MEDLINE | ID: mdl-31659335

RESUMEN

Systems neuroscience seeks explanations for how the brain implements a wide variety of perceptual, cognitive and motor tasks. Conversely, artificial intelligence attempts to design computational systems based on the tasks they will have to solve. In artificial neural networks, the three components specified by design are the objective functions, the learning rules and the architectures. With the growing success of deep learning, which utilizes brain-inspired architectures, these three designed components have increasingly become central to how we model, engineer and optimize complex artificial learning systems. Here we argue that a greater focus on these components would also benefit systems neuroscience. We give examples of how this optimization-based framework can drive theoretical and experimental progress in neuroscience. We contend that this principled perspective on systems neuroscience will help to generate more rapid progress.

Asunto(s)

Inteligencia Artificial , Aprendizaje Profundo , Redes Neurales de la Computación , Animales , Encéfalo/fisiología , Humanos

6.

Equilibrium Propagation: Bridging the Gap between Energy-Based Models and Backpropagation.

Scellier, Benjamin; Bengio, Yoshua.

Front Comput Neurosci ; 11: 24, 2017.

Artículo en Inglés | MEDLINE | ID: mdl-28522969

RESUMEN

We introduce Equilibrium Propagation, a learning framework for energy-based models. It involves only one kind of neural computation, performed in both the first phase (when the prediction is made) and the second phase of training (after the target or prediction error is revealed). Although this algorithm computes the gradient of an objective function just like Backpropagation, it does not need a special computation or circuit for the second phase, where errors are implicitly propagated. Equilibrium Propagation shares similarities with Contrastive Hebbian Learning and Contrastive Divergence while solving the theoretical issues of both algorithms: our algorithm computes the gradient of a well-defined objective function. Because the objective function is defined in terms of local perturbations, the second phase of Equilibrium Propagation corresponds to only nudging the prediction (fixed point or stationary distribution) toward a configuration that reduces prediction error. In the case of a recurrent multi-layer supervised network, the output units are slightly nudged toward their target in the second phase, and the perturbation introduced at the output layer propagates backward in the hidden layers. We show that the signal "back-propagated" during this second phase corresponds to the propagation of error derivatives and encodes the gradient of the objective function, when the synaptic update corresponds to a standard form of spike-timing dependent plasticity. This work makes it more plausible that a mechanism similar to Backpropagation could be implemented by brains, since leaky integrator neural computation performs both inference and error back-propagation in our model. The only local difference between the two phases is whether synaptic changes are allowed or not. We also show experimentally that multi-layer recurrently connected networks with 1, 2, and 3 hidden layers can be trained by Equilibrium Propagation on the permutation-invariant MNIST task.

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA