Pesquisa | Portal de Pesquisa da BVS

Human-level control through deep reinforcement learning.

Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Rusu, Andrei A; Veness, Joel; Bellemare, Marc G; Graves, Alex; Riedmiller, Martin; Fidjeland, Andreas K; Ostrovski, Georg; Petersen, Stig; Beattie, Charles; Sadik, Amir; Antonoglou, Ioannis; King, Helen; Kumaran, Dharshan; Wierstra, Daan; Legg, Shane; Hassabis, Demis.

Nature ; 518(7540): 529-33, 2015 Feb 26.

Artigo em Inglês | MEDLINE | ID: mdl-25719670

RESUMO

The theory of reinforcement learning provides a normative account, deeply rooted in psychological and neuroscientific perspectives on animal behaviour, of how agents may optimize their control of an environment. To use reinforcement learning successfully in situations approaching real-world complexity, however, agents are confronted with a difficult task: they must derive efficient representations of the environment from high-dimensional sensory inputs, and use these to generalize past experience to new situations. Remarkably, humans and other animals seem to solve this problem through a harmonious combination of reinforcement learning and hierarchical sensory processing systems, the former evidenced by a wealth of neural data revealing notable parallels between the phasic signals emitted by dopaminergic neurons and temporal difference reinforcement learning algorithms. While reinforcement learning agents have achieved some successes in a variety of domains, their applicability has previously been limited to domains in which useful features can be handcrafted, or to domains with fully observed, low-dimensional state spaces. Here we use recent advances in training deep neural networks to develop a novel artificial agent, termed a deep Q-network, that can learn successful policies directly from high-dimensional sensory inputs using end-to-end reinforcement learning. We tested this agent on the challenging domain of classic Atari 2600 games. We demonstrate that the deep Q-network agent, receiving only the pixels and the game score as inputs, was able to surpass the performance of all previous algorithms and achieve a level comparable to that of a professional human games tester across a set of 49 games, using the same algorithm, network architecture and hyperparameters. This work bridges the divide between high-dimensional sensory inputs and actions, resulting in the first artificial agent that is capable of learning to excel at a diverse array of challenging tasks.

Assuntos

Inteligência Artificial , Reforço Psicológico , Jogos de Vídeo , Algoritmos , Humanos , Modelos Psicológicos , Redes Neurais de Computação , Recompensa

Overcoming catastrophic forgetting in neural networks.

Kirkpatrick, James; Pascanu, Razvan; Rabinowitz, Neil; Veness, Joel; Desjardins, Guillaume; Rusu, Andrei A; Milan, Kieran; Quan, John; Ramalho, Tiago; Grabska-Barwinska, Agnieszka; Hassabis, Demis; Clopath, Claudia; Kumaran, Dharshan; Hadsell, Raia.

Proc Natl Acad Sci U S A ; 114(13): 3521-3526, 2017 03 28.

Artigo em Inglês | MEDLINE | ID: mdl-28292907

RESUMO

The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Until now neural networks have not been capable of this and it has been widely thought that catastrophic forgetting is an inevitable feature of connectionist models. We show that it is possible to overcome this limitation and train networks that can maintain expertise on tasks that they have not experienced for a long time. Our approach remembers old tasks by selectively slowing down learning on the weights important for those tasks. We demonstrate our approach is scalable and effective by solving a set of classification tasks based on a hand-written digit dataset and by learning several Atari 2600 games sequentially.

Assuntos

Redes Neurais de Computação , Algoritmos , Inteligência Artificial , Simulação por Computador , Humanos , Aprendizagem , Memória , Rememoração Mental

Reply to Huszár: The elastic weight consolidation penalty is empirically valid.

Proc Natl Acad Sci U S A ; 115(11): E2498, 2018 03 13.

Artigo em Inglês | MEDLINE | ID: mdl-29463734

Assuntos

Aprendizado de Máquina

Imagine all the people: how the brain creates and uses personality models to predict behavior.

Hassabis, Demis; Spreng, R Nathan; Rusu, Andrei A; Robbins, Clifford A; Mar, Raymond A; Schacter, Daniel L.

Cereb Cortex ; 24(8): 1979-87, 2014 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-23463340

RESUMO

The behaviors of other people are often central to envisioning the future. The ability to accurately predict the thoughts and actions of others is essential for successful social interactions, with far-reaching consequences. Despite its importance, little is known about how the brain represents people in order to predict behavior. In this functional magnetic resonance imaging study, participants learned the unique personality of 4 protagonists and imagined how each would behave in different scenarios. The protagonists' personalities were composed of 2 traits: Agreeableness and Extraversion. Which protagonist was being imagined was accurately inferred based solely on activity patterns in the medial prefrontal cortex using multivariate pattern classification, providing novel evidence that brain activity can reveal whom someone is thinking about. Lateral temporal and posterior cingulate cortex discriminated between different degrees of agreeableness and extraversion, respectively. Functional connectivity analysis confirmed that regions associated with trait-processing and individual identities were functionally coupled. Activity during the imagination task, and revealed by functional connectivity, was consistent with the default network. Our results suggest that distinct regions code for personality traits, and that the brain combines these traits to represent individuals. The brain then uses this "personality model" to predict the behavior of others in novel situations.

Assuntos

Encéfalo/fisiologia , Imaginação/fisiologia , Personalidade , Percepção Social , Mapeamento Encefálico , Feminino , Humanos , Imageamento por Ressonância Magnética , Masculino , Testes Neuropsicológicos , Teoria da Mente/fisiologia , Pensamento/fisiologia , Adulto Jovem

Embracing Change: Continual Learning in Deep Neural Networks.

Hadsell, Raia; Rao, Dushyant; Rusu, Andrei A; Pascanu, Razvan.

Trends Cogn Sci ; 24(12): 1028-1040, 2020 12.

Artigo em Inglês | MEDLINE | ID: mdl-33158755

RESUMO

Artificial intelligence research has seen enormous progress over the past few decades, but it predominantly relies on fixed datasets and stationary environments. Continual learning is an increasingly relevant area of study that asks how artificial systems might learn sequentially, as biological systems do, from a continuous stream of correlated data. In the present review, we relate continual learning to the learning dynamics of neural networks, highlighting the potential it has to considerably improve data efficiency. We further consider the many new biologically inspired approaches that have emerged in recent years, focusing on those that utilize regularization, modularity, memory, and meta-learning, and highlight some of the most promising and impactful directions.

Assuntos

Inteligência Artificial , Redes Neurais de Computação , Humanos , Aprendizagem , Memória

Neural scene representation and rendering.

Eslami, S M Ali; Jimenez Rezende, Danilo; Besse, Frederic; Viola, Fabio; Morcos, Ari S; Garnelo, Marta; Ruderman, Avraham; Rusu, Andrei A; Danihelka, Ivo; Gregor, Karol; Reichert, David P; Buesing, Lars; Weber, Theophane; Vinyals, Oriol; Rosenbaum, Dan; Rabinowitz, Neil; King, Helen; Hillier, Chloe; Botvinick, Matt; Wierstra, Daan; Kavukcuoglu, Koray; Hassabis, Demis.

Science ; 360(6394): 1204-1210, 2018 Jun 15.

Artigo em Inglês | MEDLINE | ID: mdl-29903970

RESUMO

Scene representation-the process of converting visual sensory data into concise descriptions-is a requirement for intelligent behavior. Recent work has shown that neural networks excel at this task when provided with large, labeled datasets. However, removing the reliance on human labeling remains an important open problem. To this end, we introduce the Generative Query Network (GQN), a framework within which machines learn to represent scenes using only their own sensors. The GQN takes as input images of a scene taken from different viewpoints, constructs an internal representation, and uses this representation to predict the appearance of that scene from previously unobserved viewpoints. The GQN demonstrates representation learning without human labels or domain knowledge, paving the way toward machines that autonomously learn to understand the world around them.

Assuntos

Aprendizado de Máquina , Redes Neurais de Computação , Visão Ocular

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA