Pesquisa | BVS - MINISTÉRIO DA SAÚDE

1.

Hippocampal representations of alternative possibilities are flexibly generated to meet cognitive demands.

Comrie, Alison E; Monroe, Emily J; Kahn, Ari E; Denovellis, Eric L; Joshi, Abhilasha; Guidera, Jennifer A; Krausz, Timothy A; Berke, Joshua D; Daw, Nathaniel D; Frank, Loren M.

bioRxiv ; 2024 Sep 23.

Artigo em Inglês | MEDLINE | ID: mdl-39386651

RESUMO

The cognitive ability to go beyond the present to consider alternative possibilities, including potential futures and counterfactual pasts, can support adaptive decision making. Complex and changing real-world environments, however, have many possible alternatives. Whether and how the brain can select among them to represent alternatives that meet current cognitive needs remains unknown. We therefore examined neural representations of alternative spatial locations in the rat hippocampus during navigation in a complex patch foraging environment with changing reward probabilities. We found representations of multiple alternatives along paths ahead and behind the animal, including in distant alternative patches. Critically, these representations were modulated in distinct patterns across successive trials: alternative paths were represented proportionate to their evolving relative value and predicted subsequent decisions, whereas distant alternatives were prevalent during value updating. These results demonstrate that the brain modulates the generation of alternative possibilities in patterns that meet changing cognitive needs for adaptive behavior.

2.

Humans actively reconfigure neural task states.

Ritz, Harrison; Jha, Aditi; Pillow, Jonathan; Daw, Nathaniel D; Cohen, Jonathan D.

bioRxiv ; 2024 Oct 13.

Artigo em Inglês | MEDLINE | ID: mdl-39416099

RESUMO

The ability to switch between different tasks is a core component of adaptive cognition, but a mechanistic understanding of this capacity has remained elusive. Longstanding questions over whether task switching requires active preparation remain hotly contested, in large part due to the difficulty of inferring preparatory dynamics from behavior or time-locked neuroimaging. We make progress on this debate by quantifying neural task representations using high-dimensional linear dynamical systems fit to human electroencephalographic recordings. We find that these dynamical systems have high predictive accuracy and reveal neural signatures of active preparation that are shared with task-optimized neural networks. These findings inform a classic debate about how we control our cognition, and offer a promising new paradigm for neuroimaging analysis.

3.

Pre-existing visual responses in a projection-defined dopamine population explain individual learning trajectories.

Pan-Vazquez, Alejandro; Sanchez Araujo, Yoel; McMannon, Brenna; Louka, Miranta; Bandi, Akhil; Haetzel, Laura; Faulkner, Mayo; Pillow, Jonathan W; Daw, Nathaniel D; Witten, Ilana B.

Curr Biol ; 2024 Oct 15.

Artigo em Inglês | MEDLINE | ID: mdl-39413788

RESUMO

A key challenge of learning a new task is that the environment is high dimensional-there are many different sensory features and possible actions, with typically only a small reward-relevant subset. Although animals can learn to perform complex tasks that involve arbitrary associations between stimuli, actions, and rewards,1,2,3,4,5,6 a consistent and striking result across varied experimental paradigms is that in initially acquiring such tasks, large differences between individuals are apparent in the learning process.7,8,9,10,11,12 What neural mechanisms contribute to initial task acquisition, and why do some individuals learn a new task much more quickly than others? To address these questions, we recorded longitudinally from dopaminergic (DA) axon terminals in mice learning a visual decision-making task.7 Across striatum, DA responses tracked idiosyncratic and side-specific learning trajectories, consistent with widespread reward prediction error coding across DA terminals. However, even before any rewards were delivered, contralateral-side-specific visual responses were present in DA terminals, primarily in the dorsomedial striatum (DMS). These pre-existing responses predicted the extent of learning for contralateral stimuli. Moreover, activation of these terminals improved contralateral performance. Thus, the initial conditions of a projection-specific and feature-specific DA signal help explain individual learning trajectories. More broadly, this work suggests that functional heterogeneity across DA projections may serve to bias target regions toward learning about different subsets of task features, providing a potential mechanism to address the dimensionality of the initial task learning problem.

4.

Computational processes of simultaneous learning of stochasticity and volatility in humans.

Piray, Payam; Daw, Nathaniel D.

Nat Commun ; 15(1): 9073, 2024 Oct 21.

Artigo em Inglês | MEDLINE | ID: mdl-39433765

RESUMO

Making adaptive decisions requires predicting outcomes, and this in turn requires adapting to uncertain environments. This study explores computational challenges in distinguishing two types of noise influencing predictions: volatility and stochasticity. Volatility refers to diffusion noise in latent causes, requiring a higher learning rate, while stochasticity introduces moment-to-moment observation noise and reduces learning rate. Dissociating these effects is challenging as both increase the variance of observations. Previous research examined these factors mostly separately, but it remains unclear whether and how humans dissociate them when they are played off against one another. In two large-scale experiments, through a behavioral prediction task and computational modeling, we report evidence of humans dissociating volatility and stochasticity solely based on their observations. We observed contrasting effects of volatility and stochasticity on learning rates, consistent with statistical principles. These results are consistent with a computational model that estimates volatility and stochasticity by balancing their dueling effects.

Assuntos

Aprendizagem , Processos Estocásticos , Humanos , Aprendizagem/fisiologia , Masculino , Feminino , Adulto , Adulto Jovem , Tomada de Decisões/fisiologia , Simulação por Computador , Incerteza

5.

The role of training variability for model-based and model-free learning of an arbitrary visuomotor mapping.

Velázquez-Vargas, Carlos A; Daw, Nathaniel D; Taylor, Jordan A.

PLoS Comput Biol ; 20(9): e1012471, 2024 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-39331685

RESUMO

A fundamental feature of the human brain is its capacity to learn novel motor skills. This capacity requires the formation of vastly different visuomotor mappings. Using a grid navigation task, we investigated whether training variability would enhance the flexible use of a visuomotor mapping (key-to-direction rule), leading to better generalization performance. Experiments 1 and 2 show that participants trained to move between multiple start-target pairs exhibited greater generalization to both distal and proximal targets compared to participants trained to move between a single pair. This finding suggests that limited variability can impair decisions even in simple tasks without planning. In addition, during the training phase, participants exposed to higher variability were more inclined to choose options that, counterintuitively, moved the cursor away from the target while minimizing its actual distance under the constrained mapping, suggesting a greater engagement in model-based computations. In Experiments 3 and 4, we showed that the limited generalization performance in participants trained with a single pair can be enhanced by a short period of variability introduced early in learning or by incorporating stochasticity into the visuomotor mapping. Our computational modeling analyses revealed that a hybrid model between model-free and model-based computations with different mixing weights for the training and generalization phases, best described participants' data. Importantly, the differences in the model-based weights between our experimental groups, paralleled the behavioral findings during training and generalization. Taken together, our results suggest that training variability enables the flexible use of the visuomotor mapping, potentially by preventing the consolidation of habits due to the continuous demand to change responses.

Assuntos

Aprendizagem , Desempenho Psicomotor , Humanos , Masculino , Aprendizagem/fisiologia , Feminino , Desempenho Psicomotor/fisiologia , Adulto , Adulto Jovem , Biologia Computacional , Modelos Neurológicos , Destreza Motora/fisiologia , Simulação por Computador , Encéfalo/fisiologia

6.

Major depression symptom severity associations with willingness to exert effort and patch foraging strategy.

Bustamante, Laura A; Barch, Deanna M; Solis, Johanne; Oshinowo, Temitope; Grahek, Ivan; Konova, Anna B; Daw, Nathaniel D; Cohen, Jonathan D.

medRxiv ; 2024 Sep 17.

Artigo em Inglês | MEDLINE | ID: mdl-38947009

RESUMO

Background: Individuals with major depressive disorder (MDD) can experience reduced motivation and cognitive function, leading to challenges with goal-directed behavior. When selecting goals, people maximize 'expected value' by selecting actions that maximize potential reward while minimizing associated costs, including effort 'costs' and the opportunity cost of time. In MDD, differential weighing of costs and benefits are theorized mechanisms underlying changes in goal-directed cognition and may contribute to symptom heterogeneity. Methods: We used the Effort Foraging Task to quantify cognitive and physical effort costs, and patch leaving thresholds in low effort conditions (reflecting perceived opportunity cost of time) and investigated their shared versus distinct relationships to clinical features in participants with MDD (N=52, 43 in-episode) and comparisons (N=27). Results: Contrary to our predictions, none of the decision-making measures differed with MDD diagnosis. However, each of the measures were related to symptom severity, over and above effects of ability (i.e., performance). Greater anxiety symptoms were selectively associated with lower cognitive effort cost (i.e. greater willingness to exert effort). Anhedonia and behavioral apathy were associated with increased physical effort costs. Finally, greater overall depression was related to decreased patch leaving thresholds. Conclusions: Markers of effort-based decision-making may inform understanding of MDD heterogeneity. Increased willingness to exert cognitive effort may contribute to anxiety symptoms such as worry. Decreased leaving thresholds associations with symptom severity is consistent with reward rate-based accounts of reduced vigor in MDD. Future research should address subtypes of depression with or without anxiety, which may relate differentially to cognitive effort decisions.

7.

A feature-specific prediction error model explains dopaminergic heterogeneity.

Lee, Rachel S; Sagiv, Yotam; Engelhard, Ben; Witten, Ilana B; Daw, Nathaniel D.

Nat Neurosci ; 27(8): 1574-1586, 2024 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-38961229

RESUMO

The hypothesis that midbrain dopamine (DA) neurons broadcast a reward prediction error (RPE) is among the great successes of computational neuroscience. However, recent results contradict a core aspect of this theory: specifically that the neurons convey a scalar, homogeneous signal. While the predominant family of extensions to the RPE model replicates the classic model in multiple parallel circuits, we argue that these models are ill suited to explain reports of heterogeneity in task variable encoding across DA neurons. Instead, we introduce a complementary 'feature-specific RPE' model, positing that individual ventral tegmental area DA neurons report RPEs for different aspects of an animal's moment-to-moment situation. Further, we show how our framework can be extended to explain patterns of heterogeneity in action responses reported among substantia nigra pars compacta DA neurons. This theory reconciles new observations of DA heterogeneity with classic ideas about RPE coding while also providing a new perspective of how the brain performs reinforcement learning in high-dimensional environments.

Assuntos

Neurônios Dopaminérgicos , Modelos Neurológicos , Recompensa , Área Tegmentar Ventral , Neurônios Dopaminérgicos/fisiologia , Animais , Área Tegmentar Ventral/fisiologia , Dopamina/metabolismo , Humanos , Reforço Psicológico

8.

Magnetic voluntary head-fixation in transgenic rats enables lifespan imaging of hippocampal neurons.

Rich, P Dylan; Thiberge, Stephan Yves; Scott, Benjamin B; Guo, Caiying; Tervo, D Gowanlock R; Brody, Carlos D; Karpova, Alla Y; Daw, Nathaniel D; Tank, David W.

Nat Commun ; 15(1): 4154, 2024 May 16.

Artigo em Inglês | MEDLINE | ID: mdl-38755205

RESUMO

The precise neural mechanisms within the brain that contribute to the remarkable lifetime persistence of memory are not fully understood. Two-photon calcium imaging allows the activity of individual cells to be followed across long periods, but conventional approaches require head-fixation, which limits the type of behavior that can be studied. We present a magnetic voluntary head-fixation system that provides stable optical access to the brain during complex behavior. Compared to previous systems that used mechanical restraint, there are no moving parts and animals can engage and disengage entirely at will. This system is failsafe, easy for animals to use and reliable enough to allow long-term experiments to be routinely performed. Animals completed hundreds of trials per session of an odor discrimination task that required 2-4 s fixations. Together with a reflectance fluorescence collection scheme that increases two-photon signal and a transgenic Thy1-GCaMP6f rat line, we are able to reliably image the cellular activity in the hippocampus during behavior over long periods (median 6 months), allowing us track the same neurons over a large fraction of animals' lives (up to 19 months).

Assuntos

Hipocampo , Neurônios , Ratos Transgênicos , Animais , Hipocampo/citologia , Neurônios/metabolismo , Ratos , Masculino , Cálcio/metabolismo , Cabeça/diagnóstico por imagem , Magnetismo , Odorantes/análise , Feminino

9.

Prioritizing replay when future goals are unknown.

Sagiv, Yotam; Akam, Thomas; Witten, Ilana B; Daw, Nathaniel D.

bioRxiv ; 2024 Mar 04.

Artigo em Inglês | MEDLINE | ID: mdl-38496674

RESUMO

Although hippocampal place cells replay nonlocal trajectories, the computational function of these events remains controversial. One hypothesis, formalized in a prominent reinforcement learning account, holds that replay plans routes to current goals. However, recent puzzling data appear to contradict this perspective by showing that replayed destinations lag current goals. These results may support an alternative hypothesis that replay updates route information to build a "cognitive map." Yet no similar theory exists to formalize this view, and it is unclear how such a map is represented or what role replay plays in computing it. We address these gaps by introducing a theory of replay that learns a map of routes to candidate goals, before reward is available or when its location may change. Our work extends the planning account to capture a general map-building function for replay, reconciling it with data, and revealing an unexpected relationship between the seemingly distinct hypotheses.

10.

Dynamic reinforcement learning reveals time-dependent shifts in strategy during reward learning.

Venditto, Sarah Jo C; Miller, Kevin J; Brody, Carlos D; Daw, Nathaniel D.

bioRxiv ; 2024 Oct 07.

Artigo em Inglês | MEDLINE | ID: mdl-38464244

RESUMO

Different brain systems have been hypothesized to subserve multiple "experts" that compete to generate behavior. In reinforcement learning, two general processes, one model-free (MF) and one model-based (MB), are often modeled as a mixture of agents (MoA) and hypothesized to capture differences between automaticity vs. deliberation. However, shifts in strategy cannot be captured by a static MoA. To investigate such dynamics, we present the mixture-of-agents hidden Markov model (MoA-HMM), which simultaneously learns inferred action values from a set of agents and the temporal dynamics of underlying "hidden" states that capture shifts in agent contributions over time. Applying this model to a multi-step, reward-guided task in rats reveals a progression of within-session strategies: a shift from initial MB exploration to MB exploitation, and finally to reduced engagement. The inferred states predict changes in both response time and OFC neural encoding during the task, suggesting that these states are capturing real shifts in dynamics.

11.

Learning attentional templates for value-based decision-making.

Jahn, Caroline I; Markov, Nikola T; Morea, Britney; Daw, Nathaniel D; Ebitz, R Becket; Buschman, Timothy J.

Cell ; 187(6): 1476-1489.e21, 2024 Mar 14.

Artigo em Inglês | MEDLINE | ID: mdl-38401541

RESUMO

Attention filters sensory inputs to enhance task-relevant information. It is guided by an "attentional template" that represents the stimulus features that are currently relevant. To understand how the brain learns and uses templates, we trained monkeys to perform a visual search task that required them to repeatedly learn new attentional templates. Neural recordings found that templates were represented across the prefrontal and parietal cortex in a structured manner, such that perceptually neighboring templates had similar neural representations. When the task changed, a new attentional template was learned by incrementally shifting the template toward rewarded features. Finally, we found that attentional templates transformed stimulus features into a common value representation that allowed the same decision-making mechanisms to deploy attention, regardless of the identity of the template. Altogether, our results provide insight into the neural mechanisms by which the brain learns to control attention and how attention can be flexibly deployed across tasks.

Assuntos

Atenção , Tomada de Decisões , Aprendizagem , Lobo Parietal , Recompensa , Animais , Haplorrinos

12.

Building compositional tasks with shared neural subspaces.

Tafazoli, Sina; Bouchacourt, Flora M; Ardalan, Adel; Markov, Nikola T; Uchimura, Motoaki; Mattar, Marcelo G; Daw, Nathaniel D; Buschman, Timothy J.

bioRxiv ; 2024 Mar 22.

Artigo em Inglês | MEDLINE | ID: mdl-38352540

RESUMO

Cognition is remarkably flexible; we are able to rapidly learn and perform many different tasks1. Theoretical modeling has shown artificial neural networks trained to perform multiple tasks will re-use representations2 and computational components3 across tasks. By composing tasks from these sub-components, an agent can flexibly switch between tasks and rapidly learn new tasks4. Yet, whether such compositionality is found in the brain is unknown. Here, we show the same subspaces of neural activity represent task-relevant information across multiple tasks, with each task compositionally combining these subspaces in a task-specific manner. We trained monkeys to switch between three compositionally related tasks. Neural recordings found task-relevant information about stimulus features and motor actions were represented in subspaces of neural activity that were shared across tasks. When monkeys performed a task, neural representations in the relevant shared sensory subspace were transformed to the relevant shared motor subspace. Subspaces were flexibly engaged as monkeys discovered the task in effect; their internal belief about the current task predicted the strength of representations in task-relevant subspaces. In sum, our findings suggest that the brain can flexibly perform multiple tasks by compositionally combining task-relevant neural representations across tasks.

13.

An item response theory analysis of the matrix reasoning item bank (MaRs-IB).

Zorowitz, Samuel; Chierchia, Gabriele; Blakemore, Sarah-Jayne; Daw, Nathaniel D.

Behav Res Methods ; 56(3): 1104-1122, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-37020082

RESUMO

Matrix reasoning tasks are among the most widely used measures of cognitive ability in the behavioral sciences, but the lack of matrix reasoning tests in the public domain complicates their use. Here, we present an extensive investigation and psychometric validation of the matrix reasoning item bank (MaRs-IB), an open-access set of matrix reasoning items. In a first study, we calibrate the psychometric functioning of the items in the MaRs-IB in a large sample of adult participants (N = 1501). Using additive multilevel item structure models, we establish that the MaRs-IB has many desirable psychometric properties: its items span a wide range of difficulty, possess medium-to-large levels of discrimination, and exhibit robust associations between item complexity and difficulty. However, we also find that item clones are not always psychometrically equivalent and cannot be assumed to be exchangeable. In a second study, we demonstrate how experimenters can use the estimated item parameters to design new matrix reasoning tests using optimal item assembly. Specifically, we design and validate two new sets of test forms in an independent sample of adults (N = 600). We find these new tests possess good reliability and convergent validity with an established measure of matrix reasoning. We hope that the materials and results made available here will encourage experimenters to use the MaRs-IB in their research.

Assuntos

Cognição , Resolução de Problemas , Adulto , Humanos , Reprodutibilidade dos Testes , Psicometria , Inquéritos e Questionários

14.

Effort Foraging Task reveals positive correlation between individual differences in the cost of cognitive and physical effort in humans.

Bustamante, Laura A; Oshinowo, Temitope; Lee, Jeremy R; Tong, Elizabeth; Burton, Allison R; Shenhav, Amitai; Cohen, Jonathan D; Daw, Nathaniel D.

Proc Natl Acad Sci U S A ; 120(50): e2221510120, 2023 Dec 12.

Artigo em Inglês | MEDLINE | ID: mdl-38064507

RESUMO

Effort-based decisions, in which people weigh potential future rewards against effort costs required to achieve those rewards involve both cognitive and physical effort, though the mechanistic relationship between them is not yet understood. Here, we use an individual differences approach to isolate and measure the computational processes underlying effort-based decisions and test the association between cognitive and physical domains. Patch foraging is an ecologically valid reward rate maximization problem with well-developed theoretical tools. We developed the Effort Foraging Task, which embedded cognitive or physical effort into patch foraging, to quantify the cost of both cognitive and physical effort indirectly, by their effects on foraging choices. Participants chose between harvesting a depleting patch, or traveling to a new patch that was costly in time and effort. Participants' exit thresholds (reflecting the reward they expected to receive by harvesting when they chose to travel to a new patch) were sensitive to cognitive and physical effort demands, allowing us to quantify the perceived effort cost in monetary terms. The indirect sequential choice style revealed effort-seeking behavior in a minority of participants (preferring high over low effort) that has apparently been missed by many previous approaches. Individual differences in cognitive and physical effort costs were positively correlated, suggesting that these are perceived and processed in common. We used canonical correlation analysis to probe the relationship of task measures to self-reported affect and motivation, and found correlations of cognitive effort with anxiety, cognitive function, behavioral activation, and self-efficacy, but no similar correlations with physical effort.

Assuntos

Tomada de Decisões , Esforço Físico , Humanos , Tomada de Decisões/fisiologia , Esforço Físico/fisiologia , Individualidade , Cognição/fisiologia , Recompensa , Motivação

15.

Proactive and reactive construction of memory-based preferences.

Nicholas, Jonathan; Daw, Nathaniel D; Shohamy, Daphna.

bioRxiv ; 2023 Dec 11.

Artigo em Inglês | MEDLINE | ID: mdl-38106137

RESUMO

We are often faced with decisions we have never encountered before, requiring us to infer possible outcomes before making a choice. Computational theories suggest that one way to make these types of decisions is by accessing and linking related experiences stored in memory. Past work has shown that such memory-based preference construction can occur at a number of different timepoints relative to the moment a decision is made. Some studies have found that memories are integrated at the time a decision is faced (reactively) while others found that memory integration happens earlier, when memories are encoded (proactively). Here we offer a resolution to this inconsistency. We demonstrate behavioral and neural evidence for both strategies and for how they tradeoff rationally depending on the associative structure of memory. Using fMRI to decode patterns of brain responses unique to categories of images in memory, we found that proactive memory access is more common and allows more efficient inference. However, participants also use reactive access when choice options are linked to more numerous memory associations. Together, these results indicate that the brain judiciously conducts proactive inference by accessing memories ahead of time in conditions when this strategy is most favorable.

16.

Disentangling Abstraction from Statistical Pattern Matching in Human and Machine Learning.

Kumar, Sreejan; Dasgupta, Ishita; Daw, Nathaniel D; Cohen, Jonathan D; Griffiths, Thomas L.

PLoS Comput Biol ; 19(8): e1011316, 2023 08.

Artigo em Inglês | MEDLINE | ID: mdl-37624841

RESUMO

The ability to acquire abstract knowledge is a hallmark of human intelligence and is believed by many to be one of the core differences between humans and neural network models. Agents can be endowed with an inductive bias towards abstraction through meta-learning, where they are trained on a distribution of tasks that share some abstract structure that can be learned and applied. However, because neural networks are hard to interpret, it can be difficult to tell whether agents have learned the underlying abstraction, or alternatively statistical patterns that are characteristic of that abstraction. In this work, we compare the performance of humans and agents in a meta-reinforcement learning paradigm in which tasks are generated from abstract rules. We define a novel methodology for building "task metamers" that closely match the statistics of the abstract tasks but use a different underlying generative process, and evaluate performance on both abstract and metamer tasks. We find that humans perform better at abstract tasks than metamer tasks whereas common neural network architectures typically perform worse on the abstract tasks than the matched metamers. This work provides a foundation for characterizing differences between humans and machine learning that can be used in future work towards developing machines with more human-like behavior.

Assuntos

Formação de Conceito , Aprendizado de Máquina , Humanos , Inteligência , Conhecimento , Redes Neurais de Computação

17.

Dual credit assignment processes underlie dopamine signals in a complex spatial environment.

Krausz, Timothy A; Comrie, Alison E; Kahn, Ari E; Frank, Loren M; Daw, Nathaniel D; Berke, Joshua D.

Neuron ; 111(21): 3465-3478.e7, 2023 11 01.

Artigo em Inglês | MEDLINE | ID: mdl-37611585

RESUMO

Animals frequently make decisions based on expectations of future reward ("values"). Values are updated by ongoing experience: places and choices that result in reward are assigned greater value. Yet, the specific algorithms used by the brain for such credit assignment remain unclear. We monitored accumbens dopamine as rats foraged for rewards in a complex, changing environment. We observed brief dopamine pulses both at reward receipt (scaling with prediction error) and at novel path opportunities. Dopamine also ramped up as rats ran toward reward ports, in proportion to the value at each location. By examining the evolution of these dopamine place-value signals, we found evidence for two distinct update processes: progressive propagation of value along taken paths, as in temporal difference learning, and inference of value throughout the maze, using internal models. Our results demonstrate that within rich, naturalistic environments dopamine conveys place values that are updated via multiple, complementary learning algorithms.

Assuntos

Tomada de Decisões , Dopamina , Ratos , Animais , Recompensa , Encéfalo

18.

Consciousness beyond the human case.

LeDoux, Joseph; Birch, Jonathan; Andrews, Kristin; Clayton, Nicola S; Daw, Nathaniel D; Frith, Chris; Lau, Hakwan; Peters, Megan A K; Schneider, Susan; Seth, Anil; Suddendorf, Thomas; Vandekerckhove, Marie M P.

Curr Biol ; 33(16): R832-R840, 2023 08 21.

Artigo em Inglês | MEDLINE | ID: mdl-37607474

RESUMO

There is growing interest in the relationship been AI and consciousness. Joseph LeDoux and Jonathan Birch thought it would be a good moment to put some of the big questions in this area to some leading experts. The challenge of addressing the questions they raised was taken up by Kristin Andrews, Nicky Clayton, Nathaniel Daw, Chris Frith, Hakwan Lau, Megan Peters, Susan Schneider, Anil Seth, Thomas Suddendorf, and Marie Vanderkerckhoeve.

Assuntos

Betula , Estado de Consciência , Humanos

19.

Humans decompose tasks by trading off utility and computational cost.

Correa, Carlos G; Ho, Mark K; Callaway, Frederick; Daw, Nathaniel D; Griffiths, Thomas L.

PLoS Comput Biol ; 19(6): e1011087, 2023 06.

Artigo em Inglês | MEDLINE | ID: mdl-37262023

RESUMO

Human behavior emerges from planning over elaborate decompositions of tasks into goals, subgoals, and low-level actions. How are these decompositions created and used? Here, we propose and evaluate a normative framework for task decomposition based on the simple idea that people decompose tasks to reduce the overall cost of planning while maintaining task performance. Analyzing 11,117 distinct graph-structured planning tasks, we find that our framework justifies several existing heuristics for task decomposition and makes predictions that can be distinguished from two alternative normative accounts. We report a behavioral study of task decomposition (N = 806) that uses 30 randomly sampled graphs, a larger and more diverse set than that of any previous behavioral study on this topic. We find that human responses are more consistent with our framework for task decomposition than alternative normative accounts and are most consistent with a heuristic-betweenness centrality-that is justified by our approach. Taken together, our results suggest the computational cost of planning is a key principle guiding the intelligent structuring of goal-directed behavior.

Assuntos

Heurística , Humanos , Objetivos , Comportamento

20.

Dual credit assignment processes underlie dopamine signals in a complex spatial environment.

Krausz, Timothy A; Comrie, Alison E; Frank, Loren M; Daw, Nathaniel D; Berke, Joshua D.

bioRxiv ; 2023 Mar 19.

Artigo em Inglês | MEDLINE | ID: mdl-36993482

RESUMO

Dopamine in the nucleus accumbens helps motivate behavior based on expectations of future reward ("values"). These values need to be updated by experience: after receiving reward, the choices that led to reward should be assigned greater value. There are multiple theoretical proposals for how this credit assignment could be achieved, but the specific algorithms that generate updated dopamine signals remain uncertain. We monitored accumbens dopamine as freely behaving rats foraged for rewards in a complex, changing environment. We observed brief pulses of dopamine both when rats received reward (scaling with prediction error), and when they encountered novel path opportunities. Furthermore, dopamine ramped up as rats ran towards reward ports, in proportion to the value at each location. By examining the evolution of these dopamine place-value signals, we found evidence for two distinct update processes: progressive propagation along taken paths, as in temporal-difference learning, and inference of value throughout the maze, using internal models. Our results demonstrate that within rich, naturalistic environments dopamine conveys place values that are updated via multiple, complementary learning algorithms.

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA