Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Nat Rev Neurosci ; 21(10): 576-586, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-32873936

RESUMEN

Reinforcement learning (RL) is a framework of particular importance to psychology, neuroscience and machine learning. Interactions between these fields, as promoted through the common hub of RL, has facilitated paradigm shifts that relate multiple levels of analysis in a singular framework (for example, relating dopamine function to a computationally defined RL signal). Recently, more sophisticated RL algorithms have been proposed to better account for human learning, and in particular its oft-documented reliance on two separable systems: a model-based (MB) system and a model-free (MF) system. However, along with many benefits, this dichotomous lens can distort questions, and may contribute to an unnecessarily narrow perspective on learning and decision-making. Here, we outline some of the consequences that come from overconfidently mapping algorithms, such as MB versus MF RL, with putative cognitive processes. We argue that the field is well positioned to move beyond simplistic dichotomies, and we propose a means of refocusing research questions towards the rich and complex components that comprise learning and decision-making.


Asunto(s)
Encéfalo/fisiología , Toma de Decisiones/fisiología , Modelos Neurológicos , Refuerzo en Psicología , Algoritmos , Animales , Dopamina/fisiología , Humanos , Memoria/fisiología , Recompensa
2.
Annu Rev Psychol ; 68: 73-100, 2017 Jan 03.
Artículo en Inglés | MEDLINE | ID: mdl-27687119

RESUMEN

In this review, we summarize findings supporting the existence of multiple behavioral strategies for controlling reward-related behavior, including a dichotomy between the goal-directed or model-based system and the habitual or model-free system in the domain of instrumental conditioning and a similar dichotomy in the realm of Pavlovian conditioning. We evaluate evidence from neuroscience supporting the existence of at least partly distinct neuronal substrates contributing to the key computations necessary for the function of these different control systems. We consider the nature of the interactions between these systems and show how these interactions can lead to either adaptive or maladaptive behavioral outcomes. We then review evidence that an additional system guides inference concerning the hidden states of other agents, such as their beliefs, preferences, and intentions, in a social context. We also describe emerging evidence for an arbitration mechanism between model-based and model-free reinforcement learning, placing such a mechanism within the broader context of the hierarchical control of behavior.


Asunto(s)
Toma de Decisiones/fisiología , Aprendizaje/fisiología , Recompensa , Encéfalo/fisiología , Condicionamiento Psicológico/fisiología , Objetivos , Humanos , Neurociencias , Refuerzo en Psicología
3.
Cogn Affect Behav Neurosci ; 14(2): 715-28, 2014 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-24481852

RESUMEN

Patients with schizophrenia (SZ) show cognitive impairments on a wide range of tasks, with clear deficiencies in tasks reliant on prefrontal cortex function and less consistently observed impairments in tasks recruiting the striatum. This study leverages tasks hypothesized to differentially recruit these neural structures to assess relative deficiencies of each. Forty-eight patients and 38 controls completed two reinforcement learning tasks hypothesized to interrogate prefrontal and striatal functions and their interaction. In each task, participants learned reward discriminations by trial and error and were tested on novel stimulus combinations to assess learned values. In the task putatively assessing fronto-striatal interaction, participants were (inaccurately) instructed that one of the stimuli was valuable. Consistent with prior reports and a model of confirmation bias, this manipulation resulted in overvaluation of the instructed stimulus after its true value had been experienced. Patients showed less susceptibility to this confirmation bias effect than did controls. In the choice bias task hypothesized to more purely assess striatal function, biases in endogenously and exogenously chosen actions were assessed. No group differences were observed. In the subset of participants who showed learning in both tasks, larger group differences were observed in the confirmation bias task than in the choice bias task. In the confirmation bias task, patients also showed impairment in the task conditions with no prior instruction. This deficit was most readily observed on the most deterministic discriminations. Taken together, these results suggest impairments in fronto-striatal interaction in SZ, rather than in striatal function per se.


Asunto(s)
Sesgo , Trastornos del Conocimiento/etiología , Toma de Decisiones/fisiología , Aprendizaje por Probabilidad , Refuerzo en Psicología , Esquizofrenia/complicaciones , Adulto , Femenino , Humanos , Masculino , Persona de Mediana Edad , Pruebas Neuropsicológicas , Escalas de Valoración Psiquiátrica , Estadísticas no Paramétricas , Adulto Joven
4.
Nat Commun ; 15(1): 4436, 2024 May 24.
Artículo en Inglés | MEDLINE | ID: mdl-38789415

RESUMEN

To navigate our complex social world, it is crucial to deploy multiple learning strategies, such as learning from directly experiencing action outcomes or from observing other people's behavior. Despite the prevalence of experiential and observational learning in humans and other social animals, it remains unclear how people favor one strategy over the other depending on the environment, and how individuals vary in their strategy use. Here, we describe an arbitration mechanism in which the prediction errors associated with each learning strategy influence their weight over behavior. We designed an online behavioral task to test our computational model, and found that while a substantial proportion of participants relied on the proposed arbitration mechanism, there was some meaningful heterogeneity in how people solved this task. Four other groups were identified: those who used a fixed mixture between the two strategies, those who relied on a single strategy and non-learners with irrelevant strategies. Furthermore, groups were found to differ on key behavioral signatures, and on transdiagnostic symptom dimensions, in particular autism traits and anxiety. Together, these results demonstrate how large heterogeneous datasets and computational methods can be leveraged to better characterize individual differences.


Asunto(s)
Aprendizaje , Humanos , Femenino , Masculino , Aprendizaje/fisiología , Adulto , Adulto Joven , Ansiedad/psicología , Adolescente , Negociación , Simulación por Computador
5.
Nat Commun ; 15(1): 2162, 2024 Mar 09.
Artículo en Inglés | MEDLINE | ID: mdl-38461343

RESUMEN

The value and uncertainty associated with choice alternatives constitute critical features relevant for decisions. However, the manner in which reward and risk representations are temporally organized in the brain remains elusive. Here we leverage the spatiotemporal precision of intracranial electroencephalography, along with a simple card game designed to elicit the unfolding computation of a set of reward and risk variables, to uncover this temporal organization. Reward outcome representations across wide-spread regions follow a sequential order along the anteroposterior axis of the brain. In contrast, expected value can be decoded from multiple regions at the same time, and error signals in both reward and risk domains reflect a mixture of sequential and parallel encoding. We further highlight the role of the anterior insula in generalizing between reward prediction error and risk prediction error codes. Together our results emphasize the importance of neural dynamics for understanding value-based decisions under uncertainty.


Asunto(s)
Encéfalo , Recompensa , Humanos , Encéfalo/diagnóstico por imagen
6.
Nat Hum Behav ; 7(6): 970-985, 2023 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-36959327

RESUMEN

Adaptive behaviour in real-world environments requires that choices integrate several variables, including the novelty of the options under consideration, their expected value and uncertainty in value estimation. Here, to probe how integration over decision variables occurs during decision-making, we recorded neurons from the human pre-supplementary motor area (preSMA), ventromedial prefrontal cortex and dorsal anterior cingulate. Unlike the other areas, preSMA neurons not only represented separate pre-decision variables for each choice option but also encoded an integrated utility signal for each choice option and, subsequently, the decision itself. Post-decision encoding of variables for the chosen option was more widely distributed and especially prominent in the ventromedial prefrontal cortex. Our findings position the human preSMA as central to the implementation of value-based decisions.


Asunto(s)
Conducta de Elección , Corteza Motora , Humanos , Conducta de Elección/fisiología , Corteza Prefrontal/fisiología , Giro del Cíngulo/fisiología , Neuronas/fisiología
7.
Elife ; 122023 08 16.
Artículo en Inglés | MEDLINE | ID: mdl-37585251

RESUMEN

Across the lifespan, individuals frequently choose between exploiting known rewarding options or exploring unknown alternatives. A large body of work has suggested that children may explore more than adults. However, because novelty and reward uncertainty are often correlated, it is unclear how they differentially influence decision-making across development. Here, children, adolescents, and adults (ages 8-27 years, N = 122) completed an adapted version of a recently developed value-guided decision-making task that decouples novelty and uncertainty. In line with prior studies, we found that exploration decreased with increasing age. Critically, participants of all ages demonstrated a similar bias to select choice options with greater novelty, whereas aversion to reward uncertainty increased into adulthood. Computational modeling of participant choices revealed that whereas adolescents and adults demonstrated attenuated uncertainty aversion for more novel choice options, children's choices were not influenced by reward uncertainty.


Asunto(s)
Conducta Exploratoria , Recompensa , Adolescente , Adulto , Niño , Humanos , Simulación por Computador , Toma de Decisiones , Incertidumbre , Adulto Joven
8.
bioRxiv ; 2023 May 09.
Artículo en Inglés | MEDLINE | ID: mdl-37214975

RESUMEN

The value and uncertainty associated with choice alternatives constitute critical features along which decisions are made. While the neural substrates supporting reward and risk processing have been investigated, the temporal organization by which these computations are encoded remains elusive. Here we leverage the high spatiotemporal precision of intracranial electroencephalography (iEEG) to uncover how representations of decision-related computations unfold in time. We present evidence of locally distributed representations of reward and risk variables that are temporally organized across multiple regions of interest. Reward outcome representations across wide-spread regions follow a temporally cascading order along the anteroposterior axis of the brain. In contrast, expected value can be decoded from multiple regions at the same time, and error signals in both reward and risk domains reflect a mixture of sequential and parallel encoding. We highlight the role of the anterior insula in generalizing between reward prediction error (RePE) and risk prediction error (RiPE), within which the encoding of RePE in the distributed iEEG signal predicts RiPE. Together our results emphasize the utility of uncovering temporal dynamics in the human brain for understanding how computational processes critical for value-based decisions under uncertainty unfold.

9.
J Child Psychol Psychiatry ; 53(12): 1259-67, 2012 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-22780332

RESUMEN

BACKGROUND: Although impaired social-emotional ability is a hallmark of autism spectrum disorder (ASD), the perceptual skills and mediating strategies contributing to the social deficits of autism are not well understood. A perceptual skill that is fundamental to effective social communication is the ability to accurately perceive and interpret facial emotions. To evaluate the expression processing of participants with ASD, we designed the Let's Face It! Emotion Skills Battery (LFI! Battery), a computer-based assessment composed of three subscales measuring verbal and perceptual skills implicated in the recognition of facial emotions. METHODS: We administered the LFI! Battery to groups of participants with ASD and typically developing control (TDC) participants that were matched for age and IQ. RESULTS: On the Name Game labeling task, participants with ASD (N = 68) performed on par with TDC individuals (N = 66) in their ability to name the facial emotions of happy, sad, disgust and surprise and were only impaired in their ability to identify the angry expression. On the Matchmaker Expression task that measures the recognition of facial emotions across different facial identities, the ASD participants (N = 66) performed reliably worse than TDC participants (N = 67) on the emotions of happy, sad, disgust, frighten and angry. In the Parts-Wholes test of perceptual strategies of expression, the TDC participants (N = 67) displayed more holistic encoding for the eyes than the mouths in expressive faces whereas ASD participants (N = 66) exhibited the reverse pattern of holistic recognition for the mouth and analytic recognition of the eyes. CONCLUSION: In summary, findings from the LFI! Battery show that participants with ASD were able to label the basic facial emotions (with the exception of angry expression) on par with age- and IQ-matched TDC participants. However, participants with ASD were impaired in their ability to generalize facial emotions across different identities and showed a tendency to recognize the mouth feature holistically and the eyes as isolated parts.


Asunto(s)
Trastornos Generalizados del Desarrollo Infantil/psicología , Emociones , Expresión Facial , Pruebas Neuropsicológicas/estadística & datos numéricos , Reconocimiento en Psicología , Percepción Visual , Adolescente , Adulto , Análisis de Varianza , Niño , Preescolar , Femenino , Humanos , Masculino , Adulto Joven
10.
Neuron ; 110(16): 2691-2702.e8, 2022 08 17.
Artículo en Inglés | MEDLINE | ID: mdl-35809575

RESUMEN

Both novelty and uncertainty are potent features guiding exploration; however, they are often experimentally conflated, and an understanding of how they interact to regulate the balance between exploration and exploitation has proved elusive. Using a task designed to decouple the influence of novelty and uncertainty, we identify separable mechanisms through which exploration is directed. We show that uncertainty-directed exploration is sensitive to the prospective benefit offered by new information, whereas novelty-directed exploration is maintained regardless of its potential advantage. Using a computational framework in conjunction with fMRI, we show that uncertainty-directed choice is rooted in an adaptive bias indexing the prospective utility of exploration. In contrast, novelty persistently promotes exploration by optimistically inflating reward expectations while simultaneously dampening uncertainty signals. Our results identify separable neural substrates charged with balancing the explore/exploit trade-off to foster a manageable decomposition of an otherwise intractable problem.


Asunto(s)
Conducta Exploratoria , Recompensa , Encéfalo/diagnóstico por imagen , Toma de Decisiones , Conducta Exploratoria/fisiología , Cabeza , Humanos , Incertidumbre
11.
J Child Psychol Psychiatry ; 51(8): 944-52, 2010 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-20646129

RESUMEN

BACKGROUND: An emerging body of evidence indicates that relative to typically developing children, children with autism are selectively impaired in their ability to recognize facial identity. A critical question is whether face recognition skills can be enhanced through a direct training intervention. METHODS: In a randomized clinical trial, children diagnosed with autism spectrum disorder were pre-screened with a battery of subtests (the Let's Face It! Skills battery) examining face and object processing abilities. Participants who were significantly impaired in their face processing abilities were assigned to either a treatment or a waitlist group. Children in the treatment group (N = 42) received 20 hours of face training with the Let's Face It! (LFI!) computer-based intervention. The LFI! program is comprised of seven interactive computer games that target the specific face impairments associated with autism, including the recognition of identity across image changes in expression, viewpoint and features, analytic and holistic face processing strategies and attention to information in the eye region. Time 1 and Time 2 performance for the treatment and waitlist groups was assessed with the Let's Face It! Skills battery. RESULTS: The main finding was that relative to the control group (N = 37), children in the face training group demonstrated reliable improvements in their analytic recognition of mouth features and holistic recognition of a face based on its eyes features. CONCLUSION: These results indicate that a relatively short-term intervention program can produce measurable improvements in the face recognition skills of children with autism. As a treatment for face processing deficits, the Let's Face It! program has advantages of being cost-free, adaptable to the specific learning needs of the individual child and suitable for home and school applications.


Asunto(s)
Síndrome de Asperger/terapia , Trastornos Generalizados del Desarrollo Infantil/terapia , Cara , Reconocimiento Visual de Modelos , Terapia Asistida por Computador , Juegos de Video , Atención , Niño , Aprendizaje Discriminativo , Expresión Facial , Femenino , Estudios de Seguimiento , Humanos , Masculino , Memoria a Corto Plazo , Enmascaramiento Perceptual , Retención en Psicología
12.
Int J Psychophysiol ; 132(Pt B): 243-251, 2018 10.
Artículo en Inglés | MEDLINE | ID: mdl-29208491

RESUMEN

The reward positivity is a component of the event-related brain potential (ERP) sensitive to neural mechanisms of reward processing. Multiple studies have demonstrated that reward positivity amplitude indices a reward prediction error signal that is fundamental to theories of reinforcement learning. However, whether this ERP component is also sensitive to richer forms of performance information important for supervised learning is less clear. To investigate this question, we recorded the electroencephalogram from participants engaged in a time estimation task in which the type of error information conveyed by feedback stimuli was systematically varied across conditions. Consistent with our predictions, we found that reward positivity amplitude decreased in relation to increasing information content of the feedback, and that reward positivity amplitude was unrelated to trial-to-trial behavioral adjustments in task performance. By contrast, a series of exploratory analyses revealed frontal-central and posterior ERP components immediately following the reward positivity that related to these processes. Taken in the context of the wider literature, these results suggest that the reward positivity is produced by a neural mechanism that motivates task performance, whereas the later ERP components apply the feedback information according to principles of supervised learning.


Asunto(s)
Corteza Cerebral/fisiología , Potenciales Evocados/fisiología , Retroalimentación Psicológica/fisiología , Desempeño Psicomotor/fisiología , Refuerzo en Psicología , Adulto , Electroencefalografía , Humanos , Recompensa , Adulto Joven
13.
Neuron ; 83(3): 551-7, 2014 Aug 06.
Artículo en Inglés | MEDLINE | ID: mdl-25066083

RESUMEN

Humans exhibit a preference for options they have freely chosen over equally valued options they have not; however, the neural mechanism that drives this bias and its functional significance have yet to be identified. Here, we propose a model in which choice biases arise due to amplified positive reward prediction errors associated with free choice. Using a novel variant of a probabilistic learning task, we show that choice biases are selective to options that are predominantly associated with positive outcomes. A polymorphism in DARPP-32, a gene linked to dopaminergic striatal plasticity and individual differences in reinforcement learning, was found to predict the effect of choice as a function of value. We propose that these choice biases are the behavioral byproduct of a credit assignment mechanism responsible for ensuring the effective delivery of dopaminergic reinforcement learning signals broadcast to the striatum.


Asunto(s)
Conducta de Elección/fisiología , Aprendizaje/fisiología , Refuerzo en Psicología , Cognición/fisiología , Dopamina/metabolismo , Humanos , Individualidad , Modelos Psicológicos , Desempeño Psicomotor/fisiología , Recompensa
14.
Brain Res ; 1365: 18-34, 2010 Dec 13.
Artículo en Inglés | MEDLINE | ID: mdl-20875804

RESUMEN

A number of hypotheses have suggested that the principal neurological dysfunction responsible for the behavioural symptoms associated with Attention-Deficit/Hyperactive Disorder (ADHD) is likely rooted in abnormal phasic signals coded by the firing rate of midbrain dopamine neurons. We present a formal investigation of the impact atypical phasic dopamine signals have on behaviour by applying a TD(λ) reinforcement learning model to simulations of operant conditioning tasks that have been argued to quantify the hyperactive, inattentive and impulsive behaviour associated with ADHD. The results presented here suggest that asymmetrically effective dopamine signals encoded by a punctate increase or decrease in dopamine levels provide the best account for the behaviour of children with ADHD as well as an animal model of ADHD, the spontaneously hypertensive rat (SHR). The biological sources of this asymmetry are considered, as are other computational models of ADHD.


Asunto(s)
Trastorno por Déficit de Atención con Hiperactividad/fisiopatología , Trastorno por Déficit de Atención con Hiperactividad/psicología , Simulación por Computador , Modelos Psicológicos , Recompensa , Animales , Trastorno por Déficit de Atención con Hiperactividad/diagnóstico , Niño , Modelos Animales de Enfermedad , Dopamina/fisiología , Humanos , Vías Nerviosas/fisiología , Desempeño Psicomotor/fisiología , Ratas , Ratas Endogámicas SHR , Ratas Endogámicas WKY
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA