Your browser doesn't support javascript.
loading
Integrating Reward Information for Prospective Behavior.
Hall-McMaster, Sam; Stokes, Mark G; Myers, Nicholas E.
Afiliação
  • Hall-McMaster S; Department of Experimental Psychology, University of Oxford, United Kingdom, OX2 6GG hall-mcmaster@mpib-berlin.mpg.de.
  • Stokes MG; Wellcome Centre for Integrative Neuroimaging, University of Oxford, United Kingdom, OX3 9DU.
  • Myers NE; Department of Experimental Psychology, University of Oxford, United Kingdom, OX2 6GG.
J Neurosci ; 42(9): 1804-1819, 2022 03 02.
Article em En | MEDLINE | ID: mdl-35042770
Value-based decision-making is often studied in a static context, where participants decide which option to select from those currently available. However, everyday life often involves an additional dimension: deciding when to select to maximize reward. Recent evidence suggests that agents track the latent reward of an option, updating changes in their latent reward estimate, to achieve appropriate selection timing (latent reward tracking). However, this strategy can be difficult to distinguish from one in which the optimal selection time is estimated in advance, allowing an agent to wait a predetermined amount of time before selecting, without needing to monitor an option's latent reward (distance-to-goal tracking). Here, we show that these strategies can in principle be dissociated. Human brain activity was recorded using electroencephalography (EEG), while female and male participants performed a novel decision task. Participants were shown an option and decided when to select it, as its latent reward changed from trial-to-trial. While the latent reward was uncued, it could be estimated using cued information about the option's starting value and value growth rate. We then used representational similarity analysis (RSA) to assess whether EEG signals more closely resembled latent reward tracking or distance-to-goal tracking. This approach successfully dissociated the strategies in this task. Starting value and growth rate were translated into a distance-to-goal signal, far in advance of selecting the option. Latent reward could not be independently decoded. These results demonstrate the feasibility of using high temporal resolution neural recordings to identify internally computed decision variables in the human brain.SIGNIFICANCE STATEMENT Reward-seeking behavior involves acting at the right time. However, the external world does not always tell us when an action is most rewarding, necessitating internal representations that guide action timing. Specifying this internal neural representation is challenging because it might stem from a variety of strategies, many of which make similar predictions about brain activity. This study used a novel approach to test whether alternative strategies could be dissociated in principle. Using representational similarity analysis (RSA), we were able to distinguish between candidate internal representations for selection timing. This shows how pattern analysis methods can be used to measure latent decision information in noninvasive neural data.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Recompensa / Sinais (Psicologia) Tipo de estudo: Observational_studies / Prognostic_studies / Risk_factors_studies Limite: Female / Humans / Male Idioma: En Revista: J Neurosci Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Recompensa / Sinais (Psicologia) Tipo de estudo: Observational_studies / Prognostic_studies / Risk_factors_studies Limite: Female / Humans / Male Idioma: En Revista: J Neurosci Ano de publicação: 2022 Tipo de documento: Article