Your browser doesn't support javascript.
loading
Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task.
Horvath, Lilla; Colcombe, Stanley; Milham, Michael; Ray, Shruti; Schwartenbeck, Philipp; Ostwald, Dirk.
Afiliación
  • Horvath L; Computational Cognitive Neuroscience, Freie Universität Berlin, Berlin, Germany.
  • Colcombe S; Center for Biomedical Imaging and Neuromodulation, Nathan Kline Institute, Orangeburg, NY USA.
  • Milham M; Center for Biomedical Imaging and Neuromodulation, Nathan Kline Institute, Orangeburg, NY USA.
  • Ray S; Department of Biomedical Engineering, New Jersey Institute of Technology, Newark, NJ USA.
  • Schwartenbeck P; Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Tübingen, Germany.
  • Ostwald D; Institute of Psychology, Otto-von-Guericke Universität Magdeburg, Magdeburg, Germany.
Comput Brain Behav ; 4(4): 442-462, 2021.
Article en En | MEDLINE | ID: mdl-34368622
ABSTRACT
Humans often face sequential decision-making problems, in which information about the environmental reward structure is detached from rewards for a subset of actions. In the current exploratory study, we introduce an information-selective symmetric reversal bandit task to model such situations and obtained choice data on this task from 24 participants. To arbitrate between different decision-making strategies that participants may use on this task, we developed a set of probabilistic agent-based behavioral models, including exploitative and explorative Bayesian agents, as well as heuristic control agents. Upon validating the model and parameter recovery properties of our model set and summarizing the participants' choice data in a descriptive way, we used a maximum likelihood approach to evaluate the participants' choice data from the perspective of our model set. In brief, we provide quantitative evidence that participants employ a belief state-based hybrid explorative-exploitative strategy on the information-selective symmetric reversal bandit task, lending further support to the finding that humans are guided by their subjective uncertainty when solving exploration-exploitation dilemmas. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s42113-021-00112-3.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: Comput Brain Behav Año: 2021 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: Comput Brain Behav Año: 2021 Tipo del documento: Article País de afiliación: Alemania