Value-free reinforcement learning: policy optimization as a minimal model of operant behavior.
Curr Opin Behav Sci
; 41: 114-121, 2021 Oct.
Article
in En
| MEDLINE
| ID: mdl-36341023
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Type of study:
Prognostic_studies
Language:
En
Journal:
Curr Opin Behav Sci
Year:
2021
Document type:
Article
Affiliation country:
United States
Country of publication:
Netherlands