Your browser doesn't support javascript.
loading
Emergence of cooperation under punishment: A reinforcement learning perspective.
Zhao, Chenyang; Zheng, Guozhong; Zhang, Chun; Zhang, Jiqiang; Chen, Li.
Afiliación
  • Zhao C; School of Physics and Information Technology, Shaanxi Normal University, Xi'an 710061, People's Republic of China.
  • Zheng G; School of Physics and Information Technology, Shaanxi Normal University, Xi'an 710061, People's Republic of China.
  • Zhang C; School of Science, Xi'an Shiyou University, Xi'an 710065, People's Republic of China.
  • Zhang J; School of Physics, Ningxia University, Yinchuan 750021, People's Republic of China.
  • Chen L; School of Physics and Information Technology, Shaanxi Normal University, Xi'an 710061, People's Republic of China.
Chaos ; 34(7)2024 Jul 01.
Article en En | MEDLINE | ID: mdl-38985966
ABSTRACT
Punishment is a common tactic to sustain cooperation and has been extensively studied for a long time. While most of previous game-theoretic work adopt the imitation learning framework where players imitate the strategies of those who are better off, the learning logic in the real world is often much more complex. In this work, we turn to the reinforcement learning paradigm, where individuals make their decisions based upon their experience and long-term returns. Specifically, we investigate the prisoners' dilemma game with a Q-learning algorithm, and cooperators probabilistically pose punishment on defectors in their neighborhood. Unexpectedly, we find that punishment could lead to either continuous or discontinuous cooperation phase transitions, and the nucleation process of cooperation clusters is reminiscent of the liquid-gas transition. The analysis of a Q-table reveals the evolution of the underlying "psychologic" changes, which explains the nucleation process and different levels of cooperation. The uncovered first-order phase transition indicates that great care needs to be taken when implementing the punishment compared to the continuous scenario.

Texto completo: 1 Base de datos: MEDLINE Idioma: En Revista: Chaos Asunto de la revista: CIENCIA Año: 2024 Tipo del documento: Article

Texto completo: 1 Base de datos: MEDLINE Idioma: En Revista: Chaos Asunto de la revista: CIENCIA Año: 2024 Tipo del documento: Article