Continual Reinforcement Learning for Quadruped Robot Locomotion.

Gai, Sibo; Lyu, Shangke; Zhang, Hongyin; Wang, Donglin

Gai, Sibo; Lyu, Shangke; Zhang, Hongyin; Wang, Donglin.

Afiliación

Gai S; School of Computer Science, Fudan University, Shanghai 200433, China.
Lyu S; School of Engineer, Westlake Univercity, Hangzhou 310030, China.
Zhang H; School of Engineer, Westlake Univercity, Hangzhou 310030, China.
Wang D; School of Engineer, Westlake Univercity, Hangzhou 310030, China.

Entropy (Basel) ; 26(1)2024 Jan 22.

Article en En | MEDLINE | ID: mdl-38275501

ABSTRACT

ABSTRACT

The ability to learn continuously is crucial for a robot to achieve a high level of intelligence and autonomy. In this paper, we consider continual reinforcement learning (RL) for quadruped robots, which includes the ability to continuously learn sub-sequential tasks (plasticity) and maintain performance on previous tasks (stability). The policy obtained by the proposed method enables robots to learn multiple tasks sequentially, while overcoming both catastrophic forgetting and loss of plasticity. At the same time, it achieves the above goals with as little modification to the original RL learning process as possible. The proposed method uses the Piggyback algorithm to select protected parameters for each task, and reinitializes the unused parameters to increase plasticity. Meanwhile, we encourage the policy network exploring by encouraging the entropy of the soft network of the policy network. Our experiments show that traditional continual learning algorithms cannot perform well on robot locomotion problems, and our algorithm is more stable and less disruptive to the RL training progress. Several robot locomotion experiments validate the effectiveness of our method.

Palabras clave

continual learning; entropy; plasticity; quadruped robot locomotion; reinforcement learning

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Entropy (Basel) Año: 2024 Tipo del documento: Article País de afiliación: China

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google