Curiosity model policy optimization for robotic manipulator tracking control with input saturation in uncertain environment.

Wang, Tu; Wang, Fujie; Xie, Zhongye; Qin, Feiyan

Wang, Tu; Wang, Fujie; Xie, Zhongye; Qin, Feiyan.

Afiliación

Wang T; College of Computer Science and Technology, Dongguan University of Technology, Dongguan, China.
Wang F; College of Outstanding Engineers, Dongguan University of Technology, Dongguan, China.
Xie Z; College of Outstanding Engineers, Dongguan University of Technology, Dongguan, China.
Qin F; College of Outstanding Engineers, Dongguan University of Technology, Dongguan, China.

Front Neurorobot ; 18: 1376215, 2024.

Article en En | MEDLINE | ID: mdl-38989482

ABSTRACT

ABSTRACT

In uncertain environments with robot input saturation, both model-based reinforcement learning (MBRL) and traditional controllers struggle to perform control tasks optimally. In this study, an algorithmic framework of Curiosity Model Policy Optimization (CMPO) is proposed by combining curiosity and model-based approach, where tracking errors are reduced via training agents on control gains for traditional model-free controllers. To begin with, a metric for judging positive and negative curiosity is proposed. Constrained optimization is employed to update the curiosity ratio, which improves the efficiency of agent training. Next, the novelty distance buffer ratio is defined to reduce bias between the environment and the model. Finally, CMPO is simulated with traditional controllers and baseline MBRL algorithms in the robotic environment designed with non-linear rewards. The experimental results illustrate that the algorithm achieves superior tracking performance and generalization capabilities.

Palabras clave

buffer schedule; input saturation; intrinsic motivation; model-based reinforcement learning; robotic manipulator; uncertain environment

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Front Neurorobot Año: 2024 Tipo del documento: Article País de afiliación: China

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google