Your browser doesn't support javascript.
loading
Improved Robot Path Planning Method Based on Deep Reinforcement Learning.
Han, Huiyan; Wang, Jiaqi; Kuang, Liqun; Han, Xie; Xue, Hongxin.
Afiliación
  • Han H; School of Computer Science and Technology, North University of China, Taiyuan 030051, China.
  • Wang J; Shanxi Key Laboratory of Machine Vision and Virtual Reality, Taiyuan 030051, China.
  • Kuang L; Shanxi Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan 030051, China.
  • Han X; School of Computer Science and Technology, North University of China, Taiyuan 030051, China.
  • Xue H; Shanxi Key Laboratory of Machine Vision and Virtual Reality, Taiyuan 030051, China.
Sensors (Basel) ; 23(12)2023 Jun 15.
Article en En | MEDLINE | ID: mdl-37420785
With the advancement of robotics, the field of path planning is currently experiencing a period of prosperity. Researchers strive to address this nonlinear problem and have achieved remarkable results through the implementation of the Deep Reinforcement Learning (DRL) algorithm DQN (Deep Q-Network). However, persistent challenges remain, including the curse of dimensionality, difficulties of model convergence and sparsity in rewards. To tackle these problems, this paper proposes an enhanced DDQN (Double DQN) path planning approach, in which the information after dimensionality reduction is fed into a two-branch network that incorporates expert knowledge and an optimized reward function to guide the training process. The data generated during the training phase are initially discretized into corresponding low-dimensional spaces. An "expert experience" module is introduced to facilitate the model's early-stage training acceleration in the Epsilon-Greedy algorithm. To tackle navigation and obstacle avoidance separately, a dual-branch network structure is presented. We further optimize the reward function enabling intelligent agents to receive prompt feedback from the environment after performing each action. Experiments conducted in both virtual and real-world environments have demonstrated that the enhanced algorithm can accelerate model convergence, improve training stability and generate a smooth, shorter and collision-free path.
Asunto(s)
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Robótica Idioma: En Revista: Sensors (Basel) Año: 2023 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Robótica Idioma: En Revista: Sensors (Basel) Año: 2023 Tipo del documento: Article País de afiliación: China