Your browser doesn't support javascript.
loading
Reinforcement Learning With Vision-Proprioception Model for Robot Planar Pushing.
Cong, Lin; Liang, Hongzhuo; Ruppel, Philipp; Shi, Yunlei; Görner, Michael; Hendrich, Norman; Zhang, Jianwei.
Afiliación
  • Cong L; TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.
  • Liang H; TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.
  • Ruppel P; TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.
  • Shi Y; TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.
  • Görner M; TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.
  • Hendrich N; TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.
  • Zhang J; TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.
Front Neurorobot ; 16: 829437, 2022.
Article en En | MEDLINE | ID: mdl-35308311
ABSTRACT
We propose a vision-proprioception model for planar object pushing, efficiently integrating all necessary information from the environment. A Variational Autoencoder (VAE) is used to extract compact representations from the task-relevant part of the image. With the real-time robot state obtained easily from the hardware system, we fuse the latent representations from the VAE and the robot end-effector position together as the state of a Markov Decision Process. We use Soft Actor-Critic to train the robot to push different objects from random initial poses to target positions in simulation. Hindsight Experience replay is applied during the training process to improve the sample efficiency. Experiments demonstrate that our algorithm achieves a pushing performance superior to a state-based baseline model that cannot be generalized to a different object and outperforms state-of-the-art policies which operate on raw image observations. At last, we verify that our trained model has a good generalization ability to unseen objects in the real world.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: Front Neurorobot Año: 2022 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: Front Neurorobot Año: 2022 Tipo del documento: Article País de afiliación: Alemania
...