Búsqueda | Portal de Búsqueda de la BVS España

Exploration-Based Planning for Multiple-Target Search with Real-Drone Results.

Yousuf, Bilal; Lendek, Zsófia; Busoniu, Lucian.

Sensors (Basel) ; 24(9)2024 Apr 30.

Artículo en Inglés | MEDLINE | ID: mdl-38732973

RESUMEN

Consider a drone that aims to find an unknown number of static targets at unknown positions as quickly as possible. A multi-target particle filter uses imperfect measurements of the target positions to update an intensity function that represents the expected number of targets. We propose a novel receding-horizon planner that selects the next position of the drone by maximizing an objective that combines exploration and target refinement. Confidently localized targets are saved and removed from consideration along with their future measurements. A controller with an obstacle-avoidance component is used to reach the desired waypoints. We demonstrate the performance of our approach through a series of simulations as well as via a real-robot experiment in which a Parrot Mambo drone searches from a constant altitude for targets located on the floor. Target measurements are obtained on-board the drone using segmentation in the camera image, while planning is done off-board. The sensor model is adapted to the application. Both in the simulations and in the experiments, the novel framework works better than the lawnmower and active-search baselines.

A Simulator and First Reinforcement Learning Results for Underwater Mapping.

Rosynski, Matthias; Busoniu, Lucian.

Sensors (Basel) ; 22(14)2022 Jul 19.

Artículo en Inglés | MEDLINE | ID: mdl-35891061

RESUMEN

Underwater mapping with mobile robots has a wide range of applications, and good models are lacking for key parts of the problem, such as sensor behavior. The specific focus here is the huge environmental problem of underwater litter, in the context of the Horizon 2020 SeaClear project, where a team of robots is being developed to map and collect such litter. No reinforcement-learning solution to underwater mapping has been proposed thus far, even though the framework is well suited for robot control in unknown settings. As a key contribution, this paper therefore makes a first attempt to apply deep reinforcement learning (DRL) to this problem by exploiting two state-of-the-art algorithms and making a number of mapping-specific improvements. Since DRL often requires millions of samples to work, a fast simulator is required, and another key contribution is to develop such a simulator from scratch for mapping seafloor objects with an underwater vehicle possessing a sonar-like sensor. Extensive numerical experiments on a range of algorithm variants show that the best DRL method collects litter significantly faster than a baseline lawn mower trajectory.

Asunto(s)

Algoritmos , Aprendizaje

Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards.

Mezei, Ady-Daniel; Tamás, Levente; Busoniu, Lucian.

Sensors (Basel) ; 20(9)2020 Apr 27.

Artículo en Inglés | MEDLINE | ID: mdl-32349393

RESUMEN

We consider a robot that must sort objects transported by a conveyor belt into different classes. Multiple observations must be performed before taking a decision on the class of each object, because the imperfect sensing sometimes detects the incorrect object class. The objective is to sort the sequence of objects in a minimal number of observation and decision steps. We describe this task in the framework of partially observable Markov decision processes, and we propose a reward function that explicitly takes into account the information gain of the viewpoint selection actions applied. The DESPOT algorithm is applied to solve the problem, automatically obtaining a sequence of observation viewpoints and class decision actions. Observations are made either only for the object on the first position of the conveyor belt or for multiple adjacent positions at once. The performance of the single- and multiple-position variants is compared, and the impact of including the information gain is analyzed. Real-life experiments with a Baxter robot and an industrial conveyor belt are provided.

Vision and Control for UAVs: A Survey of General Methods and of Inexpensive Platforms for Infrastructure Inspection.

Máthé, Koppány; Busoniu, Lucian.

Sensors (Basel) ; 15(7): 14887-916, 2015 Jun 25.

Artículo en Inglés | MEDLINE | ID: mdl-26121608

RESUMEN

Unmanned aerial vehicles (UAVs) have gained significant attention in recent years. Low-cost platforms using inexpensive sensor payloads have been shown to provide satisfactory flight and navigation capabilities. In this report, we survey vision and control methods that can be applied to low-cost UAVs, and we list some popular inexpensive platforms and application fields where they are useful. We also highlight the sensor suites used where this information is available. We overview, among others, feature detection and tracking, optical flow and visual servoing, low-level stabilization and high-level planning methods. We then list popular low-cost UAVs, selecting mainly quadrotors. We discuss applications, restricting our focus to the field of infrastructure inspection. Finally, as an example, we formulate two use-cases for railway inspection, a less explored application field, and illustrate the usage of the vision and control techniques reviewed by selecting appropriate ones to tackle these use-cases. To select vision methods, we run a thorough set of experimental evaluations.

Predicting Intention of Motion During Rehabilitation Tasks of the Upper-Extremity.

Natsakis, Tassos; Busoniu, Lucian.

Annu Int Conf IEEE Eng Med Biol Soc ; 2021: 6037-6040, 2021 11.

Artículo en Inglés | MEDLINE | ID: mdl-34892493

RESUMEN

Rehabilitation promoting "assistance-as-needed" is considered a promising scheme of active rehabilitation, since it can promote neuroplasticity faster and thus reduce the time needed until restoration. To implement such schemes using robotic devices, it is crucial to be able to predict accurately and in real-time the intention of motion of the patient. In this study, we present an intention-of-motion model trained on healthy volunteers. The model is trained using kinematics and muscle activation time series data, and returns future predicted values for the kinematics. We also present the results of an analysis of the sensitivity of the accuracy of the model for different amount of training datasets and varying lengths of the prediction horizon. We demonstrate that the model is able to predict reliably the kinematics of volunteers that were not involved in its training. The model is tested with three types of motion inspired by rehabilibation tasks. In all cases, the model is predicting the arm kinematics with a Root Mean Square Error (RMSE) below 0.12m. Being a non person-specific model, it could be used to predict kinematics even for patients that are not able to perform any motion without assistance. The resulting kinematics, even if not fully representative of the specific patient, might be a preferable input for a robotic rehabilitator than predefined trajectories currently in use.

Asunto(s)

Intención , Extremidad Superior , Fenómenos Biomecánicos , Humanos , Movimiento (Física) , Rango del Movimiento Articular

Efficient model learning methods for actor-critic control.

Grondman, Ivo; Vaandrager, Maarten; Busoniu, Lucian; Babuska, Robert; Schuitema, Erik.

IEEE Trans Syst Man Cybern B Cybern ; 42(3): 591-602, 2012 Jun.

Artículo en Inglés | MEDLINE | ID: mdl-22156998

RESUMEN

We propose two new actor-critic algorithms for reinforcement learning. Both algorithms use local linear regression (LLR) to learn approximations of the functions involved. A crucial feature of the algorithms is that they also learn a process model, and this, in combination with LLR, provides an efficient policy update for faster learning. The first algorithm uses a novel model-based update rule for the actor parameters. The second algorithm does not use an explicit actor but learns a reference model which represents a desired behavior, from which desired control actions can be calculated using the inverse of the learned process model. The two novel methods and a standard actor-critic algorithm are applied to the pendulum swing-up problem, in which the novel methods achieve faster learning than the standard algorithm.

Asunto(s)

Algoritmos , Inteligencia Artificial , Técnicas de Apoyo para la Decisión , Modelos Teóricos , Reconocimiento de Normas Patrones Automatizadas/métodos , Simulación por Computador

Cross-entropy optimization of control policies with adaptive basis functions.

Busoniu, Lucian; Ernst, Damien; De Schutter, Bart; Babuska, Robert.

IEEE Trans Syst Man Cybern B Cybern ; 41(1): 196-209, 2011 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-20570774

RESUMEN

This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-loop policy that can be represented using a given number of basis functions (BFs), where a discrete action is assigned to each BF. The type of the BFs and their number are specified in advance and determine the complexity of the representation. Considerable flexibility is achieved by optimizing the locations and shapes of the BFs, together with the action assignments. The optimization is carried out with the cross-entropy method and evaluates the policies by their empirical return from a representative set of initial states. The return for each representative state is estimated using Monte Carlo simulations. The resulting algorithm for cross-entropy policy search with adaptive BFs is extensively evaluated in problems with two to six state variables, for which it reliably obtains good policies with only a small number of BFs. In these experiments, cross-entropy policy search requires vastly fewer BFs than value-function techniques with equidistant BFs, and outperforms policy search with a competing optimization algorithm called DIRECT.

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA