Your browser doesn't support javascript.
loading
Robust Multiobjective Reinforcement Learning Considering Environmental Uncertainties.
Article en En | MEDLINE | ID: mdl-38781066
ABSTRACT
Numerous real-world decision or control problems involve multiple conflicting objectives whose relative importance (preference) is required to be weighed in different scenarios. While Pareto optimality is desired, environmental uncertainties (e.g., environmental changes or observational noises) may mislead the agent into performing suboptimal policies. In this article, we present a novel multiobjective optimization paradigm, robust multiobjective reinforcement learning (RMORL) considering environmental uncertainties, to train a single model that can approximate robust Pareto-optimal policies across the entire preference space. To enhance policy robustness against environmental changes, an environmental disturbance is modeled as an adversarial agent across the entire preference space via incorporating a zero-sum game into a multiobjective Markov decision process (MOMDP). Additionally, we devise an adversarial defense technique against observational perturbations, which ensures that policy variations, perturbed by adversarial attacks on state observations, remain within bounds under any specified preferences. The proposed technique is assessed in five multiobjective environments with continuous action spaces, showcasing its effectiveness through comparisons with competitive baselines, which encompass classical and state-of-the-art schemes.

Texto completo: 1 Base de datos: MEDLINE Idioma: En Revista: IEEE Trans Neural Netw Learn Syst / IEEE trans. neural netw. learn. syst. (Online) / IEEE transactions on neural networks and learning systems (Online) Año: 2024 Tipo del documento: Article

Texto completo: 1 Base de datos: MEDLINE Idioma: En Revista: IEEE Trans Neural Netw Learn Syst / IEEE trans. neural netw. learn. syst. (Online) / IEEE transactions on neural networks and learning systems (Online) Año: 2024 Tipo del documento: Article