Your browser doesn't support javascript.
loading
Reinforcement learning-based consensus control for MASs with intermittent constraints.
Luo, Ao; Zhou, Qi; Ren, Hongru; Ma, Hui; Lu, Renquan.
Afiliação
  • Luo A; School of Automation, Guangdong Provincial Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou, 510000, Guangdong, China.
  • Zhou Q; School of Automation, Guangdong Provincial Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou, 510000, Guangdong, China. Electronic address: zhouqi2009@gmail.com.
  • Ren H; School of Automation, Guangdong Provincial Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou, 510000, Guangdong, China.
  • Ma H; School of Mathematics and Statistics, Guangdong University of Technology, Guangzhou, 510000, Guangdong, China.
  • Lu R; School of Automation, Guangdong Provincial Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou, 510000, Guangdong, China.
Neural Netw ; 172: 106105, 2024 Apr.
Article em En | MEDLINE | ID: mdl-38232428
ABSTRACT
In this article, an adaptive optimal consensus control problem is studied for multiagent systems in the strict-feedback structure with intermittent constraints (the constraints appear intermittently). More specifically, by designing a novel switch-like function and an improved coordinate transformation, the constrained states are converted into unconstrained states, and the problem of intermittent constraints is resolved without requiring "feasibility conditions". In addition, using the composite learning algorithm and neural networks to construct the identifier, a simplified identifier-actor-critic-based reinforcement learning strategy is proposed to obtain the approximate optimal controller under the framework of backstepping. Meanwhile, with the aid of the nonlinear dynamic surface control technique, the issue of "explosion of complexity" in backstepping is removed, and the requirements for filter parameters are loosened. Based on Lyapunov stability theory, it is demonstrated that all signals in the closed-loop system are bounded. Finally, two simulation examples are used to verify the effectiveness of the proposed method.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Redes Neurais de Computação Idioma: En Revista: Neural Netw Assunto da revista: NEUROLOGIA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Redes Neurais de Computação Idioma: En Revista: Neural Netw Assunto da revista: NEUROLOGIA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China