Data-Driven H<sub>&#8734;</sub> Optimal Output Feedback Control for Linear Discrete-Time Systems Based on Off-Policy Q-Learning.

Zhang, Li; Fan, Jialu; Xue, Wenqian; Lopez, Victor G; Li, Jinna; Chai, Tianyou; Lewis, Frank L

Data-Driven H_∞ Optimal Output Feedback Control for Linear Discrete-Time Systems Based on Off-Policy Q-Learning.

Zhang, Li; Fan, Jialu; Xue, Wenqian; Lopez, Victor G; Li, Jinna; Chai, Tianyou; Lewis, Frank L.

IEEE Trans Neural Netw Learn Syst ; 34(7): 3553-3567, 2023 Jul.

Article em En | MEDLINE | ID: mdl-34662280

ABSTRACT

ABSTRACT

This article develops two novel output feedback (OPFB) Q -learning algorithms, on-policy Q -learning and off-policy Q -learning, to solve H∞ static OPFB control problem of linear discrete-time (DT) systems. The primary contribution of the proposed algorithms lies in a newly developed OPFB control algorithm form for completely unknown systems. Under the premise of satisfying disturbance attenuation conditions, the conditions for the existence of the optimal OPFB solution are given. The convergence of the proposed Q -learning methods, and the difference and equivalence of two algorithms are rigorously proven. Moreover, considering the effects brought by probing noise for the persistence of excitation (PE), the proposed off-policy Q -learning method has the advantage of being immune to probing noise and avoiding biasedness of solution. Simulation results are presented to verify the effectiveness of the proposed approaches.

Assuntos

Redes Neurais de Computação; Dinâmica não Linear; Retroalimentação; Algoritmos; Simulação por Computador

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / Dinâmica não Linear Idioma: En Revista: IEEE Trans Neural Netw Learn Syst Ano de publicação: 2023 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google