Reinforcement Learning for Blast Furnace Ironmaking Operation With Safety and Partial Observation Considerations.

Jiang, Ke; Jiang, Zhaohui; Jiang, Xudong; Xie, Yongfang; Gui, Weihua

Jiang, Ke; Jiang, Zhaohui; Jiang, Xudong; Xie, Yongfang; Gui, Weihua.

IEEE Trans Neural Netw Learn Syst ; 35(3): 3077-3090, 2024 Mar.

Article em En | MEDLINE | ID: mdl-38231813

ABSTRACT

ABSTRACT

Making proper decision online in complex environment during the blast furnace (BF) operation is a key factor in achieving long-term success and profitability in the steel manufacturing industry. Regulatory lags, ore source uncertainty, and continuous decision requirement make it a challenging task. Recently, reinforcement learning (RL) has demonstrated state-of-the-art performance in various sequential decision-making problems. However, the strict safety requirements make it impossible to explore optimal decisions through online trial and error. Therefore, this article proposes a novel offline RL approach designed to ensure safety, maximize return, and address issues of partially observed states. Specifically, it utilizes an off-policy actor-critic framework to infer the optimal decision from expert operation trajectories. The "actor" in this framework is jointly trained by the supervision and evaluation signals to make decision with low risk and high return. Furthermore, we investigate a recurrent version of the actor and critic networks to better capture the complete observations, which solves the partially observed Markov decision process (POMDP) arising from sensor limitations. Verification within the BF smelting process demonstrates the improvements of the proposed algorithm in performance, i.e., safety and return.

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Revista: IEEE Trans Neural Netw Learn Syst Ano de publicação: 2024 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google