Your browser doesn't support javascript.
loading
Systematic Performance Evaluation of Reinforcement Learning Algorithms Applied to Wastewater Treatment Control Optimization.
Croll, Henry C; Ikuma, Kaoru; Ong, Say Kee; Sarkar, Soumik.
Afiliação
  • Croll HC; Department of Civil, Construction, and Environmental Engineering, Iowa State University, Ames, Iowa 50011, United States.
  • Ikuma K; Department of Civil, Construction, and Environmental Engineering, Iowa State University, Ames, Iowa 50011, United States.
  • Ong SK; Department of Civil, Construction, and Environmental Engineering, Iowa State University, Ames, Iowa 50011, United States.
  • Sarkar S; Department of Mechanical Engineering, Iowa State University, Ames, Iowa 50011, United States.
Environ Sci Technol ; 57(46): 18382-18390, 2023 Nov 21.
Article em En | MEDLINE | ID: mdl-37405782
ABSTRACT
Treatment of wastewater using activated sludge relies on several complex, nonlinear processes. While activated sludge systems can provide high levels of treatment, including nutrient removal, operating these systems is often challenging and energy intensive. Significant research investment has been made in recent years into improving control optimization of such systems, through both domain knowledge and, more recently, machine learning. This study leverages a novel interface between a common process modeling software and a Python reinforcement learning environment to evaluate four common reinforcement learning algorithms for their ability to minimize treatment energy use while maintaining effluent compliance within the Benchmark Simulation Model No. 1 (BSM1) simulation. Three of the algorithms tested, deep Q-learning, proximal policy optimization, and synchronous advantage actor critic, generally performed poorly over the scenarios tested in this study. In contrast, the twin delayed deep deterministic policy gradient (TD3) algorithm consistently produced a high level of control optimization while maintaining the treatment requirements. Under the best selection of state observation features, TD3 control optimization reduced aeration and pumping energy requirements by 14.3% compared to the BSM1 benchmark control, outperforming the advanced domain-based control strategy of ammonia-based aeration control, although future work is necessary to improve robustness of RL implementation.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Esgotos / Purificação da Água Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Esgotos / Purificação da Água Idioma: En Ano de publicação: 2023 Tipo de documento: Article