Your browser doesn't support javascript.
loading
Machine learning approaches to predict the apparent rate constants for aqueous organic compounds by ferrate.
Zheng, Shan-Shan; Guo, Wan-Qian; Lu, Hao; Si, Qi-Shi; Liu, Bang-Hai; Wang, Hua-Zhe; Zhao, Qi; Jia, Wen-Rui; Yu, Tai-Ping.
Afiliación
  • Zheng SS; State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin, 150090, China.
  • Guo WQ; State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin, 150090, China. Electronic address: hitgwq@yeah.net.
  • Lu H; State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin, 150090, China.
  • Si QS; State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin, 150090, China.
  • Liu BH; State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin, 150090, China.
  • Wang HZ; State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin, 150090, China.
  • Zhao Q; State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin, 150090, China.
  • Jia WR; State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin, 150090, China.
  • Yu TP; Yangtze Ecology and Environment Co. Ltd., Wuhan, 430062, China.
J Environ Manage ; 329: 116904, 2023 Mar 01.
Article en En | MEDLINE | ID: mdl-36528943
The apparent second-order rate constant with hexavalent ferrate (Fe(VI)) (kFe(VI)) is a key indicator to evaluate the removal efficiency of a molecule by Fe(VI) oxidation. kFe(VI) is often determined by experiment, but such measurements can hardly catch up with the rapid growth of organic compounds (OCs). To address this issue, in this study, a total of 437 experimental second-order kFe(VI) rate constants at a range of conditions (pH and temperature) were used to train four machine learning (ML) algorithms (lasso regression (LR), ridge regression (RR), extreme gradient boosting (XGBoost), and the light gradient boosting machine (LightGBM)). Using the Morgan fingerprint (MF)) of a range of organic compounds (OCs) as the input, the performance of the four algorithms was comprehensively compared with respect to the coefficient of determination (R2) and root-mean-square error (RMSE). It is shown that the RR, XGBoost, and LightGBM models displayed generally acceptable performance kFe(VI) (R2test > 0.7). In addition, the shapely additive explanation (SHAP) and feature importance methods were employed to interpret the XGBoost/LightGBM and RR models, respectively. The results showed that the XGBoost/LightGBM and RR models suggestd pH as the most important predictor and the tree-based models elucidate how electron-donating and electron-withdrawing groups influence the reactivity of the Fe(VI) species. In addition, the RR model share eight common features, including pH, with the two tree-based models. This work provides a fast and acceptable method for predicting kFe(VI) values and can help researchers better understand the degradation behavior of OCs by Fe(VI) oxidation from the perspective of molecular structure.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Contaminantes Químicos del Agua / Hierro Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: J Environ Manage Año: 2023 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Contaminantes Químicos del Agua / Hierro Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: J Environ Manage Año: 2023 Tipo del documento: Article País de afiliación: China