RESUMEN
With rapid industrialization and urbanization, air pollution has become an increasingly severe problem. As a key indicator of air quality, accurate prediction of the air quality index (AQI) is essential for policymakers to establish effective early warning management mechanisms and adjust living plans. In this work, a hybrid multi-scale fusion prediction paradigm is proposed for the complex AQI time series prediction. First, an initial decomposition and integration of the original data is performed by combining the complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) and sample entropy (SE). Then, the subsequences, divided into high-frequency and low-frequency groups, are applied to different processing methods. Among them, the variational mode decomposition (VMD) is chosen to perform a secondary decomposition of the high-frequency sequence groups and integrated by using K-means clustering with sample entropy. Finally, multi-scale fusion training of sequence prediction results with different frequencies by using long short-term memory (LSTM) yields more accurate results with R2 of 0.9715, RMSE of 2.0327, MAE of 0.0154, and MAPE of 0.0488. Furthermore, validation of the AQI datasets acquired from four different cities demonstrates that the new paradigm is more robust and generalizable as compared to other baseline methods. Therefore, this model not only holds potential value in developing AQI prediction models but also serves as a valuable reference for future research on AQI control strategies.