Combining machine learning models through multiple data division methods for PM<sub>2.5</sub> forecasting in Northern Xinjiang, China.

Ren, Miaomiao; Sun, Wei; Chen, Shu

Combining machine learning models through multiple data division methods for PM_2.5 forecasting in Northern Xinjiang, China.

Ren, Miaomiao; Sun, Wei; Chen, Shu.

Afiliação

Ren M; School of Geography and Planning, Sun Yat-Sen University, Guangzhou, 510275, Guangdong, China.
Sun W; School of Resources and Environmental Science, Xinjiang University, Urumqi, 830046, Xinjiang, China.
Chen S; School of Geography and Planning, Sun Yat-Sen University, Guangzhou, 510275, Guangdong, China. sunwei29@mail.sysu.edu.cn.

Environ Monit Assess ; 193(8): 476, 2021 Jul 07.

Article em En | MEDLINE | ID: mdl-34232403

RESUMO

In this study, daily average PM2.5 forecasting models were developed and applied in the Northern Xinjiang, China, through combining the back propagation artificial neural network (BPANN) and multiple linear regression (MLR) with another BPANN model. The meteorological (daily average precipitation, pressure, relative humidity, temperature, and wind speed, daily maximum wind speed and sunshine hours on the same day) and air pollutant data (daily PM2.5, PM10, SO2, CO, NO2, and O3 concentrations on the previous day) in January and August of each year from 2015 to 2019 were used as candidate inputs. The optimal member and combining models were evaluated through the leave-one-out cross-validation (LOOCV), fivefold cross-validation, and hold-out methods. Twelve member models with optimal or sub-optimal performance were further used to develop the combining models. The performances of the BPANN and MLR member models were different using three data division methods. The models were evaluated more comprehensively through the LOOCV. The performances of the combining models were generally better than the member models. For both member and combining models, the PM2.5 forecasting model performance in August was generally better than in January. The correlation coefficient (R) for the validation set of the optimal combination model was about 0.87 in January and 0.946 in August. These results showed that combining linear and nonlinear models through multiple data division methods would be an effective tool to forecast PM2.5 concentrations.

Assuntos

Poluentes Atmosféricos; Poluição do Ar; Poluentes Atmosféricos/análise; China; Monitoramento Ambiental; Previsões; Aprendizado de Máquina; Material Particulado/análise

Palavras-chave

Air quality forecasting; Artificial neural network; Combining model; Cross-validation; PM2.5 concentration

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Poluentes Atmosféricos / Poluição do Ar Tipo de estudo: Prognostic_studies País como assunto: Asia Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google