RESUMO
The groundwater salinization process complexity and the lack of data on its controlling factors are the main challenges for accurate predictions and mapping of aquifer salinity. For this purpose, effective machine learning (ML) methodologies are employed for effective modeling and mapping of groundwater salinity (GWS) in the Mio-Pliocene aquifer in the Sidi Okba region, Algeria, based on limited dataset of electrical conductivity (EC) measurements and readily available digital elevation model (DEM) derivatives. The dataset was randomly split into training (70%) and testing (30%) sets, and three wrapper selection methods, recursive feature elimination (RFE), forward feature selection (FFS), and backward feature selection (BFS) are applied to train the data. The resulting combinations are used as inputs for five ML models, namely random forest (RF), hybrid neuro-fuzzy inference system (HyFIS), K-nearest neighbors (KNN), cubist regression model (CRM), and support vector machine (SVM). The best-performing model is identified and applied to predict and map GWS across the entire study area. It is highlighted that the applied methods yield input variation combinations as critical factors that are often overlocked by many researchers, which substantially impacts the models' accuracy. Among different alternatives the RF model emerged as the most effective for predicting and mapping GWS in the study area, which led to the high performance in both the training (RMSE = 1.016, R = 0.854, and MAE = 0.759) and testing (RMSE = 1.069, R = 0.831, and MAE = 0.921) phases. The generated digital map highlighted the alarming situation regarding excessive GWS levels in the study area, particularly in zones of low elevations and far from the Foum Elgherza dam and Elbiraz wadi. Overall, this study represents a significant advancement over previous approaches, offering enhanced predictive performance for GWS with the minimum number of input variables.
Assuntos
Água Subterrânea , Aprendizado de Máquina , Salinidade , Argélia , Água Subterrânea/química , Monitoramento Ambiental/métodos , Máquina de Vetores de Suporte , Modelos TeóricosRESUMO
Biskra region currently shows signs of stress and a high risk of groundwater contamination by various chemicals and pesticides. For this purpose, a modified integrated susceptibility index (SI) is coupled with remote sensing (RS) and WetSpass model to assess the sensitivity of the groundwater and the risk of pollution in the most exploited aquifer (Quaternary aquifer) in the study area. The results of the modified SI model show that a major part of the aquifer is at risk of contamination if the farmers do not implement good agricultural practices. Four sensitivity levels are considered, reflecting a vulnerability rating that ranges from low to very high. The very high category is observed in the agricultural areas with an estimated pollution index ranging from 84 to 90.57, while a large part of the aquifer shows a high vulnerability to contamination (64 < SI ≤ 84). This category is found in areas characterized by the dominance of bare soil. In urban areas, the vulnerability level decreases to low category (37 < SI ≤ 45). However, the area of forests is classified as moderate to vulnerability (45 < SI ≤ 64). The different statistical and GIS methods confirm the reliability of the obtained SI map. The combination of the SI method with WetSpass model and RS can give a reliable map to help and assist the authorities and decision-makers in groundwater resources planning and the implementation of monitoring programs and networks to control the quality of groundwater in arid environments.