RESUMEN
Accurate and reliable estimation of Reference Evapotranspiration (ETo) is crucial for water resources management, hydrological processes, and agricultural production. The FAO-56 Penman-Monteith (FAO-56PM) approach is recommended as the standard model for ETo estimation; nevertheless, the absence of comprehensive meteorological variables at many global locations frequently restricts its implementation. This study compares shallow learning (SL) and deep learning (DL) models for estimating daily ETo against the FAO-56PM approach based on various statistic metrics and graphic tool over a coastal Red Sea region, Sudan. A novel approach of the SL model, the Catboost Regressor (CBR) and three DL models: 1D-Convolutional Neural Networks (1D-CNN), Long Short-Term Memory (LSTM), and Gated Recurrent Unit (GRU) were adopted and coupled with a semi-supervised pseudo-labeling (PL) technique. Six scenarios were developed regarding different input combinations of meteorological variables such as air temperature (Tmin, Tmax, and Tmean), wind speed (U2), relative humidity (RH), sunshine hours duration (SSH), net radiation (Rn), and saturation vapor pressure deficit (es-ea). The results showed that the PL technique reduced the systematic error of SL and DL models during training for all the scenarios. The input combination of Tmin, Tmax, Tmean, and RH reflected higher performance than other combinations for all employed models. The CBR-PL model demonstrated good generalization abilities to predict daily ETo and was the overall superior model in the testing phase according to prediction accuracy, stability analysis, and less computation cost compared to DL models. Thus, the relatively simple CBR-PL model is highly recommended as a promising tool for predicting daily ETo in coastal regions worldwide which have limited climate data.
Asunto(s)
Aprendizaje Profundo , Redes Neurales de la Computación , Clima , Viento , TemperaturaRESUMEN
Significant progress has been made in designing advanced membranes; however, persistent challenges remain due to their reduced permeation rates and a propensity for substantial fouling. These factors continue to pose significant barriers to the effective utilization of membranes in the separation of oil-in-water emulsions. Metal-organic frameworks (MOFs) are considered promising materials for such applications; however, they encounter three key challenges when applied to the separation of oil from water: (a) lack of water stability; (b) difficulty in producing defect-free membranes; and (c) unresolved issue of stabilizing the MOF separating layer on the ceramic membrane (CM) support. In this study, a defect-free hydrolytically stable zirconium-based MOF separating layer was formed through a two-step method: first, by in situ growth of UiO-66-NH2 MOF into the voids of polydopamine (PDA)-functionalized CM during the solvothermal process, and then by facilitating the self-assembly of UiO-66-NH2 with PDA using a pressurized dead-end assembly. A stable MOF separating layer was attained by enriching the ceramic support with amines and hydroxyl groups using PDA, which assisted in the assembly and stabilization of UiO-66-NH2. The PDA-s-UiO-66-NH2-CM membrane displayed air superhydrophilicity and underwater superoleophobicity, demonstrating its oil resistance and high antifouling behavior. The PDA-s-UiO-66-NH2-CM membrane has shown exceptionally high permeability and separation capacity for challenging oil-in-water emulsions. This is attributed to numerous nanochannels from the membrane and its high resistance to oil adhesion. The membranes showed excellent stability over 15 continuous test cycles, which indicates that the developed MOFs separating layers have a low tendency to be clogged by oil droplets during separation. Machine learning-based Gaussian process regression (GPR) models as nonparametric kernel-based probabilistic models were employed to predict the performance efficiency of the PDA-s-UiO-66-NH2-CM membrane in oil-in-water separation. The outcomes were compared with the support vector machine (SVM) and decision tree (DT) algorithm. This efficiency includes various metrics related to its separation accuracy, and the models were developed through feature engineering to identify and utilize the most significant factors affecting the membrane's performance. The results proved the reliability of GPR optimization with the highest prediction accuracy in the validation phase. The average percentage increase of the GPR model compared to the SVM and DT model was 6.11 and 42.94%, respectively.
RESUMEN
Reliable modeling of oily wastewater emphasizes the paramount importance of sustainable and health-conscious wastewater management practices, which directly aligns with the Sustainable Development Goals (SDG) while also meeting the guidelines of the World Health Organization (WHO). This research explores the efficiency of utilizing polypyrrole-coated ceramic-polymeric membranes to model oily wastewater separation efficiency (SE) and permeate flux (PF) based on established experimental procedures. In this area, computational simulation still needs to be explored. The study developed predictive regression models, including robust linear regression (RLR), stepwise linear regression (SWR) and linear regression (LR) for the ceramic-polymeric porous membrane, aiming to interpret its complex performance across diverse conditions and, thus, develop its utility in oily wastewater treatment applications. Subsequently, a novel, simple average ensemble paradigm was explored to reduce errors and improve prediction skills. Prior to the development of the model, stability and reliability analysis of the data was conducted based on Philip Perron tests with the Bartlett kernel estimation method. The accuracy of the SE exhibited a high consistency, averaging 99.92% with minimal variability (standard deviation of 0.026%), potentially simplifying its prediction compared to PF. The modes were validated and evaluated using metrics like MAE, RMSE, Speed, and MSE, in addition to 2D graphical and cumulative distribution function graphs. The LR model emerged as the best with the lowest RMSE =0.21951, indicating superior prediction accuracy, followed closely by RLR with an RMSE = 0.22359. SWLR, while having the highest RMSE = 0.34573, marked its dominance in prediction speed with 110 observations per second. Notably, the RLR model justified a reduction in error by approximately 35.29% compared to SWLR. Moreover, the training efficiency of the LR model exceeded, demanding a mere 2.9252 s, marking a reduction of about 32.54% compared to SWLR. The improved simple ensemble learning proved merit over the three models regarding error accuracy. This study emphasizes the essential role of soft-computing learning in optimizing the design and performance of ceramic-polymeric membranes.
Asunto(s)
Cerámica , Membranas Artificiales , Polímeros , Pirroles , Aguas Residuales , Polímeros/química , Aguas Residuales/química , Pirroles/química , Cerámica/química , Modelos Lineales , Purificación del Agua/métodos , Porosidad , Reproducibilidad de los Resultados , Simulación por ComputadorRESUMEN
Predicting the efficacy of micropollutant separation through functionalized membranes is an arduous endeavor. The challenge stems from the complex interactions between the physicochemical properties of the micropollutants and the basic principles underlying membrane filtration. This study aimed to compare the effectiveness of a modest dataset on various machine learning tools (ML) tools in predicting micropollutant removal efficiency for functionalized reverse osmosis (RO) and nanofiltration (NF) membranes. The inherent attributes of both the micropollutants and the membranes are utilized as input factors. The chosen ML tools are supervised algorithm (adaptive network-based fuzzy inference system (NF), linear regression framework (linear regression (LR)), stepwise linear regression (SLR) and multivariate linear regression (MVR)), and unsupervised algorithm (support vector machine (SVM) and ensemble boosted tree (BT)). The feature engineering and parametric dependency analysis revealed that characteristics of micropollutants, such as maximum projection diameter (MaxP), minimal projection diameter (MinP), molecular weight (MW), and compound size (CS), exhibited a notably positive impact on the correlation with removal efficiency. Model combination with key variables demonstrated high prediction accuracy in both supervised and unsupervised ML for micropollutant removal efficiency. An NF-grid partitioning (NF-GP) model achieved the highest accuracy with an R 2 value of 0.965, accompanied by low error metrics, specifically an RMSE and MAE of 3.65. It is owed to the handling of the complex spatial and temporal aspects of micropollutant data through division into consistent subsets facilitating improved identification of rejection efficiency and relationships. The inclusion of inputs with both negative and positive correlations introduces variability, amplifies the system responsiveness, and impedes the precision of predictive models. This study identified key micropollutant properties, including MaxP, MinP, MW, and CS, as crucial factors for efficient micropollutant rejection during real-time filtration applications. It also allowed the design of pore size of self-prepared membranes for the enhanced separation of micropollutants from wastewater.
RESUMEN
Addressing global freshwater scarcity requires innovative technological solutions, among which desalination through thin-film composite polyamide membranes stands out. The performance of these membranes plays a vital role in desalination, necessitating advanced predictive modeling for optimization. This study harnesses machine learning (ML) algorithms, including support vector machine (SVM), neural networks (NN), linear regression (LR), and multivariate linear regression (MLR), alongside their ensemble techniques to predict and enhance average water flux (AWF) and average salt rejection (ASR) essential metrics of desalination efficiency. To ensure model interpretability and feature importance analysis, SHapley Additive exPlanations (SHAP) were employed, providing both global and local insights into feature contributions. Initially, the individual models were validated, with NN demonstrating superior performance for both AWF and ASR, achieving the lowest mean absolute error (MAE = 0.001) and root mean squared error (RMSE = 0.0111) for AWF and an MAE = 0.0107 and RMSE = 0.0982 for ASR. The accuracy of predictions improved significantly with ensemble models, as evidenced by the near-perfect Nash-Sutcliffe efficiency (NSE) values. Specifically, the NN ensemble (NN-E) and Linear Regression ensemble (LR-E) reached an MAE and RMSE of 0.001 and 0.0111, respectively, for AWF. For ASR, NN-E reduced the MAE to 0.0013 and the RMSE to 0.0089, while LR-E maintained competitive performance with an MAE of 0.0133 and an RMSE of 0.0936. SHAP analysis revealed that features such as MDP and TMC were critical drivers of performance, with MDP showing the most significant positive impact on ASR. These findings demonstrate the dominance of ensemble methods over individual algorithms in predicting key desalination parameters. The enhanced precision in estimating AWF and ASR offered by these neuro-intelligent ensembles, combined with the interpretability provided by SHAP analysis, can lead to significant environmental and operational improvements in membrane performance, optimizing resource usage and minimizing ecological impacts. This study paves the way for integrating intelligent ML ensembles and SHAP-based interpretability into the practical field of membrane technology, marking a step forward toward sustainable and efficient desalination processes.
RESUMEN
The rising heavy metal (HM) pollution in coastal aquifers in rapidly urbanizing areas such as Dammam leads to significant risks to public health and environmental sustainability, challenging compliance with Environmental Protection Agency (EPA) guidelines, World Health Organization (WHO) standards, and Sustainable Development Goals (SDGs) related to clean water and life on land. This study developed the predictive-based monitoring of HM concentrations, including cadmium (Cd), chromium (Cr), and mercury (Hg) in the coastal aquifers of Dammam, influenced by industrial, agricultural, and urban activities. For this purpose, dynamic system identification and machine learning (ML) models integrated with three ensemble techniques, namely, simple averaging (SAE), weighted averaging (WAE), and neuro-ensemble (N-ESB), were employed to enhance the accuracy, reliability, and efficiency of environmental monitoring systems. The experimental data were calibrated and validated in addition to k-fold cross-validation to ensure the predictive skills of the models. The methodology integrates extensive data collection across varied land uses in Dammam and accurate model calibration and validation phases to develop highly accurate predictive models. The findings proved that the N-ESB and Hammerstein-Wiener (HW) models surpassed other models in predicting the concentrations of all HM. For Cd, the N-ESB model achieved a root mean square error (RMSE = 0.0010 mg/kg). Similarly, Cr demonstrated superior performance (RMSE = 0.0179 mg/kg). Further numerical results indicated that the HW algorithm proved the most effective for Hg, with RMSE = 0.0000 mg/kg. The quantitative comparison suggested that the N-ESB model's consistently high performance and low error rates make it an optimal choice for real-time, precise monitoring and management of HM pollution in coastal aquifers. The outcomes of this research highlighted the importance of integrating advanced predictive modeling techniques in environmental science, providing significant and practical implications for policymaking and ecological management.
Asunto(s)
Monitoreo del Ambiente , Agua Subterránea , Aprendizaje Automático , Metales Pesados , Contaminantes Químicos del Agua , Metales Pesados/análisis , Monitoreo del Ambiente/métodos , Contaminantes Químicos del Agua/análisis , Agua Subterránea/química , HumanosRESUMEN
This study presents an innovative approach for predicting water and groundwater quality indices (WQI and GWQI) in the Eastern Province of Saudi Arabia, addressing critical challenges of scarcity and pollution in arid regions. Recent literature highlights the increasing attention towards WQI based on water pollution index (WPI) and GWQI as essential tools for simplifying complex hydrogeological data, thereby facilitating effective groundwater management and protection. Unlike previous works, the present research introduces a novel hybrid method that integrates non-parametric kernel Gaussian learning (GPR), adaptive neuro-fuzzy inference system (ANFIS), and decision tree (DT) algorithms. This approach marks the first application of a non-parametric kernel for groundwater quality pollution index prediction in Saudi Arabia, offering a significant advancement in the field. Through laboratory analysis and the combination of various machine learning (ML) techniques, this study enhances prediction capabilities, particularly for unmonitored sites in arid and semi-arid regions. The study's objectives include feature engineering based on dependency sensitivity analysis to identify the most influential variables affecting WQI and GWQI, and the development of predictive models using ANFIS, GPR, and DT for both indices. Furthermore, it aims to assess the impact of different data portions on WQI and GWQI predictions, exploring data divisions such as (70% / 30%), (60% / 40%), and (80% / 20%) for training and testing phase, respectively. By filling a critical gap in water resource management, this research offers significant implications for the prediction of water quality in regions facing similar environmental challenges. Through its innovative methodology and comprehensive analysis, this study contributes to the broader effort of managing and protecting water resources in arid and semi-arid areas. The result proved that GPR-M1 exhibited exceptional testing phase accuracy with RMSE = 0.0169 for GWQI. Similarly, for WPI, the ANFIS-M1 achieved high testing predictive skills with RMSE = 0.0401. The results emphasize the critical role of data quality and quantity in training for enhancing model robustness and prediction precision in water quality assessment.
RESUMEN
Artificial intelligence (AI) is being employed in brine mining to enhance the extraction of lithium, vital for the manufacturing of lithium-ion batteries, through improved recovery efficiencies and the reduction of energy consumption. An innovative approach was proposed combining Emotional Neural Networks (ENN) and Random Forest (RF) algorithms to elucidate the adsorption energy (AE) (kcal mol-1) of Li+ ions by utilizing crown ether (CE)-incorporated honeycomb 2D nanomaterials. The screening and feature engineering analysis of honeycomb-patterned 2D materials and individual CE were conducted through Density Functional Theory (DFT) and Gaussian 16 simulations. The selected honeycomb-patterned 2D materials encompass graphene, silicene, and hexagonal boron nitride, while the specific CEs evaluated are 15-crown-5 and 18-crown-6. The crown-passivated 2D surfaces held a significant adsorption site through van der Waals forces for efficient recovery of Li+ ions. ENN predicted the targeted adsorption sites with high precision and minimal deviation. The eTAI (XAI) based Shapley Additive exPlanations (SHAP) was also explored for insight into the feature importance of CE embedded 2D nanomaterials for the recovery of Li+ ions. The extreme gradient boosting algorithm (XGBoost) model demonstrated a RT-2-MAPE = 0.4618% and ENN-2-MAPE = 0.4839% for the feature engineering analysis. This research would be an insight into the AI-driven nanotechnology that presents a viable and sustainable approach for the extraction of natural resources through the application of brine mining.
RESUMEN
This study proposes different standalone models viz: Elman neural network (ENN), Boosted Tree algorithm (BTA), and f relevance vector machine (RVM) for modeling arsenic (As (mg/kg)) and zinc (Zn (mg/kg)) in marine sediments owing to anthropogenic activities. A heuristic algorithm based on the potential of RVM and a flower pollination algorithm (RVM-FPA) was developed to improve the prediction performance. Several evaluation indicators and graphical methods coupled with visualized cumulative probability function (CDF) were used to evaluate the accuracy of the models. Akaike (AIC) and Schwarz (SCI) information criteria based on Dickey-Fuller (ADF) and Philip Perron (PP) tests were introduced to check the reliability and stationarity of the data. The prediction performance in the verification phase indicated that RVM-M2 (PBAIS = -o.0465, MAE = 0.0335) and ENN-M2 (PBAIS = 0.0043, MAE = 0.0322) emerged as the best model for As (mg/kg) and Zn (mg/kg), respectively. In contrast with the standalone approaches, the simulated hybrid RVM-FPA proved merit and the most reliable, with a 5 % and 18 % predictive increase for As (mg/kg) and Zn (mg/kg), respectively. The study's findings validated the potential for estimating complex HMs through intelligent data-driven models and heuristic optimization. The study also generated valuable insights that can inform the decision-makers and stockholders for environmental management strategies.
Asunto(s)
Algoritmos , Metales Pesados , Reproducibilidad de los Resultados , Aprendizaje Automático , Sedimentos GeológicosRESUMEN
River water quality management and monitoring are essential responsibilities for communities near rivers. Government decision-makers should monitor important quality factors like temperature, dissolved oxygen (DO), pH, and biochemical oxygen demand (BOD). Among water quality parameters, the BOD throughout 5 days is an important index that must be detected by devoting a significant amount of time and effort, which is a source of significant concern in both academic and commercial settings. The traditional experimental and statistical methods cannot give enough accuracy or solve the problem for a long time to detect something. This study used a unique hybrid model called MVMD-LWLR, which introduced an innovative method for forecasting BOD in the Klang River, Malaysia. The hybrid model combines a locally weighted linear regression (LWLR) model with a wavelet-based kernel function, along with multivariate variational mode decomposition (MVMD) for the decomposition of input variables. In addition, categorical boosting (Catboost) feature selection was used to discover and extract significant input variables. This combination of MVMD-LWLR and Catboost is the first use of such a complete model for predicting BOD levels in the given river environment. In addition, an optimization process was used to improve the performance of the model. This process utilized the gradient-based optimization (GBO) approach to fine-tune the parameters and better the overall accuracy of predicting BOD levels. To assess the robustness of the proposed method, we compared it to other popular models such as kernel ridge (KRidge) regression, LASSO, elastic net, and gaussian process regression (GPR). Several metrics, comprising root-mean-square error (RMSE), R (correlation coefficient), U95% (uncertainty coefficient at 95% level), and NSE (Nash-Sutcliffe efficiency), as well as visual interpretation, were used to evaluate the predictive efficacy of hybrid models. Extensive testing revealed that, in forecasting the BOD parameter, the MVMD-LWLR model outperformed its competitors. Consequently, for BOD forecasting, the suggested MVMD-LWLR optimized with the GBO algorithm yields encouraging and reliable results, with increased forecasting accuracy and minimal error.
Asunto(s)
Ríos , Calidad del Agua , Modelos Lineales , Ríos/química , Malasia , Monitoreo del Ambiente/métodos , PredicciónRESUMEN
This study presented a detailed investigation into the performance of a plate-frame water gap membrane distillation (WGMD) system for the desalination of untreated real seawater. One approach to improving the performance of WGMD is through the proper selection of cooling plate material, which plays a vital role in enhancing the gap vapor condensation process. Hence, the influence of different cooling plate materials was examined and discussed. Furthermore, two different hydrophobic micro-porous polymeric membranes of similar mean pore sizes were utilized in the study. The influence of key operating parameters, including the feed water temperature and flow rate, was examined against the system vapor flux and gained output ratio (GOR). In addition, the used membranes were characterized by means of different techniques in terms of surface morphology, liquid entry pressure, water contact angle, pore size distribution, and porosity. Findings revealed that, at all conditions, the PTFE membrane exhibits superior vapor flux and energy efficiency (GOR), with 9.36% to 14.36% higher flux at a 0.6 to 1.2 L/min feed flow rate when compared to the PVDF membrane. The copper plate, which has the highest thermal conductivity, attained the highest vapor flux, while the acrylic plate, which has an extra-low thermal conductivity, recorded the lowest vapor flux. The increasing order of GOR values for different cooling plates is acrylic < HDPE < copper < aluminum < brass < stainless steel. Results also indicated that increasing the feed temperature increases the vapor flux almost exponentially to a maximum flux value of 30.36 kg/m2hr. The system GOR also improves in a decreasing pattern to a maximum value of 0.4049. Moreover, a long-term test showed that the PTFE membrane, which exhibits superior hydrophobicity, registered better salt rejection stability. The use of copper as a cooling plate material for better system performance is recommended, while cooling plate materials with very low thermal conductivities, such as a low thermally conducting polymer, are discouraged.