Your browser doesn't support javascript.
loading
Scrutinizing different predictive modeling validation methodologies and data-partitioning strategies: new insights using groundwater modeling case study.
Lal, Alvin; Sharan, Ashneel; Sharma, Krishneel; Ram, Arishma; Roy, Dilip Kumar; Datta, Bithin.
Afiliação
  • Lal A; Global Centre for Environmental Remediation, College of Engineering, Science and Environment, The University of Newcastle, Callaghan, New South Wales, Australia.
  • Sharan A; CRC for Contamination Assessment and Remediation of the Environment (crcCARE), The University of Newcastle, Callaghan, New South Wales, 2308, Australia.
  • Sharma K; Dicipline of Civil Engineering, College of Science & Engineering, James Cook University, Townsville, Australia. ashneel.sharan@my.jcu.edu.au.
  • Ram A; C&R Consulting, Geochemical and Hydrobiological Solutions Pty Ltd, Aitkenvale, Queensland, 4814, Australia. ashneel.sharan@my.jcu.edu.au.
  • Roy DK; School of Environmental and Life Sciences, College of Engineering, Science and Environment, The University of Newcastle, Callaghan, New South Wales, Australia.
  • Datta B; School of Agriculture, Geography, Environment, Ocean and Natural Sciences, University of the South Pacific, Laucala Campus, Suva, Fiji.
Environ Monit Assess ; 196(7): 623, 2024 Jun 17.
Article em En | MEDLINE | ID: mdl-38880864
ABSTRACT
Groundwater salinity is a critical factor affecting water quality and ecosystem health, with implications for various sectors including agriculture, industry, and public health. Hence, the reliability and accuracy of groundwater salinity predictive models are paramount for effective decision-making in managing groundwater resources. This pioneering study presents the validation of a predictive model aimed at forecasting groundwater salinity levels using three different validation methods and various data partitioning strategies. This study tests three different data validation methodologies with different data-partitioning strategies while developing a group method of data handling (GMDH)-based model for predicting groundwater salinity concentrations in a coastal aquifer system. The three different methods are the hold-out strategy (last and random selection), k-fold cross-validation, and the leave-one-out method. In addition, various combinations of data-partitioning strategies are also used while using these three validation methodologies. The prediction model's validation results are assessed using various statistical indices such as root mean square error (RMSE), means squared error (MSE), and coefficient of determination (R2). The results indicate that for monitoring wells 1, 2, and 3, the hold-out (random) with 40% data partitioning strategy gave the most accurate predictive model in terms of RMSE statistical indices. Also, the results suggested that the GMDH-based models behave differently with different validation methodologies and data-partitioning strategies giving better salinity predictive capabilities. In general, the results justify that various model validation methodologies and data-partitioning strategies yield different results due to their inherent differences in how they partition the data, assess model performance, and handle sources of bias and variance. Therefore, it is important to use them in conjunction to obtain a comprehensive understanding of the groundwater salinity prediction model's behavior and performance.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Água Subterrânea / Monitoramento Ambiental / Salinidade Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Água Subterrânea / Monitoramento Ambiental / Salinidade Idioma: En Ano de publicação: 2024 Tipo de documento: Article