Your browser doesn't support javascript.
loading
A comparative analysis of feature selection models for spatial analysis of floods using hybrid metaheuristic and machine learning models.
Sarwar, Javeria; Khan, Saud Ahmed; Azmat, Muhammad; Khan, Faridoon.
Afiliação
  • Sarwar J; Pakistan Institute of Development Economics, Islamabad, Pakistan.
  • Khan SA; Allama Iqbal Open University, Islamabad, Pakistan.
  • Azmat M; Pakistan Institute of Development Economics, Islamabad, Pakistan.
  • Khan F; Institute of Geographical Information Systems (IGIS), School of Civil & Environmental Engineering (SCEE), National University of Sciences and Technology, Islamabad, Pakistan.
Environ Sci Pollut Res Int ; 31(23): 33495-33514, 2024 May.
Article em En | MEDLINE | ID: mdl-38684613
ABSTRACT
The research aims to propose a feature selection model for hydraulic analysis as such a model has not been proposed previously. For this purpose, hybrids of three metaheuristic algorithms, particle swarm optimization (PSO), ant colony optimization (ACO), and genetic algorithm (GA) with two machine learning models which are support vector machine (SVM) and K-nearest neighbor (KNN) are employed. The dataset considered was hydraulic having an association with flood and possessed topographic, geo-environmental, and human-induced variables. The dataset considered had multicollinearity heteroscedasticity and autocorrelation problems. The metaheuristic algorithms were evaluated by varying the number of population size. Among them, PSO performed better by providing an appropriate number of features with a lower number of iterations. We have analyzed the performance of SVM with different kernels; linear, radial basis function (RBF), sigmoid, and polynomial, as the original SVM is designed only for linear datasets but the hydraulic dataset possesses non-linear characteristics as well. The performance of different kernels in terms of their accuracies is evaluated and recorded. This study showed that RBF performed the best and sigmoid showed the least accuracy for GA, PSO, and ACO algorithms. The performance of KNN is evaluated in terms of accuracies by varying the K-values. It was found that KNN shows low accuracy with a small K-value which then attained a maximum level by increasing K-values, and it finally started decreasing, explicitly, by further enhancing K-values. While comparing the performance of hybrids of GA, PSO, and ACO with SVM and KNN, it was analyzed that KNN performed better with these meta-heuristics with PSO-KNN which performed the best among the baseline models. Thus, the study proposes that PSO-KNN can be utilized as a feature selection technique to obtain optimal data subsets for hydraulic modeling and analysis.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Inundações / Máquina de Vetores de Suporte / Aprendizado de Máquina Idioma: En Revista: Environ Sci Pollut Res Int Assunto da revista: SAUDE AMBIENTAL / TOXICOLOGIA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Paquistão

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Inundações / Máquina de Vetores de Suporte / Aprendizado de Máquina Idioma: En Revista: Environ Sci Pollut Res Int Assunto da revista: SAUDE AMBIENTAL / TOXICOLOGIA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Paquistão