Your browser doesn't support javascript.
loading
Feature selection for a continental-scale geospatial model of environmental sound levels.
Pedersen, Katrina; Transtrum, Mark K; Gee, Kent L; Lympany, Shane V; James, Michael M; Salton, Alexandria R.
Afiliación
  • Pedersen K; Physics and Astronomy, Brigham Young University, Provo, Utah 84602, USA.
  • Transtrum MK; Physics and Astronomy, Brigham Young University, Provo, Utah 84602, USA.
  • Gee KL; Physics and Astronomy, Brigham Young University, Provo, Utah 84602, USA.
  • Lympany SV; Blue Ridge Research and Consulting, LLC, Asheville, North Carolina 28801, USA.
  • James MM; Blue Ridge Research and Consulting, LLC, Asheville, North Carolina 28801, USA.
  • Salton AR; Blue Ridge Research and Consulting, LLC, Asheville, North Carolina 28801, USA.
J Acoust Soc Am ; 154(2): 1168-1178, 2023 Aug 01.
Article en En | MEDLINE | ID: mdl-37610283
ABSTRACT
Modeling environmental sound levels over continental scales is difficult due to the variety of geospatial environments. Moreover, current continental-scale models depend upon machine learning and therefore face additional challenges due to limited acoustic training data. In previous work, an ensemble of machine learning models was used to predict environmental sound levels in the contiguous United States using a training set composed of 51 geospatial layers (downselected from 120) and acoustic data from 496 geographic sites from Pedersen, Transtrum, Gee, Lympany, James, and Salton [JASA Express Lett. 1(12), 122401 (2021)]. In this paper, the downselection process, which is based on factors such as data quality and inter-feature correlations, is described in further detail. To investigate additional dimensionality reduction, four different feature selection methods are applied to the 51 layers. Leave-one-out median absolute deviation cross-validation errors suggest that the number of geospatial features can be reduced to 15 without significant degradation of the model's predictive error. However, ensemble predictions demonstrate that feature selection results are sensitive to variations in details of the problem formulation and, therefore, should elicit some skepticism. These results suggest that more sophisticated dimensionality reduction techniques are necessary for problems with limited training data and different training and testing distributions.

Texto completo: 1 Banco de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Año: 2023 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Año: 2023 Tipo del documento: Article