Your browser doesn't support javascript.
loading
Predicting spatiotemporally-resolved mean air temperature over Sweden from satellite data using an ensemble model.
Jin, Zhihao; Ma, Yiqun; Chu, Lingzhi; Liu, Yang; Dubrow, Robert; Chen, Kai.
Affiliation
  • Jin Z; Department of Environmental Health Sciences, Yale School of Public Health, New Haven, CT, USA; Yale Center on Climate Change and Health, Yale School of Public Health, New Haven, CT, USA.
  • Ma Y; Department of Environmental Health Sciences, Yale School of Public Health, New Haven, CT, USA; Yale Center on Climate Change and Health, Yale School of Public Health, New Haven, CT, USA.
  • Chu L; Department of Environmental Health Sciences, Yale School of Public Health, New Haven, CT, USA; Yale Center on Climate Change and Health, Yale School of Public Health, New Haven, CT, USA.
  • Liu Y; Gangarosa Department of Environmental Health, Rollins School of Public Health, Emory University, Atlanta, GA, USA.
  • Dubrow R; Department of Environmental Health Sciences, Yale School of Public Health, New Haven, CT, USA; Yale Center on Climate Change and Health, Yale School of Public Health, New Haven, CT, USA.
  • Chen K; Department of Environmental Health Sciences, Yale School of Public Health, New Haven, CT, USA; Yale Center on Climate Change and Health, Yale School of Public Health, New Haven, CT, USA. Electronic address: kai.chen@yale.edu.
Environ Res ; 204(Pt A): 111960, 2022 03.
Article in En | MEDLINE | ID: mdl-34464620
ABSTRACT
Mapping of air temperature (Ta) at high spatiotemporal resolution is critical to reducing exposure assessment errors in epidemiological studies on the health effects of air temperature. In this study, we applied a three-stage ensemble model to estimate daily mean Ta from satellite-based land surface temperature (Ts) over Sweden during 2001-2019 at a high spatial resolution of 1 × 1 km2. The ensemble model incorporated four base models, including a generalized additive model (GAM), a generalized additive mixed model (GAMM), and two machine learning models (random forest [RF] and extreme gradient boosting [XGBoost]), and allowed the weights for each model to vary over space, with the best-performing model for each grid cell assigned the highest weight. Various spatial predictors were included as adjustment variables in all the base models, including land cover type, normalized difference vegetation index (NDVI), and elevation. The ensemble model showed high performance with an overall R2 of 0.98 and a root mean square error of 1.38 °C in the ten-fold cross-validation, and outperformed each of the four base models. Although each base model performed well, the two machine learning models (RF [R2 = 0.97], XGBoost [R2 = 0.98]) had better performance than the two regression models (GAM [R2 = 0.95], GAMM [R2 = 0.96]). In the machine learning models, Ts was the dominant predictor of Ta, followed by day of year, NDVI, latitude, elevation, and longitude. The highly spatiotemporally-resolved Ta can improve temperature exposure assessment in future epidemiological studies.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Environmental Monitoring / Machine Learning Type of study: Prognostic_studies / Risk_factors_studies Country/Region as subject: Europa Language: En Journal: Environ Res Year: 2022 Document type: Article Affiliation country:

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Environmental Monitoring / Machine Learning Type of study: Prognostic_studies / Risk_factors_studies Country/Region as subject: Europa Language: En Journal: Environ Res Year: 2022 Document type: Article Affiliation country: