Your browser doesn't support javascript.
loading
Evaluation of random forest regression and multiple linear regression for predicting indoor fine particulate matter concentrations in a highly polluted city.
Yuchi, Weiran; Gombojav, Enkhjargal; Boldbaatar, Buyantushig; Galsuren, Jargalsaikhan; Enkhmaa, Sarangerel; Beejin, Bolor; Naidan, Gerel; Ochir, Chimedsuren; Legtseg, Bayarkhuu; Byambaa, Tsogtbaatar; Barn, Prabjit; Henderson, Sarah B; Janes, Craig R; Lanphear, Bruce P; McCandless, Lawrence C; Takaro, Tim K; Venners, Scott A; Webster, Glenys M; Allen, Ryan W.
Affiliation
  • Yuchi W; Faculty of Health Sciences, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada.
  • Gombojav E; School of Public Health, Mongolian National University of Medical Sciences, Zorig Street, Ulaanbaatar, 14210, Mongolia.
  • Boldbaatar B; School of Public Health, Mongolian National University of Medical Sciences, Zorig Street, Ulaanbaatar, 14210, Mongolia.
  • Galsuren J; School of Public Health, Mongolian National University of Medical Sciences, Zorig Street, Ulaanbaatar, 14210, Mongolia.
  • Enkhmaa S; Institute of Meteorology and Environmental Monitoring, Ministry of Environment of Mongolia, Mongolia.
  • Beejin B; Mongolian National Center for Public Health, Olympic Street 2, Ulaanbaatar, Mongolia.
  • Naidan G; School of Public Health, Mongolian National University of Medical Sciences, Zorig Street, Ulaanbaatar, 14210, Mongolia.
  • Ochir C; School of Public Health, Mongolian National University of Medical Sciences, Zorig Street, Ulaanbaatar, 14210, Mongolia.
  • Legtseg B; Sukhbaatar District Health Center, 11 Horoo, Tsagdaagiin Gudamj, Sukhbaatar District, Ulaanbaatar, Mongolia.
  • Byambaa T; Ministry of Health of Mongolia, Olympic Street-2, Government Building VIII, Sukhbaatar District, Ulaanbaatar, Mongolia.
  • Barn P; Faculty of Health Sciences, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada.
  • Henderson SB; Environmental Health Services, British Columbia Centre for Disease Control, 655 W. 12th Ave, Vancouver, BC, V5T 4R4, Canada.
  • Janes CR; School of Public Health and Health Systems, University of Waterloo, 200 University Avenue West, Waterloo, ON, N2L 3G1, Canada.
  • Lanphear BP; Faculty of Health Sciences, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada.
  • McCandless LC; Faculty of Health Sciences, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada.
  • Takaro TK; Faculty of Health Sciences, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada.
  • Venners SA; Faculty of Health Sciences, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada.
  • Webster GM; Faculty of Health Sciences, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada.
  • Allen RW; Faculty of Health Sciences, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada. Electronic address: allenr@sfu.ca.
Environ Pollut ; 245: 746-753, 2019 Feb.
Article in En | MEDLINE | ID: mdl-30500754
BACKGROUND: Indoor and outdoor fine particulate matter (PM2.5) are both leading risk factors for death and disease, but making indoor measurements is often infeasible for large study populations. METHODS: We developed models to predict indoor PM2.5 concentrations for pregnant women who were part of a randomized controlled trial of portable air cleaners in Ulaanbaatar, Mongolia. We used multiple linear regression (MLR) and random forest regression (RFR) to model indoor PM2.5 concentrations with 447 independent 7-day PM2.5 measurements and 87 potential predictor variables obtained from outdoor monitoring data, questionnaires, home assessments, and geographic data sets. We also developed blended models that combined the MLR and RFR approaches. All models were evaluated in a 10-fold cross-validation. RESULTS: The predictors in the MLR model were season, outdoor PM2.5 concentration, the number of air cleaners deployed, and the density of gers (traditional felt-lined yurts) surrounding the apartments. MLR and RFR had similar performance in cross-validation (R2 = 50.2%, R2 = 48.9% respectively). The blended MLR model that included RFR predictions had the best performance (cross validation R2 = 81.5%). Intervention status alone explained only 6.0% of the variation in indoor PM2.5 concentrations. CONCLUSIONS: We predicted a moderate amount of variation in indoor PM2.5 concentrations using easily obtained predictor variables and the models explained substantially more variation than intervention status alone. While RFR shows promise for modelling indoor concentrations, our results highlight the importance of out-of-sample validation when evaluating model performance. We also demonstrate the improved performance of blended MLR/RFR models in predicting indoor air pollution.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Air Pollution, Indoor / Maternal Exposure / Particulate Matter / Models, Theoretical Type of study: Clinical_trials / Evaluation_studies / Prognostic_studies / Risk_factors_studies Limits: Female / Humans / Pregnancy Country/Region as subject: Asia Language: En Journal: Environ Pollut Journal subject: SAUDE AMBIENTAL Year: 2019 Document type: Article Affiliation country: Canada Country of publication: United kingdom

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Air Pollution, Indoor / Maternal Exposure / Particulate Matter / Models, Theoretical Type of study: Clinical_trials / Evaluation_studies / Prognostic_studies / Risk_factors_studies Limits: Female / Humans / Pregnancy Country/Region as subject: Asia Language: En Journal: Environ Pollut Journal subject: SAUDE AMBIENTAL Year: 2019 Document type: Article Affiliation country: Canada Country of publication: United kingdom