Your browser doesn't support javascript.
loading
Forecasting daily COVID-19 cases with gradient boosted regression trees and other methods: evidence from U.S. cities.
Sen, Anindya; Stevens, Nathaniel T; Tran, N Ken; Agarwal, Rishav R; Zhang, Qihuang; Dubin, Joel A.
Afiliação
  • Sen A; Department of Economics, University of Waterloo, Waterloo, ON, Canada.
  • Stevens NT; Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON, Canada.
  • Tran NK; School of Public Health Sciences, University of Waterloo, Waterloo, ON, Canada.
  • Agarwal RR; Cheriton School of Computer Science, University of Waterloo, Waterloo, ON, Canada.
  • Zhang Q; Department of Epidemiology, Biostatistics and Occupational Health, McGill College, Montreal, QC, Canada.
  • Dubin JA; Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON, Canada.
Front Public Health ; 11: 1259410, 2023.
Article em En | MEDLINE | ID: mdl-38146480
ABSTRACT

Introduction:

There is a vast literature on the performance of different short-term forecasting models for country specific COVID-19 cases, but much less research with respect to city level cases. This paper employs daily case counts for 25 Metropolitan Statistical Areas (MSAs) in the U.S. to evaluate the efficacy of a variety of statistical forecasting models with respect to 7 and 28-day ahead predictions.

Methods:

This study employed Gradient Boosted Regression Trees (GBRT), Linear Mixed Effects (LME), Susceptible, Infectious, or Recovered (SIR), and Seasonal Autoregressive Integrated Moving Average (SARIMA) models to generate daily forecasts of COVID-19 cases from November 2020 to March 2021.

Results:

Consistent with other research that have employed Machine Learning (ML) based methods, we find that Median Absolute Percentage Error (MAPE) values for both 7-day ahead and 28-day ahead predictions from GBRTs are lower than corresponding values from SIR, Linear Mixed Effects (LME), and Seasonal Autoregressive Integrated Moving Average (SARIMA) specifications for the majority of MSAs during November-December 2020 and January 2021. GBRT and SARIMA models do not offer high-quality predictions for February 2021. However, SARIMA generated MAPE values for 28-day ahead predictions are slightly lower than corresponding GBRT estimates for March 2021.

Discussion:

The results of this research demonstrate that basic ML models can lead to relatively accurate forecasts at the local level, which is important for resource allocation decisions and epidemiological surveillance by policymakers.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: COVID-19 Limite: Humans Idioma: En Revista: Front Public Health Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Canadá País de publicação: Suíça

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: COVID-19 Limite: Humans Idioma: En Revista: Front Public Health Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Canadá País de publicação: Suíça