Designing Reinforcement Learning Algorithms for Digital Interventions: Pre-Implementation Guidelines.

Trella, Anna L; Zhang, Kelly W; Nahum-Shani, Inbal; Shetty, Vivek; Doshi-Velez, Finale; Murphy, Susan A

Trella, Anna L; Zhang, Kelly W; Nahum-Shani, Inbal; Shetty, Vivek; Doshi-Velez, Finale; Murphy, Susan A.

Afiliação

Trella AL; School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02420, USA.
Zhang KW; School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02420, USA.
Nahum-Shani I; Institute for Social Research, University of Michigan, Ann Arbor, MI 48109, USA.
Shetty V; Schools of Dentistry & Engineering, University of California, Los Angeles, CA 90095, USA.
Doshi-Velez F; School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02420, USA.
Murphy SA; School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02420, USA.

Algorithms ; 15(8)2022 Aug.

Article em En | MEDLINE | ID: mdl-36713810

ABSTRACT

ABSTRACT

Online reinforcement learning (RL) algorithms are increasingly used to personalize digital interventions in the fields of mobile health and online education. Common challenges in designing and testing an RL algorithm in these settings include ensuring the RL algorithm can learn and run stably under real-time constraints, and accounting for the complexity of the environment, e.g., a lack of accurate mechanistic models for the user dynamics. To guide how one can tackle these challenges, we extend the PCS (predictability, computability, stability) framework, a data science framework that incorporates best practices from machine learning and statistics in supervised learning to the design of RL algorithms for the digital interventions setting. Furthermore, we provide guidelines on how to design simulation environments, a crucial tool for evaluating RL candidate algorithms using the PCS framework. We show how we used the PCS framework to design an RL algorithm for Oralytics, a mobile health study aiming to improve users' tooth-brushing behaviors through the personalized delivery of intervention messages. Oralytics will go into the field in late 2022.

Palavras-chave

algorithm design; algorithm evaluation; mobile health; online learning; reinforcement learning (RL)

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Guideline / Prognostic_studies Idioma: En Revista: Algorithms Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google