Skip to main content
Top
Published in: BMC Public Health 1/2024

Open Access 01-12-2024 | COVID-19 | Research

A dynamic approach to support outbreak management using reinforcement learning and semi-connected SEIQR models

Authors: Yamin Kao, Po-Jui Chu, Pai-Chien Chou, Chien-Chang Chen

Published in: BMC Public Health | Issue 1/2024

Login to get access

Abstract

Background

Containment measures slowed the spread of COVID-19 but led to a global economic crisis. We establish a reinforcement learning (RL) algorithm that balances disease control and economic activities.

Methods

To train the RL agent, we design an RL environment with 4 semi-connected regions to represent the COVID-19 epidemic in Tokyo, Osaka, Okinawa, and Hokkaido, Japan. Every region is governed by a Susceptible-Exposed-Infected-Quarantined-Removed (SEIQR) model and has a transport hub to connect with other regions. The allocation of the synthetic population and inter-regional traveling is determined by population-weighted density. The agent learns the best policy from interacting with the RL environment, which involves obtaining daily observations, performing actions on individual movement and screening, and receiving feedback from the reward function. After training, we implement the agent into RL environments describing the actual epidemic waves of the four regions to observe the agent’s performance.

Results

For all epidemic waves covered by our study, the trained agent reduces the peak number of infectious cases and shortens the epidemics (from 165 to 35 cases and 148 to 131 days for the 5th wave). The agent is generally strict on screening but easy on movement, except for Okinawa, where the agent is easy on both actions. Action timing analyses indicate that restriction on movement is elevated when the number of exposed or infectious cases remains high or infectious cases increase rapidly, and stringency on screening is eased when the number of exposed or infectious cases drops quickly or to a regional low. For Okinawa, action on screening is tightened when the number of exposed or infectious cases increases rapidly.

Conclusions

Our experiments exhibit the potential of the RL in assisting policy-making and how the semi-connected SEIQR models establish an interactive environment for imitating cross-regional human flows.
Appendix
Available only for authorised users
Literature
1.
go back to reference Deb P, Furceri D, Ostry JD, Tawk N. The effect of containment measures on the COVID-19 pandemic. Covid Econ. 2020;19:53–86. Deb P, Furceri D, Ostry JD, Tawk N. The effect of containment measures on the COVID-19 pandemic. Covid Econ. 2020;19:53–86.
26.
go back to reference Barto A, Thomas P, Sutton R. Published. Some recent applications of reinforcement learning. Proceedings of the Eighteenth Yale Workshop on Adaptive and Learning Systems. 2017. Accessed 21 June 2023. Barto A, Thomas P, Sutton R. Published. Some recent applications of reinforcement learning. Proceedings of the Eighteenth Yale Workshop on Adaptive and Learning Systems. 2017. Accessed 21 June 2023.
34.
go back to reference Portal Site of Official Statistics of Japan. 2015 population census: basic complete tabulation on population and households of Japan. https://www.e-stat.go.jp/en/stat-search/files?page=1&toukei=00200521&tstat=000001080615. Updated 18 Jan 2019. Accessed 21 June 2023. Portal Site of Official Statistics of Japan. 2015 population census: basic complete tabulation on population and households of Japan. https://​www.​e-stat.​go.​jp/​en/​stat-search/​files?​page=​1&​toukei=​00200521&​tstat=​000001080615.​ Updated 18 Jan 2019. Accessed 21 June 2023.
35.
go back to reference Authority GI. June, Japan. The 2020 planimetric reports on the land area by prefectures and municipalities in Japan. https://www.gsi.go.jp/KOKUJYOHO/OLD-MENCHO-title.htm . Published 22 Dec 2020. Accessed 21 2023. Authority GI. June, Japan. The 2020 planimetric reports on the land area by prefectures and municipalities in Japan. https://​www.​gsi.​go.​jp/​KOKUJYOHO/​OLD-MENCHO-title.​htm . Published 22 Dec 2020. Accessed 21 2023.
40.
go back to reference Kochenderfer MJ, Wheeler TA, Wray KH. Algorithms for decision making. Cambridge: MIT Press; 2022. Kochenderfer MJ, Wheeler TA, Wray KH. Algorithms for decision making. Cambridge: MIT Press; 2022.
44.
go back to reference Summers J, Cheng H-Y, Lin H-H, et al. Potential lessons from the Taiwan and New Zealand health responses to the COVID-19 pandemic. Lancet Reg Health West Pac. 2020;100044. 10.1016/j.lanwpc.2020.100044. Summers J, Cheng H-Y, Lin H-H, et al. Potential lessons from the Taiwan and New Zealand health responses to the COVID-19 pandemic. Lancet Reg Health West Pac. 2020;100044. 10.1016/j.lanwpc.2020.100044.
Metadata
Title
A dynamic approach to support outbreak management using reinforcement learning and semi-connected SEIQR models
Authors
Yamin Kao
Po-Jui Chu
Pai-Chien Chou
Chien-Chang Chen
Publication date
01-12-2024
Publisher
BioMed Central
Keyword
COVID-19
Published in
BMC Public Health / Issue 1/2024
Electronic ISSN: 1471-2458
DOI
https://doi.org/10.1186/s12889-024-18251-0

Other articles of this Issue 1/2024

BMC Public Health 1/2024 Go to the issue