Skip to main content
Top
Published in: BMC Public Health 1/2024

Open Access 01-12-2024 | COVID-19 | Research

Using meta-learning to recommend an appropriate time-series forecasting model

Authors: Nasrin Talkhi, Narges Akhavan Fatemi, Mehdi Jabbari Nooghabi, Ehsan Soltani, Azadeh Jabbari Nooghabi

Published in: BMC Public Health | Issue 1/2024

Login to get access

Abstract

Background

There are various forecasting algorithms available for univariate time series, ranging from simple to sophisticated and computational. In practice, selecting the most appropriate algorithm can be difficult, because there are too many algorithms. Although expert knowledge is required to make an informed decision, sometimes it is not feasible due to the lack of such resources as time, money, and manpower.

Methods

In this study, we used coronavirus disease 2019 (COVID-19) data, including the absolute numbers of confirmed, death and recovered cases per day in 187 countries from February 20, 2020, to May 25, 2021. Two popular forecasting models, including Auto-Regressive Integrated Moving Average (ARIMA) and exponential smoothing state-space model with Trigonometric seasonality, Box-Cox transformation, ARMA errors, Trend, and Seasonal components (TBATS) were used to forecast the data. Moreover, the data were evaluated by the root mean squared error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE), and symmetric mean absolute percentage error (SMAPE) criteria to label time series. The various characteristics of each time series based on the univariate time series structure were extracted as meta-features. After that, three machine-learning classification algorithms, including support vector machine (SVM), decision tree (DT), random forest (RF), and artificial neural network (ANN) were used as meta-learners to recommend an appropriate forecasting model.

Results

The finding of the study showed that the DT model had a better performance in the classification of time series. The accuracy of DT in the training and testing phases was 87.50% and 82.50%, respectively. The sensitivity of the DT algorithm in the training phase was 86.58% and its specificity was 88.46%. Moreover, the sensitivity and specificity of the DT algorithm in the testing phase were 73.33% and 88%, respectively.

Conclusion

In general, the meta-learning approach was able to predict the appropriate forecasting model (ARIMA and TBATS) based on some time series features. Considering some characteristics of the desired COVID-19 time series, the ARIMA or TBATS forecasting model might be recommended to forecast the death, confirmed, and recovered trend cases of COVID-19 by the DT model.
Literature
1.
go back to reference Zhu N, et al. A novel coronavirus from patients with pneumonia in China, 2019. New England journal of medicine; 2020. Zhu N, et al. A novel coronavirus from patients with pneumonia in China, 2019. New England journal of medicine; 2020.
2.
go back to reference Brem A, Viardot E, Nylund PA. Implications of the coronavirus (COVID-19) outbreak for innovation: which technologies will improve our lives? Technol Forecast Soc Chang. 2021;163:120451.CrossRef Brem A, Viardot E, Nylund PA. Implications of the coronavirus (COVID-19) outbreak for innovation: which technologies will improve our lives? Technol Forecast Soc Chang. 2021;163:120451.CrossRef
3.
go back to reference Yadav M, Perumal M, Srinivas M. Analysis on novel coronavirus (COVID-19) using machine learning methods. Volume 139. Chaos, Solitons & Fractals; 2020;110050. Yadav M, Perumal M, Srinivas M. Analysis on novel coronavirus (COVID-19) using machine learning methods. Volume 139. Chaos, Solitons & Fractals; 2020;110050.
4.
go back to reference Pontoh RS, et al. Covid-19 modelling in South Korea using a Time Series Approach. Int J Adv Sci Technol. 2020;29(7):1620–32. Pontoh RS, et al. Covid-19 modelling in South Korea using a Time Series Approach. Int J Adv Sci Technol. 2020;29(7):1620–32.
5.
go back to reference Belhadi A, et al. Manufacturing and service supply chain resilience to the COVID-19 outbreak: lessons learned from the automobile and airline industries. Technol Forecast Soc Chang. 2021;163:120447.CrossRef Belhadi A, et al. Manufacturing and service supply chain resilience to the COVID-19 outbreak: lessons learned from the automobile and airline industries. Technol Forecast Soc Chang. 2021;163:120447.CrossRef
6.
go back to reference Ballı S. Data analysis of Covid-19 pandemic and short-term cumulative case forecasting using machine learning time series methods. Chaos Solitons Fractals. 2021;142:110512.CrossRefPubMed Ballı S. Data analysis of Covid-19 pandemic and short-term cumulative case forecasting using machine learning time series methods. Chaos Solitons Fractals. 2021;142:110512.CrossRefPubMed
10.
go back to reference Morris M, et al. Neural network models for influenza forecasting with associated uncertainty using web search activity trends. PLoS Comput Biol. 2023;19(8):e1011392.CrossRefPubMedPubMedCentral Morris M, et al. Neural network models for influenza forecasting with associated uncertainty using web search activity trends. PLoS Comput Biol. 2023;19(8):e1011392.CrossRefPubMedPubMedCentral
11.
go back to reference Ristic B, Dawson P. Real-time forecasting of an epidemic outbreak: Ebola 2014/2015 case study. in 2016 19th International Conference on Information Fusion (FUSION). 2016. Ristic B, Dawson P. Real-time forecasting of an epidemic outbreak: Ebola 2014/2015 case study. in 2016 19th International Conference on Information Fusion (FUSION). 2016.
12.
go back to reference Tsan YT et al. The prediction of influenza-like illness and respiratory Disease using LSTM and ARIMA. Int J Environ Res Public Health, 2022;19(3). Tsan YT et al. The prediction of influenza-like illness and respiratory Disease using LSTM and ARIMA. Int J Environ Res Public Health, 2022;19(3).
13.
go back to reference Srinivas M, Lin YY, Liao HYM. Deep dictionary learning for fine-grained image classification. in 2017 IEEE International Conference on Image Processing (ICIP). 2017. Srinivas M, Lin YY, Liao HYM. Deep dictionary learning for fine-grained image classification. in 2017 IEEE International Conference on Image Processing (ICIP). 2017.
14.
go back to reference Lemke C, Gabrys B. Meta-learning for time series forecasting and forecast combination. Neurocomputing. 2010;73(10):2006–16. Lemke C, Gabrys B. Meta-learning for time series forecasting and forecast combination. Neurocomputing. 2010;73(10):2006–16.
15.
go back to reference Makridakis S, Wheelwright S, Hyndman R. Forecasting: Methods and Applications, third ed., John Wiley, New York, 1998. 1998, New York: John Wiley. Makridakis S, Wheelwright S, Hyndman R. Forecasting: Methods and Applications, third ed., John Wiley, New York, 1998. 1998, New York: John Wiley.
16.
go back to reference Wang X, Smith-Miles K, Hyndman R. Rule induction for forecasting method selection: Meta-learning the characteristics of univariate time series. Neurocomputing. 2009;72(10):2581–94.CrossRef Wang X, Smith-Miles K, Hyndman R. Rule induction for forecasting method selection: Meta-learning the characteristics of univariate time series. Neurocomputing. 2009;72(10):2581–94.CrossRef
17.
go back to reference Prudêncio R, Ludermir T. Using machine learning techniques to combine forecasting methods. in Australasian Joint Conference on Artificial Intelligence. 2004. Springer. Prudêncio R, Ludermir T. Using machine learning techniques to combine forecasting methods. in Australasian Joint Conference on Artificial Intelligence. 2004. Springer.
18.
go back to reference Malki Z, et al. ARIMA models for predicting the end of COVID-19 pandemic and the risk of second rebound. Neural Comput Appl. 2021;33(7):2929–48.CrossRef Malki Z, et al. ARIMA models for predicting the end of COVID-19 pandemic and the risk of second rebound. Neural Comput Appl. 2021;33(7):2929–48.CrossRef
19.
go back to reference Malki Z, et al. The COVID-19 pandemic: prediction study based on machine learning models. Environ Sci Pollut Res. 2021;28(30):40496–506.CrossRef Malki Z, et al. The COVID-19 pandemic: prediction study based on machine learning models. Environ Sci Pollut Res. 2021;28(30):40496–506.CrossRef
20.
go back to reference Malki Z, et al. Association between weather data and COVID-19 pandemic predicting mortality rate: machine learning approaches. Chaos Solitons Fractals. 2020;138:110137.CrossRef Malki Z, et al. Association between weather data and COVID-19 pandemic predicting mortality rate: machine learning approaches. Chaos Solitons Fractals. 2020;138:110137.CrossRef
21.
go back to reference Khanna A et al. Data Analytics and Management: Proceedings of ICDAM. 2021: Springer. Khanna A et al. Data Analytics and Management: Proceedings of ICDAM. 2021: Springer.
22.
go back to reference Talkhi N, et al. Modeling and forecasting number of confirmed and death caused COVID-19 in IRAN: a comparison of time series forecasting methods. Biomed Signal Process Control. 2021;66:102494.CrossRef Talkhi N, et al. Modeling and forecasting number of confirmed and death caused COVID-19 in IRAN: a comparison of time series forecasting methods. Biomed Signal Process Control. 2021;66:102494.CrossRef
23.
go back to reference Nishiura H, et al. The rate of Underascertainment of Novel Coronavirus (2019-nCoV) infection: estimation using Japanese passengers data on evacuation flights. J Clin Med. 2020;9(2):419.CrossRef Nishiura H, et al. The rate of Underascertainment of Novel Coronavirus (2019-nCoV) infection: estimation using Japanese passengers data on evacuation flights. J Clin Med. 2020;9(2):419.CrossRef
24.
go back to reference Moftakhar L, Seif M, Safe MS. Exponentially Increasing Trend of Infected Patients with COVID-19 in Iran: A Comparison of Neural Network and ARIMA Forecasting Models Iranian Journal of Public Health, 2020;49(Supple 1). Moftakhar L, Seif M, Safe MS. Exponentially Increasing Trend of Infected Patients with COVID-19 in Iran: A Comparison of Neural Network and ARIMA Forecasting Models Iranian Journal of Public Health, 2020;49(Supple 1).
25.
go back to reference Yonar H, et al. Modeling and forecasting for the number of cases of the COVID-19 pandemic with the curve estimation models, the Box-Jenkins and Exponential Smoothing methods. Eurasian J Med Oncol. 2020;4(2):160–5. Yonar H, et al. Modeling and forecasting for the number of cases of the COVID-19 pandemic with the curve estimation models, the Box-Jenkins and Exponential Smoothing methods. Eurasian J Med Oncol. 2020;4(2):160–5.
26.
go back to reference Ceylan Z. Estimation of COVID-19 prevalence in Italy, Spain, and France. Sci Total Environ. 2020;729:138817.CrossRef Ceylan Z. Estimation of COVID-19 prevalence in Italy, Spain, and France. Sci Total Environ. 2020;729:138817.CrossRef
27.
go back to reference Papastefanopoulos V, Linardatos P, Kotsiantis S. COVID-19: a comparison of Time Series methods to Forecast percentage of active cases per Population. Appl Sci. 2020;10(11):3880.CrossRef Papastefanopoulos V, Linardatos P, Kotsiantis S. COVID-19: a comparison of Time Series methods to Forecast percentage of active cases per Population. Appl Sci. 2020;10(11):3880.CrossRef
28.
go back to reference Almasarweh M, Alwadi S. ARIMA model in predicting banking stock market data. Mod Appl Sci. 2018;12(11):4. Almasarweh M, Alwadi S. ARIMA model in predicting banking stock market data. Mod Appl Sci. 2018;12(11):4.
29.
go back to reference Cryer JD, Chan KS. Time Series Analysis: With Applications in R. Vol. 2nd edition. 2008: Springer-Verlag New York. Cryer JD, Chan KS. Time Series Analysis: With Applications in R. Vol. 2nd edition. 2008: Springer-Verlag New York.
31.
go back to reference De Livera AM, Hyndman RJ, Snyder RD. Forecasting Time Series with Complex Seasonal patterns using exponential smoothing. J Am Stat Assoc. 2011;106(496):1513–27.CrossRef De Livera AM, Hyndman RJ, Snyder RD. Forecasting Time Series with Complex Seasonal patterns using exponential smoothing. J Am Stat Assoc. 2011;106(496):1513–27.CrossRef
32.
go back to reference Ma S, Fildes R. Retail sales forecasting with meta-learning. Eur J Oper Res. 2021;288(1):111–28.CrossRef Ma S, Fildes R. Retail sales forecasting with meta-learning. Eur J Oper Res. 2021;288(1):111–28.CrossRef
33.
go back to reference Tanaka S et al. A clinical prediction rule for predicting a delay in quality of life recovery at 1 month after total knee arthroplasty: a decision tree model. J Orthop Sci, 2020. Tanaka S et al. A clinical prediction rule for predicting a delay in quality of life recovery at 1 month after total knee arthroplasty: a decision tree model. J Orthop Sci, 2020.
34.
go back to reference Vilalta R, Drissi Y. A Perspective View and Survey of Meta-Learning. Artif Intell Rev. 2002;18(2):77–95.CrossRef Vilalta R, Drissi Y. A Perspective View and Survey of Meta-Learning. Artif Intell Rev. 2002;18(2):77–95.CrossRef
35.
go back to reference Ali AR, Gabrys B, Budka M. Cross-domain Meta-learning for time-series forecasting. Procedia Comput Sci. 2018;126:9–18.CrossRef Ali AR, Gabrys B, Budka M. Cross-domain Meta-learning for time-series forecasting. Procedia Comput Sci. 2018;126:9–18.CrossRef
36.
go back to reference Prudêncio RBC, Ludermir TB. Meta-learning approaches to selecting time series models. Neurocomputing. 2004;61:121–37.CrossRef Prudêncio RBC, Ludermir TB. Meta-learning approaches to selecting time series models. Neurocomputing. 2004;61:121–37.CrossRef
37.
go back to reference Yang L, et al. A regression tree approach using mathematical programming. Expert Syst Appl. 2017;78:347–57.CrossRef Yang L, et al. A regression tree approach using mathematical programming. Expert Syst Appl. 2017;78:347–57.CrossRef
39.
go back to reference Niazkar HR, Niazkar M. Application of artificial neural networks to predict the COVID-19 outbreak. Global Health Research and Policy. 2020;5(1):50.CrossRef Niazkar HR, Niazkar M. Application of artificial neural networks to predict the COVID-19 outbreak. Global Health Research and Policy. 2020;5(1):50.CrossRef
40.
go back to reference Yoon J. Forecasting of real GDP growth using machine learning models: gradient boosting and Random Forest Approach. Comput Econ. 2021;57(1):247–65.CrossRef Yoon J. Forecasting of real GDP growth using machine learning models: gradient boosting and Random Forest Approach. Comput Econ. 2021;57(1):247–65.CrossRef
41.
go back to reference Xue L, et al. A data-driven shale gas production forecasting method based on the multi-objective random forest regression. J Petrol Sci Eng. 2021;196:107801.CrossRef Xue L, et al. A data-driven shale gas production forecasting method based on the multi-objective random forest regression. J Petrol Sci Eng. 2021;196:107801.CrossRef
42.
go back to reference Makridakis S. Forecasting: its role and value for planning and strategy. Int J Forecast. 1996;12(4):513–37.CrossRef Makridakis S. Forecasting: its role and value for planning and strategy. Int J Forecast. 1996;12(4):513–37.CrossRef
43.
go back to reference Doupe P, Faghmous J, Basu S. Machine Learning for Health Services Researchers. Value in Health. 2019;22(7):808–15.CrossRef Doupe P, Faghmous J, Basu S. Machine Learning for Health Services Researchers. Value in Health. 2019;22(7):808–15.CrossRef
44.
go back to reference Shailaja K, Seetharamulu B, Jabbar MA. Machine Learning in Healthcare: A Review. in 2018 Second International Conference on Electronics, Communication and Aerospace Technology (ICECA). 2018. Shailaja K, Seetharamulu B, Jabbar MA. Machine Learning in Healthcare: A Review. in 2018 Second International Conference on Electronics, Communication and Aerospace Technology (ICECA). 2018.
45.
go back to reference Yang Q et al. Research on COVID-19 based on ARIMA model∆—Taking Hubei, China as an example to see the epidemic in Italy. J Infect Public Health, 2020. Yang Q et al. Research on COVID-19 based on ARIMA model∆—Taking Hubei, China as an example to see the epidemic in Italy. J Infect Public Health, 2020.
46.
go back to reference Farooq J, Bazaz MA. A deep learning algorithm for modeling and forecasting of COVID-19 in five worst affected states of India. Alexandria Eng J. 2021;60(1):587–96.CrossRef Farooq J, Bazaz MA. A deep learning algorithm for modeling and forecasting of COVID-19 in five worst affected states of India. Alexandria Eng J. 2021;60(1):587–96.CrossRef
47.
go back to reference Christie N, Basri MH. Personal Protective Equipment Demand Forecasting and Inventory Management during COVID-19 Case Study: Public Hospital at Bandung, Indonesia, in international conference on management, economics & finance. 2021. Christie N, Basri MH. Personal Protective Equipment Demand Forecasting and Inventory Management during COVID-19 Case Study: Public Hospital at Bandung, Indonesia, in international conference on management, economics & finance. 2021.
48.
go back to reference Rostami-Tabar B, Rendon-Sanchez JF. Forecasting COVID-19 daily cases using phone call data. Appl Soft Comput. 2021;100:106932.CrossRef Rostami-Tabar B, Rendon-Sanchez JF. Forecasting COVID-19 daily cases using phone call data. Appl Soft Comput. 2021;100:106932.CrossRef
49.
go back to reference Khan F, Saeed A, Ali S. Modelling and forecasting of new cases, deaths and recover cases of COVID-19 by using Vector Autoregressive model in Pakistan. Volume 140. Chaos, Solitons & Fractals; 2020;110189. Khan F, Saeed A, Ali S. Modelling and forecasting of new cases, deaths and recover cases of COVID-19 by using Vector Autoregressive model in Pakistan. Volume 140. Chaos, Solitons & Fractals; 2020;110189.
Metadata
Title
Using meta-learning to recommend an appropriate time-series forecasting model
Authors
Nasrin Talkhi
Narges Akhavan Fatemi
Mehdi Jabbari Nooghabi
Ehsan Soltani
Azadeh Jabbari Nooghabi
Publication date
01-12-2024
Publisher
BioMed Central
Keyword
COVID-19
Published in
BMC Public Health / Issue 1/2024
Electronic ISSN: 1471-2458
DOI
https://doi.org/10.1186/s12889-023-17627-y

Other articles of this Issue 1/2024

BMC Public Health 1/2024 Go to the issue