Skip to main content
Top
Published in: BMC Public Health 1/2024

Open Access 01-12-2024 | Malaria | Research

Risk assessment of imported malaria in China: a machine learning perspective

Authors: Shuo Yang, Ruo-yang Li, Shu-ning Yan, Han-yin Yang, Zi-you Cao, Li Zhang, Jing-bo Xue, Zhi-gui Xia, Shang Xia, Bin Zheng

Published in: BMC Public Health | Issue 1/2024

Login to get access

Abstract

Background

Following China’s official designation as malaria-free country by WHO, the imported malaria has emerged as a significant determinant impacting the malaria reestablishment within China. The objective of this study is to explore the application prospects of machine learning algorithms in imported malaria risk assessment of China.

Methods

The data of imported malaria cases in China from 2011 to 2019 was provided by China CDC; historical epidemic data of malaria endemic country was obtained from World Malaria Report, and the other data used in this study are open access data. All the data processing and model construction based on R, and map visualization used ArcGIS software.

Results

A total of 27,088 malaria cases imported into China from 85 countries between 2011 and 2019. After data preprocessing and classification, clean dataset has 765 rows (85 * 9) and 11 cols. Six machine learning models was constructed based on the training set, and Random Forest model demonstrated the best performance in model evaluation. According to RF, the highest feature importance were the number of malaria deaths and Indigenous malaria cases. The RF model demonstrated high accuracy in forecasting risk for the year 2019, achieving commendable accuracy rate of 95.3%. This result aligns well with the observed outcomes, indicating the model’s reliability in predicting risk levels.

Conclusions

Machine learning algorithms have reliable application prospects in risk assessment of imported malaria in China. This study provides a new methodological reference for the risk assessment and control strategies adjusting of imported malaria in China.
Literature
1.
go back to reference Cao CL, Guo JG. Challenge and strategy of prevention and control of important parasitic diseases under the Belt and Road Initiative[J]. Chin J Schistosomiasis Control. 2018;30(02):111–6. (in Chinese). Cao CL, Guo JG. Challenge and strategy of prevention and control of important parasitic diseases under the Belt and Road Initiative[J]. Chin J Schistosomiasis Control. 2018;30(02):111–6. (in Chinese).
3.
go back to reference World Health Organization., World Malaria Report(2020). World Health Organization., World Malaria Report(2020).
4.
go back to reference Bitoh T, Fueda K, Ohmae H, et al. Risk analysis of the re-emergence of Plasmodium Vivax malaria in Japan using a stochastic transmission model[J]. Volume 16. Environmental Health & Preventive Medicine; 2011. pp. 171–7. 3. Bitoh T, Fueda K, Ohmae H, et al. Risk analysis of the re-emergence of Plasmodium Vivax malaria in Japan using a stochastic transmission model[J]. Volume 16. Environmental Health & Preventive Medicine; 2011. pp. 171–7. 3.
8.
go back to reference Uddin S, Khan A, Hossain ME, et al. Comparing different supervised machine learning algorithms for disease prediction[J]. BMC Med Inf Decis Mak. 2019;19(1):1–16. Uddin S, Khan A, Hossain ME, et al. Comparing different supervised machine learning algorithms for disease prediction[J]. BMC Med Inf Decis Mak. 2019;19(1):1–16.
10.
go back to reference Peiffer-Smadja N, Rawson TM, Ahmad R, Buchard A, Georgiou P, Lescure FX, Birgand G, Holmes AH. Machine learning for clinical decision support in infectious diseases: a narrative review of current applications. Clin Microbiol Infect. 2020;26(5):584–595. https://doi.org/10.1016/j.cmi.2019.09.009. Epub 2019 Sep 17. Erratum in: Clin Microbiol Infect. 2020;26(8):1118. PMID: 31539636. Peiffer-Smadja N, Rawson TM, Ahmad R, Buchard A, Georgiou P, Lescure FX, Birgand G, Holmes AH. Machine learning for clinical decision support in infectious diseases: a narrative review of current applications. Clin Microbiol Infect. 2020;26(5):584–595. https://​doi.​org/​10.​1016/​j.​cmi.​2019.​09.​009. Epub 2019 Sep 17. Erratum in: Clin Microbiol Infect. 2020;26(8):1118. PMID: 31539636.
11.
go back to reference Lo Vercio L, Amador K, Bannister JJ, Crites S, Gutierrez A, MacDonald ME, Moore J, Mouches P, Rajashekar D, Schimert S, Subbanna N, Tuladhar A, Wang N, Wilms M, Winder A, Forkert ND. Supervised machine learning tools: a tutorial for clinicians. J Neural Eng. 2020;17(6). https://doi.org/10.1088/1741-2552/abbff2. PMID: 33036008. Lo Vercio L, Amador K, Bannister JJ, Crites S, Gutierrez A, MacDonald ME, Moore J, Mouches P, Rajashekar D, Schimert S, Subbanna N, Tuladhar A, Wang N, Wilms M, Winder A, Forkert ND. Supervised machine learning tools: a tutorial for clinicians. J Neural Eng. 2020;17(6). https://​doi.​org/​10.​1088/​1741-2552/​abbff2. PMID: 33036008.
13.
go back to reference Ikerionwu C, Ugwuishiwu C, Okpala I, James I, Okoronkwo M, Nnadi C, Orji U, Ebem D, Ike A. Application of machine and deep learning algorithms in optical microscopic detection of Plasmodium: a malaria diagnostic tool for the future. Photodiagnosis Photodyn Ther. 2022;40:103198. Epub 2022 Nov 12. PMID: 36379305.CrossRefPubMed Ikerionwu C, Ugwuishiwu C, Okpala I, James I, Okoronkwo M, Nnadi C, Orji U, Ebem D, Ike A. Application of machine and deep learning algorithms in optical microscopic detection of Plasmodium: a malaria diagnostic tool for the future. Photodiagnosis Photodyn Ther. 2022;40:103198. Epub 2022 Nov 12. PMID: 36379305.CrossRefPubMed
15.
16.
20.
go back to reference Sow B, Mukhtar H, Ahmad HF, Suguri H. Assessing the relative importance of social determinants of health in malaria and anemia classification based on machine learning techniques. Inf Health Soc Care. 2020;45(3):229–41. Epub 2019 Mar 27. PMID: 30917718.CrossRef Sow B, Mukhtar H, Ahmad HF, Suguri H. Assessing the relative importance of social determinants of health in malaria and anemia classification based on machine learning techniques. Inf Health Soc Care. 2020;45(3):229–41. Epub 2019 Mar 27. PMID: 30917718.CrossRef
22.
go back to reference Arowolo MO, Awotunde JB, Ayegba P. Shakirat Oluwatosin Haroon-Sulyman. Relevant gene selection using ANOVA-ant colony optimisation approach for malaria vector data classification. IJMIC. 2022;41(1/2):12–21.CrossRef Arowolo MO, Awotunde JB, Ayegba P. Shakirat Oluwatosin Haroon-Sulyman. Relevant gene selection using ANOVA-ant colony optimisation approach for malaria vector data classification. IJMIC. 2022;41(1/2):12–21.CrossRef
23.
go back to reference Arowolo M, Olaolu. Marion Olubunmi Adebiyi, and Ayodele Ariyo Adebiyi. Enhanced dimensionality reduction methods for classifying malaria vector dataset using decision tree. Sains Malaysiana. 2021;50(9):2579–89.CrossRef Arowolo M, Olaolu. Marion Olubunmi Adebiyi, and Ayodele Ariyo Adebiyi. Enhanced dimensionality reduction methods for classifying malaria vector dataset using decision tree. Sains Malaysiana. 2021;50(9):2579–89.CrossRef
24.
go back to reference Yin JH, Zhang L, Yi BY, Zhou SS, Xia ZG. Imported malaria from land bordering countries in China: a challenge in preventing the reestablishment of malaria transmission. Travel Med Infect Dis. 2023 May-Jun;53:102575. https://doi.org/10.1016/j.tmaid.2023.102575. Epub 2023 Apr 24. PMID: 37100163; PMCID: PMC10250815. Yin JH, Zhang L, Yi BY, Zhou SS, Xia ZG. Imported malaria from land bordering countries in China: a challenge in preventing the reestablishment of malaria transmission. Travel Med Infect Dis. 2023 May-Jun;53:102575. https://​doi.​org/​10.​1016/​j.​tmaid.​2023.​102575. Epub 2023 Apr 24. PMID: 37100163; PMCID: PMC10250815.
26.
go back to reference Feng J et al. Imported malaria cases—China, 2012–2018. China CDC Weekly 2.17 (2020): 277. Feng J et al. Imported malaria cases—China, 2012–2018. China CDC Weekly 2.17 (2020): 277.
27.
go back to reference Yin JH, Xia ZG. Consolidating the achievements of elimination and preventing reestablishment of transmission: main challenges and priorities of malaria prevention and control in post-elimination era in China. J Trop Dis Parasitol. 2022;20(5):241–4. 299 (in Chinese). Yin JH, Xia ZG. Consolidating the achievements of elimination and preventing reestablishment of transmission: main challenges and priorities of malaria prevention and control in post-elimination era in China. J Trop Dis Parasitol. 2022;20(5):241–4. 299 (in Chinese).
29.
go back to reference Belgiu M, Drăguţ L. Random forest in remote sensing: a review of applications and future directions. ISPRS J Photogrammetry Remote Sens. 2016;114:24–31.CrossRef Belgiu M, Drăguţ L. Random forest in remote sensing: a review of applications and future directions. ISPRS J Photogrammetry Remote Sens. 2016;114:24–31.CrossRef
30.
go back to reference Rodriguez-Galiano V, Francisco, et al. An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J Photogrammetry Remote Sens. 2012;67:93–104.CrossRef Rodriguez-Galiano V, Francisco, et al. An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J Photogrammetry Remote Sens. 2012;67:93–104.CrossRef
31.
go back to reference Parmar A, Katariya R, Patel V. A review on random forest: An ensemble classifier. International conference on intelligent data communication technologies and internet of things (ICICI) 2018. Springer International Publishing, 2019. Parmar A, Katariya R, Patel V. A review on random forest: An ensemble classifier. International conference on intelligent data communication technologies and internet of things (ICICI) 2018. Springer International Publishing, 2019.
32.
go back to reference Chen T. and Carlos Guestrin. Xgboost: A scalable tree boosting system. Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. 2016. Chen T. and Carlos Guestrin. Xgboost: A scalable tree boosting system. Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. 2016.
33.
go back to reference Ogunleye A, Qing-Guo W. XGBoost model for chronic kidney disease diagnosis. IEEE/ACM Trans Comput Biol Bioinf. 2019;17(6):2131–40.CrossRef Ogunleye A, Qing-Guo W. XGBoost model for chronic kidney disease diagnosis. IEEE/ACM Trans Comput Biol Bioinf. 2019;17(6):2131–40.CrossRef
34.
go back to reference Menze BH, et al. A comparison of random forest and its Gini importance with standard chemometric methods for the feature selection and classification of spectral data. BMC Bioinformatics. 2009;10:1–16.CrossRef Menze BH, et al. A comparison of random forest and its Gini importance with standard chemometric methods for the feature selection and classification of spectral data. BMC Bioinformatics. 2009;10:1–16.CrossRef
35.
go back to reference Han H, Guo X. and Hua Yu. Variable selection using mean decrease accuracy and mean decrease gini based on random forest. 2016 7th ieee international conference on software engineering and service science (icsess). IEEE, 2016. Han H, Guo X. and Hua Yu. Variable selection using mean decrease accuracy and mean decrease gini based on random forest. 2016 7th ieee international conference on software engineering and service science (icsess). IEEE, 2016.
36.
go back to reference Zheng H, Yuan J, Chen L. Short-term load forecasting using EMD-LSTM neural networks with a Xgboost algorithm for feature importance evaluation. Energies 10.8 (2017): 1168. Zheng H, Yuan J, Chen L. Short-term load forecasting using EMD-LSTM neural networks with a Xgboost algorithm for feature importance evaluation. Energies 10.8 (2017): 1168.
37.
go back to reference Zien A et al. The feature importance ranking measure. Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2009, Bled, Slovenia, September 7–11, 2009, Proceedings, Part II 20. Springer Berlin Heidelberg, 2009. Zien A et al. The feature importance ranking measure. Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2009, Bled, Slovenia, September 7–11, 2009, Proceedings, Part II 20. Springer Berlin Heidelberg, 2009.
38.
go back to reference Li Z, et al. Epidemiologic features of overseas imported malaria in the people’s Republic of China. Malar J. 2016;15:1–9. Li Z, et al. Epidemiologic features of overseas imported malaria in the people’s Republic of China. Malar J. 2016;15:1–9.
39.
go back to reference Cao J, et al. Sustained challenge to malaria elimination in China: imported malaria. Chin J Parasitol Parasitic Dis. 2018;36(2):93–6.MathSciNet Cao J, et al. Sustained challenge to malaria elimination in China: imported malaria. Chin J Parasitol Parasitic Dis. 2018;36(2):93–6.MathSciNet
40.
go back to reference Feng J, et al. Analysis of malaria epidemiological characteristics in the people’s Republic of China, 2004–2013. The American. J Trop Med Hygiene. 2015;93(2):293.CrossRef Feng J, et al. Analysis of malaria epidemiological characteristics in the people’s Republic of China, 2004–2013. The American. J Trop Med Hygiene. 2015;93(2):293.CrossRef
41.
go back to reference ZHANG, Rongbing et al. Epidemiological characteristics of overseas imported dengue fever and malaria cases in Yunnan Province from 2015 to 2021. J Prev Med (2023): 141–3. ZHANG, Rongbing et al. Epidemiological characteristics of overseas imported dengue fever and malaria cases in Yunnan Province from 2015 to 2021. J Prev Med (2023): 141–3.
Metadata
Title
Risk assessment of imported malaria in China: a machine learning perspective
Authors
Shuo Yang
Ruo-yang Li
Shu-ning Yan
Han-yin Yang
Zi-you Cao
Li Zhang
Jing-bo Xue
Zhi-gui Xia
Shang Xia
Bin Zheng
Publication date
01-12-2024
Publisher
BioMed Central
Keyword
Malaria
Published in
BMC Public Health / Issue 1/2024
Electronic ISSN: 1471-2458
DOI
https://doi.org/10.1186/s12889-024-17929-9

Other articles of this Issue 1/2024

BMC Public Health 1/2024 Go to the issue