Skip to main content
Top
Published in: BMC Cardiovascular Disorders 1/2024

Open Access 01-12-2024 | Research

Machine learning approach for predicting cardiovascular disease in Bangladesh: evidence from a cross-sectional study in 2023

Authors: Sorif Hossain, Mohammad Kamrul Hasan, Mohammad Omar Faruk, Nelufa Aktar, Riyadh Hossain, Kabir Hossain

Published in: BMC Cardiovascular Disorders | Issue 1/2024

Login to get access

Abstract

Background

Cardiovascular disorders (CVDs) are the leading cause of death worldwide. Lower- and middle-income countries (LMICs), such as Bangladesh, are also affected by several types of CVDs, such as heart failure and stroke. The leading cause of death in Bangladesh has recently switched from severe infections and parasitic illnesses to CVDs.

Materials and methods

The study dataset comprised a random sample of 391 CVD patients' medical records collected between August 2022 and April 2023 using simple random sampling. Moreover, 260 data points were collected from individuals with no CVD problems for comparison purposes. Crosstabs and chi-square tests were used to determine the association between CVD and the explanatory variables. Logistic regression, Naïve Bayes classifier, Decision Tree, AdaBoost classifier, Random Forest, Bagging Tree, and Ensemble learning classifiers were used to predict CVD. The performance evaluations encompassed accuracy, sensitivity, specificity, and area under the receiver operator characteristic (AU-ROC) curve.

Results

Random Forest had the highest precision among the five techniques considered. The precision rates for the mentioned classifiers are as follows: Logistic Regression (93.67%), Naïve Bayes (94.87%), Decision Tree (96.1%), AdaBoost (94.94%), Random Forest (96.15%), and Bagging Tree (94.87%). The Random Forest classifier maintains the highest balance between correct and incorrect predictions. With 98.04% accuracy, the Random Forest classifier achieved the best precision (96.15%), robust recall (100%), and high F1 score (97.7%). In contrast, the Logistic Regression model achieved the lowest accuracy of 95.42%. Remarkably, the Random Forest classifier achieved the highest AUC value (0.989).

Conclusion

This research mainly focused on identifying factors that are critical in impacting patients with CVD and predicting CVD risk. It is strongly advised that the Random Forest technique be implemented in a system for predicting cardiac diseases. This research may change clinical practice by providing doctors with a new instrument to determine a patient’s CVD prognosis.
Appendix
Available only for authorised users
Literature
4.
go back to reference Boyer K. Encyclopedia of Global Health. Lung Blood Inst (NHLBI): Natl. Hear; 2011. Boyer K. Encyclopedia of Global Health. Lung Blood Inst (NHLBI): Natl. Hear; 2011.
8.
go back to reference WHO. WHO Fact-Sheets Cardiovascular diseases (CVDs).” WHO. 2021. WHO. WHO Fact-Sheets Cardiovascular diseases (CVDs).” WHO. 2021.
9.
go back to reference N. G. A. P. Lestari Santika Dewi, A. A. Ayu Dwi Adelia Yasmin, Ni Made Citra Riesti Wulan, and I Gede Catur Wira Natanagara, “Factors Affecting Chronic Heart Failure in Patients with End-Stage Renal Disease at Bhayangkara Hospital Denpasar,” Biosci. Med. J. Biomed. Transl. Res. 2022. https://doi.org/10.37275/bsm.v6i7.545. N. G. A. P. Lestari Santika Dewi, A. A. Ayu Dwi Adelia Yasmin, Ni Made Citra Riesti Wulan, and I Gede Catur Wira Natanagara, “Factors Affecting Chronic Heart Failure in Patients with End-Stage Renal Disease at Bhayangkara Hospital Denpasar,” Biosci. Med. J. Biomed. Transl. Res. 2022. https://​doi.​org/​10.​37275/​bsm.​v6i7.​545.
13.
go back to reference Baghdadi NA, FarghalyAbdelaliem SM, Malki A, Gad I, Ewis A, Atlam E. Advanced machine learning techniques for cardiovascular disease early detection and diagnosis. J Big Data. 2023;10(1):1–29.CrossRef Baghdadi NA, FarghalyAbdelaliem SM, Malki A, Gad I, Ewis A, Atlam E. Advanced machine learning techniques for cardiovascular disease early detection and diagnosis. J Big Data. 2023;10(1):1–29.CrossRef
16.
go back to reference Mehrabani-Zeinabad K, Feizi A, Sadeghi M, Roohafza H, Talaei M, Sarrafzadegan N. Cardiovascular disease incidence prediction by machine learning and statistical techniques: a 16-year cohort study from eastern Mediterranean region. BMC Med Inform Decis Mak. 2023;23(1):1–12. https://doi.org/10.1186/s12911-023-02169-5.CrossRef Mehrabani-Zeinabad K, Feizi A, Sadeghi M, Roohafza H, Talaei M, Sarrafzadegan N. Cardiovascular disease incidence prediction by machine learning and statistical techniques: a 16-year cohort study from eastern Mediterranean region. BMC Med Inform Decis Mak. 2023;23(1):1–12. https://​doi.​org/​10.​1186/​s12911-023-02169-5.CrossRef
25.
go back to reference Alba AC, Agoritsas T, Jankowski M, Courvoisier D, Walter SD, Guyatt GH, Ross HJ. Risk prediction models for mortality in ambulatory patients with heart failure: a systematic review. Circulation: Heart Failure. 2013;6(5):881–9. Alba AC, Agoritsas T, Jankowski M, Courvoisier D, Walter SD, Guyatt GH, Ross HJ. Risk prediction models for mortality in ambulatory patients with heart failure: a systematic review. Circulation: Heart Failure. 2013;6(5):881–9.
27.
go back to reference L. Yap, J. Lim, F. Y. Chia, S. Y. Allen, J. C. Jaufeerally, F. R. Macdonald, M. R. Chai, P. and C. S. P. S. Y. Lim, P. Zaw, M. W. W. Teo, L. Sim, D. & Lam, “Prediction of Survival in Asian Patients Hospitalized With Heart Failure: Validation of the OPTIMIZEHF Risk Score. Journal of Cardiac Failure.” 2019. https://doi.org/10.1016/j.cardfail.2019.02.016. L. Yap, J. Lim, F. Y. Chia, S. Y. Allen, J. C. Jaufeerally, F. R. Macdonald, M. R. Chai, P. and C. S. P. S. Y. Lim, P. Zaw, M. W. W. Teo, L. Sim, D. & Lam, “Prediction of Survival in Asian Patients Hospitalized With Heart Failure: Validation of the OPTIMIZEHF Risk Score. Journal of Cardiac Failure.” 2019. https://​doi.​org/​10.​1016/​j.​cardfail.​2019.​02.​016.
28.
go back to reference A. Canepa, M. Fonseca, C. Chioncel, O. Laroche, C. Crespo-Leiro, M. Coats, A. Mebazaa, O. Piepoli, M. F. Tavazzi, L. Maggioni, A. P. Anker, S. Filippatos, G. Ferrari, R. Amir, … Gunes Dahlström, U. Delgado Jimenez, J. F. Drozdz, J. Erglis, A. Fazlibegovic, E. and H. “Performance of Prognostic Risk Scores in Chronic Heart Failure Patients Enrolled in the European Society of Cardiology Heart Failure Long-Term Registry. JACC: Heart Failure.” 2018. https://doi.org/10.1016/j.jchf.2018.02.001. A. Canepa, M. Fonseca, C. Chioncel, O. Laroche, C. Crespo-Leiro, M. Coats, A. Mebazaa, O. Piepoli, M. F. Tavazzi, L. Maggioni, A. P. Anker, S. Filippatos, G. Ferrari, R. Amir, … Gunes Dahlström, U. Delgado Jimenez, J. F. Drozdz, J. Erglis, A. Fazlibegovic, E. and H. “Performance of Prognostic Risk Scores in Chronic Heart Failure Patients Enrolled in the European Society of Cardiology Heart Failure Long-Term Registry. JACC: Heart Failure.” 2018. https://​doi.​org/​10.​1016/​j.​jchf.​2018.​02.​001.
29.
go back to reference M. Straw, S., Byrom, R., Gierula, J., Paton, M. F., Koshy, A., Cubbon, R., Drozd, M., Kearney and K. K. & Witte, “Predicting one-year mortality in heart failure using the ‘Surprise Question’: a prospective pilot study.” Eur. J. Hear. Fail. 2019. https://doi.org/10.1002/ejhf.1353. M. Straw, S., Byrom, R., Gierula, J., Paton, M. F., Koshy, A., Cubbon, R., Drozd, M., Kearney and K. K. & Witte, “Predicting one-year mortality in heart failure using the ‘Surprise Question’: a prospective pilot study.” Eur. J. Hear. Fail. 2019. https://​doi.​org/​10.​1002/​ejhf.​1353.
30.
go back to reference G. Dauriz, M., Mantovani, A., Bonapace, S., Verlato, G., Zoppini, G., Bonora, E., & Targher, “Prognostic impact of diabetes on long-term survival outcomes in patients with heart failure: A meta-analysis. Diabetes Care.” 2017. https://doi.org/10.2337/dc17-0697. G. Dauriz, M., Mantovani, A., Bonapace, S., Verlato, G., Zoppini, G., Bonora, E., & Targher, “Prognostic impact of diabetes on long-term survival outcomes in patients with heart failure: A meta-analysis. Diabetes Care.” 2017. https://​doi.​org/​10.​2337/​dc17-0697.
31.
go back to reference K. V. Segar, M. W., Vaduganathan, M., Patel, “Machine learning to predict the risk of incident heart failure hospitalization among patients with diabetes: The WATCH-DM risk score. Diabetes Care.” 2019. https://doi.org/10.2337/dc19-0587. K. V. Segar, M. W., Vaduganathan, M., Patel, “Machine learning to predict the risk of incident heart failure hospitalization among patients with diabetes: The WATCH-DM risk score. Diabetes Care.” 2019. https://​doi.​org/​10.​2337/​dc19-0587.
33.
go back to reference Morse JM. Determining sample size. Qualitative Health Res. 2000;10(1):3–5. Morse JM. Determining sample size. Qualitative Health Res. 2000;10(1):3–5.
40.
go back to reference Machová K, Barčák F, Bednár P. A bagging method using decision trees in the role of base classifiers. Hungarica: Acta Polytech; 2006. Machová K, Barčák F, Bednár P. A bagging method using decision trees in the role of base classifiers. Hungarica: Acta Polytech; 2006.
43.
go back to reference Benavides C, Garc T, Ben A, Jos JA. Heart disease risk prediction using deep learning techniques with feature augmentation. 2023. p. 31759–73. Benavides C, Garc T, Ben A, Jos JA. Heart disease risk prediction using deep learning techniques with feature augmentation. 2023. p. 31759–73.
44.
go back to reference Kumar NK, Sindhu, GS, Prashanthi DK, Sulthana AS. Analysis and prediction of cardio vascular disease using machine learning classifiers. In 2020 6th International Conference on Advanced Computing and Communication Systems (ICACCS). IEEE; 2020. p. 15–21. Kumar NK, Sindhu, GS, Prashanthi DK, Sulthana AS. Analysis and prediction of cardio vascular disease using machine learning classifiers. In 2020 6th International Conference on Advanced Computing and Communication Systems (ICACCS). IEEE; 2020. p. 15–21.
45.
go back to reference Sharma V, Yadav S, Gupta M. Heart disease prediction using machine learning techniques. In 2020 2nd international conference on advances in computing, communication control and networking (ICACCCN) ). IEEE; 2020. p. 177–181. Sharma V, Yadav S, Gupta M. Heart disease prediction using machine learning techniques. In 2020 2nd international conference on advances in computing, communication control and networking (ICACCCN) ). IEEE; 2020. p. 177–181.
46.
go back to reference Ramalingam VV, Dandapath A. and MK. Raja, Heart disease prediction using machine learning techniques : a survey. 2018;7:684–7. Ramalingam VV, Dandapath A. and MK. Raja, Heart disease prediction using machine learning techniques : a survey. 2018;7:684–7.
47.
go back to reference Pouriyeh S, Vahid S, Sannino G, De Pietro G, Arabnia H, Gutierrez J. A comprehensive investigation and comparison of machine learning techniques in the domain of heart disease. In 2017 IEEE symposium on computers and communications (ISCC). IEEE; 2017. p. 204–207. Pouriyeh S, Vahid S, Sannino G, De Pietro G, Arabnia H, Gutierrez J. A comprehensive investigation and comparison of machine learning techniques in the domain of heart disease. In 2017 IEEE symposium on computers and communications (ISCC). IEEE; 2017. p. 204–207.
48.
go back to reference Jabbar MA, Deekshatulu BL, Chndra P. Alternating decision trees for early diagnosis of heart disease. In International conference on circuits, communication, control and computing. IEEE; 2014. p. 322–328. Jabbar MA, Deekshatulu BL, Chndra P. Alternating decision trees for early diagnosis of heart disease. In International conference on circuits, communication, control and computing. IEEE; 2014. p. 322–328.
49.
go back to reference Jindal H, Agrawal S, Khera R, Jain R, Nagrath P. Heart disease prediction using machine learning algorithms. In IOP conference series: materials science and engineering (Vol. 1022, No. 1). IOP Publishing; 2021. p. 012072. Jindal H, Agrawal S, Khera R, Jain R, Nagrath P. Heart disease prediction using machine learning algorithms. In IOP conference series: materials science and engineering (Vol. 1022, No. 1). IOP Publishing; 2021. p. 012072.
Metadata
Title
Machine learning approach for predicting cardiovascular disease in Bangladesh: evidence from a cross-sectional study in 2023
Authors
Sorif Hossain
Mohammad Kamrul Hasan
Mohammad Omar Faruk
Nelufa Aktar
Riyadh Hossain
Kabir Hossain
Publication date
01-12-2024
Publisher
BioMed Central
Published in
BMC Cardiovascular Disorders / Issue 1/2024
Electronic ISSN: 1471-2261
DOI
https://doi.org/10.1186/s12872-024-03883-2

Other articles of this Issue 1/2024

BMC Cardiovascular Disorders 1/2024 Go to the issue