Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2021

Open Access 01-12-2021 | Research

Application of multi-label classification models for the diagnosis of diabetic complications

Authors: Liang Zhou, Xiaoyuan Zheng, Di Yang, Ying Wang, Xuesong Bai, Xinhua Ye

Published in: BMC Medical Informatics and Decision Making | Issue 1/2021

Login to get access

Abstract

Background

Early diagnosis for the diabetes complications is clinically demanding with great significancy. Regarding the complexity of diabetes complications, we applied a multi-label classification (MLC) model to predict four diabetic complications simultaneously using data in the modern electronic health records (EHRs), and leveraged the correlations between the complications to further improve the prediction accuracy.

Methods

We obtained the demographic characteristics and laboratory data from the EHRs for patients admitted to Changzhou No. 2 People’s Hospital, the affiliated hospital of Nanjing Medical University in China from May 2013 to June 2020. The data included 93 biochemical indicators and 9,765 patients. We used the Pearson correlation coefficient (PCC) to analyze the correlations between different diabetic complications from a statistical perspective. We used an MLC model, based on the Random Forest (RF) technique, to leverage these correlations and predict four complications simultaneously. We explored four different MLC models; a Label Power Set (LP), Classifier Chains (CC), Ensemble Classifier Chains (ECC), and Calibrated Label Ranking (CLR). We used traditional Binary Relevance (BR) as a comparison. We used 11 different performance metrics and the area under the receiver operating characteristic curve (AUROC) to evaluate these models. We analyzed the weights of the learned model and illustrated (1) the top 10 key indicators of different complications and (2) the correlations between different diabetic complications.

Results

The MLC models including CC, ECC and CLR outperformed the traditional BR method in most performance metrics; the ECC models performed the best in Hamming loss (0.1760), Accuracy (0.7020), F1_Score (0.7855), Precision (0.8649), F1_micro (0.8078), F1_macro (0.7773), Recall_micro (0.8631), Recall_macro (0.8009), and AUROC (0.8231). The two diabetic complication correlation matrices drawn from the PCC analysis and the MLC models were consistent with each other and indicated that the complications correlated to different extents. The top 10 key indicators given by the model are valuable in medical application.

Conclusions

Our MLC model can effectively utilize the potential correlation between different diabetic complications to further improve the prediction accuracy. This model should be explored further in other complex diseases with multiple complications.
Appendix
Available only for authorised users
Literature
1.
go back to reference An Y, Zhang P, Wang J, et al. Cardiovascular and all-cause mortality over a 23-year period among chinese with newly diagnosed diabetes in the da qing igt and diabetes study. Diabetes Care. 2015;38(7):1365–71.CrossRef An Y, Zhang P, Wang J, et al. Cardiovascular and all-cause mortality over a 23-year period among chinese with newly diagnosed diabetes in the da qing igt and diabetes study. Diabetes Care. 2015;38(7):1365–71.CrossRef
2.
go back to reference Hu H, Sawhney M, Shi L, et al. A systematic review of the direct economic burden of type 2 diabetes in china. Diabetes Ther. 2015;6(1):7–16.CrossRef Hu H, Sawhney M, Shi L, et al. A systematic review of the direct economic burden of type 2 diabetes in china. Diabetes Ther. 2015;6(1):7–16.CrossRef
3.
go back to reference Liu Z, Fu C, Wang W, Xu B. Prevalence of chronic complications of type 2 diabetes mellitus in outpatients: a cross-sectional hospital based survey in urban China. Health Qual Life Outcomes. 2010;8:62.CrossRef Liu Z, Fu C, Wang W, Xu B. Prevalence of chronic complications of type 2 diabetes mellitus in outpatients: a cross-sectional hospital based survey in urban China. Health Qual Life Outcomes. 2010;8:62.CrossRef
4.
go back to reference Mao W, Yip CW, Chen W. Complications of diabetes in China: health system and economic implications. BMC Public Health. 2019;19(1):269.CrossRef Mao W, Yip CW, Chen W. Complications of diabetes in China: health system and economic implications. BMC Public Health. 2019;19(1):269.CrossRef
6.
go back to reference Preo N, Capobianco E. Significant EHR feature-driven t2d inference: predictive machine learning and networks. Front Big Data. 2019;2:30.CrossRef Preo N, Capobianco E. Significant EHR feature-driven t2d inference: predictive machine learning and networks. Front Big Data. 2019;2:30.CrossRef
7.
go back to reference Lan K, Wang DT, Fong S, Liu LS, Wong K, Dey N. A survey of data mining and deep learning in bioinformatics. J Med Syst. 2018;42:139.CrossRef Lan K, Wang DT, Fong S, Liu LS, Wong K, Dey N. A survey of data mining and deep learning in bioinformatics. J Med Syst. 2018;42:139.CrossRef
8.
go back to reference Belur Nagaraj S, Pena MJ, Ju W, Heerspink HL. Machine-learning-based early prediction of end-stage renal disease in patients with diabetic kidney disease using clinical trials data. Diabetes Obes Metab. 2020;22(12):2479–86.CrossRef Belur Nagaraj S, Pena MJ, Ju W, Heerspink HL. Machine-learning-based early prediction of end-stage renal disease in patients with diabetic kidney disease using clinical trials data. Diabetes Obes Metab. 2020;22(12):2479–86.CrossRef
9.
go back to reference Makino M, Yoshimoto R, Ono M, et al. Artificial intelligence predicts the progression of diabetic kidney disease using big data machine learning. Sci Rep. 2019;9(1):11862.CrossRef Makino M, Yoshimoto R, Ono M, et al. Artificial intelligence predicts the progression of diabetic kidney disease using big data machine learning. Sci Rep. 2019;9(1):11862.CrossRef
10.
go back to reference Song X, Waitman LR, Yu AS, Robbins DC, Hu Y, Liu M. Longitudinal risk prediction of chronic kidney disease in diabetic patients using a temporal-enhanced gradient boosting machine: retrospective cohort study. JMIR Med Inform. 2020;8(1):e15510.CrossRef Song X, Waitman LR, Yu AS, Robbins DC, Hu Y, Liu M. Longitudinal risk prediction of chronic kidney disease in diabetic patients using a temporal-enhanced gradient boosting machine: retrospective cohort study. JMIR Med Inform. 2020;8(1):e15510.CrossRef
11.
go back to reference Jonnagaddala J, Liaw ST, Ray P, Kumar M, Dai HJ, Hsu CY. Identification and progression of heart disease risk factors in diabetic patients from longitudinal electronic health records. Biomed Res Int. 2015;2015:636371.CrossRef Jonnagaddala J, Liaw ST, Ray P, Kumar M, Dai HJ, Hsu CY. Identification and progression of heart disease risk factors in diabetic patients from longitudinal electronic health records. Biomed Res Int. 2015;2015:636371.CrossRef
12.
go back to reference Ogunyemi OI, Gandhi M, Tayek C. Predictive models for diabetic retinopathy from non-image teleretinal screening data. AMIA Jt Summits Transl Sci Proc. 2019;2019:472–7.PubMedPubMedCentral Ogunyemi OI, Gandhi M, Tayek C. Predictive models for diabetic retinopathy from non-image teleretinal screening data. AMIA Jt Summits Transl Sci Proc. 2019;2019:472–7.PubMedPubMedCentral
13.
go back to reference Dagliati A, Marini S, Sacchi L, et al. Machine learning methods to predict diabetes complications. J Diabetes Sci Technol. 2018;12(2):295–302.CrossRef Dagliati A, Marini S, Sacchi L, et al. Machine learning methods to predict diabetes complications. J Diabetes Sci Technol. 2018;12(2):295–302.CrossRef
14.
go back to reference Kim E, Pieczkiewicz DS, Castro MR, Caraballo PJ, Simon GJ. Multi-task learning to identify outcome-specific risk factors that distinguish individual micro and macrovascular complications of type 2 diabetes. AMIA Jt Summits Transl Sci Proc. 2018;2017:122–31.PubMed Kim E, Pieczkiewicz DS, Castro MR, Caraballo PJ, Simon GJ. Multi-task learning to identify outcome-specific risk factors that distinguish individual micro and macrovascular complications of type 2 diabetes. AMIA Jt Summits Transl Sci Proc. 2018;2017:122–31.PubMed
15.
go back to reference Zhang M, Zhou Z. A review on multi-label learning algorithms. IEEE Trans Knowl Data Eng. 2014;26:1819.CrossRef Zhang M, Zhou Z. A review on multi-label learning algorithms. IEEE Trans Knowl Data Eng. 2014;26:1819.CrossRef
16.
go back to reference Ganz T, Wainstein J, Gilad S, Limor R, Boaz M, Stern N. Serum asymmetric dimethylarginine and arginine levels predict microvascular and macrovascular complications in type 2 diabetes mellitus. Diabetes Metab Res Rev. 2017;33(2):2017.CrossRef Ganz T, Wainstein J, Gilad S, Limor R, Boaz M, Stern N. Serum asymmetric dimethylarginine and arginine levels predict microvascular and macrovascular complications in type 2 diabetes mellitus. Diabetes Metab Res Rev. 2017;33(2):2017.CrossRef
17.
go back to reference Zhao Y, Lin W, Li Z, et al. High expression of mannose-binding lectin and the risk of vascular complications of diabetes: evidence from a meta-analysis. Diabetes Technol Ther. 2015;17(7):490–7.CrossRef Zhao Y, Lin W, Li Z, et al. High expression of mannose-binding lectin and the risk of vascular complications of diabetes: evidence from a meta-analysis. Diabetes Technol Ther. 2015;17(7):490–7.CrossRef
18.
go back to reference Miller RG, Costacou T, Orchard TJ. Risk factor modeling for cardiovascular disease in type 1 diabetes in the pittsburgh epidemiology of diabetes complications (EDC) study: a comparison with the diabetes control and complications trial/epidemiology of diabetes interventions and complications study (DCCT/EDIC). Diabetes. 2019;68(2):409–19.CrossRef Miller RG, Costacou T, Orchard TJ. Risk factor modeling for cardiovascular disease in type 1 diabetes in the pittsburgh epidemiology of diabetes complications (EDC) study: a comparison with the diabetes control and complications trial/epidemiology of diabetes interventions and complications study (DCCT/EDIC). Diabetes. 2019;68(2):409–19.CrossRef
19.
go back to reference Basu S, Sussman JB, Berkowitz SA, Hayward RA, Yudkin JS. Development and validation of risk equations for complications of type 2 diabetes (RECODe) using individual participant data from randomised trials. Lancet Diabetes Endocrinol. 2017;5(10):788–98.CrossRef Basu S, Sussman JB, Berkowitz SA, Hayward RA, Yudkin JS. Development and validation of risk equations for complications of type 2 diabetes (RECODe) using individual participant data from randomised trials. Lancet Diabetes Endocrinol. 2017;5(10):788–98.CrossRef
20.
go back to reference Basu S, Sussman JB, Berkowitz SA, et al. Validation of risk equations for complications of type 2 diabetes (RECODe) using individual participant data from diverse longitudinal cohorts in the US. Diabetes Care. 2018;41(3):586–95.CrossRef Basu S, Sussman JB, Berkowitz SA, et al. Validation of risk equations for complications of type 2 diabetes (RECODe) using individual participant data from diverse longitudinal cohorts in the US. Diabetes Care. 2018;41(3):586–95.CrossRef
21.
go back to reference Gerstein HC, Miller ME, Byington RP, et al. Effects of intensive glucose lowering in type 2 diabetes. N Engl J Med. 2008;358(24):2545–59.CrossRef Gerstein HC, Miller ME, Byington RP, et al. Effects of intensive glucose lowering in type 2 diabetes. N Engl J Med. 2008;358(24):2545–59.CrossRef
22.
go back to reference Hayes AJ, Leal J, Gray AM, Holman RR, Clarke PM. UKPDS outcomes model 2: a new version of a model to simulate lifetime health outcomes of patients with type 2 diabetes mellitus using data from the 30 year United Kingdom Prospective Diabetes Study: UKPDS 82. Diabetologia. 2013;56(9):1925–33.CrossRef Hayes AJ, Leal J, Gray AM, Holman RR, Clarke PM. UKPDS outcomes model 2: a new version of a model to simulate lifetime health outcomes of patients with type 2 diabetes mellitus using data from the 30 year United Kingdom Prospective Diabetes Study: UKPDS 82. Diabetologia. 2013;56(9):1925–33.CrossRef
23.
go back to reference Maxwell A, Li R, Yang B, et al. Deep learning architectures for multi-label classification of intelligent health risk prediction. BMC Bioinform. 2017;18(Suppl 14):523.CrossRef Maxwell A, Li R, Yang B, et al. Deep learning architectures for multi-label classification of intelligent health risk prediction. BMC Bioinform. 2017;18(Suppl 14):523.CrossRef
24.
go back to reference Folorunso SO, Fashoto SG, Olaomi J, Fashoto OY. A multi-label learning model for psychotic diseases in Nigeria. Inform Med Unlocked. 2020;19:100326.CrossRef Folorunso SO, Fashoto SG, Olaomi J, Fashoto OY. A multi-label learning model for psychotic diseases in Nigeria. Inform Med Unlocked. 2020;19:100326.CrossRef
25.
go back to reference Omar M, Tahir M, Khelifi F. Multi-label learning model for improving retinal image classification in diabetic retinopathy. 2017. 0202. Omar M, Tahir M, Khelifi F. Multi-label learning model for improving retinal image classification in diabetic retinopathy. 2017. 0202.
26.
go back to reference Lagani V, Chiarugi F, Manousos D, et al. Realization of a service for the long-term risk assessment of diabetes-related complications. J Diabetes Compl. 2015;29(5):691–8.CrossRef Lagani V, Chiarugi F, Manousos D, et al. Realization of a service for the long-term risk assessment of diabetes-related complications. J Diabetes Compl. 2015;29(5):691–8.CrossRef
27.
go back to reference Flammer J, Konieczka K, Bruno RM, Virdis A, Flammer AJ, Taddei S. The eye and the heart. Eur Heart J. 2013;34(17):1270–8.CrossRef Flammer J, Konieczka K, Bruno RM, Virdis A, Flammer AJ, Taddei S. The eye and the heart. Eur Heart J. 2013;34(17):1270–8.CrossRef
28.
go back to reference Rim TH, Teo A, Yang H, Cheung CY, Wong TY. Retinal vascular signs and cerebrovascular diseases. J Neuroophthalmol. 2020;40:44–59.CrossRef Rim TH, Teo A, Yang H, Cheung CY, Wong TY. Retinal vascular signs and cerebrovascular diseases. J Neuroophthalmol. 2020;40:44–59.CrossRef
30.
go back to reference Nägele MP, Barthelmes J, Ludovici V, et al. Retinal microvascular dysfunction in heart failure. Eur Heart J. 2018;39(1):47–56.CrossRef Nägele MP, Barthelmes J, Ludovici V, et al. Retinal microvascular dysfunction in heart failure. Eur Heart J. 2018;39(1):47–56.CrossRef
32.
go back to reference Xu X, Sun F, Wang Q, et al. Comprehensive retinal vascular measurements: a novel association with renal function in type 2 diabetic patients in China. Sci Rep. 2020;10(1):13737.CrossRef Xu X, Sun F, Wang Q, et al. Comprehensive retinal vascular measurements: a novel association with renal function in type 2 diabetic patients in China. Sci Rep. 2020;10(1):13737.CrossRef
33.
go back to reference Bai BM, Mangathayaru N, Rani BP. Diabetes complications prediction using different multi-label classification algorithms-MEKA. ICICCT 2019: system reliability, quality control, safety, maintenance and management. 2020. Bai BM, Mangathayaru N, Rani BP. Diabetes complications prediction using different multi-label classification algorithms-MEKA. ICICCT 2019: system reliability, quality control, safety, maintenance and management. 2020.
34.
go back to reference Boutell M, Luo J, Shen X, Brown C. Learning multi-label scene classification. Pattern Recognit. 2004;37:1757.CrossRef Boutell M, Luo J, Shen X, Brown C. Learning multi-label scene classification. Pattern Recognit. 2004;37:1757.CrossRef
35.
go back to reference Read J, Pfahringer B, Holmes G, Frank E. Classifier Chains for Multi-label Classification. 2009. Read J, Pfahringer B, Holmes G, Frank E. Classifier Chains for Multi-label Classification. 2009.
36.
go back to reference Read J, Pfahringer B, Holmes G, Frank E. Classifier chains for multi-label classification. Mach Learn. 2011;85(3):333–59.CrossRef Read J, Pfahringer B, Holmes G, Frank E. Classifier chains for multi-label classification. Mach Learn. 2011;85(3):333–59.CrossRef
37.
go back to reference Fürnkranz J, Hüllermeier E, Loza Mencía E, Brinker K. Multilabel classification via calibrated label ranking. Mach Learn. 2008;73(2):133.CrossRef Fürnkranz J, Hüllermeier E, Loza Mencía E, Brinker K. Multilabel classification via calibrated label ranking. Mach Learn. 2008;73(2):133.CrossRef
38.
go back to reference Tsoumakas G, Vlahavas I. Random k-Labelsets: An Ensemble Method for Multilabel Classification. Berlin, Heidelberg,2007. Tsoumakas G, Vlahavas I. Random k-Labelsets: An Ensemble Method for Multilabel Classification. Berlin, Heidelberg,2007.
39.
go back to reference Zhang M, Zhou Z. ML-KNN: A lazy learning approach to multi-label leaming. Pattern Recognit. 2007;40:2038.CrossRef Zhang M, Zhou Z. ML-KNN: A lazy learning approach to multi-label leaming. Pattern Recognit. 2007;40:2038.CrossRef
40.
go back to reference Veloso A, Jr WM. Multi-Label Associative Classification. Springerbriefs in Computer Science. 2011: 53–59. Veloso A, Jr WM. Multi-Label Associative Classification. Springerbriefs in Computer Science. 2011: 53–59.
41.
go back to reference Elisseeff A, Weston J. A Kernel Method for Multi-Labelled Classification. 2002. Elisseeff A, Weston J. A Kernel Method for Multi-Labelled Classification. 2002.
42.
go back to reference Ghamrawi N, Mccallum A. Collective multi-label classification. 2005. 195. Ghamrawi N, Mccallum A. Collective multi-label classification. 2005. 195.
43.
go back to reference Elkafrawy P, Mausad A, Esmail H. Experimental comparison of methods for multi-label classification in different application domains. Int J Comput Appl. 2015;114:1. Elkafrawy P, Mausad A, Esmail H. Experimental comparison of methods for multi-label classification in different application domains. Int J Comput Appl. 2015;114:1.
44.
go back to reference Zhang J, Wang Y, Zhang R, et al. Serum fibrinogen predicts diabetic ESRD in patients with type 2 diabetes mellitus. Diabetes Res Clin Pract. 2018;141:1–9.CrossRef Zhang J, Wang Y, Zhang R, et al. Serum fibrinogen predicts diabetic ESRD in patients with type 2 diabetes mellitus. Diabetes Res Clin Pract. 2018;141:1–9.CrossRef
45.
go back to reference Zhang J, Zhang R, Wang Y, et al. The level of serum albumin is associated with renal prognosis in patients with diabetic nephropathy. J Diabetes Res. 2019;2019:7825804.PubMedPubMedCentral Zhang J, Zhang R, Wang Y, et al. The level of serum albumin is associated with renal prognosis in patients with diabetic nephropathy. J Diabetes Res. 2019;2019:7825804.PubMedPubMedCentral
46.
go back to reference Tessari P, Kiwanuka E, Barazzoni R, Vettore M, Zanetti M. Diabetic nephropathy is associated with increased albumin and fibrinogen production in patients with type 2 diabetes. Diabetologia. 2006;49(8):1955–61.CrossRef Tessari P, Kiwanuka E, Barazzoni R, Vettore M, Zanetti M. Diabetic nephropathy is associated with increased albumin and fibrinogen production in patients with type 2 diabetes. Diabetologia. 2006;49(8):1955–61.CrossRef
47.
go back to reference Robles NR, Ferreira F, Martinez-Gallardo R, et al. Hematocrit, urea and gender: the Hematocrit, Urea and GEnder formula for prognosing progressive renal failure in diabetic nephropathy. Eur J Intern Med. 2012;23(3):283–6.CrossRef Robles NR, Ferreira F, Martinez-Gallardo R, et al. Hematocrit, urea and gender: the Hematocrit, Urea and GEnder formula for prognosing progressive renal failure in diabetic nephropathy. Eur J Intern Med. 2012;23(3):283–6.CrossRef
48.
go back to reference Samra YA, Saleh HM, Hussein KA, et al. Adenosine deaminase-2-induced hyperpermeability in human retinal vascular endothelial cells is suppressed by MicroRNA-146b-3p. Invest Ophthalmol Vis Sci. 2017;58(2):933–43.CrossRef Samra YA, Saleh HM, Hussein KA, et al. Adenosine deaminase-2-induced hyperpermeability in human retinal vascular endothelial cells is suppressed by MicroRNA-146b-3p. Invest Ophthalmol Vis Sci. 2017;58(2):933–43.CrossRef
49.
go back to reference Issar T, Arnold R, Kwai N, et al. Relative contributions of diabetes and chronic kidney disease to neuropathy development in diabetic nephropathy patients. Clin Neurophysiol. 2019;130(11):2088–95.CrossRef Issar T, Arnold R, Kwai N, et al. Relative contributions of diabetes and chronic kidney disease to neuropathy development in diabetic nephropathy patients. Clin Neurophysiol. 2019;130(11):2088–95.CrossRef
Metadata
Title
Application of multi-label classification models for the diagnosis of diabetic complications
Authors
Liang Zhou
Xiaoyuan Zheng
Di Yang
Ying Wang
Xuesong Bai
Xinhua Ye
Publication date
01-12-2021
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2021
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-021-01525-7

Other articles of this Issue 1/2021

BMC Medical Informatics and Decision Making 1/2021 Go to the issue