Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2020

Open Access 01-12-2020 | Insulins | Research article

Development and validation of algorithms to classify type 1 and 2 diabetes according to age at diagnosis using electronic health records

Authors: Calvin Ke, Thérèse A. Stukel, Andrea Luk, Baiju R. Shah, Prabhat Jha, Eric Lau, Ronald C. W. Ma, Wing-Yee So, Alice P. Kong, Elaine Chow, Juliana C. N. Chan

Published in: BMC Medical Research Methodology | Issue 1/2020

Login to get access

Abstract

Background

Validated algorithms to classify type 1 and 2 diabetes (T1D, T2D) are mostly limited to white pediatric populations. We conducted a large study in Hong Kong among children and adults with diabetes to develop and validate algorithms using electronic health records (EHRs) to classify diabetes type against clinical assessment as the reference standard, and to evaluate performance by age at diagnosis.

Methods

We included all people with diabetes (age at diagnosis 1.5–100 years during 2002–15) in the Hong Kong Diabetes Register and randomized them to derivation and validation cohorts. We developed candidate algorithms to identify diabetes types using encounter codes, prescriptions, and combinations of these criteria (“combination algorithms”). We identified 3 algorithms with the highest sensitivity, positive predictive value (PPV), and kappa coefficient, and evaluated performance by age at diagnosis in the validation cohort.

Results

There were 10,196 (T1D n = 60, T2D n = 10,136) and 5101 (T1D n = 43, T2D n = 5058) people in the derivation and validation cohorts (mean age at diagnosis 22.7, 55.9 years; 53.3, 43.9% female; for T1D and T2D respectively). Algorithms using codes or prescriptions classified T1D well for age at diagnosis < 20 years, but sensitivity and PPV dropped for older ages at diagnosis. Combination algorithms maximized sensitivity or PPV, but not both. The “high sensitivity for type 1” algorithm (ratio of type 1 to type 2 codes ≥ 4, or at least 1 insulin prescription within 90 days) had a sensitivity of 95.3% (95% confidence interval 84.2–99.4%; PPV 12.8%, 9.3–16.9%), while the “high PPV for type 1” algorithm (ratio of type 1 to type 2 codes ≥ 4, and multiple daily injections with no other glucose-lowering medication prescription) had a PPV of 100.0% (79.4–100.0%; sensitivity 37.2%, 23.0–53.3%), and the “optimized” algorithm (ratio of type 1 to type 2 codes ≥ 4, and at least 1 insulin prescription within 90 days) had a sensitivity of 65.1% (49.1–79.0%) and PPV of 75.7% (58.8–88.2%) across all ages. Accuracy of T2D classification was high for all algorithms.

Conclusions

Our validated set of algorithms accurately classifies T1D and T2D using EHRs for Hong Kong residents enrolled in a diabetes register. The choice of algorithm should be tailored to the unique requirements of each study question.
Appendix
Available only for authorised users
Literature
1.
go back to reference Ke C, Sohal P, Qian H, Quan H, Khan NA. Diabetes in the young: a population-based study of south Asian, Chinese and White people. Diabet Med. 2015;32:487–96.CrossRef Ke C, Sohal P, Qian H, Quan H, Khan NA. Diabetes in the young: a population-based study of south Asian, Chinese and White people. Diabet Med. 2015;32:487–96.CrossRef
2.
go back to reference Hux JE, Ivis F, Flintoft V, Bica A. Diabetes in Ontario. Diabetes Care. 2002;25:512–6.CrossRef Hux JE, Ivis F, Flintoft V, Bica A. Diabetes in Ontario. Diabetes Care. 2002;25:512–6.CrossRef
3.
go back to reference Lipscombe LL, Hwee J, Webster L, Shah BR, Booth GL, Tu K. Identifying diabetes cases from administrative data: a population-based validation study. BMC Health Serv Res. 2018;18:316.CrossRef Lipscombe LL, Hwee J, Webster L, Shah BR, Booth GL, Tu K. Identifying diabetes cases from administrative data: a population-based validation study. BMC Health Serv Res. 2018;18:316.CrossRef
4.
go back to reference Guttmann A, Nakhla M, Henderson M, To T, Daneman D, Cauch-Dudek K, et al. Validation of a health administrative data algorithm for assessing the epidemiology of diabetes in Canadian children. Pediatr Diabetes. 2010;11:122–8.CrossRef Guttmann A, Nakhla M, Henderson M, To T, Daneman D, Cauch-Dudek K, et al. Validation of a health administrative data algorithm for assessing the epidemiology of diabetes in Canadian children. Pediatr Diabetes. 2010;11:122–8.CrossRef
5.
go back to reference Green A, Sortsø C, Jensen PB, Emneus M. Validation of the Danish National Diabetes Register. Clin Epidemiol. 2014;7:5–15.CrossRef Green A, Sortsø C, Jensen PB, Emneus M. Validation of the Danish National Diabetes Register. Clin Epidemiol. 2014;7:5–15.CrossRef
6.
go back to reference Holt RIG, Cockram C, Flyvbjerg A, Goldstein BJ. Textbook of diabetes. 5th ed. Chichester, West Sussex, UK ; Hoboken, NJ: Wiley-Blackwell; 2017.CrossRef Holt RIG, Cockram C, Flyvbjerg A, Goldstein BJ. Textbook of diabetes. 5th ed. Chichester, West Sussex, UK ; Hoboken, NJ: Wiley-Blackwell; 2017.CrossRef
7.
go back to reference Ke C, Morgan S, Smolina K, Gasevic D, Qian H, Khan N. Mortality and cardiovascular risk of sulfonylureas in south Asian, Chinese and other Canadians with diabetes. Can J Diabetes. 2017;41:150–5.CrossRef Ke C, Morgan S, Smolina K, Gasevic D, Qian H, Khan N. Mortality and cardiovascular risk of sulfonylureas in south Asian, Chinese and other Canadians with diabetes. Can J Diabetes. 2017;41:150–5.CrossRef
8.
go back to reference Ke CH, Morgan S, Smolina K, Gasevic D, Qian H, Khan NA. Is cardiovascular risk reduction therapy effective in south Asian, Chinese and other patients with diabetes? A population-based cohort study from Canada. BMJ Open. 2017;7:e013808.CrossRef Ke CH, Morgan S, Smolina K, Gasevic D, Qian H, Khan NA. Is cardiovascular risk reduction therapy effective in south Asian, Chinese and other patients with diabetes? A population-based cohort study from Canada. BMJ Open. 2017;7:e013808.CrossRef
9.
go back to reference Luk AOY, Lau ESH, So W-Y, Ma RCW, Kong APS, Ozaki R, et al. Prospective study on the incidences of cardiovascular-renal complications in Chinese patients with young-onset type 1 and type 2 diabetes. Diabetes Care. 2014;37:149–57.CrossRef Luk AOY, Lau ESH, So W-Y, Ma RCW, Kong APS, Ozaki R, et al. Prospective study on the incidences of cardiovascular-renal complications in Chinese patients with young-onset type 1 and type 2 diabetes. Diabetes Care. 2014;37:149–57.CrossRef
10.
go back to reference Saydah S, Imperatore G. Emerging approaches in surveillance of type 1 diabetes. Curr Diab Rep. 2018;18:61.CrossRef Saydah S, Imperatore G. Emerging approaches in surveillance of type 1 diabetes. Curr Diab Rep. 2018;18:61.CrossRef
11.
go back to reference Wang L, Gao P, Zhang M, Huang Z, Zhang D, Deng Q, et al. Prevalence and ethnic pattern of diabetes and Prediabetes in China in 2013. JAMA. 2017;317:2515–23.CrossRef Wang L, Gao P, Zhang M, Huang Z, Zhang D, Deng Q, et al. Prevalence and ethnic pattern of diabetes and Prediabetes in China in 2013. JAMA. 2017;317:2515–23.CrossRef
12.
go back to reference Klompas M, Eggleston E, McVetta J, Lazarus R, Li L, Platt R. Automated detection and classification of type 1 versus type 2 diabetes using electronic health record data. Diabetes Care. 2013;36:914–21.CrossRef Klompas M, Eggleston E, McVetta J, Lazarus R, Li L, Platt R. Automated detection and classification of type 1 versus type 2 diabetes using electronic health record data. Diabetes Care. 2013;36:914–21.CrossRef
13.
go back to reference Lawrence JM, Black MH, Zhang JL, Slezak JM, Takhar HS, Koebnick C, et al. Validation of pediatric diabetes case identification approaches for diagnosed cases by using information in the electronic health Records of a Large Integrated Managed Health Care Organization. Am J Epidemiol. 2014;179:27–38.CrossRef Lawrence JM, Black MH, Zhang JL, Slezak JM, Takhar HS, Koebnick C, et al. Validation of pediatric diabetes case identification approaches for diagnosed cases by using information in the electronic health Records of a Large Integrated Managed Health Care Organization. Am J Epidemiol. 2014;179:27–38.CrossRef
14.
go back to reference Vanderloo SE, Johnson JA, Reimer K, McCrea P, Nuernberger K, Krueger H, et al. Validation of classification algorithms for childhood diabetes identified from administrative data. Pediatr Diabetes. 2012;13:229–34.CrossRef Vanderloo SE, Johnson JA, Reimer K, McCrea P, Nuernberger K, Krueger H, et al. Validation of classification algorithms for childhood diabetes identified from administrative data. Pediatr Diabetes. 2012;13:229–34.CrossRef
15.
go back to reference Zhong VW, Pfaff ER, Beavers DP, Thomas J, Jaacks LM, Bowlby DA, et al. Use of administrative and electronic health record data for development of automated algorithms for childhood diabetes case ascertainment and type classification: the SEARCH for diabetes in youth study. Pediatr Diabetes. 2014;15:573–84.CrossRef Zhong VW, Pfaff ER, Beavers DP, Thomas J, Jaacks LM, Bowlby DA, et al. Use of administrative and electronic health record data for development of automated algorithms for childhood diabetes case ascertainment and type classification: the SEARCH for diabetes in youth study. Pediatr Diabetes. 2014;15:573–84.CrossRef
16.
go back to reference Zhong VW, Obeid JS, Craig JB, Pfaff ER, Thomas J, Jaacks LM, et al. An efficient approach for surveillance of childhood diabetes by type derived from electronic health record data: the SEARCH for diabetes in youth study. J Am Med Inform Assoc. 2016;23:1060–7.CrossRef Zhong VW, Obeid JS, Craig JB, Pfaff ER, Thomas J, Jaacks LM, et al. An efficient approach for surveillance of childhood diabetes by type derived from electronic health record data: the SEARCH for diabetes in youth study. J Am Med Inform Assoc. 2016;23:1060–7.CrossRef
17.
go back to reference Park Y. Why is type 1 diabetes uncommon in Asia? Ann N Y Acad Sci. 2006;1079:31–40.CrossRef Park Y. Why is type 1 diabetes uncommon in Asia? Ann N Y Acad Sci. 2006;1079:31–40.CrossRef
18.
go back to reference Quan J, Li TK, Pang H, Choi CH, Siu SC, Tang SY, et al. Diabetes incidence and prevalence in Hong Kong, China during 2006–2014. Diabet Med. 2017;34:902–8.CrossRef Quan J, Li TK, Pang H, Choi CH, Siu SC, Tang SY, et al. Diabetes incidence and prevalence in Hong Kong, China during 2006–2014. Diabet Med. 2017;34:902–8.CrossRef
19.
go back to reference Leung GM, Tin KY, Chan W-S. Hong Kong’s health spending projections through 2033. Health Policy. 2007;81:93–101.CrossRef Leung GM, Tin KY, Chan W-S. Hong Kong’s health spending projections through 2033. Health Policy. 2007;81:93–101.CrossRef
20.
go back to reference World Health Organization. Use of glycated haemoglobin (HbA1c) in the diagnosis of diabetes mellitus. Diabetes Res Clin Pract. 2011;93:299–309. World Health Organization. Use of glycated haemoglobin (HbA1c) in the diagnosis of diabetes mellitus. Diabetes Res Clin Pract. 2011;93:299–309.
22.
go back to reference International Association of Diabetes and Pregnancy Study Groups Consensus Panel. International Association of Diabetes and Pregnancy Study Groups Recommendations on the diagnosis and classification of hyperglycemia in pregnancy. Diabetes Care. 2010;33:676–82.CrossRef International Association of Diabetes and Pregnancy Study Groups Consensus Panel. International Association of Diabetes and Pregnancy Study Groups Recommendations on the diagnosis and classification of hyperglycemia in pregnancy. Diabetes Care. 2010;33:676–82.CrossRef
23.
go back to reference Piwernetz K, Home PD, Snorgaard O, Antsiferov M, Staehr-Johansen K, Krans M, et al. Monitoring the targets of the St Vincent declaration and the implementation of quality Management in Diabetes Care: the DiabCare initiative. Diabet Med. 1993;10:371–7.CrossRef Piwernetz K, Home PD, Snorgaard O, Antsiferov M, Staehr-Johansen K, Krans M, et al. Monitoring the targets of the St Vincent declaration and the implementation of quality Management in Diabetes Care: the DiabCare initiative. Diabet Med. 1993;10:371–7.CrossRef
24.
go back to reference Chan JCN, So W, Ma RCW, Tong PCY, Wong R, Yang X. The complexity of vascular and non-vascular complications of diabetes: the Hong Kong diabetes registry. Curr Cardiovasc Risk Rep. 2011;5:230–9.CrossRef Chan JCN, So W, Ma RCW, Tong PCY, Wong R, Yang X. The complexity of vascular and non-vascular complications of diabetes: the Hong Kong diabetes registry. Curr Cardiovasc Risk Rep. 2011;5:230–9.CrossRef
25.
go back to reference Jones AG, Hattersley AT. The clinical utility of C-peptide measurement in the care of patients with diabetes. Diabet Med. 2013;30:803–17.CrossRef Jones AG, Hattersley AT. The clinical utility of C-peptide measurement in the care of patients with diabetes. Diabet Med. 2013;30:803–17.CrossRef
26.
go back to reference Sharma M, Petersen I, Nazareth I, Coton SJ. An algorithm for identification and classification of individuals with type 1 and type 2 diabetes mellitus in a large primary care database. Clin Epidemiol. 2016;8:373–80.CrossRef Sharma M, Petersen I, Nazareth I, Coton SJ. An algorithm for identification and classification of individuals with type 1 and type 2 diabetes mellitus in a large primary care database. Clin Epidemiol. 2016;8:373–80.CrossRef
27.
go back to reference Schroeder EB, Donahoo WT, Goodrich GK, Raebel MA. Validation of an algorithm for identifying type 1 diabetes in adults based on electronic health record data. Pharmacoepidemiol Drug Saf. 2018;27:1053–9.CrossRef Schroeder EB, Donahoo WT, Goodrich GK, Raebel MA. Validation of an algorithm for identifying type 1 diabetes in adults based on electronic health record data. Pharmacoepidemiol Drug Saf. 2018;27:1053–9.CrossRef
28.
go back to reference German RR, Lee LM, Horan JM, Milstein RL, Pertowski CA, Waller MN, et al. Updated guidelines for evaluating public health surveillance systems: Recommendations from the Guidelines Working Group. Morb Mortal Wkly Rep. 2001;50(RR-13):1–35. German RR, Lee LM, Horan JM, Milstein RL, Pertowski CA, Waller MN, et al. Updated guidelines for evaluating public health surveillance systems: Recommendations from the Guidelines Working Group. Morb Mortal Wkly Rep. 2001;50(RR-13):1–35.
29.
go back to reference Marshall RJ. The predictive value of simple rules for combining two diagnostic tests. Biometrics. 1989;45:1213–22.CrossRef Marshall RJ. The predictive value of simple rules for combining two diagnostic tests. Biometrics. 1989;45:1213–22.CrossRef
30.
go back to reference Cebul RD, Hershey JC, Williams SV. Using multiple tests: series and parallel approaches. Clin Lab Med. 1982;2:871–90.CrossRef Cebul RD, Hershey JC, Williams SV. Using multiple tests: series and parallel approaches. Clin Lab Med. 1982;2:871–90.CrossRef
31.
go back to reference Feuerman M, Miller AR. The kappa statistic as a function of sensitivity and specificity. Int J Math Educ Sci Technol. 2005;36:517–27.CrossRef Feuerman M, Miller AR. The kappa statistic as a function of sensitivity and specificity. Int J Math Educ Sci Technol. 2005;36:517–27.CrossRef
33.
34.
go back to reference Kharrazi H, Chi W, Chang H-Y, Richards T, Gallagher J, Knudson S, et al. Comparing population-based risk-stratification model performance using demographic, diagnosis and medication data extracted from outpatient electronic health records versus administrative claims. Med Care. 2017;55:789–96.CrossRef Kharrazi H, Chi W, Chang H-Y, Richards T, Gallagher J, Knudson S, et al. Comparing population-based risk-stratification model performance using demographic, diagnosis and medication data extracted from outpatient electronic health records versus administrative claims. Med Care. 2017;55:789–96.CrossRef
Metadata
Title
Development and validation of algorithms to classify type 1 and 2 diabetes according to age at diagnosis using electronic health records
Authors
Calvin Ke
Thérèse A. Stukel
Andrea Luk
Baiju R. Shah
Prabhat Jha
Eric Lau
Ronald C. W. Ma
Wing-Yee So
Alice P. Kong
Elaine Chow
Juliana C. N. Chan
Publication date
01-12-2020
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2020
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/s12874-020-00921-3

Other articles of this Issue 1/2020

BMC Medical Research Methodology 1/2020 Go to the issue