Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2021

Open Access 01-12-2021 | Public Health | Research article

Public health utility of cause of death data: applying empirical algorithms to improve data quality

Authors: Sarah Charlotte Johnson, Matthew Cunningham, Ilse N. Dippenaar, Fablina Sharara, Eve E. Wool, Kareha M. Agesa, Chieh Han, Molly K. Miller-Petrie, Shadrach Wilson, John E. Fuller, Shelly Balassyano, Gregory J. Bertolacci, Nicole Davis Weaver, Alan D. Lopez, Christopher J. L. Murray, Mohsen Naghavi, GBD Cause of Death Collaborators

Published in: BMC Medical Informatics and Decision Making | Issue 1/2021

Login to get access

Abstract

Background

Accurate, comprehensive, cause-specific mortality estimates are crucial for informing public health decision making worldwide. Incorrectly or vaguely assigned deaths, defined as garbage-coded deaths, mask the true cause distribution. The Global Burden of Disease (GBD) study has developed methods to create comparable, timely, cause-specific mortality estimates; an impactful data processing method is the reallocation of garbage-coded deaths to a plausible underlying cause of death. We identify the pattern of garbage-coded deaths in the world and present the methods used to determine their redistribution to generate more plausible cause of death assignments.

Methods

We describe the methods developed for the GBD 2019 study and subsequent iterations to redistribute garbage-coded deaths in vital registration data to plausible underlying causes. These methods include analysis of multiple cause data, negative correlation, impairment, and proportional redistribution. We classify garbage codes into classes according to the level of specificity of the reported cause of death (CoD) and capture trends in the global pattern of proportion of garbage-coded deaths, disaggregated by these classes, and the relationship between this proportion and the Socio-Demographic Index. We examine the relative importance of the top four garbage codes by age and sex and demonstrate the impact of redistribution on the annual GBD CoD rankings.

Results

The proportion of least-specific (class 1 and 2) garbage-coded deaths ranged from 3.7% of all vital registration deaths to 67.3% in 2015, and the age-standardized proportion had an overall negative association with the Socio-Demographic Index. When broken down by age and sex, the category for unspecified lower respiratory infections was responsible for nearly 30% of garbage-coded deaths in those under 1 year of age for both sexes, representing the largest proportion of garbage codes for that age group. We show how the cause distribution by number of deaths changes before and after redistribution for four countries: Brazil, the United States, Japan, and France, highlighting the necessity of accounting for garbage-coded deaths in the GBD.

Conclusions

We provide a detailed description of redistribution methods developed for CoD data in the GBD; these methods represent an overall improvement in empiricism compared to past reliance on a priori knowledge.
Appendix
Available only for authorised users
Literature
1.
go back to reference Alter GC, Carmichael AG. Classifying the dead: toward a history of the registration of causes of death. J Hist Med Allied Sci. 1999;54(2):114–32.PubMedCrossRef Alter GC, Carmichael AG. Classifying the dead: toward a history of the registration of causes of death. J Hist Med Allied Sci. 1999;54(2):114–32.PubMedCrossRef
2.
go back to reference GBD 2019 Diseases, Injuries, and Impairments Collaborators. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. The Lancet. in press GBD 2019 Diseases, Injuries, and Impairments Collaborators. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. The Lancet. in press
4.
go back to reference Sibai AM. Mortality certification and cause-of-death reporting in developing countries. Bull World Health Organ. 2004;82(2):83.PubMedPubMedCentral Sibai AM. Mortality certification and cause-of-death reporting in developing countries. Bull World Health Organ. 2004;82(2):83.PubMedPubMedCentral
6.
go back to reference AbouZahr C, Boerma T. Health information systems: the foundations of public health. Bull World Health Organ. 2005;83(8):578–83.PubMedPubMedCentral AbouZahr C, Boerma T. Health information systems: the foundations of public health. Bull World Health Organ. 2005;83(8):578–83.PubMedPubMedCentral
7.
go back to reference Ruzicka LT, Lopez AD. The use of cause-of-death statistics for health situation assessment: national and international experiences. World Health Stat Q Rapp Trimest Stat Sanit Mond. 1990;43(4):249–58. Ruzicka LT, Lopez AD. The use of cause-of-death statistics for health situation assessment: national and international experiences. World Health Stat Q Rapp Trimest Stat Sanit Mond. 1990;43(4):249–58.
8.
go back to reference World Health Organization, editor. International statistical classification of diseases and related health problems. 10th revision, 2nd edition. Geneva: World Health Organization; 2004 World Health Organization, editor. International statistical classification of diseases and related health problems. 10th revision, 2nd edition. Geneva: World Health Organization; 2004
10.
go back to reference Campos-Outcalt D. Cause-of-death certification: not as easy as it seems. J Fam Pract. 2005;54(2):134–9.PubMed Campos-Outcalt D. Cause-of-death certification: not as easy as it seems. J Fam Pract. 2005;54(2):134–9.PubMed
11.
go back to reference Lakkireddy DR, Basarakodu KR, Vacek JL, Kondur AK, Ramachandruni SK, Esterbrooks DJ, et al. Improving death certificate completion: a trial of two training interventions. J Gen Intern Med. 2007;22(4):544–8.PubMedPubMedCentralCrossRef Lakkireddy DR, Basarakodu KR, Vacek JL, Kondur AK, Ramachandruni SK, Esterbrooks DJ, et al. Improving death certificate completion: a trial of two training interventions. J Gen Intern Med. 2007;22(4):544–8.PubMedPubMedCentralCrossRef
12.
go back to reference Naghavi M, Makela S, Foreman K, O’Brien J, Pourmalek F, Lozano R. Algorithms for enhancing public health utility of national causes-of-death data. Popul Health Metr. 2010;8(1):9.PubMedPubMedCentralCrossRef Naghavi M, Makela S, Foreman K, O’Brien J, Pourmalek F, Lozano R. Algorithms for enhancing public health utility of national causes-of-death data. Popul Health Metr. 2010;8(1):9.PubMedPubMedCentralCrossRef
13.
go back to reference Mathers CD, Fat DM, Inoue M, Rao C, Lopez AD. Counting the dead and what they died from: an assessment of the global status of cause of death data. Bull World Health Organ. 2005;83:171–7.PubMedPubMedCentral Mathers CD, Fat DM, Inoue M, Rao C, Lopez AD. Counting the dead and what they died from: an assessment of the global status of cause of death data. Bull World Health Organ. 2005;83:171–7.PubMedPubMedCentral
14.
go back to reference Rudd K, Johnson S, Agesa K, Shackelford K, Tsoi D, Kievlan D, et al. Global, regional, and national sepsis incidence and mortality, 1990–2017. The Lancet. 2020;395(10219):200–11.CrossRef Rudd K, Johnson S, Agesa K, Shackelford K, Tsoi D, Kievlan D, et al. Global, regional, and national sepsis incidence and mortality, 1990–2017. The Lancet. 2020;395(10219):200–11.CrossRef
15.
go back to reference Hernández B, Ramírez-Villalobos D, Romero M, Gómez S, Atkinson C, Lozano R. Assessing quality of medical death certification: Concordance between gold standard diagnosis and underlying cause of death in selected Mexican hospitals. Popul Health Metr. 2011;9(1):38.PubMedPubMedCentralCrossRef Hernández B, Ramírez-Villalobos D, Romero M, Gómez S, Atkinson C, Lozano R. Assessing quality of medical death certification: Concordance between gold standard diagnosis and underlying cause of death in selected Mexican hospitals. Popul Health Metr. 2011;9(1):38.PubMedPubMedCentralCrossRef
16.
go back to reference Rao C, Lopez AD, Yang G, Begg S, Ma J. Evaluating national cause-of-death statistics: principles and application to the case of China. Bull World Health Organ. 2005;83(8):618–25.PubMedPubMedCentral Rao C, Lopez AD, Yang G, Begg S, Ma J. Evaluating national cause-of-death statistics: principles and application to the case of China. Bull World Health Organ. 2005;83(8):618–25.PubMedPubMedCentral
17.
go back to reference Lu TH, Lee MC, Chou MC. Accuracy of cause-of-death coding in Taiwan: types of miscoding and effects on mortality statistics. Int J Epidemiol. 2000;29(2):336–43.PubMedCrossRef Lu TH, Lee MC, Chou MC. Accuracy of cause-of-death coding in Taiwan: types of miscoding and effects on mortality statistics. Int J Epidemiol. 2000;29(2):336–43.PubMedCrossRef
18.
go back to reference de Lima RB, Frederes A, Marinho MF, da Cunha CC, Adair T, França EB. Investigation of garbage code deaths to improve the quality of cause-of-death in Brazil: results from a pilot study. Rev Bras Epidemiol. 2019;22:e19004.supl.3.CrossRef de Lima RB, Frederes A, Marinho MF, da Cunha CC, Adair T, França EB. Investigation of garbage code deaths to improve the quality of cause-of-death in Brazil: results from a pilot study. Rev Bras Epidemiol. 2019;22:e19004.supl.3.CrossRef
19.
go back to reference Ellingsen CL, Ebbing M, Alfsen GC, Vollset SE. Injury death certificates without specification of the circumstances leading to the fatal injury—the Norwegian Cause of Death Registry 2005–2014. Popul Health Metr. 2018;16(1):20.PubMedPubMedCentralCrossRef Ellingsen CL, Ebbing M, Alfsen GC, Vollset SE. Injury death certificates without specification of the circumstances leading to the fatal injury—the Norwegian Cause of Death Registry 2005–2014. Popul Health Metr. 2018;16(1):20.PubMedPubMedCentralCrossRef
20.
go back to reference Metcalf P, Meyer M, Suchindran C, Heiss G. Assessment of a regression method to reclassify deaths attributable to heart failure. Glob J Health Sci. 2016;9(3):p13.CrossRef Metcalf P, Meyer M, Suchindran C, Heiss G. Assessment of a regression method to reclassify deaths attributable to heart failure. Glob J Health Sci. 2016;9(3):p13.CrossRef
21.
go back to reference Danilova I, Shkolnikov VM, Jdanov DA, Meslé F, Vallin J. Identifying potential differences in cause-of-death coding practices across Russian regions. Popul Health Metr. 2016;14(1):8.PubMedPubMedCentralCrossRef Danilova I, Shkolnikov VM, Jdanov DA, Meslé F, Vallin J. Identifying potential differences in cause-of-death coding practices across Russian regions. Popul Health Metr. 2016;14(1):8.PubMedPubMedCentralCrossRef
22.
go back to reference Qaddumi JAS, Nazzal Z, Yacoub A, Mansour M. Physicians’ knowledge and practice on death certification in the North West Bank, Palestine: across sectional study. BMC Health Serv Res. 2018;18:8.PubMedPubMedCentralCrossRef Qaddumi JAS, Nazzal Z, Yacoub A, Mansour M. Physicians’ knowledge and practice on death certification in the North West Bank, Palestine: across sectional study. BMC Health Serv Res. 2018;18:8.PubMedPubMedCentralCrossRef
23.
go back to reference Madadin M, Alhumam AS, Bushulaybi NA, Alotaibi AR, Aldakhil HA, Alghamdi AY, et al. Common errors in writing the cause of death certificate in the Middle East. J Forensic Leg Med. 2019;68:101864.PubMedCrossRef Madadin M, Alhumam AS, Bushulaybi NA, Alotaibi AR, Aldakhil HA, Alghamdi AY, et al. Common errors in writing the cause of death certificate in the Middle East. J Forensic Leg Med. 2019;68:101864.PubMedCrossRef
24.
go back to reference Teixeira RA, Naghavi M, Guimarães MDC, Ishitani LH, França EB, Teixeira RA, et al. Quality of cause-of-death data in Brazil: garbage codes among registered deaths in 2000 and 2015. Rev Bras Epidemiol. 2019;22Suppl:e19002.supl.3.CrossRef Teixeira RA, Naghavi M, Guimarães MDC, Ishitani LH, França EB, Teixeira RA, et al. Quality of cause-of-death data in Brazil: garbage codes among registered deaths in 2000 and 2015. Rev Bras Epidemiol. 2019;22Suppl:e19002.supl.3.CrossRef
25.
go back to reference GBD 2019 Risk Factors Collaborators. The unfulfilled promise of prevention: the global burden of 87 risk factors, 1990–2019; a systematic analysis for the Global Burden of Disease Study 2019. Lancet Press. GBD 2019 Risk Factors Collaborators. The unfulfilled promise of prevention: the global burden of 87 risk factors, 1990–2019; a systematic analysis for the Global Burden of Disease Study 2019. Lancet Press.
26.
go back to reference GBD 2019 Demographics Collaborators. Global, regional, and national age-sex-specific fertility, mortality, and population estimates, 1950–2019: a comprehensive demographic analysis for the Global Burden of Disease Study 2019. Lancet. 2020;in press. GBD 2019 Demographics Collaborators. Global, regional, and national age-sex-specific fertility, mortality, and population estimates, 1950–2019: a comprehensive demographic analysis for the Global Burden of Disease Study 2019. Lancet. 2020;in press.
27.
go back to reference Phillips DE, Lozano R, Naghavi M, Atkinson C, Gonzalez-Medina D, Mikkelsen L, et al. A composite metric for assessing data on mortality and causes of death: the vital statistics performance index. Popul Health Metr. 2014;12:14.PubMedPubMedCentralCrossRef Phillips DE, Lozano R, Naghavi M, Atkinson C, Gonzalez-Medina D, Mikkelsen L, et al. A composite metric for assessing data on mortality and causes of death: the vital statistics performance index. Popul Health Metr. 2014;12:14.PubMedPubMedCentralCrossRef
28.
go back to reference Stevens GA, Alkema L, Black RE, Boerma JT, Collins GS, Ezzati M, et al. Guidelines for accurate and transparent health estimates reporting: the GATHER statement. PLOS Med. 2016;13(6):e1002056.PubMedPubMedCentralCrossRef Stevens GA, Alkema L, Black RE, Boerma JT, Collins GS, Ezzati M, et al. Guidelines for accurate and transparent health estimates reporting: the GATHER statement. PLOS Med. 2016;13(6):e1002056.PubMedPubMedCentralCrossRef
32.
go back to reference Groenewald P, Nannan N, Bourne D, Laubscher R, Bradshaw D. Identifying deaths from AIDS in South Africa. AIDS. 2005;19(2):193–201.PubMedCrossRef Groenewald P, Nannan N, Bourne D, Laubscher R, Bradshaw D. Identifying deaths from AIDS in South Africa. AIDS. 2005;19(2):193–201.PubMedCrossRef
33.
go back to reference GBD 2017 Causes of Death Collaborators. Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980–2017: a systematic analysis for the Global Burden of Disease Study 2017—The Lancet. Lancet. 2018;392(10159):1736–88.CrossRef GBD 2017 Causes of Death Collaborators. Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980–2017: a systematic analysis for the Global Burden of Disease Study 2017—The Lancet. Lancet. 2018;392(10159):1736–88.CrossRef
34.
go back to reference Naghavi M, Abajobir AA, Abbafati C, Abbas KM, Abd-Allah F, Abera SF, et al. Global, regional, and national age-sex specific mortality for 264 causes of death, 1980–2016: a systematic analysis for the Global Burden of Disease Study 2016. The Lancet. 2017;390(10100):1151–210.CrossRef Naghavi M, Abajobir AA, Abbafati C, Abbas KM, Abd-Allah F, Abera SF, et al. Global, regional, and national age-sex specific mortality for 264 causes of death, 1980–2016: a systematic analysis for the Global Burden of Disease Study 2016. The Lancet. 2017;390(10100):1151–210.CrossRef
35.
go back to reference Naghavi M, Richards N, Chowdhury H, Eynstone-Hinkins J, Franca E, Hegnauer M, et al. Improving the quality of cause of death data for public health policy: are all ‘garbage’ codes equally problematic? BMC Med. 2020;18(1):55.PubMedPubMedCentralCrossRef Naghavi M, Richards N, Chowdhury H, Eynstone-Hinkins J, Franca E, Hegnauer M, et al. Improving the quality of cause of death data for public health policy: are all ‘garbage’ codes equally problematic? BMC Med. 2020;18(1):55.PubMedPubMedCentralCrossRef
36.
go back to reference Global, regional, and national age–sex specific all-cause and cause-specific mortality for 240 causes of death, 1990–2013: a systematic analysis for the Global Burden of Disease Study 2013. The Lancet. 2015;385(9963):117–71. Global, regional, and national age–sex specific all-cause and cause-specific mortality for 240 causes of death, 1990–2013: a systematic analysis for the Global Burden of Disease Study 2013. The Lancet. 2015;385(9963):117–71.
37.
go back to reference Kircher T, Anderson RE. Cause of death. Proper completion of the death certificate. JAMA. 1987;258(3):349–52.PubMedCrossRef Kircher T, Anderson RE. Cause of death. Proper completion of the death certificate. JAMA. 1987;258(3):349–52.PubMedCrossRef
38.
go back to reference Puffer RR. New approaches for epidemiologic studies of mortality statistics. Bull Pan Am Health Organ. 1989;23(4):365–83.PubMed Puffer RR. New approaches for epidemiologic studies of mortality statistics. Bull Pan Am Health Organ. 1989;23(4):365–83.PubMed
39.
go back to reference Foreman KJ, Naghavi M, Ezzati M. Improving the usefulness of US mortality data: new methods for reclassification of underlying cause of death. Popul Health Metr. 2016;14(1):14.PubMedPubMedCentralCrossRef Foreman KJ, Naghavi M, Ezzati M. Improving the usefulness of US mortality data: new methods for reclassification of underlying cause of death. Popul Health Metr. 2016;14(1):14.PubMedPubMedCentralCrossRef
40.
go back to reference Snyder ML, Love S-A, Sorlie PD, Rosamond WD, Antini C, Metcalf PA, et al. Redistribution of heart failure as the cause of death: the Atherosclerosis Risk in Communities Study. Popul Health Metr. 2014;12(1):10.PubMedPubMedCentralCrossRef Snyder ML, Love S-A, Sorlie PD, Rosamond WD, Antini C, Metcalf PA, et al. Redistribution of heart failure as the cause of death: the Atherosclerosis Risk in Communities Study. Popul Health Metr. 2014;12(1):10.PubMedPubMedCentralCrossRef
41.
go back to reference Stevens GA, King G, Shibuya K. Deaths from heart failure: using coarsened exact matching to correct cause-of-death statistics. Popul Health Metr. 2010;8(1):6.PubMedPubMedCentralCrossRef Stevens GA, King G, Shibuya K. Deaths from heart failure: using coarsened exact matching to correct cause-of-death statistics. Popul Health Metr. 2010;8(1):6.PubMedPubMedCentralCrossRef
42.
go back to reference Murray CJL, Dias RH, Kulkarni SC, Lozano R, Stevens GA, Ezzati M. Improving the comparability of diabetes mortality statistics in the U.S. and Mexico. Diabetes Care. 2008;31(3):451–8.PubMedCrossRef Murray CJL, Dias RH, Kulkarni SC, Lozano R, Stevens GA, Ezzati M. Improving the comparability of diabetes mortality statistics in the U.S. and Mexico. Diabetes Care. 2008;31(3):451–8.PubMedCrossRef
43.
go back to reference Tibshirani R. Regression shrinkage and selection via the Lasso. J R Stat Soc Ser B Methodol. 1996;58(1):267–88. Tibshirani R. Regression shrinkage and selection via the Lasso. J R Stat Soc Ser B Methodol. 1996;58(1):267–88.
44.
45.
go back to reference Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. 2015;67(1):1–48.CrossRef Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. 2015;67(1):1–48.CrossRef
46.
go back to reference Fullman N, Yearwood J, Abay SM, Abbafati C, Abd-Allah F, Abdela J, et al. Measuring performance on the Healthcare Access and Quality Index for 195 countries and territories and selected subnational locations: a systematic analysis from the Global Burden of Disease Study 2016. The Lancet. 2018;391(10136):2236–71.CrossRef Fullman N, Yearwood J, Abay SM, Abbafati C, Abd-Allah F, Abdela J, et al. Measuring performance on the Healthcare Access and Quality Index for 195 countries and territories and selected subnational locations: a systematic analysis from the Global Burden of Disease Study 2016. The Lancet. 2018;391(10136):2236–71.CrossRef
50.
go back to reference Ahern RM, Lozano R, Naghavi M, Foreman K, Gakidou E, Murray CJ. Improving the public health utility of global cardiovascular mortality data: the rise of ischemic heart disease. Popul Health Metr. 2011;9(1):8.PubMedPubMedCentralCrossRef Ahern RM, Lozano R, Naghavi M, Foreman K, Gakidou E, Murray CJ. Improving the public health utility of global cardiovascular mortality data: the rise of ischemic heart disease. Popul Health Metr. 2011;9(1):8.PubMedPubMedCentralCrossRef
51.
go back to reference Suthar AB, Khalifa A, Yin S, Wenz K, Fat DM, Mills SL, et al. Evaluation of approaches to strengthen civil registration and vital statistics systems: A systematic review and synthesis of policies in 25 countries. PLOS Med. 2019;16(9):e1002929.PubMedPubMedCentralCrossRef Suthar AB, Khalifa A, Yin S, Wenz K, Fat DM, Mills SL, et al. Evaluation of approaches to strengthen civil registration and vital statistics systems: A systematic review and synthesis of policies in 25 countries. PLOS Med. 2019;16(9):e1002929.PubMedPubMedCentralCrossRef
52.
go back to reference Hart JD, Sorchik R, Bo KS, Chowdhury HR, Gamage S, Joshi R, et al. Improving medical certification of cause of death: effective strategies and approaches based on experiences from the Data for Health Initiative. BMC Med. 2020;18(1):74.PubMedCrossRef Hart JD, Sorchik R, Bo KS, Chowdhury HR, Gamage S, Joshi R, et al. Improving medical certification of cause of death: effective strategies and approaches based on experiences from the Data for Health Initiative. BMC Med. 2020;18(1):74.PubMedCrossRef
Metadata
Title
Public health utility of cause of death data: applying empirical algorithms to improve data quality
Authors
Sarah Charlotte Johnson
Matthew Cunningham
Ilse N. Dippenaar
Fablina Sharara
Eve E. Wool
Kareha M. Agesa
Chieh Han
Molly K. Miller-Petrie
Shadrach Wilson
John E. Fuller
Shelly Balassyano
Gregory J. Bertolacci
Nicole Davis Weaver
Alan D. Lopez
Christopher J. L. Murray
Mohsen Naghavi
GBD Cause of Death Collaborators
Publication date
01-12-2021
Publisher
BioMed Central
Keyword
Public Health
Published in
BMC Medical Informatics and Decision Making / Issue 1/2021
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-021-01501-1

Other articles of this Issue 1/2021

BMC Medical Informatics and Decision Making 1/2021 Go to the issue