Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Identifying cirrhosis, decompensated cirrhosis and hepatocellular carcinoma in health administrative data: A validation study

  • Lauren Lapointe-Shaw ,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Writing – original draft, Writing – review & editing

    lauren.lapointe.shaw@mail.utoronto.ca

    Affiliations Department of Medicine, University of Toronto, Toronto, Canada, Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Canada, Institute for Clinical Evaluative Sciences, Toronto, Canada

  • Firass Georgie,

    Roles Data curation, Formal analysis, Investigation, Validation, Writing – review & editing

    Affiliation McGill University Medical School, Montreal, Canada

  • David Carlone,

    Roles Data curation, Investigation, Writing – review & editing

    Affiliation Queen’s University School of Medicine, Kingston, Canada

  • Orlando Cerocchi,

    Roles Data curation, Investigation, Writing – review & editing

    Affiliation University Health Network, Toronto, Canada

  • Hannah Chung,

    Roles Data curation, Formal analysis, Investigation, Writing – review & editing

    Affiliation Institute for Clinical Evaluative Sciences, Toronto, Canada

  • Yvonne Dewit,

    Roles Data curation, Formal analysis, Investigation, Writing – review & editing

    Affiliation Institute for Clinical Evaluative Sciences, Kingston, Canada

  • Jordan J. Feld,

    Roles Conceptualization, Funding acquisition, Methodology, Writing – review & editing

    Affiliations Department of Medicine, University of Toronto, Toronto, Canada, University Health Network, Toronto, Canada

  • Laura Holder,

    Roles Data curation, Formal analysis, Investigation, Writing – review & editing

    Affiliation Institute for Clinical Evaluative Sciences, Toronto, Canada

  • Jeffrey C. Kwong,

    Roles Methodology, Resources, Writing – review & editing

    Affiliations Institute for Clinical Evaluative Sciences, Toronto, Canada, Institute for Clinical Evaluative Sciences, Kingston, Canada, Public Health Ontario, Toronto, Canada, Dalla Lana School of Public Health, Toronto, Canada, Department of Family and Community Medicine, University of Toronto, Toronto, Canada

  • Beate Sander,

    Roles Methodology, Writing – review & editing

    Affiliations Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Canada, Institute for Clinical Evaluative Sciences, Toronto, Canada, University Health Network, Toronto, Canada, Public Health Ontario, Toronto, Canada, Toronto Health Economics and Technology Assessment Collaborative, Toronto, Canada

  • Jennifer A. Flemming

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Resources, Supervision, Writing – review & editing

    Affiliations Institute for Clinical Evaluative Sciences, Kingston, Canada, Department of Medicine and Department of Public Health Sciences, Queen`s University, Kingston, Canada

Abstract

Background

To evaluate screening and treatment strategies, large-scale real-world data on liver disease-related outcomes are needed. We sought to validate health administrative data for identification of cirrhosis, decompensated cirrhosis and hepatocellular carcinoma among patients with known liver disease.

Methods

Primary patient data were abstracted from patients of the Toronto Center for Liver Disease with viral hepatitis (2006–2014), and all patients with liver disease from the Kingston Health Sciences Centre Hepatology Clinic (2013). We linked clinical information to health administrative data and tested a range of coding algorithms against the clinical reference standard.

Results

A total of 6,714 patients had primary chart data abstracted. A single physician visit code for cirrhosis was sensitive (98–99%), and a single hospital diagnostic code for cirrhosis was specific (91–96%). The most sensitive algorithm for decompensated cirrhosis was one cirrhosis code with any of: a hospital diagnostic code, death code, or procedure code for decompensation (range 88–99% across groups). The most specific was one cirrhosis code and one hospital diagnostic code (range 89–98% across groups). Two physician visit codes or a single hospital diagnostic code, death code, or procedure code combined with a code for cirrhosis were sensitive and specific for hepatocellular carcinoma (sensitivity 94–96%, specificity 93–98%).

Conclusion

These sensitive and specific algorithms can be used to define patient cohorts or detect clinical outcomes using health administrative data. Our results will facilitate research into the adequacy of screening and treatment for patients with chronic viral hepatitis or other liver diseases.

Introduction

In 2016, 1.26 million people worldwide died of cirrhosis and chronic liver diseases, and their complications.[1] Hepatocellular carcinoma (HCC) mortality rates are rising faster than those from any other malignancy.[2] Globally, viral hepatitis secondary to hepatitis B and C virus infection underlies 55% of cirrhosis-related deaths and 61% of deaths from HCC.[1] Yet, many patients with liver disease remain undiagnosed, largely because they remain asymptomatic until a late stage.[3, 4] Of late, much progress has been made in the prevention and treatment of viral hepatitis. Many jurisdictions have advanced the timing of immunization against hepatitis B virus (HBV) from early adolescence to infancy.[5, 6] Further, new treatments for chronic hepatitis C virus (HCV) infection have enabled large numbers of patients to achieve sustained virologic response, a marker of long-term clinical cure.[710] Finally, our understanding of the epidemic of non-alcoholic fatty liver disease (NAFLD) in North America is just beginning and the natural history of this disease is still not completely defined.

To evaluate the epidemiology and the long-term clinical effectiveness of treatments and screening programs for chronic liver diseases, it is essential to accurately detect clinical outcomes such as cirrhosis, decompensated cirrhosis, and HCC. Although a registry of patients could be used to measure long-term treatment effects, it is ill-suited to study the overall burden or healthcare utilization related to liver disease since liver-related outcomes can take decades to occur. Longitudinal, systematically collected information from large cohorts of patients with chronic liver disease is needed.

Routinely collected health administrative data enable efficient research on real-world outcomes of patients with liver disease, while offering objective evidence of past healthcare utilization. Although several studies have validated data algorithms for identifying cirrhosis, decompensated cirrhosis, and HCC, these have been limited to International Classification of Diseases 9th revision (ICD-9) codes in the healthcare system of the United States.[1117] In Canada, all hospitalization data have been coded using the ICD-10 system since 2002 and as of as of October 2015, all hospital discharge information in the United States have also been coded using the ICD-10 system.[18, 19] While administrative data codes have been used in outcomes research[20, 21], there are no existing validation studies for liver-related outcomes in Canadian patients and no validation studies using ICD-10 codes in the United States.

The primary objective of this study was to measure the validity of combined ICD-9 and ICD-10 health administrative data codes for detecting cirrhosis, decompensated cirrhosis, and HCC in patients with known chronic liver disease. This will facilitate the study of the long-term effects of antiviral treatment or policy changes relating to this patient population. Furthermore, as the epidemiology of chronic liver diseases such as NAFLD and alcohol-related disease have a different natural history than viral hepatitis, we also aimed to assess their broader validity in a group of patients with liver diseases of all causes.

Methods

Setting

The validation cohorts consisted of patients from two different university-affiliated tertiary care hepatology clinics in Ontario, Canada: The Toronto Centre for Liver Disease (TCLD) at the University Health Network (UHN) located in Toronto and the Liver Disease Clinic at the Kingston Health Sciences Centre (KHSC) located in Kingston. Both clinics are staffed by subspecialty trained academic hepatologists and receive patient referrals for patients with acute or chronic liver diseases. At both sites, clinicians employ a standardized computerized form for clinical data entry. Patient status, including test results and treatments, are updated at every encounter. UHN and KHSC clinical records include information on patient demographics, most responsible diagnosis, laboratory data, imaging data, endoscopic reports, pathology data, non-invasive fibrosis assessment tests and results, and any hepatic decompensation events.

Administrative databases

The Institute for Clinical Evaluative Sciences holds health administrative data for all Ontario residents with provincial health insurance. Data on demographics, physician visits, emergency department visits, hospital admissions and procedures are linked using an encrypted patient identifier.[22]

The Ontario Health Insurance Program (OHIP) database contains all billing claims made by physicians.[23] The Canadian Institute for Health Information’s Discharge Abstract Database contains information for all admissions to acute care hospitals in Ontario[24], and the National Ambulatory Care Reporting System database contains information on emergency department and day surgery visits.[22] The Office of the Registrar General Death Database contains the cause of death for all deaths in the province.[25] The Ontario Cancer Registry includes detailed clinical information on malignancies such as anatomical site and tumour histology.[26]

Study population

We identified patients for inclusion in the chronic HBV and HCV cohorts using a two-step process. First, any patient followed at the TCLD with an HBV or HCV treatment status, positive HBV or HCV serology, positive HBV DNA, or positive HCV RNA, with a clinic visit between April 2006 and March 2014 were selected for further review. We then reviewed the charts to confirm chronic HBV or HCV status rather than resolved prior infection (patients with the latter were not included in the study cohorts). Patients who had evidence of ongoing infection with HBV or HCV at any time from their first clinic visit to March 31st 2014 were included in the study. The cohort of patients from KHSC comprised consecutive patients seen at the KHSC Liver Clinic from May through August 2013. Patients in the KHSC cohort had liver disease of viral and non-viral etiology. Across all groups, any patient that could not be linked to administrative data holdings was excluded from the study.

Reference standard: Clinical outcomes

Outcomes evaluated in this study included cirrhosis, decompensated cirrhosis, and hepatocellular carcinoma. Two trained medical graduates (FG at TCLD and DC at KHSC) reviewed the charts to identify clinical outcomes.

Decompensated cirrhosis was identified based on the presence of any of the following in the clinical record: ascites, bleeding varices, encephalopathy, use of spironolactone without alternative indication, or explicit mention of decompensated cirrhosis. Cirrhosis was identified based on any of the decompensated cirrhosis criteria, or explicit mention of cirrhosis, non-bleeding varices, or use of nadolol without alternative indication. In addition, cirrhotic appearance on ultrasound, a liver biopsy result of F4 fibrosis, or a non-invasive test result consistent with F4 fibrosis were also considered diagnostic of cirrhosis. Hepatocellular carcinoma was identified based on explicit mention anywhere in the clinical note. Uncertain cases were reviewed and classified by a hepatologist (JJF at UHN or JAF at KHSC).

A 5% random sample of charts was re-abstracted by a general internist at TCLD (LLS) and a hepatologist at KHSC (JAF). Agreement beyond chance on the outcome ascertainment by both abstracters was measured using Cohen’s kappa.

Administrative data outcomes

The primary outcomes of cirrhosis, decompensated cirrhosis and HCC were defined using relevant physician visit, emergency department visit, hospital diagnosis, procedure, death and pathology codes (Table 1). A secondary outcome of 2-year all-cause mortality following last clinic visit was reported as an overall measure of patient severity of illness.

thumbnail
Table 1. Administrative data codes used to identify cirrhosis, decompensated cirrhosis and hepatocellular carcinoma.

OHIP = Ontario Health Insurance Plan, ICD-9 = International Classification of Diseases, 9th Revision, ICD-10 = = International Classification of Diseases, 10th Revision, CCP = Canadian Classification of Diagnostic, Therapeutic, and Surgical Procedures, CCI = Canadian Classification of Health Interventions.

https://doi.org/10.1371/journal.pone.0201120.t001

We built a series of diagnostic algorithms for each outcome ranging from simple (e.g. a single physician visit code) to more complex. This was done to identify the most parsimonious algorithms that were also highly sensitive and/or specific. We aimed to utilize all available health administrative data, and include both hospital-based and outpatient physician visits in our algorithms. Data sources were searched for relevant codes ten years prior to two years following the date of last clinical assessment (up to March 31st, 2014 for TCLD and August 31st, 2013 for KHSC).

Cirrhosis algorithms ranged from cirrhosis codes only to combinations with codes for chronic liver disease or any complication (decompensation events, HCC or liver transplant). Algorithms were combined in such a way as to make them more sensitive (“or” combinations) or specific (“and” combinations). As physician visit codes were noted to be less specific, we aimed to increase specificity by combining two or more such codes with hospitalization codes.

Decompensation algorithms included hospital diagnostic and death codes for portal hypertension, hepatorenal syndrome, jaundice, hepatic coma, hepatic failure, bleeding esophageal varices, gastric varices (bleeding not specified), and ascites. There were no physician visit codes available for decompensation events. We included procedure codes for endoscopy or insertion of Sengstaken tube for upper gastrointestinal bleeding, transjugular intrahepatic portosystemic shunt, and paracentesis. Since several of these procedures could occur for reasons other than decompensated cirrhosis (such as bleeding from an ulcer, or ascites secondary to an extra-hepatic malignancy), we tested combinations of procedure codes with a cirrhosis code from a physician visit.

Hepatocellular carcinoma algorithms ranged from simple (a single physician visit code) to more complex. We tested several combinations in order to optimise both sensitivity and specificity. We combined physician visit codes, hospital diagnostic codes, and cause of death codes. Further, we included procedure codes for radiofrequency ablation. Since this procedure can also be used to ablate tumours outside the liver (e.g., renal tumours), we combined ablation codes with an outpatient code for cirrhosis. Finally, we tested our results with and without anatomical and pathology codes from the Ontario Cancer Registry.

Analysis

Characteristics of patients in each validation cohort (HBV and HCV patients from TCLD, patients from KHSC) and 2-year mortality were described using univariate statistics.

We tested the performance of administrative data algorithms for cirrhosis, decompensated cirrhosis, and HCC against the clinical reference standard. Each algorithm was evaluated for sensitivity, specificity, and overall accuracy. Performance measures were reported with their 95% confidence intervals. We did not report positive and negative predictive values as these parameters are highly dependent on prevalence in the reference population, making them poorly generalizable.

Measurement of algorithm performance was performed using SAS software, version 9.4 (SAS Institute Inc., Carey, NC). This project was approved by the Research Ethics Boards of UHN and KHSC.

Results

From April 2006 to March 2014, there were 3,502 patients with chronic HBV and 2,956 patients with chronic HCV seen at TCLD, of which 3,381 (97%) with HBV and 2,891 (98%) with HCV could be linked to administrative data. From May to August 2013, there were 444 patients seen at the KHSC Liver Clinic, of which 442 (99.5%) could be linked to administrative data. The most common causes of liver disease in KHSC patients were: HCV in 199 (45%), NAFLD in 49 (11%), alcohol-related liver disease in 37 (8%), autoimmune liver disease in 36 (8%) and HBV infection in 34 (8%).

The characteristics of TCLD patients with HBV or HCV and all KHSC patients are presented in Table 2. The patients in the validation cohorts were, on average, middle-aged (median age 48–57 years), more likely to be male (57–60%), frequently low-income (26–28%), and mostly urban (74–99%). Many TCLD patients were either still consuming alcohol or had done so regularly in the past (37% of HBV patients, 64% of HCV patients). Few patients had been hospitalized in the previous year (1–16%), however many KHSC patients had visited the emergency department (44%). Most patients had not undergone a liver biopsy, but had been assessed clinically, including using non-invasive fibrosis testing (44–75%).

thumbnail
Table 2. Characteristics of patients in the three validation cohorts, at the time of last clinical follow-up.

HBV = Hepatitis B Virus infection, HCV = Hepatitis C virus infection, TCLD = Toronto Centre for Liver Disease, KHSC = Kingston Health Sciences Centre, NA = not available.

https://doi.org/10.1371/journal.pone.0201120.t002

Of TCLD patients with HBV infection, 669 (19%) had cirrhosis, 99 (3%) had decompensated cirrhosis and 133 (4%) had hepatocellular carcinoma at any time during follow-up. Of the patients with HCV, 1,175 (40%), 335 (11%) and 167 (6%) had a clinical diagnosis of cirrhosis, decompensated cirrhosis, and HCC, respectively. For KHSC patients, this was 233 (53%), 93 (21%) and 25 (6%) for cirrhosis, decompensated cirrhosis, and HCC, respectively. By two years following their last clinic visit, 4% (n = 140) of TCLD patients with HBV, 8% (n = 243) of TCLD patients with HCV and 12% (n = 53) of KHSC patients had died.

At re-abstraction of charts belonging to HBV patients (n = 176) from TCLD, Cohen’s Kappa was 0.94 for cirrhosis, 1 for decompensated cirrhosis and 1 for hepatocellular carcinoma. For HCV patients (n = 148), Cohen’s kappa was 0.99 for cirrhosis, 0.92 for decompensated cirrhosis, and 1 for hepatocellular carcinoma. For KHSC patients (n = 20), there was complete agreement (kappa = 1) for the presence of cirrhosis, decompensated cirrhosis and hepatocellular carcinoma.

Algorithms for cirrhosis

A single physician visit code for cirrhosis was highly sensitive (98% for HBV, 99% for HCV and KHSC group, Table 3), while a single hospital diagnostic code for cirrhosis was specific (96% for HBV, 91% for HCV, 92% for KHSC group). Greatest specificity was achieved using a combination of a chronic liver disease code, a complication code and either 2+ physician visit codes or a single hospital diagnostic code (algorithm 13: 98% in HBV, 95% in HCV, 97% for KHSC cohort). A single hospital diagnosis code had the greatest overall accuracy in all three groups (HBV 88%, 95% CI 87–90%, HCV 84%, 95% CI 83–85%, KHSC 87%, 95% CI 84–90%).

thumbnail
Table 3. Administrative data algorithms used to identify patients with cirrhosis.

Number of patients in each group with cirrhosis (reference outcome) indicated (n) at top of column. HBV = Hepatitis B Virus infection, HCV = Hepatitis C virus infection, TCLD = Toronto Centre for Liver Disease, KHSC = Kingston Health Sciences Centre, CLD = Chronic Liver Disease, Sens = sensitivity, Spec = specificity, CI = confidence interval. 1+ = code occurs on at least one date; 2+ = code occurs on at least two separate dates.

https://doi.org/10.1371/journal.pone.0201120.t003

Algorithms for decompensated cirrhosis

The algorithm with the greatest sensitivity (88% in HBV, 92% in HCV, 99% in KHSC group), was one physician visit code for cirrhosis and any of the following: one hospital diagnostic code, death code, or procedure code (Table 4). For the HBV and HCV groups, this algorithm also had the highest overall accuracy (95%, CI 94–96% in HBV group; 88%, CI 87–90% in HCV group). The most specific algorithm was one physician visit code for cirrhosis and one hospital diagnostic code for decompensation (95% in HBV, 88% in HCV, 79% in the KHSC group); this was also the algorithm with the greatest overall accuracy in the KHSC group (83%, 95% CI 80–87%). After excluding a single hospital diagnostic code which accounted for many false positives (ICD-10 code for portal hypertension K766), the specificity of this algorithm was further improved to 98% in HBV, 95% in HCV, and 89% in KHSC groups.

thumbnail
Table 4. Administrative data algorithms used to identify patients with decompensated cirrhosis.

Number of patients in each group with decompensated cirrhosis (reference outcome) indicated (n) at top of column. HBV = Hepatitis B Virus infection, HCV = Hepatitis C virus infection, TCLD = Toronto Centre for Liver Disease, KHSC = Kingston Health Sciences Centre, CLD = Chronic Liver Disease, Sens = sensitivity, Spec = specificity, CI = confidence interval. 1+ = code occurs on at least one date; 2+ = code occurs on at least two separate dates.

https://doi.org/10.1371/journal.pone.0201120.t004

Algorithms for hepatocellular carcinoma

The most sensitive algorithm for HCC was any of: two or more physician visit codes, a hospital diagnostic code, a death code or a procedure code with a physician visit code for cirrhosis (algorthim 9: sensitivity 96% in HBV, 97% in HCV, 96% in KHSC group, Table 5). The most specific algorithm not employing pathological data was one physician visit cirrhosis code and one procedure code (specificity 100% in both HBV and HCV, 97% in KHSC group). This compared favourably to the specificity of using the Ontario Cancer Registry (which includes pathology data), which had a specificity of 99% in HBV and HCV groups, 97% in KHSC group. The algorithm with the highest overall accuracy in the HBV (99%, CI 98–99), HCV (98%, CI 98–99%) and all-cause liver disease (96%, CI 94–98%) groups was a diagnosis in the Ontario Cancer Registry or a hospital diagnostic or death code for HCC.

thumbnail
Table 5. Administrative data algorithms used to identify patients with hepatocellular carcinoma.

Number of patients in each group with hepatocellular carcinoma (reference outcome) indicated (n) at top of column. HBV = Hepatitis B Virus infection, HCV = Hepatitis C virus infection, TCLD = Toronto Centre for Liver Disease, KHSC = Kingston Health Sciences Centre, CLD = Chronic Liver Disease, Sens = sensitivity, Spec = specificity, CI = confidence interval. 1+ = code occurs on at least one date; 2+ = code occurs on at least two separate dates.

https://doi.org/10.1371/journal.pone.0201120.t005

Discussion

We identified sensitive and specific algorithms for the identification of cirrhosis, decompensated cirrhosis and HCC in patients with HBV or HCV infection, and confirmed these findings in patients with known liver disease of other causes. While identifying an algorithm that combines optimal sensitivity and specificity is desired, this is not always possible, and thus different criteria can be used depending on the study purpose. If the goal is to identify a cohort of individuals with a condition, highly specific criteria are preferable as this maximizes the likelihood that included individuals indeed have the target condition. In contrast, when the condition is the outcome variable in a study, a more sensitive definition can be used to prevent underestimation.

In our study, the tradeoff between sensitivity and specificity was most notable for cirrhosis. In particular, outpatient physician claims for cirrhosis were very sensitive but not specific. We believe this is because, until recently, physicians in Ontario could receive additional payment for a visit with a diagnosis of cirrhosis.[27, 28] On this basis, physicians may have been more likely to correctly list this diagnosis on their billing claim, or even to “up-code” patients with borderline clinical features consistent with cirrhosis. Although algorithms for cirrhosis and decompensated cirrhosis demonstrated inverse relationships between sensitivity and specificity, we were able to identify an algorithm for HCC that was both highly sensitive and specific.

Hospitalization diagnostic data are entered by trained chart abstractors and may be more reliable than outpatient physician billing claims. Hospitalization codes were more specific than physician visit codes for the target conditions in our study. The only exception was the in-hospital diagnostic code for portal hypertension, which falsely identified several patients as having decompensated cirrhosis. Non-cirrhotic portal hypertension is covered by this diagnostic code, as is portal hypertensive gastropathy, a condition diagnosed based on non-specific endoscopic findings, raising the possibility that it might be over-diagnosed. These conditions may explain this diagnostic code’s lack of specificity.

Overall, algorithms to identify complications in HBV-infected patients tended to be less sensitive but more specific than the same algorithms, when used in the HCV or KHSC groups. The difference was largest for algorithms identifying decompensated cirrhosis. We suspect that this can be explained by lower rates of hospital-based care for HBV-infected patients, who had fewer previous hospitalizations and emergency department visits than the other two groups. Substance use and mental health issues may contribute to greater healthcare usage by patients with HCV infection.[29] The range in results obtained in our study underscores the importance of testing administrative algorithms in several different patient populations. Although sensitivity, specificity and accuracy varied across patient groups, the most sensitive, specific and accurate algorithms in each group remained the same or very similar.

Previous studies have validated administrative data algorithms for cirrhosis, decompensation and HCC in U.S. populations. As we have done, others have also included codes for decompensation events as part of their administrative data definition of cirrhosis, or have combined HCC codes with cirrhosis codes to improve algorithm performance.[11, 12] In one validation study of cirrhosis definitions, sensitivity using multiple codes was achieved at the expense of low specificity, similar to what we observed.[15] While prior studies were limited to ICD-9 codes, ours is the first study to test the performance of ICD-10 codes for the detection of liver disease outcomes. U.S. hospital data have been coded using ICD-10 since 2015.[19] The algorithms and codes we provide can now be used to identify liver disease outcomes using Canadian, U.S. and European health administrative data, as well as any data coded in the widely-used ICD-9 or ICD-10 systems.

Strengths of our study are the inclusion of inpatient and outpatient data, as well as procedures, cause of death and pathology results, all tested against a physician-confirmed reference standard. Further, we included a large number of individuals in three patient cohorts from two institutions, with different etiologies of liver disease. The consistency of the relative ranking of algorithms across patient groups suggests that the most sensitive or specific algorithms can be applied broadly to all patients with known liver disease. Finally, our study results are comprehensive as they can be used to identify all three important clinical outcomes in patients with chronic liver disease.

Our study has several limitations. First, the existence of a premium payment for physician visits in Ontario may indicate that cirrhosis billing codes were claimed more often in our study setting than they would be elsewhere. Without a premium code, one might expect that a single physician visit code for cirrhosis would have greater specificity and lower sensitivity than measured in our study. Second, the study of hepatology clinic patients has advantages and disadvantages. One advantage is that specialist physicians with expertise in evaluating patients for liver outcomes can be expected to have greater diagnostic accuracy than generalist physicians. However, patients seen in a specialist clinic are likely to have more severe disease, which can lead to spectrum bias.[30] We would expect this to be most relevant for cirrhosis, where there is a clear spectrum of disease ranging from asymptomatic to severely symptomatic, decompensated cirrhosis.

If patients in a hepatology clinic are sicker than the general population, we would expect our measured sensitivity to be higher, and specificity to be lower, than they would be in the general population. Our results are valid for patients like those seen in a hepatology clinic: that is, patients with diagnosed liver diseases. Therefore, studies validating health administrative data codes against patient data abstracted from community hospitals, primary care clinics, or the general population would be an important contribution to the literature. In the general population, our specific algorithms can be used to define cohorts, however they may not be sensitive enough to identify all cases, and as such may underestimate outcome rates. An additional caveat is that some decompensation codes (e.g. ascites) can occur without liver disease. In order to avoid misclassification when applied to the general population, these codes should be combined with liver disease or cirrhosis codes.

Conclusions

Liver diseases such as chronic HBV and HCV infection are a major cause of mortality worldwide. Administrative data can be used to identify large groups of patients with complications of chronic liver disease. We have reported the operating characteristics of algorithms for cirrhosis, decompensated cirrhosis, and hepatocellular carcinoma, using ICD-9 and ICD-10 health administrative data. We have identified sensitive and specific algorithms for each of these clinical conditions, which can be used to define patient cohorts or detect clinical outcomes. Our results will facilitate research into the adequacy of screening and treatment outcomes for patients with chronic HBV, HCV, or other liver diseases. Future research should test the performance of health administrative data codes in the general population.

Acknowledgments

This work was supported by a grant from the Physician Services Incorporated (PSI) Foundation and by the Institute for Clinical Evaluative Sciences (ICES), which is funded by an annual grant from the Ontario Ministry of Health and Long-Term Care (MOHLTC). The study sponsors did not participate in the design and conduct of the study; collection, management, analysis and interpretation of the data; preparation, review or approval of the manuscript; or the decision to submit the manuscript for publication.

LLS is supported by a CIHR Fellowship Award. JAF is supported by a Southeastern Ontario New Clinician Scientist Award. JCK is supported by a CIHR New Investigator Award and a University of Toronto Department of Family & Community Medicine Investigator Award.

The opinions, results, and conclusions reported in this paper are those of the authors and are independent from the funding sources. Parts of this material are based on data and information compiled and provided by the Canadian Institute for Health Information (CIHI) and by Cancer Care Ontario (CCO). However, the analyses, conclusions, opinions, and statement expressed herein are those of the authors, and not necessarily those of CIHI or CCO. No endorsement by ICES, MOHLTC, CIHI, or CCO is intended or should be inferred.

References

  1. 1. Naghavi M, Abajobir AA, Abbafati C, Abbas KM, Abd-Allah F, Abera SF, et al. Global, regional, and national age-sex specific mortality for 264 causes of death, 1980–2016: a systematic analysis for the Global Burden of Disease Study 2016. The Lancet. 390(10100):1151–210. pmid:28919116
  2. 2. Ryerson AB, Eheman CR, Altekruse SF, Ward JW, Jemal A, Sherman RL, et al. Annual Report to the Nation on the Status of Cancer, 1975–2012, featuring the increasing incidence of liver cancer. Cancer. 2016;122(9):1312–37. Epub 2016/03/10. pmid:26959385; PubMed Central PMCID: PMCPMC4840031.
  3. 3. Rotermann M, Langlois K, Andonov A, Trubnikov M. Seroprevalence of hepatitis B and C virus infections: Results from the 2007 to 2009 and 2009 to 2011 Canadian Health Measures Survey. 2013.
  4. 4. Denniston MM, Klevens RM, McQuillan GM, Jiles RB. Awareness of infection, knowledge of hepatitis C, and medical follow-up among individuals testing positive for hepatitis C: National Health and Nutrition Examination Survey 2001–2008. Hepatology. 2012;55(6):1652–61. Epub 2012/01/04. pmid:22213025; PubMed Central PMCID: PMCPMC4586034.
  5. 5. Centers for Disease Control and Prevention. Implementation of Newborn Hepatitis B Vaccination—Worldwide, 2006. Morbidity and Mortality Weekly Report. 2008;57(46):1249–52. pmid:19023261
  6. 6. Centers for Disease Control and Prevention. Newborn Hepatitis B Vaccination Coverage Among Children Born January 2003—June 2005—United States. Morbidity and Mortality Weekly Report. 2008;57(30):825–8. pmid:18668021
  7. 7. Veldt BJ, Heathcote EJ, Wedemeyer H, Reichen J, Hofmann WP, Zeuzem S, et al. Sustained virologic response and clinical outcomes in patients with chronic hepatitis C and advanced fibrosis. Ann Intern Med. 2007;147(10):677–84. Epub 2007/11/21. pmid:18025443.
  8. 8. Swain MG, Lai MY, Shiffman ML, Cooksley WG, Zeuzem S, Dieterich DT, et al. A sustained virologic response is durable in patients with chronic hepatitis C treated with peginterferon alfa-2a and ribavirin. Gastroenterology. 2010;139(5):1593–601. Epub 2010/07/20. pmid:20637202.
  9. 9. Asselah T, Marcellin P, Schinazi RF. Treatment of hepatitis C virus infection with direct-acting antiviral agents: 100% cure? Liver international: official journal of the International Association for the Study of the Liver. 2018;38 Suppl 1:7–13. Epub 2018/02/11. pmid:29427484.
  10. 10. Maylin S, Martinot-Peignoux M, Moucari R, Boyer N, Ripault MP, Cazals-Hatem D, et al. Eradication of hepatitis C virus in patients successfully treated for chronic hepatitis C. Gastroenterology. 2008;135(3):821–9. Epub 2008/07/03. pmid:18593587.
  11. 11. Goldberg D, Lewis J, Halpern S, Weiner M, Lo Re V 3rd. Validation of three coding algorithms to identify patients with end-stage liver disease in an administrative database. Pharmacoepidemiol Drug Saf. 2012;21(7):765–9. Epub 2012/06/08. pmid:22674685; PubMed Central PMCID: PMCPmc4267226.
  12. 12. Goldberg DS, Lewis JD, Halpern SD, Weiner MG, Lo Re V 3rd. Validation of a coding algorithm to identify patients with hepatocellular carcinoma in an administrative database. Pharmacoepidemiol Drug Saf. 2013;22(1):103–7. Epub 2012/11/06. pmid:23124932; PubMed Central PMCID: PMCPmc3540172.
  13. 13. Kramer JR, Davila JA, Miller ED, Richardson P, Giordano TP, El-Serag HB. The validity of viral hepatitis and chronic liver disease diagnoses in Veterans Affairs administrative databases. Aliment Pharmacol Ther. 2008;27(3):274–82. pmid:17996017.
  14. 14. Lo Re V 3rd, Lim JK, Goetz MB, Tate J, Bathulapalli H, Klein MB, et al. Validity of diagnostic codes and liver-related laboratory abnormalities to identify hepatic decompensation events in the Veterans Aging Cohort Study. Pharmacoepidemiol Drug Saf. 2011;20(7):689–99. Epub 2011/06/01. pmid:21626605; PubMed Central PMCID: PMCPmc3131229.
  15. 15. Nehra MS, Ma Y, Clark C, Amarasingham R, Rockey DC, Singal AG. Use of administrative claims data for identifying patients with cirrhosis. J Clin Gastroenterol. 2013;47(5):e50–4. Epub 2012/10/24. pmid:23090041; PubMed Central PMCID: PMCPmc3556340.
  16. 16. Niu B, Forde KA, Goldberg DS. Coding algorithms for identifying patients with cirrhosis and hepatitis B or C virus using administrative data. Pharmacoepidemiol Drug Saf. 2015;24(1):107–11. Epub 2014/10/23. pmid:25335773; PubMed Central PMCID: PMCPmc4293241.
  17. 17. Sada Y, Hou J, Richardson P, El-Serag H, Davila J. Validation of Case Finding Algorithms for Hepatocellular Cancer From Administrative Data and Electronic Health Records Using Natural Language Processing. Med Care. 2016;54(2):e9–14. pmid:23929403; PubMed Central PMCID: PMCNIHMS512083 [Available on 02/01/17].
  18. 18. Canadian Institute for Health Information. Discharge Abstract Database Metadata 2017 [cited 2017 November 30]. Available from: https://www.cihi.ca/en/discharge-abstract-database-metadata.
  19. 19. Centers for Disease Control and Prevention. International Classification of Diseases, (ICD-10-CM/PCS) Transition—Background 2015 [cited 2018 January 26]. Available from: https://www.cdc.gov/nchs/icd/icd10cm_pcs_background.htm.
  20. 20. Thein HH, Campitelli MA, Yeung LT, Zaheen A, Yoshida EM, Earle CC. Improved Survival in Patients with Viral Hepatitis-Induced Hepatocellular Carcinoma Undergoing Recommended Abdominal Ultrasound Surveillance in Ontario: A Population-Based Retrospective Cohort Study. PLoS One. 2015;10(9):e0138907. Epub 2015/09/24. pmid:26398404; PubMed Central PMCID: PMCPmc4580446.
  21. 21. Tonelli M, Wiebe N, Fortin M, Guthrie B, Hemmelgarn BR, James MT, et al. Methods for identifying 30 chronic conditions: application to administrative data. BMC Med Inform Decis Mak. 2015;15:31. Epub 2015/04/18. pmid:25886580; PubMed Central PMCID: PMCPmc4415341.
  22. 22. Institute for Clinical Evaluative Sciences. Improving Healthcare Data in Ontario. Toronto: 2005.
  23. 23. Shah BR, Hux JE, Laupacis A, Zinman B, Cauch-Dudek K, Booth GL. Administrative data algorithms can describe ambulatory physician utilization. Health Serv Res. 2007;42(4):1783–96. Epub 2007/07/06. pmid:17610448; PubMed Central PMCID: PMCPmc1955277.
  24. 24. Juurlink DN, Preyra C, Croxford R, Chong A, Austin PC, Tu JV, et al. Canadian Institute for Health Information Discharge Abstract Database: A Validation Study. Toronto, Ontario: Institute for Clinical Evaluative Sciences, 2006.
  25. 25. Iron K, Zagorski BM, Sykora K, Manuel DG. Living and Dying in Ontario: An Opportunity for Improved Health Information. Toronto: Institute for Clinical Evaluative Sciences, 2008.
  26. 26. Hall S, Schulze K, Groome P, Mackillop W, Holowaty E. Using cancer registry data for survival studies: the example of the Ontario Cancer Registry. J Clin Epidemiol. 2006;59(1):67–76. Epub 2005/12/20. pmid:16360563.
  27. 27. Ministry of Health and Long Term Care. Schedule of Benefits. Toronto, Ontario: Province of Ontario; 2013.
  28. 28. Physician Services Committee. Chronic Disease Assessment Premium: Code E078 http://www.ontla.on.ca/library/repository/ser/274754/2007v05no02.pdf: Legislative Assembly of Ontario; 2007 [cited 2018 June 19].
  29. 29. Rifai MA, Gleason OC, Sabouni D. Psychiatric Care of the Patient With Hepatitis C: A Review of the Literature. Prim Care Companion J Clin Psychiatry. 2010;12(6):PCC.09r00877. PubMed PMID: PMC3067984. pmid:21494349
  30. 30. Montori VM, Wyer P, Newman TB, Keitz S, Guyatt G. Tips for learners of evidence-based medicine: 5. The effect of spectrum of disease on the performance of diagnostic tests. CMAJ. 2005;173(4):385–90. Epub 2005/08/17. pmid:16103513; PubMed Central PMCID: PMCPMC1188226.