Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2009

Open Access 01-12-2009 | Research article

Measuring agreement of administrative data with chart data using prevalence unadjusted and adjusted kappa

Authors: Guanmin Chen, Peter Faris, Brenda Hemmelgarn, Robin L Walker, Hude Quan

Published in: BMC Medical Research Methodology | Issue 1/2009

Login to get access

Abstract

Background

Kappa is commonly used when assessing the agreement of conditions with reference standard, but has been criticized for being highly dependent on the prevalence. To overcome this limitation, a prevalence-adjusted and bias-adjusted kappa (PABAK) has been developed. The purpose of this study is to demonstrate the performance of Kappa and PABAK, and assess the agreement between hospital discharge administrative data and chart review data conditions.

Methods

The agreement was compared for random sampling, restricted sampling by conditions, and case-control sampling from the four teaching hospitals in Alberta, Canada from ICD10 administrative data during January 1, 2003 and June 30, 2003. A total of 4,008 hospital discharge records and chart view, linked for personal unique identifier and admission date, for 32 conditions of random sampling were analyzed. The restricted sample for hypertension, myocardial infarction and congestive heart failure, and case-control sample for those three conditions were extracted from random sample. The prevalence, kappa, PABAK, positive agreement, negative agreement for the condition was compared for each of three samples.

Results

The prevalence of each condition was highly dependent on the sampling method, and this variation in prevalence had a significant effect on both kappa and PABAK. PABAK values were obviously high for certain conditions with low kappa values. The gap between these two statistical values for the same condition narrowed as the prevalence of the condition approached 50%.

Conclusion

Kappa values varied more widely than PABAK values across the 32 conditions. PABAK values should usually not be interpreted as measuring the same agreement as kappa in administrative data, particular for the condition with low prevalence. There is no single statistic measuring agreement that captures the desired information for validity of administrative data. Researchers should report kappa, the prevalence, positive agreement, negative agreement, and the relative frequency in each cell (i.e. a, b, c and d) to enable the reader to judge the validity of administrative data from multiple aspects.
Appendix
Available only for authorised users
Literature
1.
go back to reference Scott WA: Reliability of content analysis: The case of nominal scale coding. Public Opinion Quart. 1955, 19: 321-353. 10.1086/266577.CrossRef Scott WA: Reliability of content analysis: The case of nominal scale coding. Public Opinion Quart. 1955, 19: 321-353. 10.1086/266577.CrossRef
2.
go back to reference Cohen J: A coefficients of agreement for nominal scales. Edu and Psych Meas. 1960, 20: 37-46. 10.1177/001316446002000104.CrossRef Cohen J: A coefficients of agreement for nominal scales. Edu and Psych Meas. 1960, 20: 37-46. 10.1177/001316446002000104.CrossRef
3.
go back to reference Cicchetti DV, Feinstein AR: High agreement but low kappa: II. Resolving the paradoxes. J Clin Epidemiol. 1990, 43 (6): 551-558. 10.1016/0895-4356(90)90159-M.CrossRefPubMed Cicchetti DV, Feinstein AR: High agreement but low kappa: II. Resolving the paradoxes. J Clin Epidemiol. 1990, 43 (6): 551-558. 10.1016/0895-4356(90)90159-M.CrossRefPubMed
4.
go back to reference Feinstein AR, Cicchetti DV: High agreement but low kappa: I. The problems of two paradoxes. J Clin Epidemiol. 1990, 43 (6): 543-549. 10.1016/0895-4356(90)90158-L.CrossRefPubMed Feinstein AR, Cicchetti DV: High agreement but low kappa: I. The problems of two paradoxes. J Clin Epidemiol. 1990, 43 (6): 543-549. 10.1016/0895-4356(90)90158-L.CrossRefPubMed
5.
go back to reference Bloch DA, Kraemer HC: 2 × 2 kappa coefficients: measures of agreement or association. Biometrics. 1989, 45 (1): 269-287. 10.2307/2532052.CrossRefPubMed Bloch DA, Kraemer HC: 2 × 2 kappa coefficients: measures of agreement or association. Biometrics. 1989, 45 (1): 269-287. 10.2307/2532052.CrossRefPubMed
6.
go back to reference Hoehler FK: Bias and prevalence effects on kappa viewed in terms of sensitivity and specificity. J Clin Epidemiol. 2000, 53 (5): 499-503. 10.1016/S0895-4356(99)00174-2.CrossRefPubMed Hoehler FK: Bias and prevalence effects on kappa viewed in terms of sensitivity and specificity. J Clin Epidemiol. 2000, 53 (5): 499-503. 10.1016/S0895-4356(99)00174-2.CrossRefPubMed
7.
go back to reference Lantz CA, Nebenzahl E: Behavior and interpretation of the kappa statistic: resolution of the two paradoxes. J Clin Epidemiol. 1996, 49 (4): 431-434. 10.1016/0895-4356(95)00571-4.CrossRefPubMed Lantz CA, Nebenzahl E: Behavior and interpretation of the kappa statistic: resolution of the two paradoxes. J Clin Epidemiol. 1996, 49 (4): 431-434. 10.1016/0895-4356(95)00571-4.CrossRefPubMed
8.
go back to reference Soeken KL, Prescott PA: Issues in the use of kappa to estimate reliability. Med Care. 1986, 24 (8): 733-741. 10.1097/00005650-198608000-00008.CrossRefPubMed Soeken KL, Prescott PA: Issues in the use of kappa to estimate reliability. Med Care. 1986, 24 (8): 733-741. 10.1097/00005650-198608000-00008.CrossRefPubMed
9.
go back to reference Byrt T, Bishop J, Carlin JB: Bias, prevalence and kappa. J Clin Epidemiol. 1993, 46 (5): 423-429. 10.1016/0895-4356(93)90018-V.CrossRefPubMed Byrt T, Bishop J, Carlin JB: Bias, prevalence and kappa. J Clin Epidemiol. 1993, 46 (5): 423-429. 10.1016/0895-4356(93)90018-V.CrossRefPubMed
10.
go back to reference Mak HKF, Yau KKW, Chan BPL: Prevalence-adjusted Bias-adjusted {kappa} Values as Additional Indicators to Measure Observer Agreement [letter]. Radiology. 2004, 232 (1): 302-a-303. 10.1148/radiol.2321031974.CrossRef Mak HKF, Yau KKW, Chan BPL: Prevalence-adjusted Bias-adjusted {kappa} Values as Additional Indicators to Measure Observer Agreement [letter]. Radiology. 2004, 232 (1): 302-a-303. 10.1148/radiol.2321031974.CrossRef
11.
go back to reference Mak HKF, Yau KKW, Khong P-L, Ching ASC, Cheng P-W, Au-Yeung PKM, Pang PKM, Wong KCW, Chan BPL: Hypodensity of > 1/3 Middle Cerebral Artery Territory Versus Alberta Stroke Programme Early CT Score (ASPECTS): Comparison of Two Methods of Quantitative Evaluation of Early CT Changes in Hyperacute Ischemic Stroke in the Community Setting. Stroke. 2003, 34 (5): 1194-1196. 10.1161/01.STR.0000069162.64966.71.CrossRefPubMed Mak HKF, Yau KKW, Khong P-L, Ching ASC, Cheng P-W, Au-Yeung PKM, Pang PKM, Wong KCW, Chan BPL: Hypodensity of > 1/3 Middle Cerebral Artery Territory Versus Alberta Stroke Programme Early CT Score (ASPECTS): Comparison of Two Methods of Quantitative Evaluation of Early CT Changes in Hyperacute Ischemic Stroke in the Community Setting. Stroke. 2003, 34 (5): 1194-1196. 10.1161/01.STR.0000069162.64966.71.CrossRefPubMed
12.
go back to reference Wapenaar W, Barkema HW, VanLeeuwen JA, McClure JT, O'Handley RM, Kwok OCH, Thulliez P, Dubey JP, Jenkins MC: Comparison of serological methods for the diagnosis of Neospora caninum infection in cattle. Veterinary Parasitology. 2007, 143 (2): 166-173. 10.1016/j.vetpar.2006.08.007.CrossRefPubMed Wapenaar W, Barkema HW, VanLeeuwen JA, McClure JT, O'Handley RM, Kwok OCH, Thulliez P, Dubey JP, Jenkins MC: Comparison of serological methods for the diagnosis of Neospora caninum infection in cattle. Veterinary Parasitology. 2007, 143 (2): 166-173. 10.1016/j.vetpar.2006.08.007.CrossRefPubMed
13.
go back to reference Garbers S, Chiasson MA: Patterns of agreement on breast cancer screening knowledge and practices among women in Dominican and Mexican families in New York City. Med Sci Monit. 2004, 10 (11): CR628-634.PubMed Garbers S, Chiasson MA: Patterns of agreement on breast cancer screening knowledge and practices among women in Dominican and Mexican families in New York City. Med Sci Monit. 2004, 10 (11): CR628-634.PubMed
14.
go back to reference Thomsen PT, Baadsgaard NP: Intra- and inter-observer agreement of a protocol for clinical examination of dairy cows. Preventive Veterinary Medicine. 2006, 75 (1–2): 133-139. 10.1016/j.prevetmed.2006.02.004.CrossRefPubMed Thomsen PT, Baadsgaard NP: Intra- and inter-observer agreement of a protocol for clinical examination of dairy cows. Preventive Veterinary Medicine. 2006, 75 (1–2): 133-139. 10.1016/j.prevetmed.2006.02.004.CrossRefPubMed
15.
go back to reference Cibere J, Bellamy N, Thorne A, Esdaile JM, McGorm KJ, Chalmers A, Huang S, Peloso P, Shojania K, Singer J, et al: Reliability of the knee examination in osteoarthritis: Effect of standardization. Arthritis & Rheumatism. 2004, 50 (2): 458-468. 10.1002/art.20025.CrossRef Cibere J, Bellamy N, Thorne A, Esdaile JM, McGorm KJ, Chalmers A, Huang S, Peloso P, Shojania K, Singer J, et al: Reliability of the knee examination in osteoarthritis: Effect of standardization. Arthritis & Rheumatism. 2004, 50 (2): 458-468. 10.1002/art.20025.CrossRef
16.
go back to reference Wapenaar W, Barkema HW, Schares G, Rouvinen-Watt K, Zeijlemaker L, Poorter B, O'Handley RM, Kwok OCH, Dubey JP: Evaluation of four serological techniques to determine the seroprevalence of Neospora caninum in foxes (Vulpes vulpes) and coyotes (Canis latrans) on Prince Edward Island, Canada. Veterinary Parasitology. 2007, 145 (1–2): 51-58. 10.1016/j.vetpar.2006.12.002.CrossRefPubMed Wapenaar W, Barkema HW, Schares G, Rouvinen-Watt K, Zeijlemaker L, Poorter B, O'Handley RM, Kwok OCH, Dubey JP: Evaluation of four serological techniques to determine the seroprevalence of Neospora caninum in foxes (Vulpes vulpes) and coyotes (Canis latrans) on Prince Edward Island, Canada. Veterinary Parasitology. 2007, 145 (1–2): 51-58. 10.1016/j.vetpar.2006.12.002.CrossRefPubMed
17.
go back to reference Vania Reis Girianelli LCST: Evaluation of agreement between conventional and liquid-based cytology in cervical cancer early detection based on analysis of 2,091 smears: Experience at the Brazilian National Cancer Institute. Diagnostic Cytopathology. 2007, 35 (9): 545-549. 10.1002/dc.20699.CrossRefPubMed Vania Reis Girianelli LCST: Evaluation of agreement between conventional and liquid-based cytology in cervical cancer early detection based on analysis of 2,091 smears: Experience at the Brazilian National Cancer Institute. Diagnostic Cytopathology. 2007, 35 (9): 545-549. 10.1002/dc.20699.CrossRefPubMed
18.
go back to reference Quan H, Sundararajan V, Halfon P, Fong A, Burnand B, Luthi JC, Saunders LD, Beck CA, Feasby TE, Ghali WA: Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data. Med Care. 2005, 43 (11): 1130-1139. 10.1097/01.mlr.0000182534.19832.83.CrossRefPubMed Quan H, Sundararajan V, Halfon P, Fong A, Burnand B, Luthi JC, Saunders LD, Beck CA, Feasby TE, Ghali WA: Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data. Med Care. 2005, 43 (11): 1130-1139. 10.1097/01.mlr.0000182534.19832.83.CrossRefPubMed
19.
go back to reference Bernstein CN, Blanchard JF, Rawsthorne P, Wajda A: Epidemiology of Crohn's disease and ulcerative colitis in a central Canadian province: a population-based study. Am J Epidemiol. 1999, 149 (10): 916-924.CrossRefPubMed Bernstein CN, Blanchard JF, Rawsthorne P, Wajda A: Epidemiology of Crohn's disease and ulcerative colitis in a central Canadian province: a population-based study. Am J Epidemiol. 1999, 149 (10): 916-924.CrossRefPubMed
20.
go back to reference Humphries KH, Rankin JM, Carere RG, Buller CE, Kiely FM, Spinelli JJ: Co-morbidity data in outcomes research: are clinical data derived from administrative databases a reliable alternative to chart review?. J Clin Epidemiol. 2000, 53 (4): 343-349. 10.1016/S0895-4356(99)00188-2.CrossRefPubMed Humphries KH, Rankin JM, Carere RG, Buller CE, Kiely FM, Spinelli JJ: Co-morbidity data in outcomes research: are clinical data derived from administrative databases a reliable alternative to chart review?. J Clin Epidemiol. 2000, 53 (4): 343-349. 10.1016/S0895-4356(99)00188-2.CrossRefPubMed
21.
go back to reference Kokotailo RA, Hill MD: Coding of stroke and stroke risk factors using international classification of diseases, revisions 9 and 10. Stroke. 2005, 36 (8): 1776-1781. 10.1161/01.STR.0000174293.17959.a1.CrossRefPubMed Kokotailo RA, Hill MD: Coding of stroke and stroke risk factors using international classification of diseases, revisions 9 and 10. Stroke. 2005, 36 (8): 1776-1781. 10.1161/01.STR.0000174293.17959.a1.CrossRefPubMed
22.
go back to reference Vach W: The dependence of Cohen's kappa on the prevalence does not matter. J Clin Epidemiol. 2005, 58 (7): 655-661. 10.1016/j.jclinepi.2004.02.021.CrossRefPubMed Vach W: The dependence of Cohen's kappa on the prevalence does not matter. J Clin Epidemiol. 2005, 58 (7): 655-661. 10.1016/j.jclinepi.2004.02.021.CrossRefPubMed
23.
go back to reference Whiteburst JA: Interrater agreement for journal manuscript reviews. Am Psychologist. 1984, 39: 22-28. 10.1037/0003-066X.39.1.22.CrossRef Whiteburst JA: Interrater agreement for journal manuscript reviews. Am Psychologist. 1984, 39: 22-28. 10.1037/0003-066X.39.1.22.CrossRef
24.
go back to reference Agresti A, Ghosh A: Raking Kappa: Describing Potential Impact of Marginal Distributions on Measures of Agreement. Biometrical Journal. 1995, 37 (7): 811-820. 10.1002/bimj.4710370705.CrossRef Agresti A, Ghosh A: Raking Kappa: Describing Potential Impact of Marginal Distributions on Measures of Agreement. Biometrical Journal. 1995, 37 (7): 811-820. 10.1002/bimj.4710370705.CrossRef
25.
go back to reference CIHI: Canadian Coding Standards for ICD-10-CA and CCI for 2007. 2007, Ottawa: Canadian Institute of Health Information CIHI: Canadian Coding Standards for ICD-10-CA and CCI for 2007. 2007, Ottawa: Canadian Institute of Health Information
26.
go back to reference Sim J, Wright CC: The kappa statistic in reliability studies: use, interpretation, and sample size requirements. Phys Ther. 2005, 85 (3): 257-268.PubMed Sim J, Wright CC: The kappa statistic in reliability studies: use, interpretation, and sample size requirements. Phys Ther. 2005, 85 (3): 257-268.PubMed
27.
go back to reference Cohen J: Weighted kappa: Nominal scale agreement with provision for scaled disagreement or partial credit. Psych Bull. 1968, 70: 213-220. 10.1037/h0026256.CrossRef Cohen J: Weighted kappa: Nominal scale agreement with provision for scaled disagreement or partial credit. Psych Bull. 1968, 70: 213-220. 10.1037/h0026256.CrossRef
28.
go back to reference Fleiss JL, Cohen J, Davies M: Large sample standard errors of kappa and weighted kappa. Psych Bull. 1969, 72: 323-327. 10.1037/h0028106.CrossRef Fleiss JL, Cohen J, Davies M: Large sample standard errors of kappa and weighted kappa. Psych Bull. 1969, 72: 323-327. 10.1037/h0028106.CrossRef
29.
go back to reference Landis RJ, Koch GG: The measurement of observer agreement for categorical data. Biometrics. 1977, 33: 159-174. 10.2307/2529310.CrossRefPubMed Landis RJ, Koch GG: The measurement of observer agreement for categorical data. Biometrics. 1977, 33: 159-174. 10.2307/2529310.CrossRefPubMed
30.
go back to reference Fleiss JL: Measuring nominal scale agreement among many raters. Psych Bull. 1971, 76: 378-382. 10.1037/h0031619.CrossRef Fleiss JL: Measuring nominal scale agreement among many raters. Psych Bull. 1971, 76: 378-382. 10.1037/h0031619.CrossRef
31.
go back to reference Landis RJ, Koch GG: A one-way components of variance model for categorical data. Biometrics. 1977, 33: 671-679. 10.2307/2529465.CrossRef Landis RJ, Koch GG: A one-way components of variance model for categorical data. Biometrics. 1977, 33: 671-679. 10.2307/2529465.CrossRef
32.
go back to reference Barlow W: Measurement of interrater agreement with adjustment for covariates. Biometrics. 1996, 52 (2): 695-702. 10.2307/2532907.CrossRefPubMed Barlow W: Measurement of interrater agreement with adjustment for covariates. Biometrics. 1996, 52 (2): 695-702. 10.2307/2532907.CrossRefPubMed
33.
go back to reference Quan H, Parsons GA, Ghali WA: Validity of procedure codes in International Classification of Diseases, 9th revision, clinical modification administrative data. Med Care. 2004, 42 (8): 801-809. 10.1097/01.mlr.0000132391.59713.0d.CrossRefPubMed Quan H, Parsons GA, Ghali WA: Validity of procedure codes in International Classification of Diseases, 9th revision, clinical modification administrative data. Med Care. 2004, 42 (8): 801-809. 10.1097/01.mlr.0000132391.59713.0d.CrossRefPubMed
Metadata
Title
Measuring agreement of administrative data with chart data using prevalence unadjusted and adjusted kappa
Authors
Guanmin Chen
Peter Faris
Brenda Hemmelgarn
Robin L Walker
Hude Quan
Publication date
01-12-2009
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2009
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/1471-2288-9-5

Other articles of this Issue 1/2009

BMC Medical Research Methodology 1/2009 Go to the issue