Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2012

Open Access 01-12-2012 | Research article

Interpreting the concordance statistic of a logistic regression model: relation to the variance and odds ratio of a continuous explanatory variable

Authors: Peter C Austin, Ewout W Steyerberg

Published in: BMC Medical Research Methodology | Issue 1/2012

Login to get access

Abstract

Background

When outcomes are binary, the c-statistic (equivalent to the area under the Receiver Operating Characteristic curve) is a standard measure of the predictive accuracy of a logistic regression model.

Methods

An analytical expression was derived under the assumption that a continuous explanatory variable follows a normal distribution in those with and without the condition. We then conducted an extensive set of Monte Carlo simulations to examine whether the expressions derived under the assumption of binormality allowed for accurate prediction of the empirical c-statistic when the explanatory variable followed a normal distribution in the combined sample of those with and without the condition. We also examine the accuracy of the predicted c-statistic when the explanatory variable followed a gamma, log-normal or uniform distribution in combined sample of those with and without the condition.

Results

Under the assumption of binormality with equality of variances, the c-statistic follows a standard normal cumulative distribution function with dependence on the product of the standard deviation of the normal components (reflecting more heterogeneity) and the log-odds ratio (reflecting larger effects). Under the assumption of binormality with unequal variances, the c-statistic follows a standard normal cumulative distribution function with dependence on the standardized difference of the explanatory variable in those with and without the condition. In our Monte Carlo simulations, we found that these expressions allowed for reasonably accurate prediction of the empirical c-statistic when the distribution of the explanatory variable was normal, gamma, log-normal, and uniform in the entire sample of those with and without the condition.

Conclusions

The discriminative ability of a continuous explanatory variable cannot be judged by its odds ratio alone, but always needs to be considered in relation to the heterogeneity of the population.
Appendix
Available only for authorised users
Literature
1.
2.
go back to reference Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N, et al: Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology. 2010, 21: 128-138. 10.1097/EDE.0b013e3181c30fb2.CrossRefPubMedPubMedCentral Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N, et al: Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology. 2010, 21: 128-138. 10.1097/EDE.0b013e3181c30fb2.CrossRefPubMedPubMedCentral
3.
4.
go back to reference Hanley JA, McNeil BJ: The meaning and use of the area under a Receiver Operating Characteristic (ROC) curve. Radiology. 1982, 143: 29-36.CrossRefPubMed Hanley JA, McNeil BJ: The meaning and use of the area under a Receiver Operating Characteristic (ROC) curve. Radiology. 1982, 143: 29-36.CrossRefPubMed
5.
go back to reference Bamber D: The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. J Math Psychol. 1975, 12: 387-415. 10.1016/0022-2496(75)90001-2.CrossRef Bamber D: The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. J Math Psychol. 1975, 12: 387-415. 10.1016/0022-2496(75)90001-2.CrossRef
6.
go back to reference Demler OV, Pencina MJ, D’Agostino RB: Equivalence of improvement in area under ROC curve and linear discriminant analysis coefficient under assumption of normality. Statistics in Medicine. 2011, 30: 1410-1418.PubMed Demler OV, Pencina MJ, D’Agostino RB: Equivalence of improvement in area under ROC curve and linear discriminant analysis coefficient under assumption of normality. Statistics in Medicine. 2011, 30: 1410-1418.PubMed
7.
go back to reference Royston P, Altman DG: Visualizing and assessing discrimination in the logistic regression model. Statistics in Medicine. 2010, 29: 2508-2520. 10.1002/sim.3994.CrossRefPubMed Royston P, Altman DG: Visualizing and assessing discrimination in the logistic regression model. Statistics in Medicine. 2010, 29: 2508-2520. 10.1002/sim.3994.CrossRefPubMed
8.
go back to reference Royston P, Thompson SG: Model-based screening by risk with application to Down’s syndrome. Statistics in Medicine. 1992, 11: 257-268. 10.1002/sim.4780110211.CrossRefPubMed Royston P, Thompson SG: Model-based screening by risk with application to Down’s syndrome. Statistics in Medicine. 1992, 11: 257-268. 10.1002/sim.4780110211.CrossRefPubMed
9.
go back to reference Deeks JJ, Macaskill P, Irwig L: The performance of tests of publication bias and other sample size effects in systematic reviews of diagnostic test accuracy was assessed. Journal of Clinical Epidemiology. 2005, 58: 882-893. 10.1016/j.jclinepi.2005.01.016.CrossRefPubMed Deeks JJ, Macaskill P, Irwig L: The performance of tests of publication bias and other sample size effects in systematic reviews of diagnostic test accuracy was assessed. Journal of Clinical Epidemiology. 2005, 58: 882-893. 10.1016/j.jclinepi.2005.01.016.CrossRefPubMed
10.
go back to reference Zhou X, Obuchowski N, McClish D: Statistical Methods in diagnostic medicine. 2002, Wiley-Interscience, New YorkCrossRef Zhou X, Obuchowski N, McClish D: Statistical Methods in diagnostic medicine. 2002, Wiley-Interscience, New YorkCrossRef
11.
go back to reference Cohen J: Statistical Power Analysis for the Behavioural Sciences. 1988, Lawrence Erlbaum Associates, Hillsdale, NJ, 2 Cohen J: Statistical Power Analysis for the Behavioural Sciences. 1988, Lawrence Erlbaum Associates, Hillsdale, NJ, 2
12.
go back to reference Flury BK, Riedwyl H: Standard distance in univariate and multivariate analysis. Am Stat. 1986, 40: 249-251. Flury BK, Riedwyl H: Standard distance in univariate and multivariate analysis. Am Stat. 1986, 40: 249-251.
13.
go back to reference Austin PC: Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples. Statistics in Medicine. 2009, 28: 3083-3107. 10.1002/sim.3697.CrossRefPubMedPubMedCentral Austin PC: Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples. Statistics in Medicine. 2009, 28: 3083-3107. 10.1002/sim.3697.CrossRefPubMedPubMedCentral
14.
go back to reference Normand ST, Landrum MB, Guadagnoli E, Ayanian JZ, Ryan TJ, Cleary PD, et al: Validating recommendations for coronary angiography following acute myocardial infarction in the elderly: a matched analysis using propensity scores. Journal of Clinical Epidemiology. 2001, 54: 387-398. 10.1016/S0895-4356(00)00321-8.CrossRefPubMed Normand ST, Landrum MB, Guadagnoli E, Ayanian JZ, Ryan TJ, Cleary PD, et al: Validating recommendations for coronary angiography following acute myocardial infarction in the elderly: a matched analysis using propensity scores. Journal of Clinical Epidemiology. 2001, 54: 387-398. 10.1016/S0895-4356(00)00321-8.CrossRefPubMed
15.
go back to reference Hosmer DW, Lemeshow S: Applied Logistic Regression. 1989, John Wiley & Sons, New York, NY Hosmer DW, Lemeshow S: Applied Logistic Regression. 1989, John Wiley & Sons, New York, NY
16.
go back to reference R Core Development Team: R: a language and environment for statistical computing. 2005, R Foundation for Statistical Computing, Vienna R Core Development Team: R: a language and environment for statistical computing. 2005, R Foundation for Statistical Computing, Vienna
17.
go back to reference Tu JV, Donovan LR, Lee DS, Wang JT, Austin PC, Alter DA, et al: Effectiveness of public report cards for improving the quality of cardiac care: the EFFECT study: a randomized trial. JAMA. 2009, 302: 2330-2337. 10.1001/jama.2009.1731.CrossRefPubMed Tu JV, Donovan LR, Lee DS, Wang JT, Austin PC, Alter DA, et al: Effectiveness of public report cards for improving the quality of cardiac care: the EFFECT study: a randomized trial. JAMA. 2009, 302: 2330-2337. 10.1001/jama.2009.1731.CrossRefPubMed
18.
go back to reference Tu JV, Donovan LR, Lee DS, Austin PC, Ko DT, Wang JT, et al: Quality of Cardiac Care in Ontario. 2004, Institute for Clinical Evaluative Sciences, Toronto, Ontario Tu JV, Donovan LR, Lee DS, Austin PC, Ko DT, Wang JT, et al: Quality of Cardiac Care in Ontario. 2004, Institute for Clinical Evaluative Sciences, Toronto, Ontario
19.
go back to reference Janssens AC, Moonesinghe R, Yang Q, Steyerberg EW, van Duijn CM, Khoury MJ: The impact of genotype frequencies on the clinical validity of genomic profiling for predicting common chronic diseases. Genetics in Medicine. 2007, 9: 528-535. 10.1097/GIM.0b013e31812eece0.CrossRefPubMed Janssens AC, Moonesinghe R, Yang Q, Steyerberg EW, van Duijn CM, Khoury MJ: The impact of genotype frequencies on the clinical validity of genomic profiling for predicting common chronic diseases. Genetics in Medicine. 2007, 9: 528-535. 10.1097/GIM.0b013e31812eece0.CrossRefPubMed
20.
go back to reference Pepe MS, Janes H, Longton G, Leisenring W, Newcomb P: Limitations of the odds ratio in gauging the performance of a diagnostic, prognostic, or screening marker. Am J Epidemiol. 2004, 159: 882-890. 10.1093/aje/kwh101.CrossRefPubMed Pepe MS, Janes H, Longton G, Leisenring W, Newcomb P: Limitations of the odds ratio in gauging the performance of a diagnostic, prognostic, or screening marker. Am J Epidemiol. 2004, 159: 882-890. 10.1093/aje/kwh101.CrossRefPubMed
21.
go back to reference Vergouwe Y, Moons KG, Steyerberg EW: External validity of risk models: Use of benchmark values to disentangle a case-mix effect from incorrect coefficients. Am J Epidemiol. 2010, 172: 971-980. 10.1093/aje/kwq223.CrossRefPubMedPubMedCentral Vergouwe Y, Moons KG, Steyerberg EW: External validity of risk models: Use of benchmark values to disentangle a case-mix effect from incorrect coefficients. Am J Epidemiol. 2010, 172: 971-980. 10.1093/aje/kwq223.CrossRefPubMedPubMedCentral
Metadata
Title
Interpreting the concordance statistic of a logistic regression model: relation to the variance and odds ratio of a continuous explanatory variable
Authors
Peter C Austin
Ewout W Steyerberg
Publication date
01-12-2012
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2012
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/1471-2288-12-82

Other articles of this Issue 1/2012

BMC Medical Research Methodology 1/2012 Go to the issue