Skip to main content
Top
Published in: EJNMMI Research 1/2011

Open Access 01-12-2011 | Original research

Computer-aided diagnosis of renal obstruction: utility of log-linear modeling versus standard ROC and kappa analysis

Authors: Amita K Manatunga, José Nilo G Binongo, Andrew T Taylor

Published in: EJNMMI Research | Issue 1/2011

Login to get access

Abstract

Background

The accuracy of computer-aided diagnosis (CAD) software is best evaluated by comparison to a gold standard which represents the true status of disease. In many settings, however, knowledge of the true status of disease is not possible and accuracy is evaluated against the interpretations of an expert panel. Common statistical approaches to evaluate accuracy include receiver operating characteristic (ROC) and kappa analysis but both of these methods have significant limitations and cannot answer the question of equivalence: Is the CAD performance equivalent to that of an expert? The goal of this study is to show the strength of log-linear analysis over standard ROC and kappa statistics in evaluating the accuracy of computer-aided diagnosis of renal obstruction compared to the diagnosis provided by expert readers.

Methods

Log-linear modeling was utilized to analyze a previously published database that used ROC and kappa statistics to compare diuresis renography scan interpretations (non-obstructed, equivocal, or obstructed) generated by a renal expert system (RENEX) in 185 kidneys (95 patients) with the independent and consensus scan interpretations of three experts who were blinded to clinical information and prospectively and independently graded each kidney as obstructed, equivocal, or non-obstructed.

Results

Log-linear modeling showed that RENEX and the expert consensus had beyond-chance agreement in both non-obstructed and obstructed readings (both p < 0.0001). Moreover, pairwise agreement between experts and pairwise agreement between each expert and RENEX were not significantly different (p = 0.41, 0.95, 0.81 for the non-obstructed, equivocal, and obstructed categories, respectively). Similarly, the three-way agreement of the three experts and three-way agreement of two experts and RENEX was not significantly different for non-obstructed (p = 0.79) and obstructed (p = 0.49) categories.

Conclusion

Log-linear modeling showed that RENEX was equivalent to any expert in rating kidneys, particularly in the obstructed and non-obstructed categories. This conclusion, which could not be derived from the original ROC and kappa analysis, emphasizes and illustrates the role and importance of log-linear modeling in the absence of a gold standard. The log-linear analysis also provides additional evidence that RENEX has the potential to assist in the interpretation of diuresis renography studies.
Literature
1.
go back to reference Li F, Engleman R, Metz CE, Doi K, MacMahon H: Lung cancers missed on chest radiographs: Results obtained with a commercial computer-aided detection program. Radiology 2008, 246: 273–280.PubMedCrossRef Li F, Engleman R, Metz CE, Doi K, MacMahon H: Lung cancers missed on chest radiographs: Results obtained with a commercial computer-aided detection program. Radiology 2008, 246: 273–280.PubMedCrossRef
2.
go back to reference Taylor SA, Charmin SC, Lefere P, McFarland EG, Paulson EK, Yee J, Aslam R, Barlow JM, Gupta A, Kim DH, Miller CM, Halligan S: CT Colonography: Investigation of the optimum reader paradigm by using computer-aided detection software. Radiology 2008, 246: 463–471.PubMedCrossRef Taylor SA, Charmin SC, Lefere P, McFarland EG, Paulson EK, Yee J, Aslam R, Barlow JM, Gupta A, Kim DH, Miller CM, Halligan S: CT Colonography: Investigation of the optimum reader paradigm by using computer-aided detection software. Radiology 2008, 246: 463–471.PubMedCrossRef
3.
go back to reference Iglehart J: The new era of medical imaging-progress and pitfalls. N Eng J Med 2006, 354: 2822–2828. 10.1056/NEJMhpr061219CrossRef Iglehart J: The new era of medical imaging-progress and pitfalls. N Eng J Med 2006, 354: 2822–2828. 10.1056/NEJMhpr061219CrossRef
4.
go back to reference IMV Medical information division: 2003 nuclear medicine census market summary report. Volume IV. IMV Limited, Des Plaines, IL; 2003:7–11. IMV Medical information division: 2003 nuclear medicine census market summary report. Volume IV. IMV Limited, Des Plaines, IL; 2003:7–11.
5.
go back to reference Hunsche A: A value of quantitative data in the interpretation of diuresis renography for suspected urinary tract obstruction. In Ph D thesis. Federal University of Rio Grande o Sul, Porto Alegre, Rio Grande o Sul; 2006. Hunsche A: A value of quantitative data in the interpretation of diuresis renography for suspected urinary tract obstruction. In Ph D thesis. Federal University of Rio Grande o Sul, Porto Alegre, Rio Grande o Sul; 2006.
6.
go back to reference Kupinski MA, Hoppin JW, Clarkson E, Barrett HH, Kastis GA: Estimation in medical imaging without a gold standard. Academic Radiology 2002, 9: 290–297. 10.1016/S1076-6332(03)80372-0PubMedCentralPubMedCrossRef Kupinski MA, Hoppin JW, Clarkson E, Barrett HH, Kastis GA: Estimation in medical imaging without a gold standard. Academic Radiology 2002, 9: 290–297. 10.1016/S1076-6332(03)80372-0PubMedCentralPubMedCrossRef
7.
go back to reference Kundel HL, Polansky M: Mixture distribution and receiver operating characteristic analysis of bedside chest imaging with screen-film and computed radiology. Acad Radiol 1997, 4: 1–7. 10.1016/S1076-6332(97)80152-3PubMedCrossRef Kundel HL, Polansky M: Mixture distribution and receiver operating characteristic analysis of bedside chest imaging with screen-film and computed radiology. Acad Radiol 1997, 4: 1–7. 10.1016/S1076-6332(97)80152-3PubMedCrossRef
8.
go back to reference Kung JW, Matsumoto S, Hasegawa I, Nguyen B, Toto LC, Kundel H, Hatabu H: Mixture distribution analysis of a computer assisted diagnostic method for the evaluation of pulmonary nodules on computed tomography scan. Acad Radiol 2004, 11: 281–285. 10.1016/S1076-6332(03)00717-7PubMedCrossRef Kung JW, Matsumoto S, Hasegawa I, Nguyen B, Toto LC, Kundel H, Hatabu H: Mixture distribution analysis of a computer assisted diagnostic method for the evaluation of pulmonary nodules on computed tomography scan. Acad Radiol 2004, 11: 281–285. 10.1016/S1076-6332(03)00717-7PubMedCrossRef
9.
go back to reference Nelson JC and Pepe MS: Statistical description of inter-rater variability in ordinal ratings. Statistical Methods in Medical Research 2000,9(5):475–496. 10.1191/096228000701555262CrossRef Nelson JC and Pepe MS: Statistical description of inter-rater variability in ordinal ratings. Statistical Methods in Medical Research 2000,9(5):475–496. 10.1191/096228000701555262CrossRef
10.
go back to reference Taylor A Jr, Garcia EV, Binongo J, Manatunga A, Folks RD, Dubovsky E: Diagnostic performance of an expert system for the interpretation of Tc-99 m MAG3 scans to detect renal obstruction. J Nucl Med 2008, 49: 216–224. 10.2967/jnumed.107.045484PubMedCentralPubMedCrossRef Taylor A Jr, Garcia EV, Binongo J, Manatunga A, Folks RD, Dubovsky E: Diagnostic performance of an expert system for the interpretation of Tc-99 m MAG3 scans to detect renal obstruction. J Nucl Med 2008, 49: 216–224. 10.2967/jnumed.107.045484PubMedCentralPubMedCrossRef
11.
go back to reference Chan HP, Sahiner B, Helvie MA, Petrick N, Roubidoux MA, Wilson TE, Adler DD, Paramagul C, Newman JS, and Sanjay-Gopal S: Improvement of radiologists' characterization of mammographic masses by using computer-aided diagnosis: an ROC study. Radiology 1999, 212: 817.PubMedCrossRef Chan HP, Sahiner B, Helvie MA, Petrick N, Roubidoux MA, Wilson TE, Adler DD, Paramagul C, Newman JS, and Sanjay-Gopal S: Improvement of radiologists' characterization of mammographic masses by using computer-aided diagnosis: an ROC study. Radiology 1999, 212: 817.PubMedCrossRef
12.
go back to reference Chakraborty DP, Breatnach ES, Yester MV, Soto B, Barnes GT, and Fraser RG: Digital and conventional chest imaging: a modified ROC study of observer performance using simulated nodules. Radiology 1986, 158: 35–39.PubMedCrossRef Chakraborty DP, Breatnach ES, Yester MV, Soto B, Barnes GT, and Fraser RG: Digital and conventional chest imaging: a modified ROC study of observer performance using simulated nodules. Radiology 1986, 158: 35–39.PubMedCrossRef
13.
go back to reference Cohen J: A coefficient of agreement for nominal tables. Educational and Psychological measurement 1960, 20: 37–46. 10.1177/001316446002000104CrossRef Cohen J: A coefficient of agreement for nominal tables. Educational and Psychological measurement 1960, 20: 37–46. 10.1177/001316446002000104CrossRef
14.
go back to reference Agresti A: A model for agreement between ratings on an ordinal scale. Biometrics 1988, 44: 539–548. 10.2307/2531866CrossRef Agresti A: A model for agreement between ratings on an ordinal scale. Biometrics 1988, 44: 539–548. 10.2307/2531866CrossRef
15.
go back to reference Light RJ: Measures of response agreement for qualitative data: some generalizations and alternatives. Psychological Bulletin 1971, 5: 365–377.CrossRef Light RJ: Measures of response agreement for qualitative data: some generalizations and alternatives. Psychological Bulletin 1971, 5: 365–377.CrossRef
16.
go back to reference Tanner MA, Young MA: Modeling agreement among raters. JASA 1985, 80: 175–180.CrossRef Tanner MA, Young MA: Modeling agreement among raters. JASA 1985, 80: 175–180.CrossRef
17.
go back to reference Kraemer HC: Ramifications of a population model for kappa as a coefficient of reliability. Psychometrika 1979, 44: 461–472. 10.1007/BF02296208CrossRef Kraemer HC: Ramifications of a population model for kappa as a coefficient of reliability. Psychometrika 1979, 44: 461–472. 10.1007/BF02296208CrossRef
18.
go back to reference Garcia EV, Taylor A, Halkar R et al: RENEX: An expert system for the interpretation of Tc-99 m MAG3 scans to detect renal obstruction. J Nucl Med 2006, 47: 320–329.PubMed Garcia EV, Taylor A, Halkar R et al: RENEX: An expert system for the interpretation of Tc-99 m MAG3 scans to detect renal obstruction. J Nucl Med 2006, 47: 320–329.PubMed
19.
20.
go back to reference Taylor A Jr, Corrigan PL, Galt J, et al.: Measuring technetium-99 m-MAG3 clearance with an improved camera-based method. J Nucl Med 1995, 36: 1689–1695.PubMed Taylor A Jr, Corrigan PL, Galt J, et al.: Measuring technetium-99 m-MAG3 clearance with an improved camera-based method. J Nucl Med 1995, 36: 1689–1695.PubMed
21.
go back to reference Taylor A Jr, Manatunga A, Morton K, et al.: Multicenter trial validation of a camera-based method to measure Tc-99 m mercaptoacetyltriglycine, or Tc-99 m MAG3, clearance. Radiology 1997, 204: 47–54.PubMedCrossRef Taylor A Jr, Manatunga A, Morton K, et al.: Multicenter trial validation of a camera-based method to measure Tc-99 m mercaptoacetyltriglycine, or Tc-99 m MAG3, clearance. Radiology 1997, 204: 47–54.PubMedCrossRef
22.
go back to reference O'Reilly P, Aurell M, Britton K, et al.: Consensus on diuresis renography for investigating the dilated upper urinary tract. J Nucl Med 1996, 37: 1872–1876.PubMed O'Reilly P, Aurell M, Britton K, et al.: Consensus on diuresis renography for investigating the dilated upper urinary tract. J Nucl Med 1996, 37: 1872–1876.PubMed
23.
go back to reference SAS/STAT ® 9.2 User's Guide. Chapter 28: The CATMOD Procedure Cary, NC: SAS Institute 1998, 1092–1127. SAS/STAT ® 9.2 User's Guide. Chapter 28: The CATMOD Procedure Cary, NC: SAS Institute 1998, 1092–1127.
Metadata
Title
Computer-aided diagnosis of renal obstruction: utility of log-linear modeling versus standard ROC and kappa analysis
Authors
Amita K Manatunga
José Nilo G Binongo
Andrew T Taylor
Publication date
01-12-2011
Publisher
Springer Berlin Heidelberg
Published in
EJNMMI Research / Issue 1/2011
Electronic ISSN: 2191-219X
DOI
https://doi.org/10.1186/2191-219X-1-5

Other articles of this Issue 1/2011

EJNMMI Research 1/2011 Go to the issue