Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2006

Open Access 01-12-2006 | Debate

A system for rating the stability and strength of medical evidence

Authors: Jonathan R Treadwell, Stephen J Tregear, James T Reston, Charles M Turkelson

Published in: BMC Medical Research Methodology | Issue 1/2006

Login to get access

Abstract

Background

Methods for describing one's confidence in the available evidence are useful for end-users of evidence reviews. Analysts inevitably make judgments about the quality, quantity consistency, robustness, and magnitude of effects observed in the studies identified. The subjectivity of these judgments in several areas underscores the need for transparency in judgments.

Discussion

This paper introduces a new system for rating medical evidence. The system requires explicit judgments and provides explicit rules for balancing these judgments. Unlike other systems for rating the strength of evidence, our system draws a distinction between two types of conclusions: quantitative and qualitative. A quantitative conclusion addresses the question, "How well does it work?", whereas a qualitative conclusion addresses the question, "Does it work?" In our system, quantitative conclusions are tied to stability ratings, and qualitative conclusions are tied to strength ratings. Our system emphasizes extensive a priori criteria for judgments to reduce the potential for bias. Further, the system makes explicit the impact of heterogeneity testing, meta-analysis, and sensitivity analyses on evidence ratings. This article provides details of our system, including graphical depictions of how the numerous judgments that an analyst makes can be combined. We also describe two worked examples of how the system can be applied to both interventional and diagnostic technologies.

Summary

Although explicit judgments and formal combination rules are two important steps on the path to a comprehensive system for rating medical evidence, many additional steps must also be taken. Foremost among these are the distinction between quantitative and qualitative conclusions, an extensive set of a priori criteria for making judgments, and the direct impact of analytic results on evidence ratings. These attributes form the basis for a logically consistent system that can improve the usefulness of evidence reviews.
Appendix
Available only for authorised users
Literature
1.
go back to reference Harris RP, Helfand M, Woolf SH, Lohr KN, Mulrow CD, Teutsch SM, Atkins D: Current methods of the U.S. Preventive Services Task Force. A review of the process. Am J Prev Med. 2001, 20 (3 Suppl): 21-35. 10.1016/S0749-3797(01)00261-6.CrossRefPubMed Harris RP, Helfand M, Woolf SH, Lohr KN, Mulrow CD, Teutsch SM, Atkins D: Current methods of the U.S. Preventive Services Task Force. A review of the process. Am J Prev Med. 2001, 20 (3 Suppl): 21-35. 10.1016/S0749-3797(01)00261-6.CrossRefPubMed
2.
go back to reference Atkins D, Best D, Briss PA, Eccles M, Falck-Ytter Y, Flottorp S, Guyatt GH, Harbour RT, Haugh MC, Henry D, Hill S, Jaeschke R, Leng G, Liberati A, Magrini N, Mason J, Middleton P, Mrukowicz J, O'Connell D, Oxman AD, Phillips B, Schunemann HJ, Edejer TT, Varonen H, Vist GE, Williams JW, Zaza S: Grading quality of evidence and strength of recommendations. BMJ. 328 (7454): 1490-2004 Jun 19;, [http://bmj.bmjjournals.com/cgi/reprint/328/7454/1490] Atkins D, Best D, Briss PA, Eccles M, Falck-Ytter Y, Flottorp S, Guyatt GH, Harbour RT, Haugh MC, Henry D, Hill S, Jaeschke R, Leng G, Liberati A, Magrini N, Mason J, Middleton P, Mrukowicz J, O'Connell D, Oxman AD, Phillips B, Schunemann HJ, Edejer TT, Varonen H, Vist GE, Williams JW, Zaza S: Grading quality of evidence and strength of recommendations. BMJ. 328 (7454): 1490-2004 Jun 19;, [http://​bmj.​bmjjournals.​com/​cgi/​reprint/​328/​7454/​1490]
3.
go back to reference Atkins D, Eccles M, Flottorp S, Guyatt GH, Henry D, Hill S, Liberati A, O'Connell D, Oxman AD, Phillips B, Schunemann H, Edejer TT, Vist GE, Williams JW, GRADE Working Group: Systems for grading the quality of evidence and the strength of recommendations I: critical appraisal of existing approaches The GRADE Working Group. BMC Health Serv Res. 4 (1): 38-10.1186/1472-6963-4-38. 2004 Dec 22; Atkins D, Eccles M, Flottorp S, Guyatt GH, Henry D, Hill S, Liberati A, O'Connell D, Oxman AD, Phillips B, Schunemann H, Edejer TT, Vist GE, Williams JW, GRADE Working Group: Systems for grading the quality of evidence and the strength of recommendations I: critical appraisal of existing approaches The GRADE Working Group. BMC Health Serv Res. 4 (1): 38-10.1186/1472-6963-4-38. 2004 Dec 22;
4.
go back to reference Atkins D, Briss PA, Eccles M, Flottorp S, Guyatt GH, Harbour RT, Hill S, Jaeschke R, Liberati A, Magrini N, Mason J, O'Connell D, Oxman AD, Phillips B, Schunemann H, Edejer TT, Vist GE, Williams JW, GRADE Working Group: Systems for grading the quality of evidence and the strength of recommendations II: pilot study of a new system. BMC Health Serv Res. 5 (1): 25-10.1186/1472-6963-5-25. 2005 Mar 23; Atkins D, Briss PA, Eccles M, Flottorp S, Guyatt GH, Harbour RT, Hill S, Jaeschke R, Liberati A, Magrini N, Mason J, O'Connell D, Oxman AD, Phillips B, Schunemann H, Edejer TT, Vist GE, Williams JW, GRADE Working Group: Systems for grading the quality of evidence and the strength of recommendations II: pilot study of a new system. BMC Health Serv Res. 5 (1): 25-10.1186/1472-6963-5-25. 2005 Mar 23;
5.
go back to reference Guyatt G, Gutterman D, Baumann MH, Addrizzo-Harris D, Hylek EM, Phillips B, Raskob G, Lewis SZ, Schunemann H: Grading strength of recommendations and quality of evidence in clinical guidelines: report from an american college of chest physicians task force. Chest. 2006, 129 (1): 174-81. 10.1378/chest.129.1.174.CrossRefPubMed Guyatt G, Gutterman D, Baumann MH, Addrizzo-Harris D, Hylek EM, Phillips B, Raskob G, Lewis SZ, Schunemann H: Grading strength of recommendations and quality of evidence in clinical guidelines: report from an american college of chest physicians task force. Chest. 2006, 129 (1): 174-81. 10.1378/chest.129.1.174.CrossRefPubMed
6.
go back to reference Brouwers MC, Johnston ME, Charette ML, Hanna SE, Jadad AR, Browman GP: Evaluating the role of quality assessment of primary studies in systematic reviews of cancer practice guidelines. BMC Med Res Methodol. 5 (1): 8-10.1186/1471-2288-5-8. 2005 Feb 16; Brouwers MC, Johnston ME, Charette ML, Hanna SE, Jadad AR, Browman GP: Evaluating the role of quality assessment of primary studies in systematic reviews of cancer practice guidelines. BMC Med Res Methodol. 5 (1): 8-10.1186/1471-2288-5-8. 2005 Feb 16;
7.
go back to reference West S, King V, Carey TS, Lohr KN, McKoy N, Sutton SF, Lux L: Systems to rate the strength of scientific evidence. (Prepared by Research Triangle Institute – University of North Carolina Evidence-based Practice Center under Contract no. 290-97-0011). AHRQ Publication no. 02-E016. 2002, Rockville (MD): Agency for Healthcare Research and Quality (AHRQ), 199-(Evidence report/technology assessment; no. 47). West S, King V, Carey TS, Lohr KN, McKoy N, Sutton SF, Lux L: Systems to rate the strength of scientific evidence. (Prepared by Research Triangle Institute – University of North Carolina Evidence-based Practice Center under Contract no. 290-97-0011). AHRQ Publication no. 02-E016. 2002, Rockville (MD): Agency for Healthcare Research and Quality (AHRQ), 199-(Evidence report/technology assessment; no. 47).
8.
go back to reference Guyatt G, Schunemann H, Cook D, Jaeschke R, Pauker S, Bucher H: Grades of recommendation for antithrombotic agents. Chest. 2001, 119 (1 Suppl): 3S-7S. 10.1378/chest.119.1_suppl.3S.CrossRefPubMed Guyatt G, Schunemann H, Cook D, Jaeschke R, Pauker S, Bucher H: Grades of recommendation for antithrombotic agents. Chest. 2001, 119 (1 Suppl): 3S-7S. 10.1378/chest.119.1_suppl.3S.CrossRefPubMed
9.
go back to reference National Health and Medical Research Council (NHMRC): How to use the evidence: assessment and application of scientific evidence. 2000, Canberra (Australia): National Health and Medical Research Council (NHMRC), 91-(NHMRC handbook series on preparing clinical practice guidelines; no. CP69). National Health and Medical Research Council (NHMRC): How to use the evidence: assessment and application of scientific evidence. 2000, Canberra (Australia): National Health and Medical Research Council (NHMRC), 91-(NHMRC handbook series on preparing clinical practice guidelines; no. CP69).
11.
go back to reference Harbour R, Miller J: A new system for grading recommendations in evidence based guidelines. BMJ. 323: 334-6. 10.1136/bmj.323.7308.334. 2001 Aug 11; Harbour R, Miller J: A new system for grading recommendations in evidence based guidelines. BMJ. 323: 334-6. 10.1136/bmj.323.7308.334. 2001 Aug 11;
12.
go back to reference Briss PA, Zaza S, Pappaioanou M, Fielding J, Wright-De Aguero L, Truman BI, Hopkins DP, Mullen PD, Thompson RS, Woolf SH, Carande-Kulis VG, Anderson L, Hinman AR, McQueen DV, Teutsch SM, Harris JR: Developing an evidence-based Guide to Community Preventive Services– methods. The Task Force on Community Preventive Services. Am J Prev Med. 2000, 18 (1 Suppl): 35-43. 10.1016/S0749-3797(99)00119-1.CrossRefPubMed Briss PA, Zaza S, Pappaioanou M, Fielding J, Wright-De Aguero L, Truman BI, Hopkins DP, Mullen PD, Thompson RS, Woolf SH, Carande-Kulis VG, Anderson L, Hinman AR, McQueen DV, Teutsch SM, Harris JR: Developing an evidence-based Guide to Community Preventive Services– methods. The Task Force on Community Preventive Services. Am J Prev Med. 2000, 18 (1 Suppl): 35-43. 10.1016/S0749-3797(99)00119-1.CrossRefPubMed
13.
go back to reference ECRI: Drug-eluting stents for the treatment of coronary artery disease. 2006, Plymouth Meeting (PA):ECRI Health Technology Assessment Information Service, 124-(Windows on medical technology; no. 134). ECRI: Drug-eluting stents for the treatment of coronary artery disease. 2006, Plymouth Meeting (PA):ECRI Health Technology Assessment Information Service, 124-(Windows on medical technology; no. 134).
14.
go back to reference Fleming C, Whitlock EP, Beil TL, Lederle FA: Screening for abdominal aortic aneurysm: a best-evidence systematic review for the U.S. Preventive Services Task Force. Ann Intern Med. 142 (3): 203-11. 2005 Feb 1; Fleming C, Whitlock EP, Beil TL, Lederle FA: Screening for abdominal aortic aneurysm: a best-evidence systematic review for the U.S. Preventive Services Task Force. Ann Intern Med. 142 (3): 203-11. 2005 Feb 1;
15.
go back to reference Hartmann K, Viswanathan M, Palmieri R, Gartlehner G, Thorp J, Lohr KN: Outcomes of routine episiotomy: a systematic review. JAMA. 293 (17): 2141-8. 10.1001/jama.293.17.2141. 2005 May 4; Hartmann K, Viswanathan M, Palmieri R, Gartlehner G, Thorp J, Lohr KN: Outcomes of routine episiotomy: a systematic review. JAMA. 293 (17): 2141-8. 10.1001/jama.293.17.2141. 2005 May 4;
16.
go back to reference McAlister FA, Ezekowitz JA, Wiebe N, Rowe B, Spooner C, Crumley E, Hartling L, Klassen T, Abraham W: Systematic review: cardiac resynchronization in patients with symptomatic heart failure. Ann Intern Med. 141 (5): 381-90. 2004 Sep 7; McAlister FA, Ezekowitz JA, Wiebe N, Rowe B, Spooner C, Crumley E, Hartling L, Klassen T, Abraham W: Systematic review: cardiac resynchronization in patients with symptomatic heart failure. Ann Intern Med. 141 (5): 381-90. 2004 Sep 7;
17.
go back to reference Wilt T, Nair B, MacDonald R, Rutks I: Early versus deferred androgen suppression in the treatment of advanced prostatic cancer. Cochrane Database of Systematic Reviews 2001 [internet]. 2001, Hoboken (NJ): John Wiley & Sons, Ltd., 2001 Oct 23 [updated 2001 Aug 16]. [cited 2006 Jul 10]. [Art. No.: CD003506]. Available: DOI: 10.1002/14651858. CD003506., 4, [internet]CrossRef Wilt T, Nair B, MacDonald R, Rutks I: Early versus deferred androgen suppression in the treatment of advanced prostatic cancer. Cochrane Database of Systematic Reviews 2001 [internet]. 2001, Hoboken (NJ): John Wiley & Sons, Ltd., 2001 Oct 23 [updated 2001 Aug 16]. [cited 2006 Jul 10]. [Art. No.: CD003506]. Available: DOI: 10.1002/14651858. CD003506., 4, [internet]CrossRef
18.
go back to reference Kane RL, Saleh KJ, Wilt TJ, Bershadsky B, Cross WW, MacDonald RM, Rutks I: Total knee replacement. (Prepared by Minnesota Evidence-based Practice Center, Minneapolis, MN). AHRQ Pub. No. 04-E006-2. 2003, Rockville (MD): Agency for Healthcare Research and Quality, 150-(Evidence report/technology assessment; no. 86). Kane RL, Saleh KJ, Wilt TJ, Bershadsky B, Cross WW, MacDonald RM, Rutks I: Total knee replacement. (Prepared by Minnesota Evidence-based Practice Center, Minneapolis, MN). AHRQ Pub. No. 04-E006-2. 2003, Rockville (MD): Agency for Healthcare Research and Quality, 150-(Evidence report/technology assessment; no. 86).
19.
go back to reference Higgins JP, Thompson SG: Quantifying heterogeneity in a meta-analysis. Stat Med. 21 (11): 1539-58. 10.1002/sim.1186. 2002 Jun 15; Higgins JP, Thompson SG: Quantifying heterogeneity in a meta-analysis. Stat Med. 21 (11): 1539-58. 10.1002/sim.1186. 2002 Jun 15;
20.
go back to reference Higgins JP, Thompson SG, Deeks JJ, Altman DG: Measuring inconsistency in meta-analyses. BMJ. 327 (7414): 557-60. 10.1136/bmj.327.7414.557. 2003 Sep 6; Higgins JP, Thompson SG, Deeks JJ, Altman DG: Measuring inconsistency in meta-analyses. BMJ. 327 (7414): 557-60. 10.1136/bmj.327.7414.557. 2003 Sep 6;
21.
go back to reference Lau J, Schmid CH, Chalmers TC: Cumulative meta-analysis of clinical trials builds evidence for exemplary medical care. J Clin Epidemiol. 1995, 48 (1): 45–57-59–60. 10.1016/0895-4356(94)00106-Z.CrossRefPubMed Lau J, Schmid CH, Chalmers TC: Cumulative meta-analysis of clinical trials builds evidence for exemplary medical care. J Clin Epidemiol. 1995, 48 (1): 45–57-59–60. 10.1016/0895-4356(94)00106-Z.CrossRefPubMed
23.
go back to reference Armitage P, Berry G: Statistical methods in medical research. 1994, Oxford, England: Blackwell Scientific, 620-3 Armitage P, Berry G: Statistical methods in medical research. 1994, Oxford, England: Blackwell Scientific, 620-3
24.
go back to reference Pudar Hozo S, Djulbegovic B, Hozo I: Estimating the mean and variance from the median, range, and the size of a sample. BMC Med Res Methodol. 5 (1): 13-10.1186/1471-2288-5-13. 2005 Apr 20; Pudar Hozo S, Djulbegovic B, Hozo I: Estimating the mean and variance from the median, range, and the size of a sample. BMC Med Res Methodol. 5 (1): 13-10.1186/1471-2288-5-13. 2005 Apr 20;
25.
go back to reference Moses LE, Shapiro D, Littenberg B: Combining independent studies of a diagnostic test into a summary ROC curve: data-analytic approaches and some additional considerations. Stat Med. 12 (14): 1293-316. 1993 Jul 30; Moses LE, Shapiro D, Littenberg B: Combining independent studies of a diagnostic test into a summary ROC curve: data-analytic approaches and some additional considerations. Stat Med. 12 (14): 1293-316. 1993 Jul 30;
26.
go back to reference Olkin I: Diagnostic statistical procedures in medical meta-analysis. Stat Med. 18 (17–18): 2331-41. 1999 Sep 15; Olkin I: Diagnostic statistical procedures in medical meta-analysis. Stat Med. 18 (17–18): 2331-41. 1999 Sep 15;
27.
go back to reference Begg CB, Mazumdar M: Operating characteristics of a rank correlation test for publication bias. Biometrics. 1994, 50 (4): 1088-101. 10.2307/2533446.CrossRefPubMed Begg CB, Mazumdar M: Operating characteristics of a rank correlation test for publication bias. Biometrics. 1994, 50 (4): 1088-101. 10.2307/2533446.CrossRefPubMed
28.
go back to reference Duval SJ, Tweedie RL: A non-parametric 'trim and fill' method of assessing publication bias in meta-analysis. J Am Stat Assoc. 2000, 95 (449): 89-98. 10.2307/2669529. Duval SJ, Tweedie RL: A non-parametric 'trim and fill' method of assessing publication bias in meta-analysis. J Am Stat Assoc. 2000, 95 (449): 89-98. 10.2307/2669529.
29.
go back to reference Egger M, Davey Smith G, Schneider M, Minder C: Bias in meta-analysis detected by a simple, graphical test. BMJ. 315 (7109): 629-34. 1997 Sep 13; Egger M, Davey Smith G, Schneider M, Minder C: Bias in meta-analysis detected by a simple, graphical test. BMJ. 315 (7109): 629-34. 1997 Sep 13;
30.
go back to reference Ioannidis JP, Contopoulos-Ioannidis DG, Lau J: Recursive cumulative meta-analysis: a diagnostic for the evolution of total randomized evidence from group and individual patient data. J Clin Epidemiol. 1999, 52 (4): 281-91. 10.1016/S0895-4356(98)00159-0.CrossRefPubMed Ioannidis JP, Contopoulos-Ioannidis DG, Lau J: Recursive cumulative meta-analysis: a diagnostic for the evolution of total randomized evidence from group and individual patient data. J Clin Epidemiol. 1999, 52 (4): 281-91. 10.1016/S0895-4356(98)00159-0.CrossRefPubMed
Metadata
Title
A system for rating the stability and strength of medical evidence
Authors
Jonathan R Treadwell
Stephen J Tregear
James T Reston
Charles M Turkelson
Publication date
01-12-2006
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2006
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/1471-2288-6-52

Other articles of this Issue 1/2006

BMC Medical Research Methodology 1/2006 Go to the issue