Skip to main content
Top
Published in: BMC Medical Imaging 1/2013

Open Access 01-12-2013 | Research article

Are two readers more reliable than one? A study of upper neck ligament scoring on magnetic resonance images

Authors: Ansgar Espeland, Nils Vetti, Jostein Kråkenes

Published in: BMC Medical Imaging | Issue 1/2013

Login to get access

Abstract

Background

Magnetic resonance imaging (MRI) studies typically employ either a single expert or multiple readers in collaboration to evaluate (read) the image results. However, no study has examined whether evaluations from multiple readers provide more reliable results than a single reader. We examined whether consistency in image interpretation by a single expert might be equal to the consistency of combined readings, defined as independent interpretations by two readers, where cases of disagreement were reconciled by consensus.

Methods

One expert neuroradiologist and one trained radiology resident independently evaluated 102 MRIs of the upper neck. The signal intensities of the alar and transverse ligaments were scored 0, 1, 2, or 3. Disagreements were resolved by consensus. They repeated the grading process after 3–8 months (second evaluation). We used kappa statistics and intraclass correlation coefficients (ICCs) to assess agreement between the initial and second evaluations for each radiologist and for combined determinations. Disagreements on score prevalence were evaluated with McNemar’s test.

Results

Higher consistency between the initial and second evaluations was obtained with the combined readings than with individual readings for signal intensity scores of ligaments on both the right and left sides of the spine. The weighted kappa ranges were 0.65-0.71 vs. 0.48-0.62 for combined vs. individual scoring, respectively. The combined scores also showed better agreement between evaluations than individual scores for the presence of grade 2–3 signal intensities on any side in a given subject (unweighted kappa 0.69-0.74 vs. 0.52-0.63, respectively). Disagreement between the initial and second evaluations on the prevalence of grades 2–3 was less marked for combined scores than for individual scores (P ≥ 0.039 vs. P ≤ 0.004, respectively). ICCs indicated a more reliable sum score per patient for combined scores (0.74) and both readers’ average scores (0.78) than for individual scores (0.55-0.69).

Conclusions

This study was the first to provide empirical support for the principle that an additional reader can improve the reproducibility of MRI interpretations compared to one expert alone. Furthermore, even a moderately experienced second reader improved the reliability compared to a single expert reader. The implications of this for clinical work require further study.
Appendix
Available only for authorised users
Literature
1.
go back to reference Thornbury JR: Eugene W. Caldwell Lecture. Clinical efficacy of diagnostic imaging: love it or leave it. AJR Am J Roentgenol. 1994, 162: 1-8. 10.2214/ajr.162.1.8273645.CrossRefPubMed Thornbury JR: Eugene W. Caldwell Lecture. Clinical efficacy of diagnostic imaging: love it or leave it. AJR Am J Roentgenol. 1994, 162: 1-8. 10.2214/ajr.162.1.8273645.CrossRefPubMed
2.
go back to reference D'agostino MA, Aegerter P, Jousse-Joulin S, Chary-Valckenaere I, Lecoq B, Gaudin P, Brault I, Scmitz J, Dehaut F, Le Parc J, Breban M, Landais P: How to evaluate and improve the reliability of power Doppler ultrasonography for assessing enthesitis in spondylarthritis. Arthritis Rheum. 2009, 61: 61-69.CrossRefPubMed D'agostino MA, Aegerter P, Jousse-Joulin S, Chary-Valckenaere I, Lecoq B, Gaudin P, Brault I, Scmitz J, Dehaut F, Le Parc J, Breban M, Landais P: How to evaluate and improve the reliability of power Doppler ultrasonography for assessing enthesitis in spondylarthritis. Arthritis Rheum. 2009, 61: 61-69.CrossRefPubMed
3.
go back to reference Yoon LS, Haims AH, Brink JA, Rabinovici R, Forman HP: Evaluation of an emergency radiology quality assurance program at a level I trauma center: abdominal and pelvic CT studies. Radiology. 2002, 224: 42-46. 10.1148/radiol.2241011470.CrossRefPubMed Yoon LS, Haims AH, Brink JA, Rabinovici R, Forman HP: Evaluation of an emergency radiology quality assurance program at a level I trauma center: abdominal and pelvic CT studies. Radiology. 2002, 224: 42-46. 10.1148/radiol.2241011470.CrossRefPubMed
4.
go back to reference Canon CL, Smith JK, Morgan DE, Jones BC, Fell SC, Kenney PJ, Ferrante D, Lockhart ME, Westfall AO, Koehler RE: Double reading of barium enemas: is it necessary?. AJR Am J Roentgenol. 2003, 181: 1607-1610. 10.2214/ajr.181.6.1811607.CrossRefPubMed Canon CL, Smith JK, Morgan DE, Jones BC, Fell SC, Kenney PJ, Ferrante D, Lockhart ME, Westfall AO, Koehler RE: Double reading of barium enemas: is it necessary?. AJR Am J Roentgenol. 2003, 181: 1607-1610. 10.2214/ajr.181.6.1811607.CrossRefPubMed
5.
go back to reference Cheung KM, Samartzis D, Karppinen J, Mok FP, Ho DW, Fong DY, Luk KD: Intervertebral disc degeneration: new insights based on "skipped" level disc pathology. Arthritis Rheum. 2010, 62: 2392-2400. 10.1002/art.27523.CrossRefPubMed Cheung KM, Samartzis D, Karppinen J, Mok FP, Ho DW, Fong DY, Luk KD: Intervertebral disc degeneration: new insights based on "skipped" level disc pathology. Arthritis Rheum. 2010, 62: 2392-2400. 10.1002/art.27523.CrossRefPubMed
6.
go back to reference Kjaer P, Leboeuf-Yde C, Korsholm L, Sorensen JS, Bendix T: Magnetic resonance imaging and low back pain in adults: a diagnostic imaging study of 40-year-old men and women. Spine (Phila Pa 1976). 2005, 30: 1173-1180. 10.1097/01.brs.0000162396.97739.76.CrossRef Kjaer P, Leboeuf-Yde C, Korsholm L, Sorensen JS, Bendix T: Magnetic resonance imaging and low back pain in adults: a diagnostic imaging study of 40-year-old men and women. Spine (Phila Pa 1976). 2005, 30: 1173-1180. 10.1097/01.brs.0000162396.97739.76.CrossRef
7.
go back to reference Esposito L, Saam T, Heider P, Bockelbrink A, Pelisek J, Sepp D, Feurer R, Winkler C, Liebig T, Holzer K, Pauly O, Sadikovic S, Hemmer B, Poppert H: MRI plaque imaging reveals high-risk carotid plaques especially in diabetic patients irrespective of the degree of stenosis. BMC Med Imaging. 2010, 10: 27-10.1186/1471-2342-10-27.CrossRefPubMedPubMedCentral Esposito L, Saam T, Heider P, Bockelbrink A, Pelisek J, Sepp D, Feurer R, Winkler C, Liebig T, Holzer K, Pauly O, Sadikovic S, Hemmer B, Poppert H: MRI plaque imaging reveals high-risk carotid plaques especially in diabetic patients irrespective of the degree of stenosis. BMC Med Imaging. 2010, 10: 27-10.1186/1471-2342-10-27.CrossRefPubMedPubMedCentral
8.
go back to reference Jagadeesan BD, Gado Almandoz JE, Moran CJ, Benzinger TL: Accuracy of susceptibility-weighted imaging for the detection of arteriovenous shunting in vascular malformations of the brain. Stroke. 2011, 42: 87-92. 10.1161/STROKEAHA.110.584862.CrossRefPubMed Jagadeesan BD, Gado Almandoz JE, Moran CJ, Benzinger TL: Accuracy of susceptibility-weighted imaging for the detection of arteriovenous shunting in vascular malformations of the brain. Stroke. 2011, 42: 87-92. 10.1161/STROKEAHA.110.584862.CrossRefPubMed
9.
go back to reference Jensch S, de Vries AH, Peringa J, Bipat S, Dekker E, Baak LC, Bartelsman JF, Heutinck A, Montauban van Swijndregt AD, Stoker J: CT colonography with limited bowel preparation: performance characteristics in an increased-risk population. Radiology. 2008, 247: 122-132. 10.1148/radiol.2471070439.CrossRefPubMed Jensch S, de Vries AH, Peringa J, Bipat S, Dekker E, Baak LC, Bartelsman JF, Heutinck A, Montauban van Swijndregt AD, Stoker J: CT colonography with limited bowel preparation: performance characteristics in an increased-risk population. Radiology. 2008, 247: 122-132. 10.1148/radiol.2471070439.CrossRefPubMed
10.
go back to reference Peterson CK, Saupe N, Buck F, Pfirrmann CW, Zanetti M, Hodler J: CT-guided sternoclavicular joint injections: description of the procedure, reliability of imaging diagnosis, and short-term patient responses. AJR Am J Roentgenol. 2010, 195: W435-W439. 10.2214/AJR.10.4501.CrossRefPubMed Peterson CK, Saupe N, Buck F, Pfirrmann CW, Zanetti M, Hodler J: CT-guided sternoclavicular joint injections: description of the procedure, reliability of imaging diagnosis, and short-term patient responses. AJR Am J Roentgenol. 2010, 195: W435-W439. 10.2214/AJR.10.4501.CrossRefPubMed
11.
go back to reference Vetti N, Krakenes J, Damsgaard E, Rorvik J, Gilhus NE, Espeland A: MRI of the alar and transverse ligaments in acute whiplash-associated disorders 1–2 - a cross-sectional controlled study. Spine (Phila Pa 1976). 2011, 36: E434-E440. 10.1097/BRS.0b013e3181da21a9.CrossRef Vetti N, Krakenes J, Damsgaard E, Rorvik J, Gilhus NE, Espeland A: MRI of the alar and transverse ligaments in acute whiplash-associated disorders 1–2 - a cross-sectional controlled study. Spine (Phila Pa 1976). 2011, 36: E434-E440. 10.1097/BRS.0b013e3181da21a9.CrossRef
12.
go back to reference Vetti N, Alsing R, Krakenes J, Rorvik J, Gilhus NE, Brun JG, Espeland A: MRI of the transverse and alar ligaments in rheumatoid arthritis: feasibility and relations to atlantoaxial subluxation and disease activity. Neuroradiology. 2010, 52: 215-223. 10.1007/s00234-009-0650-4.CrossRefPubMed Vetti N, Alsing R, Krakenes J, Rorvik J, Gilhus NE, Brun JG, Espeland A: MRI of the transverse and alar ligaments in rheumatoid arthritis: feasibility and relations to atlantoaxial subluxation and disease activity. Neuroradiology. 2010, 52: 215-223. 10.1007/s00234-009-0650-4.CrossRefPubMed
13.
go back to reference Myran R, Kvistad KA, Nygaard OP, Andresen H, Folvik M, Zwart JA: Magnetic resonance imaging assessment of the alar ligaments in whiplash injuries: a case–control study. Spine (Phila Pa 1976). 2008, 33: 2012-2016. 10.1097/BRS.0b013e31817bb0bd.CrossRef Myran R, Kvistad KA, Nygaard OP, Andresen H, Folvik M, Zwart JA: Magnetic resonance imaging assessment of the alar ligaments in whiplash injuries: a case–control study. Spine (Phila Pa 1976). 2008, 33: 2012-2016. 10.1097/BRS.0b013e31817bb0bd.CrossRef
14.
go back to reference Vetti N, Krakenes J, Eide GE, Rorvik J, Gilhus NE, Espeland A: MRI of the alar and transverse ligaments in whiplash-associated disorders (WAD) grades 1–2: high-signal changes by age, gender, event and time since trauma. Neuroradiology. 2009, 51: 227-235. 10.1007/s00234-008-0482-7.CrossRefPubMed Vetti N, Krakenes J, Eide GE, Rorvik J, Gilhus NE, Espeland A: MRI of the alar and transverse ligaments in whiplash-associated disorders (WAD) grades 1–2: high-signal changes by age, gender, event and time since trauma. Neuroradiology. 2009, 51: 227-235. 10.1007/s00234-008-0482-7.CrossRefPubMed
15.
go back to reference Krakenes J, Kaale BR: Magnetic resonance imaging assessment of craniovertebral ligaments and membranes after whiplash trauma. Spine (Phila Pa 1976). 2006, 31: 2820-2826. 10.1097/01.brs.0000245871.15696.1f.CrossRef Krakenes J, Kaale BR: Magnetic resonance imaging assessment of craniovertebral ligaments and membranes after whiplash trauma. Spine (Phila Pa 1976). 2006, 31: 2820-2826. 10.1097/01.brs.0000245871.15696.1f.CrossRef
16.
go back to reference Krakenes J, Kaale BR, Moen G, Nordli H, Gilhus NE, Rorvik J: MRI assessment of the alar ligaments in the late stage of whiplash injury–a study of structural abnormalities and observer agreement. Neuroradiology. 2002, 44: 617-624. 10.1007/s00234-002-0799-6.CrossRefPubMed Krakenes J, Kaale BR, Moen G, Nordli H, Gilhus NE, Rorvik J: MRI assessment of the alar ligaments in the late stage of whiplash injury–a study of structural abnormalities and observer agreement. Neuroradiology. 2002, 44: 617-624. 10.1007/s00234-002-0799-6.CrossRefPubMed
17.
go back to reference Altman DG: Practical statistics for medical research. 1991, London: Chapman & Hall, 1 Altman DG: Practical statistics for medical research. 1991, London: Chapman & Hall, 1
18.
go back to reference Terwee CB, Bot SDM, de Boer MR, van der Windt DAWM, Knol DL, Dekker J, Bouter LM, de Vet HCW: Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007, 60: 34-42. 10.1016/j.jclinepi.2006.03.012.CrossRefPubMed Terwee CB, Bot SDM, de Boer MR, van der Windt DAWM, Knol DL, Dekker J, Bouter LM, de Vet HCW: Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007, 60: 34-42. 10.1016/j.jclinepi.2006.03.012.CrossRefPubMed
19.
go back to reference Sim J, Wright CC: The kappa statistic in reliability studies: use, interpretation, and sample size requirements. Phys Ther. 2005, 85: 257-268.PubMed Sim J, Wright CC: The kappa statistic in reliability studies: use, interpretation, and sample size requirements. Phys Ther. 2005, 85: 257-268.PubMed
20.
go back to reference Bankier AA, Levine D, Halpern EF, Kressel HY: Consensus interpretation in imaging research: is there a better way?. Radiology. 2010, 257: 14-17. 10.1148/radiol.10100252.CrossRefPubMed Bankier AA, Levine D, Halpern EF, Kressel HY: Consensus interpretation in imaging research: is there a better way?. Radiology. 2010, 257: 14-17. 10.1148/radiol.10100252.CrossRefPubMed
21.
go back to reference Umans H, Wimpfheimer O, Haramati N, Applbaum YH, Adler M, Bosco J: Diagnosis of partial tears of the anterior cruciate ligament of the knee: value of MR imaging. AJR Am J Roentgenol. 1995, 165: 893-897. 10.2214/ajr.165.4.7676988.CrossRefPubMed Umans H, Wimpfheimer O, Haramati N, Applbaum YH, Adler M, Bosco J: Diagnosis of partial tears of the anterior cruciate ligament of the knee: value of MR imaging. AJR Am J Roentgenol. 1995, 165: 893-897. 10.2214/ajr.165.4.7676988.CrossRefPubMed
22.
go back to reference van Rijn JC, Klemetso N, Reitsma JB, Majoie CB, Hulsmans FJ, Peul WC, Stam J, Bossuyt PM, den Heeten GJ: Observer variation in MRI evaluation of patients suspected of lumbar disk herniation. AJR Am J Roentgenol. 2005, 184: 299-303. 10.2214/ajr.184.1.01840299.CrossRefPubMed van Rijn JC, Klemetso N, Reitsma JB, Majoie CB, Hulsmans FJ, Peul WC, Stam J, Bossuyt PM, den Heeten GJ: Observer variation in MRI evaluation of patients suspected of lumbar disk herniation. AJR Am J Roentgenol. 2005, 184: 299-303. 10.2214/ajr.184.1.01840299.CrossRefPubMed
23.
go back to reference Johnson J, Kline JA: Intraobserver and interobserver agreement of the interpretation of pediatric chest radiographs. Emerg Radiol. 2010, 17: 285-290. 10.1007/s10140-009-0854-2.CrossRefPubMed Johnson J, Kline JA: Intraobserver and interobserver agreement of the interpretation of pediatric chest radiographs. Emerg Radiol. 2010, 17: 285-290. 10.1007/s10140-009-0854-2.CrossRefPubMed
24.
go back to reference Pirisi M, Leutner M, Pinato DJ, Avellini C, Carsana L, Toniutto P, Fabris C, Boldorini R: Reliability and reproducibility of the edmondson grading of hepatocellular carcinoma using paired core biopsy and surgical resection specimens. Arch Pathol Lab Med. 2010, 134: 1818-1822.PubMed Pirisi M, Leutner M, Pinato DJ, Avellini C, Carsana L, Toniutto P, Fabris C, Boldorini R: Reliability and reproducibility of the edmondson grading of hepatocellular carcinoma using paired core biopsy and surgical resection specimens. Arch Pathol Lab Med. 2010, 134: 1818-1822.PubMed
25.
go back to reference Ker M: Issues in the use of kappa. Invest Radiol. 1991, 26: 78-83. 10.1097/00004424-199101000-00015.CrossRefPubMed Ker M: Issues in the use of kappa. Invest Radiol. 1991, 26: 78-83. 10.1097/00004424-199101000-00015.CrossRefPubMed
26.
go back to reference Kaale BR, Krakenes J, Albrektsen G, Wester K: Whiplash-associated disorders impairment rating: neck disability index score according to severity of MRI findings of ligaments and membranes in the upper cervical spine. J Neurotrauma. 2005, 22: 466-475. 10.1089/neu.2005.22.466.CrossRefPubMed Kaale BR, Krakenes J, Albrektsen G, Wester K: Whiplash-associated disorders impairment rating: neck disability index score according to severity of MRI findings of ligaments and membranes in the upper cervical spine. J Neurotrauma. 2005, 22: 466-475. 10.1089/neu.2005.22.466.CrossRefPubMed
27.
go back to reference Dullerud R, Gjertsen O, Server A: Magnetic resonance imaging of ligaments and membranes in the craniocervical junction in whiplash-associated injury and in healthy control subjects. Acta Radiol. 2010, 51: 207-212. 10.3109/02841850903321617.CrossRefPubMed Dullerud R, Gjertsen O, Server A: Magnetic resonance imaging of ligaments and membranes in the craniocervical junction in whiplash-associated injury and in healthy control subjects. Acta Radiol. 2010, 51: 207-212. 10.3109/02841850903321617.CrossRefPubMed
28.
go back to reference Kim HJ, Jun BY, Kim WH, Cho YK, Lim MK, Suh CH: MR imaging of the alar ligament: morphologic changes during axial rotation of the head in asymptomatic young adults. Skeletal Radiol. 2002, 31: 637-642. 10.1007/s00256-002-0572-2.CrossRefPubMed Kim HJ, Jun BY, Kim WH, Cho YK, Lim MK, Suh CH: MR imaging of the alar ligament: morphologic changes during axial rotation of the head in asymptomatic young adults. Skeletal Radiol. 2002, 31: 637-642. 10.1007/s00256-002-0572-2.CrossRefPubMed
29.
go back to reference Roy S, Hol PK, Laerum LT, Tillung T: Pitfalls of magnetic resonance imaging of alar ligament. Neuroradiology. 2004, 46: 392-398. 10.1007/s00234-004-1193-3.CrossRefPubMed Roy S, Hol PK, Laerum LT, Tillung T: Pitfalls of magnetic resonance imaging of alar ligament. Neuroradiology. 2004, 46: 392-398. 10.1007/s00234-004-1193-3.CrossRefPubMed
30.
go back to reference Pfirrmann CW, Binkert CA, Zanetti M, Boos N, Hodler J: MR morphology of alar ligaments and occipitoatlantoaxial joints: study in 50 asymptomatic subjects. Radiology. 2001, 218: 133-137.CrossRefPubMed Pfirrmann CW, Binkert CA, Zanetti M, Boos N, Hodler J: MR morphology of alar ligaments and occipitoatlantoaxial joints: study in 50 asymptomatic subjects. Radiology. 2001, 218: 133-137.CrossRefPubMed
31.
go back to reference Vetti N, Krakenes J, Eide GE, Rorvik J, Gilhus NE, Espeland A: Are MRI high-signal changes of alar and transverse ligaments in acute whiplash injury related to outcome?. BMC Musculoskelet Disord. 2010, 11: 260-10.1186/1471-2474-11-260.CrossRefPubMedPubMedCentral Vetti N, Krakenes J, Eide GE, Rorvik J, Gilhus NE, Espeland A: Are MRI high-signal changes of alar and transverse ligaments in acute whiplash injury related to outcome?. BMC Musculoskelet Disord. 2010, 11: 260-10.1186/1471-2474-11-260.CrossRefPubMedPubMedCentral
32.
go back to reference Vetti N, Krakenes J, Ask T, Erdal KA, Torkildsen MD, Rorvik J, Gilhus NE, Espeland A: Follow-Up MR Imaging of the Alar and Transverse Ligaments after Whiplash Injury: A Prospective Controlled Study. AJNR Am J Neuroradiol. in press Vetti N, Krakenes J, Ask T, Erdal KA, Torkildsen MD, Rorvik J, Gilhus NE, Espeland A: Follow-Up MR Imaging of the Alar and Transverse Ligaments after Whiplash Injury: A Prospective Controlled Study. AJNR Am J Neuroradiol. in press
33.
go back to reference Jarvik JG, Deyo RA: Moderate versus mediocre: the reliability of spine MR data interpretations. Radiology. 2009, 250: 15-17. 10.1148/radiol.2493081458.CrossRefPubMed Jarvik JG, Deyo RA: Moderate versus mediocre: the reliability of spine MR data interpretations. Radiology. 2009, 250: 15-17. 10.1148/radiol.2493081458.CrossRefPubMed
34.
go back to reference Feinstein AR: An additional basic science for clinical medicine: IV. The development of clinimetrics. Ann Intern Med. 1983, 99: 843-848. 10.7326/0003-4819-99-6-843.CrossRefPubMed Feinstein AR: An additional basic science for clinical medicine: IV. The development of clinimetrics. Ann Intern Med. 1983, 99: 843-848. 10.7326/0003-4819-99-6-843.CrossRefPubMed
35.
go back to reference Lummel N, Zeif C, Kloetzer A, Linn J, Bruckmann H, Bitterling H: Variability of morphology and signal intensity of alar ligaments in healthy volunteers using MR imaging. AJNR Am J Neuroradiol. 2011, 32: 125-130. 10.3174/ajnr.A2629.CrossRefPubMed Lummel N, Zeif C, Kloetzer A, Linn J, Bruckmann H, Bitterling H: Variability of morphology and signal intensity of alar ligaments in healthy volunteers using MR imaging. AJNR Am J Neuroradiol. 2011, 32: 125-130. 10.3174/ajnr.A2629.CrossRefPubMed
36.
go back to reference Myran R, Zwart JA, Kvistad KA, Folvik M, Lydersen S, Ro M, Woodhouse A, Nygaard OP: Clinical characteristics, pain, and disability in relation to alar ligament MRI findings. Spine (Phila Pa 1976). 2011, 36: E862-E867. 10.1097/BRS.0b013e3181ff1dde.CrossRef Myran R, Zwart JA, Kvistad KA, Folvik M, Lydersen S, Ro M, Woodhouse A, Nygaard OP: Clinical characteristics, pain, and disability in relation to alar ligament MRI findings. Spine (Phila Pa 1976). 2011, 36: E862-E867. 10.1097/BRS.0b013e3181ff1dde.CrossRef
Metadata
Title
Are two readers more reliable than one? A study of upper neck ligament scoring on magnetic resonance images
Authors
Ansgar Espeland
Nils Vetti
Jostein Kråkenes
Publication date
01-12-2013
Publisher
BioMed Central
Published in
BMC Medical Imaging / Issue 1/2013
Electronic ISSN: 1471-2342
DOI
https://doi.org/10.1186/1471-2342-13-4

Other articles of this Issue 1/2013

BMC Medical Imaging 1/2013 Go to the issue