Skip to main content
Top
Published in: Sleep and Breathing 2/2019

01-06-2019 | Polysomnography | Methods • Original Article

Interrater agreement between American and Chinese sleep centers according to the 2014 AASM standard

Authors: Shujian Deng, Xin Zhang, Ying Zhang, He Gao, Eric I-Chao Chang, Yubo Fan, Yan Xu

Published in: Sleep and Breathing | Issue 2/2019

Login to get access

Abstract

Objectives

To determine inter-lab reliability in sleep stage scoring using the 2014 American Academy of Sleep Medicine (AASM) manual. To understand in-depth reasons for disagreement and provide suggestions for improvement.

Methods

This study consisted of 40 all-night polysomnographys (PSGs) from different samples. PSGs were segmented into 37,642 30-s epochs. Five doctors from China and two doctors from America scored the epochs following the 2014 AASM standard. Scoring disagreement between two centers was evaluated using Cohen’s kappa (κ). After visual inspection of PSGs of deviating scorings, potential disagreement reasons were analyzed.

Results

Inter-lab reliability yielded a substantial degree (κ = 0.75 ± 0.01). Scoring for stage W (κ = 0.89) and R (κ = 0.87) achieved the highest agreement, while stage N1 (κ = 0.45) reflected the lowest. Considering the relative disagreement ratio, N2-N3 (22.09%), W-N1 (19.68%), and N1-N2 (18.75%) were the most frequent combinations of discrepancy. American and Chinese doctors showed certain characteristics in the scoring of discrepancy combination W-N1, N1-N2, and N2-N3. There are seven reasons for disagreement, namely “on-threshold characteristic” (29.21%), “context influence” (18.06%), “characteristic identification difficulty” (8.81%), “arousal-wake confusion” (7.57%), “derivation inconsistence” (2.15%), “on-borderline characteristic” (0.92%), and “misrecognition” (33.27%).

Conclusions

This study demonstrated the sleep stage scoring agreement of the 2014 AASM manual and explored potential sources of labeling ambiguity. Improvement measures were suggested accordingly to help remove ambiguity for scorers and improve scoring reliability at the international level.
Appendix
Available only for authorised users
Literature
1.
go back to reference Iber C, Ancoli-Israel S, Chesson A, Quan S (2007) The AASM manual for the scoring of sleep and associated events: rules, terminology and technical specifications, vol 4849. American Academy of Sleep Medecine, Westchester Iber C, Ancoli-Israel S, Chesson A, Quan S (2007) The AASM manual for the scoring of sleep and associated events: rules, terminology and technical specifications, vol 4849. American Academy of Sleep Medecine, Westchester
2.
3.
go back to reference Hobson JA (1969) A manual of standardized terminology, techniques and scoring system for sleep stages of human subjects: a Rechtschaffen and a Kales. Electroencephalogr Clin Neurophysiol 26(6):644CrossRef Hobson JA (1969) A manual of standardized terminology, techniques and scoring system for sleep stages of human subjects: a Rechtschaffen and a Kales. Electroencephalogr Clin Neurophysiol 26(6):644CrossRef
4.
go back to reference Basner M, Griefahn B, Penzel T (2008) Inter-rater agreement in sleep stage classification between centers with different backgrounds. Somnologie-Schlafforschung und Schlafmedizin 12(1):75–84CrossRef Basner M, Griefahn B, Penzel T (2008) Inter-rater agreement in sleep stage classification between centers with different backgrounds. Somnologie-Schlafforschung und Schlafmedizin 12(1):75–84CrossRef
5.
go back to reference Danker-Hopfe H, Kunz D, Gruber G, Klösch G, Lorenzo JL, Himanen SL, Kemp B, Penzel T, Röschke J, Dorn H et al (2004) Interrater reliability between scorers from eight European sleep laboratories in subjects with different sleep disorders. J Sleep Res 13(1):63–69CrossRefPubMed Danker-Hopfe H, Kunz D, Gruber G, Klösch G, Lorenzo JL, Himanen SL, Kemp B, Penzel T, Röschke J, Dorn H et al (2004) Interrater reliability between scorers from eight European sleep laboratories in subjects with different sleep disorders. J Sleep Res 13(1):63–69CrossRefPubMed
6.
go back to reference Penzel T, Zhang X, Fietze I (2013) Inter-scorer reliability between sleep centers can teach us what to improve in the scoring rules. J Clin Sleep Med 9(01):89–91PubMedPubMedCentral Penzel T, Zhang X, Fietze I (2013) Inter-scorer reliability between sleep centers can teach us what to improve in the scoring rules. J Clin Sleep Med 9(01):89–91PubMedPubMedCentral
7.
go back to reference Silber MH, Ancoli-Israel S, Bonnet MH, Chokroverty S, Grigg-Damberger MM, Hirshkowitz M, Kapen S, Keenan SA, Kryger MH, Penzel T, Pressman MR, Iber C (2007) The visual scoring of sleep in adults. J Clin Sleep Med 3:121–131PubMed Silber MH, Ancoli-Israel S, Bonnet MH, Chokroverty S, Grigg-Damberger MM, Hirshkowitz M, Kapen S, Keenan SA, Kryger MH, Penzel T, Pressman MR, Iber C (2007) The visual scoring of sleep in adults. J Clin Sleep Med 3:121–131PubMed
8.
go back to reference Suzuki M, Saigusa H, Chiba S, Yagi T, Shibasaki K, Hayashi M, Suzuki M, Moriyama K, Kodera K (2005) Discrepancy in polysomnography scoring for a patient with obstructive sleep apnea hypopnea syndrome. Tohoku J Exp Med 206(4):353–360CrossRefPubMed Suzuki M, Saigusa H, Chiba S, Yagi T, Shibasaki K, Hayashi M, Suzuki M, Moriyama K, Kodera K (2005) Discrepancy in polysomnography scoring for a patient with obstructive sleep apnea hypopnea syndrome. Tohoku J Exp Med 206(4):353–360CrossRefPubMed
9.
go back to reference Whitney CW, Gottlieb DJ, Redline S, Norman RG, Dodge RR, Shahar E, Surovec S, Nieto FJ (1998) Reliability of scoring respiratory disturbance indices and sleep staging. Sleep 21(7):749–757CrossRefPubMed Whitney CW, Gottlieb DJ, Redline S, Norman RG, Dodge RR, Shahar E, Surovec S, Nieto FJ (1998) Reliability of scoring respiratory disturbance indices and sleep staging. Sleep 21(7):749–757CrossRefPubMed
10.
go back to reference Danker-Hopfe H, Anderer P, Zeitlhofer J, Boeck M, Dorn H, Gruber G, Heller E, Loretz E, Moser D, Parapatics S et al (2009) Interrater reliability for sleep scoring according to the Rechtschaffen & Kales and the new AASM standard. J Sleep Res 18(1):74–84CrossRefPubMed Danker-Hopfe H, Anderer P, Zeitlhofer J, Boeck M, Dorn H, Gruber G, Heller E, Loretz E, Moser D, Parapatics S et al (2009) Interrater reliability for sleep scoring according to the Rechtschaffen & Kales and the new AASM standard. J Sleep Res 18(1):74–84CrossRefPubMed
11.
go back to reference Magalang UJ, Chen NH, Cistulli PA, Fedson AC, Gíslason T, Hillman D, Penzel T, Tamisier R, Tufik S, Phillips G et al (2013) Agreement in the scoring of respiratory events and sleep among international sleep centers. Sleep 36(4):591–596CrossRefPubMedPubMedCentral Magalang UJ, Chen NH, Cistulli PA, Fedson AC, Gíslason T, Hillman D, Penzel T, Tamisier R, Tufik S, Phillips G et al (2013) Agreement in the scoring of respiratory events and sleep among international sleep centers. Sleep 36(4):591–596CrossRefPubMedPubMedCentral
12.
go back to reference Zhang X, Dong X, Kantelhardt JW, Li J, Zhao L, Garcia C, Glos M, Penzel T, Han F (2015) Process and outcome for international reliability in sleep scoring. Sleep Breath 19(1):191–195CrossRefPubMed Zhang X, Dong X, Kantelhardt JW, Li J, Zhao L, Garcia C, Glos M, Penzel T, Han F (2015) Process and outcome for international reliability in sleep scoring. Sleep Breath 19(1):191–195CrossRefPubMed
13.
go back to reference Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33:159–174CrossRefPubMed Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33:159–174CrossRefPubMed
14.
go back to reference Rosenberg RS, Van Hout S (2013) The American academy of sleep medicine inter-scorer reliability program: sleep stage scoring. J Clin Sleep Med 9(01):81–87PubMedPubMedCentral Rosenberg RS, Van Hout S (2013) The American academy of sleep medicine inter-scorer reliability program: sleep stage scoring. J Clin Sleep Med 9(01):81–87PubMedPubMedCentral
15.
go back to reference Ruehland WR, O’Donoghue FJ, Pierce RJ, Thornton AT, Singh P, Copland JM, Stevens B, Rochford PD (2011) The 2007 AASM recommendations for EEG electrode placement in polysomnography: impact on sleep and cortical arousal scoring. Sleep 34(1):73–81CrossRefPubMedPubMedCentral Ruehland WR, O’Donoghue FJ, Pierce RJ, Thornton AT, Singh P, Copland JM, Stevens B, Rochford PD (2011) The 2007 AASM recommendations for EEG electrode placement in polysomnography: impact on sleep and cortical arousal scoring. Sleep 34(1):73–81CrossRefPubMedPubMedCentral
16.
go back to reference Hare AP. Consensus versus majority vote: A laboratory experiment. Small Group Behavior, 1980; 11(2):131-143. Hare AP. Consensus versus majority vote: A laboratory experiment. Small Group Behavior, 1980; 11(2):131-143.
17.
go back to reference Mitterling T, Högl B, Schönwald SV, Hackner H, Gabelia D, Biermayr M, Frauscher B (2015) Sleep and respiration in 100 healthy Caucasian sleepers—a polysomnographic study according to American Academy of Sleep Medicine standards. Sleep 38(6):867–875PubMedPubMedCentral Mitterling T, Högl B, Schönwald SV, Hackner H, Gabelia D, Biermayr M, Frauscher B (2015) Sleep and respiration in 100 healthy Caucasian sleepers—a polysomnographic study according to American Academy of Sleep Medicine standards. Sleep 38(6):867–875PubMedPubMedCentral
18.
go back to reference Carskadon MA, Dement WC et al (2005) Normal human sleep: an overview. Principles and Practice of Sleep Medicine 4:13–23CrossRef Carskadon MA, Dement WC et al (2005) Normal human sleep: an overview. Principles and Practice of Sleep Medicine 4:13–23CrossRef
19.
go back to reference Parrino L, Ferri R, Zucconi M, Fanfulla F (2009) Commentary from the Italian association of sleep medicine on the AASM manual for the scoring of sleep and associated events: for debate and discussion. Sleep Med 10(7):799–808CrossRefPubMed Parrino L, Ferri R, Zucconi M, Fanfulla F (2009) Commentary from the Italian association of sleep medicine on the AASM manual for the scoring of sleep and associated events: for debate and discussion. Sleep Med 10(7):799–808CrossRefPubMed
20.
go back to reference Berry RB, Brooks R, Gamaldo CE et al (2014) The AASM manual for the scoring of sleep and associated events: rules, terminology and technical specifications, version 2.1. American Academy of Sleep Medicine, Darien Berry RB, Brooks R, Gamaldo CE et al (2014) The AASM manual for the scoring of sleep and associated events: rules, terminology and technical specifications, version 2.1. American Academy of Sleep Medicine, Darien
Metadata
Title
Interrater agreement between American and Chinese sleep centers according to the 2014 AASM standard
Authors
Shujian Deng
Xin Zhang
Ying Zhang
He Gao
Eric I-Chao Chang
Yubo Fan
Yan Xu
Publication date
01-06-2019
Publisher
Springer International Publishing
Keyword
Polysomnography
Published in
Sleep and Breathing / Issue 2/2019
Print ISSN: 1520-9512
Electronic ISSN: 1522-1709
DOI
https://doi.org/10.1007/s11325-019-01801-x

Other articles of this Issue 2/2019

Sleep and Breathing 2/2019 Go to the issue