Skip to main content
Top
Published in: International Journal of Computer Assisted Radiology and Surgery 6/2014

01-11-2014 | Original Article

Vision-based online recognition of surgical activities

Authors: Michael Unger, Claire Chalopin, Thomas Neumuth

Published in: International Journal of Computer Assisted Radiology and Surgery | Issue 6/2014

Login to get access

Abstract

Purpose

Surgical processes are complex entities characterized by expressive models and data. Recognizable activities define each surgical process. The principal limitation of current vision-based recognition methods is inefficiency due to the large amount of information captured during a surgical procedure. To overcome this technical challenge, we introduce a surgical gesture recognition system using temperature-based recognition.

Methods

An infrared thermal camera was combined with a hierarchical temporal memory and was used during surgical procedures. The recordings were analyzed for recognition of surgical activities. The image sequence information acquired included hand temperatures. This datum was analyzed to perform gesture extraction and recognition based on heat differences between the surgeon’s warm hands and the colder background of the environment.

Results

The system was validated by simulating a functional endoscopic sinus surgery, a common type of otolaryngologic surgery. The thermal camera was directed toward the hands of the surgeon while handling different instruments. The system achieved an online recognition accuracy of 96 % with high precision and recall rates of approximately 60 %.

Conclusion

Vision-based recognition methods are the current best practice approaches for monitoring surgical processes. Problems of information overflow and extended recognition times in vision-based approaches were overcome by changing the spectral range to infrared. This change enables the real-time recognition of surgical activities and provides online monitoring information to surgical assistance systems and workflow management systems.
Literature
2.
go back to reference Sutherland J, van den Heuvel W-J (2006) Towards an intelligent hospital environment: adaptive workflow in the OR of the future. Proceedings of the 39th annual Hawaii international conference on system sciences, volume 05. IEEE Computer Society, Washington, DC, USA, p 100b Sutherland J, van den Heuvel W-J (2006) Towards an intelligent hospital environment: adaptive workflow in the OR of the future. Proceedings of the 39th annual Hawaii international conference on system sciences, volume 05. IEEE Computer Society, Washington, DC, USA, p 100b
5.
go back to reference Neumuth T (2013) Surgical process modeling: theory, methods, and applications Neumuth T (2013) Surgical process modeling: theory, methods, and applications
6.
go back to reference Neumuth T, Liebmann P, Wiedemann P, Meixensberger J (2012) Surgical workflow management schemata for cataract procedures. Process model-based design and validation of workflow schemata. Methods Inf Med 51:371–382. doi:10.3414/ME11-01-0093 PubMedCrossRef Neumuth T, Liebmann P, Wiedemann P, Meixensberger J (2012) Surgical workflow management schemata for cataract procedures. Process model-based design and validation of workflow schemata. Methods Inf Med 51:371–382. doi:10.​3414/​ME11-01-0093 PubMedCrossRef
8.
go back to reference Tiwari V, Dexter F, Rothman BS et al (2013) Explanation for the near-constant mean time remaining in surgical cases exceeding their estimated duration, necessary for appropriate display on electronic white boards. Anesth Analg 117:487–493. doi:10.1213/ANE.0b013e31829772e9 Tiwari V, Dexter F, Rothman BS et al (2013) Explanation for the near-constant mean time remaining in surgical cases exceeding their estimated duration, necessary for appropriate display on electronic white boards. Anesth Analg 117:487–493. doi:10.​1213/​ANE.​0b013e31829772e9​
9.
go back to reference Epstein RH, Dexter F (2012) Mediated interruptions of anaesthesia providers using predictions of workload from anaesthesia information management system data. Anaesth Intensive Care 40:803–812PubMed Epstein RH, Dexter F (2012) Mediated interruptions of anaesthesia providers using predictions of workload from anaesthesia information management system data. Anaesth Intensive Care 40:803–812PubMed
12.
go back to reference Reiley CE, Lin HC, Varadarajan B et al (2008) Automatic recognition of surgical motions using statistical modeling for capturing variability. Stud Health Technol Inform 132:396PubMed Reiley CE, Lin HC, Varadarajan B et al (2008) Automatic recognition of surgical motions using statistical modeling for capturing variability. Stud Health Technol Inform 132:396PubMed
13.
go back to reference Vankipuram M, Kahol K, Cohen T, Patel VL (2009) Visualization and analysis of activities in critical care environments. AMIA Annu Symp Proc 2009:662–666PubMedPubMedCentral Vankipuram M, Kahol K, Cohen T, Patel VL (2009) Visualization and analysis of activities in critical care environments. AMIA Annu Symp Proc 2009:662–666PubMedPubMedCentral
14.
go back to reference Bouarfa L, Jonker PP, Dankelman J (2009) Surgical context discovery by monitoring low-level activities in the OR Bouarfa L, Jonker PP, Dankelman J (2009) Surgical context discovery by monitoring low-level activities in the OR
15.
17.
go back to reference Kranzfelder M, Zywitza D, Jell T et al (2012) Real-time monitoring for detection of retained surgical sponges and team motion in the surgical operation room using radio-frequency-identification (RFID) technology: a preclinical evaluation. J Surg Res 175:191–198. doi:10.1016/j.jss.2011.03.029 PubMedCrossRef Kranzfelder M, Zywitza D, Jell T et al (2012) Real-time monitoring for detection of retained surgical sponges and team motion in the surgical operation room using radio-frequency-identification (RFID) technology: a preclinical evaluation. J Surg Res 175:191–198. doi:10.​1016/​j.​jss.​2011.​03.​029 PubMedCrossRef
18.
go back to reference Ahmadi S-A, Padoy N, Heining SM et al (2008) Introducing wearable accelerometers in the surgery room for activity detection. 7. Jahrestagung der Deutschen Gesellschaft fuer Computer-und Roboter-Assistierte Chirurgie (CURAC 2008) Ahmadi S-A, Padoy N, Heining SM et al (2008) Introducing wearable accelerometers in the surgery room for activity detection. 7. Jahrestagung der Deutschen Gesellschaft fuer Computer-und Roboter-Assistierte Chirurgie (CURAC 2008)
19.
go back to reference Lester J, Choudhury T, Kern N et al (2005) A hybrid discriminative/generative approach for modeling human activities. In: Proceedings of the international joint conference on artificial intelligence (IJCAI), pp 766–772 Lester J, Choudhury T, Kern N et al (2005) A hybrid discriminative/generative approach for modeling human activities. In: Proceedings of the international joint conference on artificial intelligence (IJCAI), pp 766–772
21.
go back to reference Blum T, Feußner H, Navab N (2010) Modeling and segmentation of surgical workflow from laparoscopic video. In: Jiang T, Navab N, Pluim J, Viergever M (eds) Medical image computing and computer-assisted intervention—MICCAI 2010. Springer, Berlin, pp 400–407CrossRef Blum T, Feußner H, Navab N (2010) Modeling and segmentation of surgical workflow from laparoscopic video. In: Jiang T, Navab N, Pluim J, Viergever M (eds) Medical image computing and computer-assisted intervention—MICCAI 2010. Springer, Berlin, pp 400–407CrossRef
22.
go back to reference Lalys F, Riffaud L, Bouget D, Jannin P (2011) An application-dependent framework for the recognition of high-level surgical tasks in the OR. In: Fichtinger G, Martel A, Peters T (eds) Medical image computing and computer-assisted intervention—MICCAI 2011. Springer, Berlin, pp 331–338CrossRef Lalys F, Riffaud L, Bouget D, Jannin P (2011) An application-dependent framework for the recognition of high-level surgical tasks in the OR. In: Fichtinger G, Martel A, Peters T (eds) Medical image computing and computer-assisted intervention—MICCAI 2011. Springer, Berlin, pp 331–338CrossRef
24.
go back to reference Haro BB, Zappella L, Vidal R (2012) Surgical gesture classification from video data. In: Ayache N, Delingette H, Golland P, Mori K (eds) Medical image computing and computer-assisted intervention—MICCAI 2012. Springer, Berlin, pp 34–41CrossRef Haro BB, Zappella L, Vidal R (2012) Surgical gesture classification from video data. In: Ayache N, Delingette H, Golland P, Mori K (eds) Medical image computing and computer-assisted intervention—MICCAI 2012. Springer, Berlin, pp 34–41CrossRef
25.
go back to reference Appenrodt J, Al-Hamadi A, Michaelis B (2010) Data gathering for gesture recognition systems based on single color-, stereo color-and thermal cameras. Int J Signal Process Image Process Pattern Recognit 3:37–50 Appenrodt J, Al-Hamadi A, Michaelis B (2010) Data gathering for gesture recognition systems based on single color-, stereo color-and thermal cameras. Int J Signal Process Image Process Pattern Recognit 3:37–50
26.
go back to reference Hawkins J, George D (2006) Hierarchical temporal memory: concepts, theory, and terminology Hawkins J, George D (2006) Hierarchical temporal memory: concepts, theory, and terminology
27.
go back to reference Hawkins J, Blakeslee S (2007) On intelligence. Macmillan, New York Hawkins J, Blakeslee S (2007) On intelligence. Macmillan, New York
28.
go back to reference Kapuscinski T (2010) Using hierarchical temporal memory for vision-based hand shape recognition under large variations in hand’s rotation. In: Rutkowski L, Scherer R, Tadeusiewicz R et al (eds) Artificial intelligence and soft computing. Springer, Berlin, pp 272–279 Kapuscinski T (2010) Using hierarchical temporal memory for vision-based hand shape recognition under large variations in hand’s rotation. In: Rutkowski L, Scherer R, Tadeusiewicz R et al (eds) Artificial intelligence and soft computing. Springer, Berlin, pp 272–279
29.
go back to reference Kapuscinski T, Wysocki M (2009) Using hierarchical temporal memory for recognition of signed polish words. In: Kurzynski M, Wozniak M (eds) Computer recognition systems 3. Springer, Berlin, pp 355–362CrossRef Kapuscinski T, Wysocki M (2009) Using hierarchical temporal memory for recognition of signed polish words. In: Kurzynski M, Wozniak M (eds) Computer recognition systems 3. Springer, Berlin, pp 355–362CrossRef
31.
go back to reference Bouvrie J, Rosasco L, Poggio T (2009) On invariance in hierarchical models. Adv Neural Inf Process Syst 22:162–170 Bouvrie J, Rosasco L, Poggio T (2009) On invariance in hierarchical models. Adv Neural Inf Process Syst 22:162–170
Metadata
Title
Vision-based online recognition of surgical activities
Authors
Michael Unger
Claire Chalopin
Thomas Neumuth
Publication date
01-11-2014
Publisher
Springer Berlin Heidelberg
Published in
International Journal of Computer Assisted Radiology and Surgery / Issue 6/2014
Print ISSN: 1861-6410
Electronic ISSN: 1861-6429
DOI
https://doi.org/10.1007/s11548-014-0994-z

Other articles of this Issue 6/2014

International Journal of Computer Assisted Radiology and Surgery 6/2014 Go to the issue