Abstract
Minimally invasive surgery is an important breakthrough in the domain of medicine. Not only does it improve the quality of surgery, but the underlying digitization also provides invaluable information that opens up many possibilities for teaching, assistance during difficult cases, and quality evaluation. For instance, with a well-organized database, professors are one click away from showing and comparing various surgical procedures in their classes; surgeons can also retrieve and observe a video segment of a specific surgical task performed by another surgeon in varying conditions. However, to the best of our knowledge, database organization is done manually by experts. Considering the large number of surgical videos recorded, manual annotation is a tedious task. In this paper, we take the first step towards automatic surgical database organization by introducing the laparoscopic video classification problem, which consists of automatically identifying the type of abdominal surgery performed in a video. In spite of the visual challenges of such videos, such as blank frames, rapid movement, and sometimes incomplete recording, we show that we can rely on visual features alone to classify the videos with high accuracy. We use kernel Support Vector Machines (SVMs) for this classification task and compare their performance on different types of visual features. We also show that the result can be improved by combining the visual features using Multiple Kernel Learning approach. The classification pipeline demonstrates a classification accuracy of 91.39% on a database of 151 abdominal videos totaling over 200 hours of 8 different kinds of surgeries performed by 10 surgeons.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
WebSurg: the e-surgical reference, http://www.websurg.com/ (last access: November 21, 2013)
Atasoy, S., Mateus, D., Meining, A., Yang, G.Z., Navab, N.: Endoscopic video manifolds for targeted optical biopsy. IEEE Trans. Med. Imaging 31(3), 637–653 (2012)
Munzer, B., Schoeffmann, K., Boszormenyi, L.: Relevance segmentation of laparoscopic videos. In: IEEE International Symposium on Multimedia, pp. 84–91 (2013)
Zappella, L., Bejar, B., Hager, G., Vidal, R.: Surgical gesture classification from video and kinematic data. Medical Image Analysis 17(7), 732–745 (2013)
Padoy, N., Blum, T., Ahmadi, S.A., Feussner, H., Berger, M.O., Navab, N.: Statistical modeling and recognition of surgical workflow. Medical Image Analysis 16(3), 632–641 (2012)
Blum, T., Feußner, H., Navab, N.: Modeling and segmentation of surgical workflow from laparoscopic video. In: Jiang, T., Navab, N., Pluim, J.P.W., Viergever, M.A. (eds.) MICCAI 2010, Part III. LNCS, vol. 6363, pp. 400–407. Springer, Heidelberg (2010)
Lalys, F., Riffaud, L., Bouget, D., Jannin, P.: A framework for the recognition of high-level surgical tasks from video images for cataract surgeries. IEEE Trans. Biomed. Engineering 59(4), 966–976 (2012)
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: International Conference on Computer Vision & Pattern Recognition, pp. 886–893 (2005)
Aharon, M., Elad, M., Bruckstein, A.: K-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation. IEEE Transactions on Signal Processing 54(11), 4311–4322 (2006)
Gonen, M., Alpaydin, E.: Multiple kernel learning algorithms. Journal of Machine Learning Research 12, 2211–2268 (2011)
Vedaldi, A., Fulkerson, B.: Vlfeat: An open and portable library of computer vision algorithms. In: Proceedings of the International Conference on Multimedia, pp. 1469–1472. ACM (2010)
Varma, M., Babu, R.B.: More generality in efficient multiple kernel learning. In: Proceedings of the International Conference on Machine Learning, pp. 1065–1072. ACM (2009)
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: International Conference on Computer Vision & Pattern Recognition, pp. 1794–1801. IEEE (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Twinanda, A.P., Marescaux, J., De Mathelin, M., Padoy, N. (2014). Towards Better Laparoscopic Video Database Organization by Automatic Surgery Classification. In: Stoyanov, D., Collins, D.L., Sakuma, I., Abolmaesumi, P., Jannin, P. (eds) Information Processing in Computer-Assisted Interventions. IPCAI 2014. Lecture Notes in Computer Science, vol 8498. Springer, Cham. https://doi.org/10.1007/978-3-319-07521-1_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-07521-1_20
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07520-4
Online ISBN: 978-3-319-07521-1
eBook Packages: Computer ScienceComputer Science (R0)