Skip to main content
Top
Published in: Journal of Medical Systems 12/2017

01-12-2017 | Image & Signal Processing

Endoscopic Image Classification and Retrieval using Clustered Convolutional Features

Authors: Jamil Ahmad, Khan Muhammad, Mi Young Lee, Sung Wook Baik

Published in: Journal of Medical Systems | Issue 12/2017

Login to get access

Abstract

With the growing use of minimally invasive surgical procedures, endoscopic video archives are growing at a rapid pace. Efficient access to relevant content in such huge multimedia archives require compact and discriminative visual features for indexing and matching. In this paper, we present an effective method to represent images using salient convolutional features. Convolutional kernels from the first layer of a pre-trained convolutional neural network (CNN) are analyzed and clustered into multiple distinct groups, based on their sensitivity to colors and textures. Dominant features detected by each cluster are collected into a single, layout-preserving feature map using a spatial maximal activator pooling (SMAP) approach. A moving window based structured pooling method then captures spatial layout features and global shape information from the aggregated feature map to populate feature histograms. Finally, individual histograms for each cluster are combined into a single comprehensive feature histogram. Clustering convolutional feature space allow extraction of color and texture features of varying strengths. Further, the SMAP approach enable us to select dominant discriminative features. The proposed features are compact and capable of conveniently outperforming several existing features extraction approaches in retrieval and classification tasks on endoscopy images dataset.
Literature
1.
go back to reference Sainju, S., Bui, F.M., and Wahid, K.A., Automated bleeding detection in capsule endoscopy videos using statistical features and region growing. J. Med. Syst. 38:25, 2014.CrossRefPubMed Sainju, S., Bui, F.M., and Wahid, K.A., Automated bleeding detection in capsule endoscopy videos using statistical features and region growing. J. Med. Syst. 38:25, 2014.CrossRefPubMed
2.
go back to reference Ahmad, J., Sajjad, M., Mehmood, I., Rho, S., and Baik, S.W., Saliency-weighted graphs for efficient visual content description and their applications in real-time image retrieval systems. J. Real-Time Image Proc. 1–17, 2016. Ahmad, J., Sajjad, M., Mehmood, I., Rho, S., and Baik, S.W., Saliency-weighted graphs for efficient visual content description and their applications in real-time image retrieval systems. J. Real-Time Image Proc. 1–17, 2016.
3.
go back to reference Murala, S., Maheshwari, R., and Balasubramanian, R., Directional binary wavelet patterns for biomedical image indexing and retrieval. J. Med. Syst. 36:2865–2879, 2012.CrossRefPubMed Murala, S., Maheshwari, R., and Balasubramanian, R., Directional binary wavelet patterns for biomedical image indexing and retrieval. J. Med. Syst. 36:2865–2879, 2012.CrossRefPubMed
4.
go back to reference Smeulders, A.W., Worring, M., Santini, S., Gupta, A., and Jain, R., Content-based image retrieval at the end of the early years. Pattern Analysis and Machine Intelligence, IEEE Transactions on. 22:1349–1380, 2000.CrossRef Smeulders, A.W., Worring, M., Santini, S., Gupta, A., and Jain, R., Content-based image retrieval at the end of the early years. Pattern Analysis and Machine Intelligence, IEEE Transactions on. 22:1349–1380, 2000.CrossRef
5.
go back to reference Nowaková, J., Prílepok, M., and Snášel, V., Medical image retrieval using vector quantization and fuzzy S-tree. J. Med. Syst. 41:18, 2017.CrossRefPubMed Nowaková, J., Prílepok, M., and Snášel, V., Medical image retrieval using vector quantization and fuzzy S-tree. J. Med. Syst. 41:18, 2017.CrossRefPubMed
6.
go back to reference Messing, D. S., Van Beek, P., and Errico, J. H., The mpeg-7 colour structure descriptor: Image description using colour and local spatial information. In: IEEE International Conference on Image Processing (ICIP), Thessaloniki, Greece, pp. 670–673, 2001. http://dx.doi.org/10.1109/ICIP.2001.959134. Messing, D. S., Van Beek, P., and Errico, J. H., The mpeg-7 colour structure descriptor: Image description using colour and local spatial information. In:  IEEE International Conference on Image Processing (ICIP), Thessaloniki, Greece, pp. 670–673, 2001. http://​dx.​doi.​org/​10.​1109/​ICIP.​2001.​959134.
7.
go back to reference Liu, G.-H., and Yang, J.-Y., Content-based image retrieval using color difference histogram. Pattern Recogn. 46:188–198, 2013.CrossRef Liu, G.-H., and Yang, J.-Y., Content-based image retrieval using color difference histogram. Pattern Recogn. 46:188–198, 2013.CrossRef
8.
go back to reference Liu, G.-H., Zhang, L., Hou, Y.-K., Li, Z.-Y., and Yang, J.-Y., Image retrieval based on multi-texton histogram. Pattern Recogn. 43:2380–2389, 2010.CrossRef Liu, G.-H., Zhang, L., Hou, Y.-K., Li, Z.-Y., and Yang, J.-Y., Image retrieval based on multi-texton histogram. Pattern Recogn. 43:2380–2389, 2010.CrossRef
9.
go back to reference Liu, G.-H., Li, Z.-Y., Zhang, L., and Xu, Y., Image retrieval based on micro-structure descriptor. Pattern Recogn. 44:2123–2133, 2011.CrossRef Liu, G.-H., Li, Z.-Y., Zhang, L., and Xu, Y., Image retrieval based on micro-structure descriptor. Pattern Recogn. 44:2123–2133, 2011.CrossRef
10.
go back to reference Wang, X., and Wang, Z., A novel method for image retrieval based on structure elements’ descriptor. J. Vis. Commun. Image Represent. 24:63–74, 2013.CrossRef Wang, X., and Wang, Z., A novel method for image retrieval based on structure elements’ descriptor. J. Vis. Commun. Image Represent. 24:63–74, 2013.CrossRef
11.
go back to reference Ahmad, J., Sajjad, M., Rho, S., and Baik, S.W., Multi-scale local structure patterns histogram for describing visual contents in social image retrieval systems. Multimed. Tools Appl. 75:12669–12692, 2016.CrossRef Ahmad, J., Sajjad, M., Rho, S., and Baik, S.W., Multi-scale local structure patterns histogram for describing visual contents in social image retrieval systems. Multimed. Tools Appl. 75:12669–12692, 2016.CrossRef
12.
go back to reference Lowe, D.G., Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60:91–110, 2004.CrossRef Lowe, D.G., Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60:91–110, 2004.CrossRef
14.
go back to reference Li, T., Mei, T., Kweon, I.-S., and Hua, X.-S., Contextual bag-of-words for visual categorization. IEEE Trans. Circ. Syst. Video Technol. 21:381–392, 2011.CrossRef Li, T., Mei, T., Kweon, I.-S., and Hua, X.-S., Contextual bag-of-words for visual categorization. IEEE Trans. Circ. Syst. Video Technol. 21:381–392, 2011.CrossRef
15.
go back to reference Haas, S., Donner, R., Burner, A., Holzer, M., and Langs, G., Superpixel-based interest points for effective bags of visual words medical image retrieval. In: MICCAI International Workshop on Medical Content-Based Retrieval for Clinical Decision Support, pp. 58–68. Berlin, Heidelberg: Springer, 2011. Haas, S., Donner, R., Burner, A., Holzer, M., and Langs, G., Superpixel-based interest points for effective bags of visual words medical image retrieval. In: MICCAI International Workshop on Medical Content-Based Retrieval for Clinical Decision Support, pp. 58–68. Berlin, Heidelberg: Springer, 2011.
16.
go back to reference Yang, J., Jiang, Y.-G., Hauptmann, A. G., and Ngo, C.-W., Evaluating bag-of-visual-words representations in scene classification. In: Proceedings of the international workshop on multimedia information retrieval, Augsburg, Bavaria, Germany, pp. 197–206, 2007. Yang, J., Jiang, Y.-G., Hauptmann, A. G., and Ngo, C.-W., Evaluating bag-of-visual-words representations in scene classification. In: Proceedings of the international workshop on multimedia information retrieval, Augsburg, Bavaria, Germany, pp. 197–206, 2007.
17.
go back to reference Wang, S., Lu, S., Dong, Z., Yang, J., Yang, M., and Zhang, Y., Dual-tree complex wavelet transform and twin support vector machine for pathological brain detection. Appl. Sci. 6:169, 2016.CrossRef Wang, S., Lu, S., Dong, Z., Yang, J., Yang, M., and Zhang, Y., Dual-tree complex wavelet transform and twin support vector machine for pathological brain detection. Appl. Sci. 6:169, 2016.CrossRef
19.
go back to reference Wang, P., Krishnan, S. M., Kugean, C., and Tjoa, M., Classification of endoscopic images based on texture and neural network. In: Engineering in Medicine and Biology Society, 2001. Proceedings of the 23rd Annual International Conference of the IEEE, Istanbul, Turkey, pp. 3691–3695, 2001. Wang, P., Krishnan, S. M., Kugean, C., and Tjoa, M., Classification of endoscopic images based on texture and neural network. In: Engineering in Medicine and Biology Society, 2001. Proceedings of the 23rd Annual International Conference of the IEEE, Istanbul, Turkey, pp. 3691–3695, 2001.
20.
go back to reference Wang, S.-H., Du, S., Zhang, Y., Phillips, P., Wu, L.-N., Chen, X.-Q., et al., Alzheimer’s disease detection by Pseudo Zernike moment and linear regression classification. CNS Neurol. Disord. Drug Targets (Formerly Current Drug Targets-CNS & Neurological Disorders). 16:11–15, 2017. Wang, S.-H., Du, S., Zhang, Y., Phillips, P., Wu, L.-N., Chen, X.-Q., et al., Alzheimer’s disease detection by Pseudo Zernike moment and linear regression classification. CNS Neurol. Disord. Drug Targets (Formerly Current Drug Targets-CNS & Neurological Disorders). 16:11–15, 2017.
21.
go back to reference Krizhevsky, A., Sutskever, I., and Hinton, G. E., Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, Lake Tahoe, Nevada, pp. 1097–1105. Curran Associates, Inc., USA, 2012. Krizhevsky, A., Sutskever, I., and Hinton, G. E., Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, Lake Tahoe, Nevada, pp. 1097–1105. Curran Associates, Inc., USA, 2012.
22.
go back to reference Ahmad, J., Sajjad, M., Mehmood, I., and Baik, S.W., SiNC: Saliency-injected neural codes for representation and efficient retrieval of medical radiographs. PLoS One. 12:e0181707, 2017.CrossRefPubMedPubMedCentral Ahmad, J., Sajjad, M., Mehmood, I., and Baik, S.W., SiNC: Saliency-injected neural codes for representation and efficient retrieval of medical radiographs. PLoS One. 12:e0181707, 2017.CrossRefPubMedPubMedCentral
23.
go back to reference Krizhevsky, A., and Hinton, G. E., Using very deep autoencoders for content-based image retrieval. In: Proceedings of the 19th European Symposium on Artificial Neural Networks, Bruges, Belgium, pp. 489–494, 2011. Krizhevsky, A., and Hinton, G. E., Using very deep autoencoders for content-based image retrieval. In: Proceedings of the 19th European Symposium on Artificial Neural Networks, Bruges, Belgium, pp. 489–494, 2011.
25.
go back to reference Qi, Y., Song, Y.-Z., Zhang, H., and Liu, J., Sketch-based image retrieval via Siamese convolutional neural network. In: IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, pp. 2460–2464, 2016. Qi, Y., Song, Y.-Z., Zhang, H., and Liu, J., Sketch-based image retrieval via Siamese convolutional neural network. In: IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, pp. 2460–2464, 2016.
26.
go back to reference Vishnuvarthanan, A., Rajasekaran, M.P., Govindaraj, V., Zhang, Y., and Thiyagarajan, A., An automated hybrid approach using clustering and nature inspired optimization technique for improved tumor and tissue segmentation in magnetic resonance brain images. Appl. Soft Comput. 57:399–426, 2017.CrossRef Vishnuvarthanan, A., Rajasekaran, M.P., Govindaraj, V., Zhang, Y., and Thiyagarajan, A., An automated hybrid approach using clustering and nature inspired optimization technique for improved tumor and tissue segmentation in magnetic resonance brain images. Appl. Soft Comput. 57:399–426, 2017.CrossRef
27.
go back to reference Lu, S., Wang, S., and Zhang, Y., A note on the marker-based watershed method for X-ray image segmentation. Comput. Methods Prog. Biomed. 141:1–2, 2017.CrossRef Lu, S., Wang, S., and Zhang, Y., A note on the marker-based watershed method for X-ray image segmentation. Comput. Methods Prog. Biomed. 141:1–2, 2017.CrossRef
28.
go back to reference Pons, J., and Serra, X., Designing efficient architectures for modeling temporal features with convolutional neural networks. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, USA, pp. 2472–2476, 2017. Pons, J., and Serra, X., Designing efficient architectures for modeling temporal features with convolutional neural networks. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, USA, pp. 2472–2476, 2017.
29.
go back to reference Zeiler, M. D., and Fergus, R., Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., and Tuytelaars, T. (Eds), Computer Vision – ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6–12, 2014, Proceedings, Part I, pp. 818–833. Cham: Springer International Publishing, 2014. http://dx.doi.org/10.1007/978-3-319-10590-1_53. Zeiler, M. D., and Fergus, R., Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., and Tuytelaars, T. (Eds), Computer Vision – ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6–12, 2014, Proceedings, Part I, pp. 818–833. Cham: Springer International Publishing, 2014. http://​dx.​doi.​org/​10.​1007/​978-3-319-10590-1_​53.
30.
go back to reference Babenko, A., Slesarev, A., Chigorin, A., and Lempitsky, V., Neural codes for image retrieval. In: Fleet, D., Pajdla, T., Schiele, B., and Tuytelaars, T. (Eds.), Computer Vision – ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6–12, 2014, Proceedings, Part I, pp. 584–599. Cham: Springer International Publishing, 2014. http://dx.doi.org/10.1007/978-3-319-10590-1_38. Babenko, A., Slesarev, A., Chigorin, A., and Lempitsky, V., Neural codes for image retrieval. In: Fleet, D., Pajdla, T., Schiele, B., and Tuytelaars, T. (Eds.), Computer Vision – ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6–12, 2014, Proceedings, Part I, pp. 584–599. Cham: Springer International Publishing, 2014. http://​dx.​doi.​org/​10.​1007/​978-3-319-10590-1_​38.
31.
go back to reference Razavian, A. S., Azizpour, H., Sullivan, J., and Carlsson, S., CNN features off-the-shelf: an astounding baseline for recognition. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, 23–28 June, Columbus, OH, USA, pp. 512–519, 2014. http://dx.doi.org/10.1109/CVPRW.2014.131. Razavian, A. S., Azizpour, H., Sullivan, J., and Carlsson, S., CNN features off-the-shelf: an astounding baseline for recognition. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, 23–28 June, Columbus, OH, USA, pp. 512–519, 2014. http://​dx.​doi.​org/​10.​1109/​CVPRW.​2014.​131.
32.
go back to reference Ahmad, J., Mehmood, I., Rho, S., Chilamkurti, N., and Baik, S.W., Embedded deep vision in smart cameras for multi-view objects representation and retrieval. Comput. Electr. Eng. 61C:297–311, 2017.CrossRef Ahmad, J., Mehmood, I., Rho, S., Chilamkurti, N., and Baik, S.W., Embedded deep vision in smart cameras for multi-view objects representation and retrieval. Comput. Electr. Eng. 61C:297–311, 2017.CrossRef
33.
go back to reference Ahmad, J., Mehmood, I., and Baik, S.W., Efficient object-based surveillance image search using spatial pooling of convolutional features. J. Vis. Commun. Image Represent. 45:62–76, 2017.CrossRef Ahmad, J., Mehmood, I., and Baik, S.W., Efficient object-based surveillance image search using spatial pooling of convolutional features. J. Vis. Commun. Image Represent. 45:62–76, 2017.CrossRef
34.
go back to reference Li, C., Huang, Y., and Zhu, L., Color texture image retrieval based on Gaussian copula models of Gabor wavelets. Pattern Recogn. 64:118–129, 2017.CrossRef Li, C., Huang, Y., and Zhu, L., Color texture image retrieval based on Gaussian copula models of Gabor wavelets. Pattern Recogn. 64:118–129, 2017.CrossRef
35.
go back to reference Pogorelov, K., Randel, K. R., Griwodz, C., Eskeland, S. L., de Lange, T., Johansen, D., et al., Kvasir: a multi-class image dataset for computer aided gastrointestinal disease detection. In: Proceedings of the 8th ACM on Multimedia Systems Conference, Taipei, Taiwan, pp. 164–169, 2017. Pogorelov, K., Randel, K. R., Griwodz, C., Eskeland, S. L., de Lange, T., Johansen, D., et al., Kvasir: a multi-class image dataset for computer aided gastrointestinal disease detection. In: Proceedings of the 8th ACM on Multimedia Systems Conference, Taipei, Taiwan, pp. 164–169, 2017.
36.
go back to reference Wang, S., Chen, M., Li, Y., Shao, Y., Zhang, Y., Du, S., et al., Morphological analysis of dendrites and spines by hybridization of ridge detection with twin support vector machine. PeerJ. 4:e2207, 2016.CrossRefPubMedPubMedCentral Wang, S., Chen, M., Li, Y., Shao, Y., Zhang, Y., Du, S., et al., Morphological analysis of dendrites and spines by hybridization of ridge detection with twin support vector machine. PeerJ. 4:e2207, 2016.CrossRefPubMedPubMedCentral
37.
go back to reference Yu, L., Feng, L., Chen, C., Qiu, T., Li, L., and Wu, J., A Novel Multi-Feature Representation of Images for Heterogeneous IoTs. IEEE Access. 4:6204–6215, 2016.CrossRef Yu, L., Feng, L., Chen, C., Qiu, T., Li, L., and Wu, J., A Novel Multi-Feature Representation of Images for Heterogeneous IoTs. IEEE Access. 4:6204–6215, 2016.CrossRef
Metadata
Title
Endoscopic Image Classification and Retrieval using Clustered Convolutional Features
Authors
Jamil Ahmad
Khan Muhammad
Mi Young Lee
Sung Wook Baik
Publication date
01-12-2017
Publisher
Springer US
Published in
Journal of Medical Systems / Issue 12/2017
Print ISSN: 0148-5598
Electronic ISSN: 1573-689X
DOI
https://doi.org/10.1007/s10916-017-0836-y

Other articles of this Issue 12/2017

Journal of Medical Systems 12/2017 Go to the issue