A Novel Method for Scene Categorization Using an Improved Visual Vocabulary Approach

Elguebaly, Tarek; Bouguila, Nizar

doi:10.1007/978-3-319-11298-5_4

Tarek Elguebaly²⁰ &
Nizar Bouguila²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8779))

Included in the following conference series:

International Conference on Adaptive and Intelligent Systems

912 Accesses
1 Citations

Abstract

The performance of any scene categorization system depends on the scene representation algorithm used. Lately, the Bag of Visual Words (BoVW) approach has indisputably become the method of choice for this crucial task. Nevertheless, the BoVW approach has various flaws. First, the K-means clustering algorithm for visual dictionary creation is based solely on the Euclidean distance. Second, the size of the visual vocabulary is a user-supplied parameter which is unpractical as the final categorization depends critically on the chosen number of visual words. Finally, classifying each descriptor to only one visual word is unrealistic because it does not consider the uncertainty present in the image descriptor level. Therefore, in this paper, we propose a simple solution for these problems. Our algorithm uses the Asymmetric Generalized Gaussian mixture (AGGM) to model the distribution of the visual words. Our choice is based on the fact that the Asymmetric Generalized Gaussian distribution (AGGD) can fit different shapes of observed non-Gaussian and asymmetric data. To automatically determine the number of visual words, the number of mixture components in our case, we employed the Minimum Message length (MML) criterion. We propose to use a soft assignment by exploiting the probability for each descriptor to belong to each visual word and thus considering the uncertainty present in the image descriptor level. In addition, the efficacy of the proposed algorithm is validated by applying it to scene categorization.

The authors would like to thank the Natural Sciences and Engineering Research Council of Canada (NSERC).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Torralba, A., Murphy, K.P., Freeman, W.T., Rubin, M.: Context-based vision system for place and object recognition. In: IEEE International Conference on Computer Vision (ICCV), pp. 273–280 (2003)
Google Scholar
Vailaya, A., Figueiredo, M.A.T., Jain, A.K., Zhang, H.-J.: Image classification for content-based indexing. IEEE Transactions on Image Processing 10(1), 117–130 (2001)
Article MATH Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. International Journal of Computer Vision 42(3), 145–175 (2001)
Article MATH Google Scholar
Lowe, D.G.: Object recognition from local scale-invariant features. In: IEEE International Conference on Computer Vision (ICCV), pp. 1150–1157 (1999)
Google Scholar
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: Speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)
Chapter Google Scholar
Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: IEEE International Conference on Computer Vision (ICCV), pp. 1470–1477 (2003)
Google Scholar
Rish, I.: An empirical study of the naive bayes classifier. In: International Joint Conference on Artificial Intelligence (IJCAI), pp. 41–46 (2001)
Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20(3), 273–297 (1995)
MATH Google Scholar
Elguebaly, T., Bouguila, N.: Finite asymmetric generalized gaussian mixture models learning for infrared object detection. Computer Vision and Image Understanding 117(12), 1659–1671 (2013)
Article Google Scholar
Wallace, C.S., Boulton, D.M.: An information measure for classification. The Computer Journal 11(2), 185–194 (1968)
Article MATH Google Scholar
Figueiredo, M.A., Jain, A.K.: Unsupervised learning of finite mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(3), 381–396 (2002)
Article Google Scholar
Szummer, M., Picard, R.W.: Indoor-outdoor image classification. In: International Workshop on Content-Based Access of Image and Video Databases (CAIVD), pp. 42–51 (1998)
Google Scholar
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: British Machine Vision Conference (BMVC), pp. 384–393 (2002)
Google Scholar
Xiao, J., Hays, J., Ehinger, K., Oliva, A., Torralba, A.: Sun database: Large-scale scene recognition from abbey to zoo. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3485–3492 (2010)
Google Scholar
Maddalena, L., Petrosino, A.: A self-organizing approach to background subtraction for visual surveillance application. IEEE Transactions on Image Processing 17(7), 1168–1177 (2008)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, Faculty of Engineering and Computer Science, Concordia University, Canada
Tarek Elguebaly
Concordia Institute for Information Systems Engineering, Faculty of Engineering and Computer Science, Concordia University, Canada
Nizar Bouguila

Authors

Tarek Elguebaly
View author publications
You can also search for this author in PubMed Google Scholar
Nizar Bouguila
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Science and Technology, Bournemouth University, Fern Barrow, BH12 5BB, Poole, Dorset, UK
Abdelhamid Bouchachia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Elguebaly, T., Bouguila, N. (2014). A Novel Method for Scene Categorization Using an Improved Visual Vocabulary Approach. In: Bouchachia, A. (eds) Adaptive and Intelligent Systems. ICAIS 2014. Lecture Notes in Computer Science(), vol 8779. Springer, Cham. https://doi.org/10.1007/978-3-319-11298-5_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-11298-5_4
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11297-8
Online ISBN: 978-3-319-11298-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics