Abstract
The Visual Place Categorization (VPC) problem refers to the categorization of the semantic category of a place using only visual information collected from an autonomous robot. Previous works on this problem only made use of the global configurations observation, such as the Bag-of-Words model and spatial pyramid matching. In this paper, we present a novel system solving the problem utilizing both global configurations observation and local objects information. To be specific, we propose a local objects classifier that can automatically and effectively select key local objects of a semantic category from randomly sampled patches by the structural similarity support vector machine; and further classify the test frames with the Local Naive Bayes Nearest Neighbors algorithm. We also improve the global configurations observation with histogram intersection codebook and a noisy codewords removal mechanism. The temporal smoothness of the classification results is ensured by employing a Bayesian filtering framework. Empirically, our system outperforms state-of-the-art methods on two large scale and difficult datasets, demonstrating the superiority of the system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Torralba, A., Murphy, K.P., Freeman, W.T., Rubin, M.A.: Context-based vision system for place and object recognition. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 273–280 (2003)
Wu, J., Christensen, H.I., Rehg, J.M.: Visual Place Categorization: Problem, dataset, and algorithm. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 4763–4770 (2009)
Pronobis, A., Jensfelt, P.: Hierarchical multi-modal place categorization. In: Proceedings of the 5th European Conference on Mobile Robots (2011)
Pronobis, A., Mozos, O.M., Caputo, B., Jensfelt, P.: Multi-modal semantic place classification. The International Journal of Robotics Research, Special Issue on Robotic Vision 29, 298–320 (2010)
Wu, J., Rehg, J.M.: CENTRIST: A visual descriptor for scene categorization. IEEE Transactions on Pattern Analysis and Machine Intelligence 33, 1489–1501 (2011)
Ranganathan, A.: PLISS: Detecting and labeling places using online change-point detection. In: Proceedings of Robotics: Science and Systems (2010)
Wu, J.: Balance Support Vector Machines Locally Using the Structural Similarity Kernel. In: Huang, J.Z., Cao, L., Srivastava, J. (eds.) PAKDD 2011, Part I. LNCS, vol. 6634, pp. 112–123. Springer, Heidelberg (2011)
McCann, S., Lowe, D.G.: Local naive bayes nearest neighbor for image classification. CoRR abs/1112.0059 (2011)
Wu, J., Rehg, J.M.: Efficient and effective visual codebook generation using additive kernels. Journal of Machine Learning Research 12, 3097–3118 (2011)
Pronobis, A., Caputo, B.: Confidence-based cue integration for visual place recognition. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 2394–2401 (2007)
Pronobis, A., Caputo, B., Jensfelt, P., Christensen, H.: A discriminative approach to robust visual place recognition. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 3829–3836 (2006)
Pronobis, A., Caputo, B.: COLD: Cosy localization database. International Journal of Robotics Research 28, 588–594 (2009)
Ullah, M.M., Pronobis, A., Caputo, B., Luo, J., Jensfelt, P., Christensen, H.I.: Towards robust place recognition for robot localization. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 530–537 (2008)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2169–2178 (2006)
Quattoni, A., Torralba, A.: Recognizing indoor scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 413–420 (2009)
Pandey, M., Lazebnik, S.: Scene recognition and weakly supervised object localization with deformable part-based models. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1307–1314 (2011)
Viswanathan, P., Southey, T., Little, J., Mackworth, A.: Place classification using visual object categorization and global information. In: Proceedings of the Canadian Conference on Computer and Robot Vision, pp. 1–7 (2011)
Ranganathan, A.: PLISS: labeling places using online changepoint detection. Autonomous Robots 32, 351–368 (2012)
Ranganathan, A., Lim, J.: Visual Place Categorization in maps. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 3982–3989 (2011)
Boiman, O., Shechtman, E., Irani, M.: In defense of Nearest-Neighbor based image classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 1–8 (2008)
Wolf, L., Hassner, T., Taigman, Y.: Similarity Scores Based on Background Samples. In: Zha, H., Taniguchi, R.-i., Maybank, S. (eds.) ACCV 2009, Part II. LNCS, vol. 5995, pp. 88–97. Springer, Heidelberg (2010)
Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research 9, 1871–1874 (2008)
Wu, J.: Power mean SVM for large scale visual classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2344–2351 (2012)
Muja, M., Lowe, D.G.: Fast approximate nearest neighbors with automatic algorithm configuration. In: International Conference on Computer Vision Theory and Application, pp. 331–340 (2009)
Fazl-Ersi, E., Tsotsos, J.K.: Histogram of oriented uniform patterns for robust place recognition and categorization. International Journal of Robotics Research 31, 468–483 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yang, H., Wu, J. (2013). Object Templates for Visual Place Categorization. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds) Computer Vision – ACCV 2012. ACCV 2012. Lecture Notes in Computer Science, vol 7727. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37447-0_36
Download citation
DOI: https://doi.org/10.1007/978-3-642-37447-0_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37446-3
Online ISBN: 978-3-642-37447-0
eBook Packages: Computer ScienceComputer Science (R0)