Rapid Detection of Many Object Instances

Tongphu, Suwan; Thongsak, Naddao; Dailey, Matthew N.

doi:10.1007/978-3-642-04697-1_40

Suwan Tongphu²⁰,
Naddao Thongsak²⁰ &
Matthew N. Dailey²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5807))

Included in the following conference series:

International Conference on Advanced Concepts for Intelligent Vision Systems

1740 Accesses
2 Citations

Abstract

We describe an algorithm capable of detecting multiple object instances within a scene in the presence of changes in object viewpoint. Our approach consists of first calculating frequency vectors for discrete feature vector clusters (visual words) within a sliding window as a representation of the image patch. We then classify each patch using an AdaBoost classifier whose weak classifier simply applies a threshold to one visual word’s frequency within the patch. Compared to previous work, our algorithm is simpler yet performs remarkably well on scenes containing many object instances. The method requires relatively few training examples and consumes 2.2 seconds on commodity hardware to process an image of size 640×480. In a test on a challenging car detection problem using a relatively small training set, our implementation dramatically outperforms the detection performance of a standard AdaBoost cascade using Haar-like features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Viola, P., Jones, M.: Robust real-time face detection. International Journal of Computer Vision 57(2), 137–154 (2004)
Article Google Scholar
Huang, C., Ai, H., Li, Y., Lao, S.: High-performance rotation invariant multiview face detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(4), 671–686 (2007)
Article Google Scholar
Mohan, A., Papageorgiou, C., Poggio, T.: Example-based object detection in images by components. IEEE Transactions on Pattern Analysis and Machine Intelligence 23, 349–361 (2001)
Article Google Scholar
Keysers, D., Deselaers, T., Breuel, T.: Optimal geometric matching for patch-based object detection. Electronic Letters on Computer Vision and Image Analysis 6, 44–54 (2007)
Google Scholar
Breuel, T.M.: Implementation techniques for geometric branch-and-bound matching methods. Computer Vision and Image Understanding 90, 294 (2003)
Article MATH Google Scholar
Kim, D., Dahyot, R.: Face components detection using SURF descriptors and SVMs. In: International Machine Vision and Image Processing Conference, pp. 51–56 (2008)
Google Scholar
Bay, H., Ess, A., Tuytelaars, T., Gool, L.V.: Speeded-up robust features (SURF). Computer Vision and Image Understanding 110(3), 346–359 (2008)
Article Google Scholar
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Proceedings of the ECCV International Workshop on Statistical Learning in Computer Vision (2004)
Google Scholar
Mikolajczyk, K., Schmid, C.: An affine invariant interest point detector. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2350, pp. 128–142. Springer, Heidelberg (2002)
Chapter Google Scholar
Lowe, D.: Object recognition from local scale-invariant features. In: Proceedings of the International Conference on Computer Vision, ICCV (1999)
Google Scholar
Das, D., Masur, A., Kobayashi, Y., Kuno, Y.: An integrated method for multiple object detection and localization. In: Proceedings of the 4th International Symposium on Advances in Visual Computing (ISVC), pp. 133–144 (2008)
Google Scholar
Zickler, S., Veloso, M.: Detection and localization of multiple objects. In: Proceedings of the IEEE-RAS International Conference on Humanoid Robots, pp. 20–25 (2006)
Google Scholar
Zickler, S., Efros, A.: Detection of multiple deformable objects using PCA-SIFT. In: Proceedings of the National Conference on Artificial Intelligence (AAAI), pp. 1127–1133 (2007)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: A sparse object category model for efficient learning and exhaustive recognition. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR) 1, 380–387 (2005)
Google Scholar
Hua, G., Brown, M., Winder, S.A.J.: Discriminant embedding for local image descriptors. International Journal of Computer Vision, 1–8 (2007)
Google Scholar
Crandall, D., Felzenszwalb, P., Huttenlocher, D.: Spatial priors for part-based recognition using statistical models, pp. 10–17 (2005)
Google Scholar
Freund, Y., Schapire, R.: A decision-theoretic generalization of on-line learning and an application to boosting. In: Vitányi, P.M.B. (ed.) EuroCOLT 1995. LNCS, vol. 904, pp. 23–37. Springer, Heidelberg (1995)
Chapter Google Scholar
OpenCV Community: Open source computer vision library version 1.1, C source code (2008), http://sourceforge.net/projects/opencvlibrary/
Hess, R.: SIFT feature detector, C source code (2006), http://web.engr.oregonstate.edu/~hess/index.html
Bay, H., van Gool, L., Tuytelaars, T.: SURF version 1.0.9 (2006), C source code, http://www.vision.ee.ethz.ch/~surf/
Okuma, K., Taleghani, A., Freitas, N.D., Freitas, O.D., Little, J.J., Lowe, D.G.: A boosted particle filter: Multitarget detection and tracking. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 28–39. Springer, Heidelberg (2004)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science and Information Management, Asian Institute of Technology, Thailand
Suwan Tongphu, Naddao Thongsak & Matthew N. Dailey

Authors

Suwan Tongphu
View author publications
You can also search for this author in PubMed Google Scholar
Naddao Thongsak
View author publications
You can also search for this author in PubMed Google Scholar
Matthew N. Dailey
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

DGA/D4S/MRIS, CEP/GIP, 16 bis avenue Prieur de la côte d’or., 94114, Arcueil, France
Jacques Blanc-Talon
Department of Telecommunication and Information Processing, Ghent University, St.-Pietersnieuwstraat 41, 9000, Gent, Belgium
Wilfried Philips
CSIRO ICT Centre, Epping, Po Box 76, 1710, Sydney, NSW, Australia
Dan Popescu
University of Antwerp, Universiteitsplein 1; Building N., 2610, Wilrijk, Belgium
Paul Scheunders

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tongphu, S., Thongsak, N., Dailey, M.N. (2009). Rapid Detection of Many Object Instances. In: Blanc-Talon, J., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2009. Lecture Notes in Computer Science, vol 5807. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04697-1_40

Download citation

DOI: https://doi.org/10.1007/978-3-642-04697-1_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04696-4
Online ISBN: 978-3-642-04697-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics