Skip to main content

Pixel-Level Hand Detection with Shape-Aware Structured Forests

  • Conference paper
  • First Online:
Computer Vision -- ACCV 2014 (ACCV 2014)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9006))

Included in the following conference series:

Abstract

Hand detection has many important applications in HCI, yet it is a challenging problem because the appearance of hands can vary greatly in images. In this paper, we propose a novel method for efficient pixel-level hand detection. Unlike previous method which assigns a binary label to every pixel independently, our method estimates a probability shape mask for a pixel using structured forests. This approach can better exploit hand shape information in the training data, and enforce shape constraints in the estimation. Aggregation of multiple predictions generated from neighboring pixels further improves the robustness of our method. We evaluate our method on both ego-centric videos and unconstrained still images. Experiment results show that our method can detect hands efficiently and outperform other state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.cs.cmu.edu/~kkitani/perpix/.

References

  1. Viola, P., Jones, M.: Robust Real-time Object Detection. Int. J. Comput. Vis. 57, 137–154 (2004)

    Article  Google Scholar 

  2. Dollár, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian Detection: An Evaluation of the State of the Art. IEEE Trans. Pattern Anal. Mach. Intell. 34, 743–761 (2012)

    Article  Google Scholar 

  3. Dalal, N., Triggs, B., Europe, D.: Histograms of oriented gradients for human detection. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition. vol. 1, pp. 886–893. IEEE (2005)

    Google Scholar 

  4. Jones, M., Rehg, J.: Statistical color models with application to skin detection. Int. J. Comput. Vis. 46, 81–96 (2002)

    Article  MATH  Google Scholar 

  5. Li, C., Kitani, K.: Pixel-level hand detection in ego-centric videos. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2013)

    Google Scholar 

  6. Sigal, L., Sclaroff, S., Athitsos, V.: Skin Color-Based Video Segmentation under Time-Varying Illumination. IEEE Trans. Pattern Anal. Mach. Intell. 26, 862–877 (2004)

    Article  Google Scholar 

  7. Kolsch, M., Turk, M.: Hand tracking with flocks of features. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2005)

    Google Scholar 

  8. Pérez, P., Hue, C., Vermaak, J., Gangnet, M.: Color-based probabilistic tracking. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002, Part I. LNCS, vol. 2350, pp. 661–675. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  9. Thayananthan, A., Torr, P.H.S., Member, S., Stenger, B., Cipolla, R.: Model-based hand tracking using a hierarchical Bayesian filter. IEEE Trans. on Pattern Anal. Mach. Intell. 28, 1372–1384 (2006)

    Article  Google Scholar 

  10. Oikonomidis, I., Kyriazis, N., Argyros, A.A.: Markerless and efficient 26-DOF hand pose recovery. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 744–757. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  11. Mittal, A., Zisserman, A., Torr, P.: Hand detection using multiple proposals. In: Proceedings of the British Machine Vision Conference, pp.75.1–75.11. British Machine Vision Association (2011)

    Google Scholar 

  12. Buehler, P., Everingham, M., Huttenlocher, D.P., Zisserman, A.: Upper Body Detection and Tracking in Extended Signing Sequences. Int. J. Comput. Vis. 95, 180–197 (2011)

    Article  Google Scholar 

  13. Sheikh, Y., Javed, O., Kanade, T.: Background subtraction for freely moving cameras. In: Proceedings of the IEEE 12th International Conference on Computer Vision, pp. 1219–1225. IEEE (2009)

    Google Scholar 

  14. Fathi, A., Ren, X., Rehg, J.M.: Learning to recognize objects in egocentric activities. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3281–3288. IEEE (2011)

    Google Scholar 

  15. Trinh, H., Fan, Q., Gabbur, P., Pankanti, S.: Hand tracking by binary quadratic programming and its application to retail activity recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2012)

    Google Scholar 

  16. Dollár, P., Tu, Z., Perona, P., Belongie, S.: Integral channel features. In: Proceedings of the British Machine Vision Conference, pp. 1–11 (2009)

    Google Scholar 

  17. Breiman, L., Friedman, J., Stone, C.J., Olshen, R.A.: Classification and Regression Trees. Chapman and Hall/CRC, New York (1984)

    MATH  Google Scholar 

  18. Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32, 1627–1645 (2010)

    Article  Google Scholar 

  19. Sagonas, C., Tzimiropoulos, G., Zafeiriou, S., Pantic, M.: 300 faces in-the-wild challenge: the first facial landmark localization challenge. In: IEEE International Conference on Computer Vision Workshops, pp. 397–403. IEEE (2013)

    Google Scholar 

  20. Bagon, S., Brostovski, O., Galun, M., Irani, M.: Detecting and sketching the common. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 33–40. IEEE (2010)

    Google Scholar 

  21. Rother, C., Kolmogorov, V., Blake, A.: Grabcut: Interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. 23, 309–314 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiaolong Zhu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Zhu, X., Jia, X., Wong, KY.K. (2015). Pixel-Level Hand Detection with Shape-Aware Structured Forests. In: Cremers, D., Reid, I., Saito, H., Yang, MH. (eds) Computer Vision -- ACCV 2014. ACCV 2014. Lecture Notes in Computer Science(), vol 9006. Springer, Cham. https://doi.org/10.1007/978-3-319-16817-3_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-16817-3_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-16816-6

  • Online ISBN: 978-3-319-16817-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics