Skip to main content

A Fast 3D Object Recognition Pipeline in Cluttered and Occluded Scenes

  • Conference paper
  • First Online:
Intelligent Robotics and Applications (ICIRA 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10463))

Included in the following conference series:

Abstract

In this paper we propose a framework for instance recognition and object localization in cluttered and occluded household environment for robot grasping task. The whole system bases on a coarse to fine pipeline in combination with the state-of-the-art methods of RGBD-based object detection. We build a sparse feature model by extracting structure key points incorporating texture cues in the train procedure. After that, the paper demonstrates how the algorithm decreases the time complexity and simultaneously guarantees the accuracy of the recognition and pose estimation. Quantitative experimental evaluations are presented using both acknowledged ground truth dataset and real-world robot perception system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Collet, A., Berenson, D., Srinivasa, S.S., Ferguson, D.: Object recognition and full pose registration from a single image for robotic manipulation. In: 2009 IEEE International Conference on Robotics and Automation, ICRA 2009, pp. 48–55. IEEE (2009)

    Google Scholar 

  2. Martinez, M., Collet, A., Srinivasa, S.S.: Moped: a scalable and low latency object recognition and pose estimation system. In: 2010 IEEE International Conference on Robotics and Automation (ICRA), pp. 2043–2049. IEEE (2010)

    Google Scholar 

  3. Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 2564–2571. IEEE (2011)

    Google Scholar 

  4. Tang, J., Miller, S., Singh, A., Abbeel, P.: A textured object recognition pipeline for color and depth image data. In: 2012 IEEE International Conference on Robotics and Automation (ICRA), pp. 3467–3474. IEEE (2012)

    Google Scholar 

  5. Janoch, A., Karayev, S., Jia, Y., Barron, J.T., Fritz, M., Saenko, K., Darrell, T.: A category-level 3D object dataset: putting the kinect to work. In: Fossati, A., Gall, J., Grabner, H., Ren, X., Konolige, K. (eds.) Consumer Depth Cameras for Computer Vision. Advances in Computer Vision and Pattern Recognition, pp. 141–165. Springer, London (2013). doi:10.1007/978-1-4471-4640-7_8

    Chapter  Google Scholar 

  6. Lai, K., Bo, L., Ren, X., Fox, D.: A large-scale hierarchical multi-view RGB-D object dataset. In: 2011 IEEE International Conference on Robotics and Automation (ICRA), pp. 1817–1824. IEEE (2011)

    Google Scholar 

  7. Mian, A., Bennamoun, M., Owens, R.: On the repeatability and quality of keypoints for local feature-based 3D object retrieval from cluttered scenes. Int. J. Comput. Vis. 89(2–3), 348–361 (2010)

    Article  Google Scholar 

  8. Papazov, C., Burschka, D.: An efficient RANSAC for 3D object recognition in noisy and occluded scenes. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010. LNCS, vol. 6492, pp. 135–148. Springer, Heidelberg (2011). doi:10.1007/978-3-642-19315-6_11

    Chapter  Google Scholar 

  9. Petrelli, A., Di Stefano, L.: On the repeatability of the local reference frame for partial shape matching. In: 2011 International Conference on Computer Vision, pp. 2244–2251. IEEE (2011)

    Google Scholar 

  10. Aldoma, A., Tombari, F., Rusu, R.B., Vincze, M.: OUR-CVFH – oriented, unique and repeatable clustered viewpoint feature histogram for object recognition and 6DOF pose estimation. In: Pinz, A., Pock, T., Bischof, H., Leberl, F. (eds.) DAGM/OAGM 2012. LNCS, vol. 7476, pp. 113–122. Springer, Heidelberg (2012). doi:10.1007/978-3-642-32717-9_12

    Chapter  Google Scholar 

  11. Wohlkinger, W., Vincze, M.: Ensemble of shape functions for 3D object classification. In: 2011 IEEE International Conference on Robotics and Biomimetics (ROBIO), pp. 2987–2992. IEEE (2011)

    Google Scholar 

  12. Jiang, D., Wang, H., Chen, W., Wu, R.: A novel occlusion-free active recognition algorithm for objects in clutter. In: 2016 IEEE International Conference on Robotics and Biomimetics (ROBIO), pp. 1389–1394. IEEE (2016)

    Google Scholar 

  13. Aldoma, A., Tombari, F., Stefano, L., Vincze, M.: A global hypotheses verification method for 3D object recognition. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7574, pp. 511–524. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33712-3_37

    Chapter  Google Scholar 

  14. Aldoma, A., Tombari, F., Prankl, J., Richtsfeld, A., Di Stefano, L., Vincze, M.: Multimodal cue integration through hypotheses verification for RGB-D object recognition and 6DoF pose estimation. In: 2013 IEEE International Conference on Robotics and Automation (ICRA), pp. 2104–2111. IEEE (2013)

    Google Scholar 

  15. Lutz, M., Stampfer, D., Schlegel, C.: Probabilistic object recognition and pose estimation by fusing multiple algorithms. In: 2013 IEEE International Conference on Robotics and Automation (ICRA), pp. 4244–4249. IEEE (2013)

    Google Scholar 

  16. Tombari, F., Salti, S., Stefano, L.: Unique signatures of histograms for local surface description. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6313, pp. 356–369. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15558-1_26

    Chapter  Google Scholar 

  17. Henry, P., Krainin, M., Herbst, E., Ren, X., Fox, D.: RGB-D mapping: using kinect-style depth cameras for dense 3D modeling of indoor environments. Int. J. Robot. Res. 31(5), 647–663 (2012)

    Article  Google Scholar 

  18. Herbst, E., Henry, P., Fox, D.: Toward online 3-D object segmentation and mapping. In: 2014 IEEE International Conference on Robotics and Automation (ICRA), pp. 3193–3200. IEEE (2014)

    Google Scholar 

  19. Zhao, L., Huang, S., Sun, Y., Yan, L., Dissanayake, G.: Parallaxba: bundle adjustment using parallax angle feature parametrization. Int. J. Robot. Res. 34(4–5), 493–516 (2015)

    Article  Google Scholar 

  20. Richtsfeld, A., Mörwald, T., Prankl, J., Zillich, M., Vincze, M.: Segmentation of unknown objects in indoor environments. In: 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 4791–4796. IEEE (2012)

    Google Scholar 

  21. Papon, J., Abramov, A., Schoeler, M., Worgotter, F.: Voxel cloud connectivity segmentation-supervoxels for point clouds. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2027–2034. IEEE (2013)

    Google Scholar 

  22. Tuytelaars, T., Fritz, M., Saenko, K., Darrell, T.: The NBNN kernel. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 1824–1831. IEEE (2011)

    Google Scholar 

  23. Dogar, M., Srinivasa, S.: A framework for push-grasping in clutter. Robot.: Sci. Syst. VII 1 (2011)

    Google Scholar 

Download references

Acknowledgement

This work was supported in part by the Natural Science Foundation of China under Grant U1613218, 61473191 and 61503245, in part by the Science and Technology Commission of Shanghai Municipality under Grant 15111104802, in part by Shanghai Sailing Program under Grant 15YF1406300, in part by State Key Laboratory of Robotics and System (HIT).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hesheng Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Zheng, L., Wang, H., Chen, W. (2017). A Fast 3D Object Recognition Pipeline in Cluttered and Occluded Scenes. In: Huang, Y., Wu, H., Liu, H., Yin, Z. (eds) Intelligent Robotics and Applications. ICIRA 2017. Lecture Notes in Computer Science(), vol 10463. Springer, Cham. https://doi.org/10.1007/978-3-319-65292-4_51

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-65292-4_51

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-65291-7

  • Online ISBN: 978-3-319-65292-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics