A Fast 3D Object Recognition Pipeline in Cluttered and Occluded Scenes

Zheng, Liupo; Wang, Hesheng; Chen, Weidong

doi:10.1007/978-3-319-65292-4_51

Liupo Zheng¹⁷,
Hesheng Wang¹⁷ &
Weidong Chen¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10463))

Included in the following conference series:

International Conference on Intelligent Robotics and Applications

4129 Accesses
4 Citations

Abstract

In this paper we propose a framework for instance recognition and object localization in cluttered and occluded household environment for robot grasping task. The whole system bases on a coarse to fine pipeline in combination with the state-of-the-art methods of RGBD-based object detection. We build a sparse feature model by extracting structure key points incorporating texture cues in the train procedure. After that, the paper demonstrates how the algorithm decreases the time complexity and simultaneously guarantees the accuracy of the recognition and pose estimation. Quantitative experimental evaluations are presented using both acknowledged ground truth dataset and real-world robot perception system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Collet, A., Berenson, D., Srinivasa, S.S., Ferguson, D.: Object recognition and full pose registration from a single image for robotic manipulation. In: 2009 IEEE International Conference on Robotics and Automation, ICRA 2009, pp. 48–55. IEEE (2009)
Google Scholar
Martinez, M., Collet, A., Srinivasa, S.S.: Moped: a scalable and low latency object recognition and pose estimation system. In: 2010 IEEE International Conference on Robotics and Automation (ICRA), pp. 2043–2049. IEEE (2010)
Google Scholar
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 2564–2571. IEEE (2011)
Google Scholar
Tang, J., Miller, S., Singh, A., Abbeel, P.: A textured object recognition pipeline for color and depth image data. In: 2012 IEEE International Conference on Robotics and Automation (ICRA), pp. 3467–3474. IEEE (2012)
Google Scholar
Janoch, A., Karayev, S., Jia, Y., Barron, J.T., Fritz, M., Saenko, K., Darrell, T.: A category-level 3D object dataset: putting the kinect to work. In: Fossati, A., Gall, J., Grabner, H., Ren, X., Konolige, K. (eds.) Consumer Depth Cameras for Computer Vision. Advances in Computer Vision and Pattern Recognition, pp. 141–165. Springer, London (2013). doi:10.1007/978-1-4471-4640-7_8
Chapter Google Scholar
Lai, K., Bo, L., Ren, X., Fox, D.: A large-scale hierarchical multi-view RGB-D object dataset. In: 2011 IEEE International Conference on Robotics and Automation (ICRA), pp. 1817–1824. IEEE (2011)
Google Scholar
Mian, A., Bennamoun, M., Owens, R.: On the repeatability and quality of keypoints for local feature-based 3D object retrieval from cluttered scenes. Int. J. Comput. Vis. 89(2–3), 348–361 (2010)
Article Google Scholar
Papazov, C., Burschka, D.: An efficient RANSAC for 3D object recognition in noisy and occluded scenes. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010. LNCS, vol. 6492, pp. 135–148. Springer, Heidelberg (2011). doi:10.1007/978-3-642-19315-6_11
Chapter Google Scholar
Petrelli, A., Di Stefano, L.: On the repeatability of the local reference frame for partial shape matching. In: 2011 International Conference on Computer Vision, pp. 2244–2251. IEEE (2011)
Google Scholar
Aldoma, A., Tombari, F., Rusu, R.B., Vincze, M.: OUR-CVFH – oriented, unique and repeatable clustered viewpoint feature histogram for object recognition and 6DOF pose estimation. In: Pinz, A., Pock, T., Bischof, H., Leberl, F. (eds.) DAGM/OAGM 2012. LNCS, vol. 7476, pp. 113–122. Springer, Heidelberg (2012). doi:10.1007/978-3-642-32717-9_12
Chapter Google Scholar
Wohlkinger, W., Vincze, M.: Ensemble of shape functions for 3D object classification. In: 2011 IEEE International Conference on Robotics and Biomimetics (ROBIO), pp. 2987–2992. IEEE (2011)
Google Scholar
Jiang, D., Wang, H., Chen, W., Wu, R.: A novel occlusion-free active recognition algorithm for objects in clutter. In: 2016 IEEE International Conference on Robotics and Biomimetics (ROBIO), pp. 1389–1394. IEEE (2016)
Google Scholar
Aldoma, A., Tombari, F., Stefano, L., Vincze, M.: A global hypotheses verification method for 3D object recognition. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7574, pp. 511–524. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33712-3_37
Chapter Google Scholar
Aldoma, A., Tombari, F., Prankl, J., Richtsfeld, A., Di Stefano, L., Vincze, M.: Multimodal cue integration through hypotheses verification for RGB-D object recognition and 6DoF pose estimation. In: 2013 IEEE International Conference on Robotics and Automation (ICRA), pp. 2104–2111. IEEE (2013)
Google Scholar
Lutz, M., Stampfer, D., Schlegel, C.: Probabilistic object recognition and pose estimation by fusing multiple algorithms. In: 2013 IEEE International Conference on Robotics and Automation (ICRA), pp. 4244–4249. IEEE (2013)
Google Scholar
Tombari, F., Salti, S., Stefano, L.: Unique signatures of histograms for local surface description. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6313, pp. 356–369. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15558-1_26
Chapter Google Scholar
Henry, P., Krainin, M., Herbst, E., Ren, X., Fox, D.: RGB-D mapping: using kinect-style depth cameras for dense 3D modeling of indoor environments. Int. J. Robot. Res. 31(5), 647–663 (2012)
Article Google Scholar
Herbst, E., Henry, P., Fox, D.: Toward online 3-D object segmentation and mapping. In: 2014 IEEE International Conference on Robotics and Automation (ICRA), pp. 3193–3200. IEEE (2014)
Google Scholar
Zhao, L., Huang, S., Sun, Y., Yan, L., Dissanayake, G.: Parallaxba: bundle adjustment using parallax angle feature parametrization. Int. J. Robot. Res. 34(4–5), 493–516 (2015)
Article Google Scholar
Richtsfeld, A., Mörwald, T., Prankl, J., Zillich, M., Vincze, M.: Segmentation of unknown objects in indoor environments. In: 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 4791–4796. IEEE (2012)
Google Scholar
Papon, J., Abramov, A., Schoeler, M., Worgotter, F.: Voxel cloud connectivity segmentation-supervoxels for point clouds. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2027–2034. IEEE (2013)
Google Scholar
Tuytelaars, T., Fritz, M., Saenko, K., Darrell, T.: The NBNN kernel. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 1824–1831. IEEE (2011)
Google Scholar
Dogar, M., Srinivasa, S.: A framework for push-grasping in clutter. Robot.: Sci. Syst. VII 1 (2011)
Google Scholar

Download references

Acknowledgement

This work was supported in part by the Natural Science Foundation of China under Grant U1613218, 61473191 and 61503245, in part by the Science and Technology Commission of Shanghai Municipality under Grant 15111104802, in part by Shanghai Sailing Program under Grant 15YF1406300, in part by State Key Laboratory of Robotics and System (HIT).

Author information

Authors and Affiliations

Shanghai Jiao Tong University, Shanghai, 200240, China
Liupo Zheng, Hesheng Wang & Weidong Chen

Authors

Liupo Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Hesheng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Weidong Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hesheng Wang .

Editor information

Editors and Affiliations

School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan, China
YongAn Huang
School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan, China
Hao Wu
Institute of Industrial Research, University of Portsmouth, Portsmouth, United Kingdom
Honghai Liu
School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan, China
Zhouping Yin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zheng, L., Wang, H., Chen, W. (2017). A Fast 3D Object Recognition Pipeline in Cluttered and Occluded Scenes. In: Huang, Y., Wu, H., Liu, H., Yin, Z. (eds) Intelligent Robotics and Applications. ICIRA 2017. Lecture Notes in Computer Science(), vol 10463. Springer, Cham. https://doi.org/10.1007/978-3-319-65292-4_51

Download citation

DOI: https://doi.org/10.1007/978-3-319-65292-4_51
Published: 06 August 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-65291-7
Online ISBN: 978-3-319-65292-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics