Skip to main content

Optimizing Image Registration for Interactive Applications

  • Conference paper
  • First Online:
Augmented Reality, Virtual Reality, and Computer Graphics (AVR 2016)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9768))

  • 3867 Accesses

Abstract

With the spread of wearable and mobile devices, the request for interactive augmented reality applications is in constant growth. Among the different possibilities, we focus on the cultural heritage domain where a key step in the development applications for augmented cultural experiences is to obtain a precise localization of the user, i.e. the 6 degree-of-freedom of the camera acquiring the images used by the application. Current state of the art perform this task by extracting local descriptors from a query and exhaustively matching them to a sparse 3D model of the environment. While this procedure obtains good localization performance, due to the vast search space involved in the retrieval of 2D-3D correspondences this is often not feasible in real-time and interactive environments. In this paper we hence propose to perform descriptor quantization to reduce the search space and employ multiple KD-Trees combined with a principal component analysis dimensionality reduction to enable an efficient search. We experimentally show that our solution can halve the computational requirements of the correspondence search with regard to the state of the art while maintaining similar accuracy levels.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Alletto, S., Abati, D., Serra, G., Cucchiara, R.: Exploring architectural details through awearable egocentric vision device. Sensors 16(2) (2016)

    Google Scholar 

  2. Arth, C., Wagner, D., Klopschitz, M., Irschara, A., Schmalstieg, D.: Wide area localization on mobile phones. In: Proceedings of IEEE International Symposium on Mixed and Augmented Reality (2009)

    Google Scholar 

  3. Bentley, J.L.: Multidimensional binary search trees used for associative searching. Commun. ACM 18(9), 509–517 (1975)

    Article  MathSciNet  MATH  Google Scholar 

  4. Brown, M., Lowe, D.G.: Unsupervised 3D object recognition and reconstruction in unordered datasets. In: Proceedings of International Conference on 3-D Digital Imaging and Modeling (2005)

    Google Scholar 

  5. Castle, R., Klein, G., Murray, D.W.: Video-rate localization in multiple maps for wearable augmented reality. In: Proceedings of IEEE International Symposium on Wearable Computers (2008)

    Google Scholar 

  6. Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision, 2nd edn. Cambridge University Press, Cambridge (2004). ISBN: 0521540518

    Google Scholar 

  7. Hauagge, D., Wehrwein, S., Upchurch, P., Bala, K., Snavely, N.: Reasoning about photo collections using models of outdoor illumination. In: Proceedings of British Machine Vision Conference (2014)

    Google Scholar 

  8. Hays, J., Efros, A.: IM2GPS: estimating geographic information from a single image. In: Proceedings of CVPR (2008)

    Google Scholar 

  9. Irschara, A., Zach, C., Frahm, J., Bischof, H.: From structure-from-motion point clouds to fast location recognition. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (2009)

    Google Scholar 

  10. Jolliffe, I.: Principal Component Analysis. Wiley Online Library (2002)

    Google Scholar 

  11. Ke, Y., Sukthankar, R.: PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (2004)

    Google Scholar 

  12. Kroeger, T., Van Gool, L.: Video registration to SfM models. In: Proceedings of IEEE European Conference on Computer Vision (2014)

    Google Scholar 

  13. Li, Y., Snavely, N., Huttenlocher, D., Fua, P.: Worldwide pose estimation using 3D point clouds. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part I. LNCS, vol. 7572, pp. 15–29. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  14. Li, Y., Snavely, N., Huttenlocher, D., Fua, P.: Worldwide pose estimation using 3D point clouds. In: Proceedings of IEEE European Conference on Computer Vision (2012)

    Google Scholar 

  15. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  16. Perronnin, F., Sánchez, J., Mensink, T.: Improving the fisher kernel for large-scale image classification. In: Proceedings of IEEE European Conference on Computer Vision (2010)

    Google Scholar 

  17. Pollefeys, M., Van Gool, L., Vergauwen, M., Verbiest, F., Cornelis, K., Tops, J., Koch, R.: Visual modeling with a hand-held camera. Int. J. Comput. Vis. 59(3), 207–232 (2004)

    Article  Google Scholar 

  18. Sattler, T., Leibe, B., Kobbelt, L.: Fast image-based localization using direct 2D-to-3D matching. In: Proceedings of IEEE International Conference on Computer Vision (2011)

    Google Scholar 

  19. Sattler, T., Weyand, T., Leibe, B., Kobbelt, L.: Image retrieval for image-based localization revisited. In: Proceedings of British Machine Vision Conference (2012)

    Google Scholar 

  20. Schindler, G., Brown, M., Szeliski, R.: City-scale location recognition. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2007)

    Google Scholar 

  21. Schops, T., Engel, J., Cremers, D.: Semi-dense visual odometry for AR on a smartphone. In: Proceedings of IEEE International Symposium on Mixed and Augmented Reality (2014)

    Google Scholar 

  22. Snavely, N., Seitz, S.M., Szeliski, R.: Photo tourism: exploring photo collections in 3D. ACM Trans. Graph. 25, 835–846 (2006). ACM

    Article  Google Scholar 

  23. Snavely, N., Seitz, S.M., Szeliski, R.: Modeling the world from internet photo collections. Int. J. Comput. Vis. 80(2), 189–210 (2008)

    Article  Google Scholar 

  24. Torii, A., Arandjelovic, R., Sivic, J., Okutomi, M., Pajdla, T.: 24/7 place recognition by view synthesis. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2015)

    Google Scholar 

  25. Wu, C., Agarwal, S., Curless, B., Seitz, S.: Multicore bundle adjustment. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2011)

    Google Scholar 

Download references

Acknowledgments

This work was partially supported by the Fondazione Cassa di Risparmio di Modena project: “Vision for Augmented Experience” and the PON R&C project DICET-INMOTO (Cod. PON04a2 D).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giuseppe Serra .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Gasparini, R., Alletto, S., Serra, G., Cucchiara, R. (2016). Optimizing Image Registration for Interactive Applications. In: De Paolis, L., Mongelli, A. (eds) Augmented Reality, Virtual Reality, and Computer Graphics. AVR 2016. Lecture Notes in Computer Science(), vol 9768. Springer, Cham. https://doi.org/10.1007/978-3-319-40621-3_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-40621-3_36

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-40620-6

  • Online ISBN: 978-3-319-40621-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics