From Virtual to Real World Visual Perception Using Domain Adaptation—The DPM as Example

López, Antonio M.; Xu, Jiaolong; Gómez, José L.; Vázquez, David; Ros, Germán

doi:10.1007/978-3-319-58347-1_13

Antonio M. López³,
Jiaolong Xu⁴,
José L. Gómez⁴,
David Vázquez⁴ &
…
Germán Ros⁴

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

3627 Accesses
3 Citations

Abstract

Supervised learning tends to produce more accurate classifiers than unsupervised learning in general. This implies that training data is preferred with annotations. When addressing visual perception challenges, such as localizing certain object classes within an image, the learning of the involved classifiers turns out to be a practical bottleneck. The reason is that, at least, we have to frame object examples with bounding boxes in thousands of images. A priori, the more complex the model is regarding its number of parameters, the more annotated examples are required. This annotation task is performed by human oracles, which ends up in inaccuracies and errors in the annotations (aka ground truth) since the task is inherently very cumbersome and sometimes ambiguous. As an alternative, we have pioneered the use of virtual worlds for collecting such annotations automatically and with high precision. However, since the models learned with virtual data must operate in the real world, we still need to perform domain adaptation (DA). In this chapter, we revisit the DA of a Deformable Part-Based Model (DPM) as an exemplifying case of virtual- to real-world DA. As a use case, we address the challenge of vehicle detection for driver assistance, using different publicly available virtual-world data. While doing so, we investigate questions such as how does the domain gap behave due to virtual-vs-real data with respect to dominant object appearance per domain, as well as the role of photo-realism in the virtual world .

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
With this technique we won the first pedestrian detection challenge of the KITTI benchmark suite, a part of the Recognition Meets Reconstruction Challenge held in ICCV’13.
2.
Dataset available at http://www.xrce.xerox.com/Research-Development/Computer-Vision/Proxy-Virtual-Worlds.
3.
Dataset available at http://synthia-dataset.net.
4.
See Fig. 13.2 for a pictorial intuition.
5.
The reader is referred to [544] for the mathematical technical details.
6.
See http://unity3d.com.
7.
The number of vehicles mentioned in Table 13.1 refer to moderate cases.
8.
It is a fallacy to believe that, because good datasets are big, then big datasets are good [34].

Acknowledgements

Authors want to thank the next funding bodies: the Spanish MEC Project TRA2014-57088-C2-1-R, the People Programme (Marie Curie Actions) FP7/2007-2013 REA grant agreement no. 600388, and by the Agency of Competitiveness for Companies of the Government of Catalonia, ACCIO, the Generalitat de Catalunya Project 2014-SGR-1506 and the NVIDIA Corporation for the generous support in the form of different GPU hardware units.

Author information

Authors and Affiliations

Computer Vision Center (CVC) and Dpt. Ciències de la Computació (DCC), Universitat Autònoma de Barcelona (UAB), Barcelona, Spain
Antonio M. López
CVC and DCC, UAB, Barcelona, Spain
Jiaolong Xu, José L. Gómez, David Vázquez & Germán Ros

Authors

Antonio M. López
View author publications
You can also search for this author in PubMed Google Scholar
Jiaolong Xu
View author publications
You can also search for this author in PubMed Google Scholar
José L. Gómez
View author publications
You can also search for this author in PubMed Google Scholar
David Vázquez
View author publications
You can also search for this author in PubMed Google Scholar
Germán Ros
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Antonio M. López .

Editor information

Editors and Affiliations

Naver Labs Europe, Meylan, France
Gabriela Csurka

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

López, A.M., Xu, J., Gómez, J.L., Vázquez, D., Ros, G. (2017). From Virtual to Real World Visual Perception Using Domain Adaptation—The DPM as Example. In: Csurka, G. (eds) Domain Adaptation in Computer Vision Applications. Advances in Computer Vision and Pattern Recognition. Springer, Cham. https://doi.org/10.1007/978-3-319-58347-1_13

Download citation

DOI: https://doi.org/10.1007/978-3-319-58347-1_13
Published: 13 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-58346-4
Online ISBN: 978-3-319-58347-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics