Gaze Location Prediction with Depth Features as Auxiliary Information

Mohammed, Redwan Abdo A.; Schwabe, Lars; Staadt, Oliver

doi:10.1007/978-3-319-07230-2_28

Redwan Abdo A. Mohammed¹⁶,
Lars Schwabe¹⁶ &
Oliver Staadt¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8511))

Included in the following conference series:

International Conference on Human-Computer Interaction

3810 Accesses

Abstract

We present the results of a first experimental study to improve the computation of saliency maps, by using luminance and depth images features. More specifically, we have recorded the center of gaze of users when they were viewing natural scenes. We used machine learning techniques to train a bottom-up, top-down model of saliency based on 2D and depth features/cues. We found that models trained on Itti & Koch and depth features combined outperform models trained on other individual features (i.e. only Gabor filter responses or only depth features), or trained on combination of these features. As a consequence, depth features combined with Itti & Koch features improve the prediction of gaze locations. This first characterization of using joint luminance and depth features is an important step towards developing models of eye movements, which operate well under natural conditions such as those encountered in HCI settings.

Download to read the full chapter text

Chapter PDF

Contribution of Color Information in Visual Saliency Model for Videos

Coherence Fields for 3D Saliency Prediction

Visual attention prediction for images with leading line structure

Article 05 May 2018

References

Hoover, A., Jean-Baptiste, G., Jiang, X.: An experimental comparison of range image segmentation algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence 18, 673–689 (1996)
Article Google Scholar
Borji, A., Itti, L.: State-of-the-art in visual attention modeling. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 185–207 (2013)
Article MathSciNet Google Scholar
Gao, D., Vasconcelos, N.: Discriminant saliency for visual recognition from cluttered scenes. In: NIPS (2004)
Google Scholar
Harel, J., Koch, C., Perona, P.: Graph-based visual saliency. In: Advances in Neural Information Processing Systems 19, pp. 545–552. MIT Press (2007)
Google Scholar
Horvitz, E., Kadie, C., Paek, T., Hovel, D.: Models of attention in computing and communication: From principles to applications (2003)
Google Scholar
Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(11), 1254–1259 (1998)
Article Google Scholar
Lewis, J.P.: Fast normalized cross-correlation (1995)
Google Scholar
Mahadevan, V., Vasconcelos, N.: Spatiotemporal saliency in dynamic scenes. IEEE Trans. Pattern Anal. Mach. Intell. 32(1), 171–177 (2010)
Article Google Scholar
Mohammed, R.A.A., Schwabe, L.: Scene-dependence of saliency maps of natural luminance and depth images. In: Fifth Baltic Conference “Human - Computer Interaction” (2011) (to appear)
Google Scholar
Mohammed, R.A.A., Schwabe, L.: A brain informatics approach to explain the oblique effect via depth statistics. In: Zanzotto, F.M., Tsumoto, S., Taatgen, N., Yao, Y. (eds.) BI 2012. LNCS, vol. 7670, pp. 97–106. Springer, Heidelberg (2012)
Chapter Google Scholar
Mohammed, R.A.A., Mohammed, S.A., Schwabe, L.: Batgaze: A new tool to measure depth features at the center of gaze during free viewing. In: Zanzotto, F.M., Tsumoto, S., Taatgen, N., Yao, Y. (eds.) BI 2012. LNCS, vol. 7670, pp. 85–96. Springer, Heidelberg (2012)
Chapter Google Scholar
Potetz, B., Lee, T.S.: Statistical correlations between 2d images and 3d structures in natural scenes. Journal of Optical Society of America, A 7(20), 1292–1303 (2003)
Article Google Scholar
Reinagel, P., Zador, A.M.: Natural scene statistics at the centre of gaze. Network 10(4), 341–350 (1999)
Article MATH Google Scholar
Roda, C.: Human Attention in Digital Environments. Cambridge University Press, Cambridge (2011)
Google Scholar
Saxena, A., Chung, S.H., Ng, A.Y.: Learning depth from single monocular images. In: NIPS 18. MIT Press (2005)
Google Scholar
Saxena, A., Sun, M., Ng, A.Y.: Make3d: Learning 3d scene structure from a single still image. IEEE Trans. Pattern Anal. Mach. Intell. 31(5), 824–840 (2009)
Article Google Scholar
Durand, F., Judd, T., Ehinger, K., Torralba, A.: Learning to predict where humans look. In: ICCV (2009)
Google Scholar
Yang, Z., Purves, D.: Image source statistics of surfaces in natural scenes. Network: Computation in Neural Systems 14(3), 371–390 (2003)
Article Google Scholar
Yokoya, N., Levine, M.D.: Range image segmentation based on differential geometry: A hybrid approach. IEEE Transactions on Pattern Analysis and Machine Intelligence 11(6), 643–649 (1989)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science, University of Rostock, Rostock, Germany
Redwan Abdo A. Mohammed, Lars Schwabe & Oliver Staadt

Authors

Redwan Abdo A. Mohammed
View author publications
You can also search for this author in PubMed Google Scholar
Lars Schwabe
View author publications
You can also search for this author in PubMed Google Scholar
Oliver Staadt
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

The Open University of Japan, 2-11 Wakaba, 261-8586, Mihama-ku, Chiba-shi, Japan
Masaaki Kurosu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mohammed, R.A.A., Schwabe, L., Staadt, O. (2014). Gaze Location Prediction with Depth Features as Auxiliary Information. In: Kurosu, M. (eds) Human-Computer Interaction. Advanced Interaction Modalities and Techniques. HCI 2014. Lecture Notes in Computer Science, vol 8511. Springer, Cham. https://doi.org/10.1007/978-3-319-07230-2_28

Download citation

DOI: https://doi.org/10.1007/978-3-319-07230-2_28
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07229-6
Online ISBN: 978-3-319-07230-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Gaze Location Prediction with Depth Features as Auxiliary Information

Abstract

Chapter PDF

Similar content being viewed by others

Contribution of Color Information in Visual Saliency Model for Videos

Coherence Fields for 3D Saliency Prediction

Visual attention prediction for images with leading line structure

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Gaze Location Prediction with Depth Features as Auxiliary Information

Abstract

Chapter PDF

Similar content being viewed by others

Contribution of Color Information in Visual Saliency Model for Videos

Coherence Fields for 3D Saliency Prediction

Visual attention prediction for images with leading line structure

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation