Skip to main content

Dialog-Based 3D-Image Recognition Using a Domain Ontology

  • Conference paper
Spatial Cognition V Reasoning, Action, Interaction (Spatial Cognition 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4387))

Included in the following conference series:

Abstract

The combination of vision and speech, together with the resulting necessity for formal representations, builds a central component of an autonomous system. A robot that is supposed to navigate autonomously through space must be able to perceive its environment as automatically as possible. But each recognition system has its own inherent limits. Especially a robot whose task is to navigate through unknown terrain has to deal with unidentified or even unknown objects, thus compounding the recognition problem still further. The system described in this paper takes this into account by trying to identify objects based on their functionality where possible. To handle cases where recognition is insufficient, we examine here two further strategies: on the one hand, the linguistic reference and labeling of the unidentified objects and, on the other hand, ontological deduction. This approach then connects the probabilistic area of object recognition with the logical area of formal reasoning. In order to support formal reasoning, additional relational scene information has to be supplied by the recognition system. Moreover, for a sound ontological basis for these reasoning tasks, it is necessary to define a domain ontology that provides for the representation of real-world objects and their corresponding spatial relations in linguistic and physical respects. Physical spatial relations and objects are measured by the visual system, whereas linguistic spatial relations and objects are required for interactions with a user.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Anguelov, D., Taskar, B., Chatalbashev, V., Koller, D., Gupta, D., Heitz, G., Andrew Y. N.: Discriminative learning of markov random fields for segmentation of 3D range data. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), San Diego, California (June 2005)

    Google Scholar 

  2. Bateman, J.A., Farrar, S.: Spatial Ontology Baseline. SFB/TR8 internal report I1-[OntoSpace] D2, Collaborative Research Center for Spatial Cognition, University of Bremen, University of Freiburg, Germany (2004)

    Google Scholar 

  3. Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. Scientific American 284(5), 34–43 (2001)

    Article  Google Scholar 

  4. Anthony, G., Cohn, A.G., Bennett, B., Gooday, J., Gotts, N.M.: Qualitative spatial representation and reasoning with the region connection calculus. GeoInformatica 1(3), 275–316 (1997)

    Article  Google Scholar 

  5. Coventry, K.R., Carmichael, R., Garrod, S.C.: Spatial prepositions, object-specific function and task requirements. Journal of Semantics 11, 289–309 (1994)

    Article  Google Scholar 

  6. Auer, P., et al.: A Research Roadmap of Cognitive Vision, ECVision: European Network for Research in Cognitive Vision Systems (19.04.2005), http://www.eucognition.org/ecvision/research_planning/ECVisionRoadmapv5.0.pdf

  7. Freksa, C.: Using orientation information for qualitative spatial reasoning. In: Frank, A.U., Campari, I., Formentini, U. (eds.) Spatio-Temporal Reasoning, pp. 162–178 (1992)

    Google Scholar 

  8. Gómez-Pérez, A., Fernández-López, M., Corcho, C.: Ontological Engineering with examples from the areas of Knowledge Management, e-Commerce and the Semantic Web. Springer, Heidelberg (2004)

    Google Scholar 

  9. Gärdenfors, P.: Conceptual Spaces: The Geometry of Thought. A Bradford Book. MIT Press, Cambridge (2000)

    Google Scholar 

  10. Hernández, D.: Qualitative Representation of Spatial Knowledge. In: Hernández, D. (ed.) Qualitative Representation of Spatial Knowledge. LNCS, vol. 804, Springer, Heidelberg (1994)

    Chapter  Google Scholar 

  11. Herskovits, A.: Language and Spatial Cognition: an interdisciplinary study of the prepositions in English. Studies in Natural Language Processing (1986)

    Google Scholar 

  12. Hois, J., Schill, K., Bateman, J.A.: Integrating Uncertain Knowledge in a Domain Ontology for Room Concept Classifications. In: Bramer, M., Coenen, F., Tuson, A. (eds.) The Twenty-sixth SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence. Research and Development in Intelligent Systems, Springer, Heidelberg (2006)

    Google Scholar 

  13. Krebs, B., Burkhardt, M., Wahl, F.M.: Integration of Multiple Feature Detection by a Bayesian Net for 3D Object Recognition. Mustererkennung, pp. 143–150 (1998)

    Google Scholar 

  14. Levinson, S.C.: Space in Language and Cognition. Cambridge University Press, Cambridge (2003)

    Book  Google Scholar 

  15. Masolo, C., Borgo, S., Gangemi, A., Guarino, N., Oltramari, A.: Ontologies library (final). WonderWeb Deliverable D18, ISTC-CNR, Padova, Italy (December 2003)

    Google Scholar 

  16. Matsakis, P., Keller, J., Wendling, L., Marjamaa, J., Sjahputera, O.: Linguistic description of relative positions in images. IEEE Transactions on Systems, Man and Cybernetics, Part B 4(32), 573–588 (2001)

    Article  Google Scholar 

  17. Meyer, A.: Merkmals- und formbasierte 3D-Objekterkennung für Büroszenen. Diplomarbeit, Universität Bremen (2005)

    Google Scholar 

  18. Moratz, R., Tenbrink, T.: Spatial reference in linguistic human-robot interaction: Iterative, empirically supported development of a model of projective relations. Spatial Cognition and Computation 6(1), 63–106 (2006)

    Article  Google Scholar 

  19. Moratz, R., Tenbrink, T., Bateman, J.A., Fischer, K.: Spatial knowledge representation for human-robot interaction. In: Freksa, C., Brauer, W., Habel, C., Wender, K.F. (eds.) Spatial Cognition III. LNCS (LNAI), vol. 2685, pp. 263–286. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  20. Nagel, H.-H.: Steps toward a Cognitive Vision System. AI Magazine 25(2), 31–50 (2004)

    Google Scholar 

  21. Nagel, H.-H.: Cognitive Vision Systems (CogViSys) (31.08.2001), http://cogvisys.iaksuni-karlsruhe.de/homepage_CogViSys_V3B.html

  22. Roy, D., Gorniak, P., Mukherjee, N., Juster, J.: A Trainable Spoken Language Understanding System For Visual Object Selection. In: International Conference of Spoken Language Processing (2002)

    Google Scholar 

  23. Stamos, I., Allen, P.K.: 3-D Model Construction using Range and Image Data. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition CVPR 2000, pp. 531–536. IEEE, Los Alamitos (2000)

    Google Scholar 

  24. Tenbrink, T.: Semantics and Application of Spatial Dimensional Terms in English and German. SFB/TR8 internal report I1-[OntoSpace], Collaborative Research Center for Spatial Cognition, University of Bremen, University of Freiburg, Germany (2005)

    Google Scholar 

  25. Wünstel, M., Röfer, T.: A Probabilistic Approach for Object Recognition in a Real 3-D Office Environment. In: WSCG 2006 Posters Proceedings (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hois, J., Wünstel, M., Bateman, J.A., Röfer, T. (2007). Dialog-Based 3D-Image Recognition Using a Domain Ontology. In: Barkowsky, T., Knauff, M., Ligozat, G., Montello, D.R. (eds) Spatial Cognition V Reasoning, Action, Interaction. Spatial Cognition 2006. Lecture Notes in Computer Science(), vol 4387. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75666-8_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-75666-8_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-75665-1

  • Online ISBN: 978-3-540-75666-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics