Geometric and Photometric Analysis for Interactively Recognizing Multicolor or Partially Occluded Objects

Hossain, Md. Altab; Kurnia, Rahmadi; Kuno, Yoshinori

doi:10.1007/11595755_17

Md. Altab Hossain²⁰,
Rahmadi Kurnia²⁰ &
Yoshinori Kuno²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3804))

Included in the following conference series:

International Symposium on Visual Computing

1736 Accesses

Abstract

An effective human-robot interaction is essential for wide penetration of service robots into the market. Such robots need vision systems to recognize objects. It is, however, difficult to realize vision systems that can work in various conditions. More robust techniques of object recognition and image segmentation are essential. Thus, we have proposed to use the human user’s assistance for objects recognition through speech. Our previous system assumes that it can segment images without failure. However, if there are occluded objects and/or objects composed of multicolor parts, segmentation failures cannot be avoided. This paper presents an extended system that can recognize objects in occlusion and/or multicolor cases using geometric and photometric analysis of images. If the robot is not sure about the segmentation results, it asks questions of the user by appropriate expressions depending on the certainty to remove the ambiguity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ehrenmann, M., Zollner, R., Rogalla, O., Dillmann, R.: Programming service tasks in household environments by human demonstration. In: ROMAN 2002, pp. 460–467 (2002)
Google Scholar
Hans, M., Graf, B., Schraft, R.D.: Robotics home assistant care-o-bot: past-present-future. In: ROMAN 2002, pp. 380–385 (2002)
Google Scholar
Berry, G.A., Pavlovic, V., Huang, T.S.: BattleView: a multimodal HCI research application. In: Workshop on Perceptual User Interfaces, pp. 67–70 (1998)
Google Scholar
Raisamo, R.: A multimodal user interface for public information kiosks. In: Workshop on Perceptual User Interfaces, pp. 7–12 (1998)
Google Scholar
Takahashi, T., Nakanishi, S., Kuno, Y., Shirai, Y.: Human-robot interface by verbal and nonverbal communication. In: IROS 1998, pp. 924–929 (1998)
Google Scholar
Yoshizaki, M., Kuno, Y., Nakamura, A.: Mutual assistance between speech and vision for human-robot interface. In: IROS 2002, pp. 1308–1313 (2002)
Google Scholar
Kurnia, R., Hossain, M.A., Nakamura, A., Kuno, Y.: Object recognition through human-robot interaction by speech. In: ROMAN 2004, pp. 619–624 (2004)
Google Scholar
Takizawa, M., Makihara, Y., Shimada, N., Miura, J., Shirai, Y.: A service robot with interactive vision- objects recognition using dialog with user. In: First International Workshop on Language Understanding and Agents for Real World Interaction, Hokkaido (2003)
Google Scholar
Inamura, T., Inaba, M., Inoue, H.: Dialogue control for task achievement based on evaluation of situational vagueness and stochastic representation of experiences. In: International Conference on Intelligent Robots and Systems, Sendai, pp. 2861–2866 (2004)
Google Scholar
Cremers, A.: Object reference in task-oriented keyboard dialogues, multimodal human-computer communication: system, techniques and experiments, pp. 279–293. Springer, Heidelberg (1998)
Google Scholar
Winograd, T.: Understanding natural language. Academic Press, New York (1972)
Google Scholar
Roy, D., Schiele, B., Pentland, A.: Learning audio-visual associations using mutual information. In: ICCV, Workshop on Integrating Speech and Image Understanding, Greece (1999)
Google Scholar
Hossain, M.A., Kurnia, R., Nakamura, A., Kuno, Y.: Color objects segmentation for helper robot. In: ICECE 2004, pp. 206–209 (2004)
Google Scholar
Nayar, S.K., Bolle, R.M.: Reflectance based object recognition. Inter. Journal of Computer Vision 17(3), 219–240 (1996)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information and Computer Sciences, Saitama University, 255 Shimo-Okubo, Sakura-ku, Saitama-shi, Saitama, 338-8570, Japan
Md. Altab Hossain, Rahmadi Kurnia & Yoshinori Kuno

Authors

Md. Altab Hossain
View author publications
You can also search for this author in PubMed Google Scholar
Rahmadi Kurnia
View author publications
You can also search for this author in PubMed Google Scholar
Yoshinori Kuno
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, University of Nevada, Reno, USA
George Bebis
NASA Ames Research Center, Moffett Field, CA, USA
Richard Boyle
Desert Research Institute, Reno, NV, USA
Darko Koracin
Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Bahram Parvin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hossain, M.A., Kurnia, R., Kuno, Y. (2005). Geometric and Photometric Analysis for Interactively Recognizing Multicolor or Partially Occluded Objects. In: Bebis, G., Boyle, R., Koracin, D., Parvin, B. (eds) Advances in Visual Computing. ISVC 2005. Lecture Notes in Computer Science, vol 3804. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11595755_17

Download citation

DOI: https://doi.org/10.1007/11595755_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30750-1
Online ISBN: 978-3-540-32284-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics