Skip to main content

Interactive Interface to Optimize Sound Source Localization with HARK

  • Conference paper
  • First Online:
Current Approaches in Applied Artificial Intelligence (IEA/AIE 2015)

Abstract

In this study, we designed and developed an interactive interface to optimize sound source localization with the multi-channel robot audition software, HARK. With the developed interface, the system can lighten the loads of optimizing parameters and supports users easily to handle the parameter optimization in sound source localization. In order to properly handle the multi-channel sounds, it is better dynamically to indicate the parameter from both temporal and spatial perspectives, though almost all of the software can only indicate a static threshold. We developed an interactive interface, with which the user can create or delete the sound source on the MUSIC spectrum and can set up an appropriate parameter settings for the environment. We also conducted an evaluation of the software and revealed that our proposed interface was superior than that of the current HARK interface from the view points of intuitiveness and visibility.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Asono, F., et al.: Sound source localization and signal separation for office robot “Jijo-2". In: Proceedings of 1999 IEEE/SICE/RSJ International Conference on Multisensor Fusion and Integration for Intelligent Systems. MFI 1999, pp. 243–248. IEEE (1999)

    Google Scholar 

  2. Nakadai, Kazuhiro, et al.: Active audition for humanoid. AAAI/IAAI (2000)

    Google Scholar 

  3. Hara, I., et al.: Robust speech interface based on audio and video information fusion for humanoid HRP-2. In: Proceedings of 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, (IROS 2004), vol. 3. IEEE (2004)

    Google Scholar 

  4. Nakadai, K., et al.: An open source software system for robot audition HARK and its evaluation. In: 2008 8th IEEE-RAS International Conference on Humanoid Robots. Humanoids 2008. IEEE (2008)

    Google Scholar 

  5. Takahashi, T., et al.: Improvement in listening capability for humanoid robot HRP-2. In: 2010 IEEE International Conference on Robotics and Automation (ICRA). IEEE (2010)

    Google Scholar 

  6. Nakamura, K., et al.: Intelligent sound source localization for dynamic environments. In: 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2009. IEEE (2009)

    Google Scholar 

  7. Nakamura, K., et al.: Real-time super-resolution sound source localization for robots. In: 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE (2012)

    Google Scholar 

  8. Mizumoto, T., et al.: Design and implementation of selectable sound separation on the Texai telepresence system using hark. In: 2011 IEEE International Conference on Robotics and Automation (ICRA). IEEE (2011)

    Google Scholar 

  9. Ct, Carle, et al.: Code reusability tools for programming mobile robots. In: 2004 Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, (IROS 2004), vol. 2. IEEE (2004)

    Google Scholar 

  10. Quinn, B.G., et al.: The estimation and tracking of frequency (Vol. 9). Cambridge University Press

    Google Scholar 

  11. Salomon, D.: Curves and surfaces for computer graphics. Springer, New york (2007)

    Google Scholar 

  12. Catmull, E., Clark, J.: Recursively generated B-spline surfaces on arbitrary topological meshes. Computer-aided design 10(6), 350–355 (1978)

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by KAKENHI-No. 24220006.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Osamu Sugiyama .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Sugiyama, O., Kojima, R., Nakadai, K. (2015). Interactive Interface to Optimize Sound Source Localization with HARK. In: Ali, M., Kwon, Y., Lee, CH., Kim, J., Kim, Y. (eds) Current Approaches in Applied Artificial Intelligence. IEA/AIE 2015. Lecture Notes in Computer Science(), vol 9101. Springer, Cham. https://doi.org/10.1007/978-3-319-19066-2_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-19066-2_26

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-19065-5

  • Online ISBN: 978-3-319-19066-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics