Skip to main content

Novel Scheme of Real-Time Direction Finding and Tracking of Multiple Speakers by Robot-Embedded Microphone Array

  • Chapter
Robot Intelligence Technology and Applications 2012

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 208))

  • 218 Accesses

Abstract

Recently, interest on artificial robot audition is growing for developing human-robot interaction. The main purposes of an artificial audio system mounted on mobile robot are localizing sound sources, separating speech signal that is relevant to a particular speaker such as robot’s master, and processing speech sources to extract useful information such as master’s uttering commands. This paper reports a novel proposed method of a speaker’s direction tracking algorithm, and a realization of the real tracking system on a mobile robot. Basic approach of this study belongs to a category of direction finding known as sparseness-based one which employs time-frequency decomposition and disjoint property between different speech signals. The novel points in the proposed source tracking exist on a reliable data selection from time-frequency cells and the application of mean shift tracking to the kernel density estimator derived from these reliable time-frequency components. A wheel-based mobile robot is developed and built-in audio processing system. Experiments are conducted and demonstrate the ability to localize in real environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kwon, B., Park, Y., Park, Y.-S.: Sound Source Localization for Robot Auditory System Using the Summed GCC Method. In: International Conference on Control, Automation and Systems 2008, COEX, Seoul, Korea, October 14-17 (2008)

    Google Scholar 

  2. Nakadai, K., Nakajima, H., Murase, M., Okuno, H.G., Hasegawa, Y., Tsujino, H.: Real-Time Tracking of Multiple Sound Source by Integration of In-Room and Robot-Embedded Microphone Arrays. In: International Conference on Intelligent Robots and System, Beijing, China, October 9-15 (2006)

    Google Scholar 

  3. Huang, J., Ohnishi, N., Sugie, N.: Building ears for robots: sound localization and separation. Artif. Life Robotics 1, 157–163 (1997)

    Article  Google Scholar 

  4. Valin, J.-M., Michaud, F., Rouat, J.: Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering. Robotics and Autonomous System 55, 216–228 (2007)

    Article  Google Scholar 

  5. Hosoya, K., Ogawa, T., Kobayashi, T.: Robot auditory system using head-mounted square microphone array. In: The 2009 IEEE/RSJ International Conference on Intelligent Robotics and Systems, St. Louis, USA, October 11-15 (2009)

    Google Scholar 

  6. Asoh, H., Hara, I., Asano, F., Yamamoto, K.: Tracking human speech events using a particle filter. IEEE Trans., 1153–1156 (2005)

    Google Scholar 

  7. Arberet, S., Gribonval, R., Bimbot, F.: A Robust Method to Count and Locate Audio Sources in a Multichannel Underdetermined Mixture. IEEE Trans. SP 58(1), 121–133 (2010)

    Article  MathSciNet  Google Scholar 

  8. Ding, N., Hamada, N.: DOA Estimation of Multiple Speech Source from a Stereophonic Mixture in Underdetermined Case. IEICE Trans. Fundamentals E95-A(4), 735–744 (2012)

    Article  Google Scholar 

  9. Fujimoto, Ding, N., Hamada, N.: Multiple Sources’ Direction Finding by using Reliable Component on Phase Difference Manifold and Kernel Density Estimator. In: IEEE ICASSP 2012, pp. 2601–2604 (March 2012)

    Google Scholar 

  10. Cheng, Y.: Mean Shift, Mode Seeking, and Clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 17(8), 790–799 (1995)

    Article  Google Scholar 

  11. http://www.ros.org/wiki/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daobilige Su .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Su, D., Sekikawa, M., Nakazawa, K., Hamada, N. (2013). Novel Scheme of Real-Time Direction Finding and Tracking of Multiple Speakers by Robot-Embedded Microphone Array. In: Kim, JH., Matson, E., Myung, H., Xu, P. (eds) Robot Intelligence Technology and Applications 2012. Advances in Intelligent Systems and Computing, vol 208. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37374-9_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37374-9_44

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37373-2

  • Online ISBN: 978-3-642-37374-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics