Skip to main content

Real-Time Joint Blind Speech Separation and Dereverberation in Presence of Overlapping Speakers

  • Conference paper
Advances in Neural Networks – ISNN 2011 (ISNN 2011)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6676))

Included in the following conference series:

Abstract

Blind source separation and speech dereverberation are two important and common issues in the field of audio processing especially in the context of real meetings. In this paper a real time framework implementing a sequential source separation and speech dereverberation algorithm based on blind channel identification is taken as starting point. The major drawback of this approach consists in the inability of the BCI stage of estimating the room impulse responses when two or more sources are concurrently active. To overcome the aforementioned disadvantage a speaker diarization system have been successfully inserted in the reference framework to pilot the BCI stage. In such a way the identification task can be accomplished by using directly the microphone mixture making the overall structure well suited for real-time applications. The proposed solution works in frequency domain and the NU-Tech software platform has been used on purpose for real-time simulations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Huang, Y., Benesty, J., Chen, J.: A blind channel identification-based two-stage approach to separation and dereverberation of speech signals in a reverberant environment. IEEE Transactions on Speech and Audio Processing 13(5), 882–895 (2005)

    Article  Google Scholar 

  2. Rotili, R., De Simone, C., Perelli, A., Cifani, S., Squartini, S.: Joint Multichannel Blind Speech Separation and Dereverberation: A Real-Time Algorithmic Implementation. In: Huang, D.-S., McGinnity, M., Heutte, L., Zhang, X.-P. (eds.) ICIC 2010. Communications in Computer and Information Science, vol. 93, pp. 85–93. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  3. Squartini, S., Ciavattini, E., Lattanzi, A., Zallocco, D., Bettarelli, F., Piazza, F.: Nu-tech:implementing dsp algorithms in a plug-in based software platform for real time audio applications. In: Proceedings of 118th Convention of the Audio Engineering Society (2005)

    Google Scholar 

  4. Araki, S., Fujimoto, M., Ishizuka, K., Sawada, H., Makino, S.: A doa based speaker diarization system for real meetings. In: Hands-Free Speech Communication and Microphone Arrays, HSCMA 2008, pp. 29–32 (May 2008)

    Google Scholar 

  5. Moulines, E., Duhamel, P., Cardoso, J., Mayrargue, S.: Subspace methods for the blind identification of multichannel FIR filters. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1994, pp. IV/573–IV/576 (1994)

    Google Scholar 

  6. Huang, Y., Benesty, J.: A class of frequency-domain adaptive approaches to blind multichannel identification. IEEE Transactions on Signal Processing 51(1), 11–24 (2003)

    Article  MathSciNet  Google Scholar 

  7. Rotili, R., Cifani, S., Principi, E., Squartini, S., Piazza, F.: A robust iterative inverse filtering approach for speech dereverberation in presence of disturbances. In: APCCAS 2008 - 2008 IEEE Asia Pacific Conference on Circuits and Systems, pp. 434–437 (2008)

    Google Scholar 

  8. Habets, E.A.P.: Room impulse response (RIR) generator (May 2008), http://home.tiscali.nl/ehabets/rirgenerator.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Rotili, R., Principi, E., Squartini, S., Piazza, F. (2011). Real-Time Joint Blind Speech Separation and Dereverberation in Presence of Overlapping Speakers. In: Liu, D., Zhang, H., Polycarpou, M., Alippi, C., He, H. (eds) Advances in Neural Networks – ISNN 2011. ISNN 2011. Lecture Notes in Computer Science, vol 6676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21090-7_52

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-21090-7_52

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-21089-1

  • Online ISBN: 978-3-642-21090-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics