Real-Time Joint Blind Speech Separation and Dereverberation in Presence of Overlapping Speakers

Rotili, Rudy; Principi, Emanuele; Squartini, Stefano; Piazza, Francesco

doi:10.1007/978-3-642-21090-7_52

Rudy Rotili²¹,
Emanuele Principi²¹,
Stefano Squartini²¹ &
…
Francesco Piazza²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6676))

Included in the following conference series:

International Symposium on Neural Networks

2332 Accesses
2 Citations

Abstract

Blind source separation and speech dereverberation are two important and common issues in the field of audio processing especially in the context of real meetings. In this paper a real time framework implementing a sequential source separation and speech dereverberation algorithm based on blind channel identification is taken as starting point. The major drawback of this approach consists in the inability of the BCI stage of estimating the room impulse responses when two or more sources are concurrently active. To overcome the aforementioned disadvantage a speaker diarization system have been successfully inserted in the reference framework to pilot the BCI stage. In such a way the identification task can be accomplished by using directly the microphone mixture making the overall structure well suited for real-time applications. The proposed solution works in frequency domain and the NU-Tech software platform has been used on purpose for real-time simulations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Huang, Y., Benesty, J., Chen, J.: A blind channel identification-based two-stage approach to separation and dereverberation of speech signals in a reverberant environment. IEEE Transactions on Speech and Audio Processing 13(5), 882–895 (2005)
Article Google Scholar
Rotili, R., De Simone, C., Perelli, A., Cifani, S., Squartini, S.: Joint Multichannel Blind Speech Separation and Dereverberation: A Real-Time Algorithmic Implementation. In: Huang, D.-S., McGinnity, M., Heutte, L., Zhang, X.-P. (eds.) ICIC 2010. Communications in Computer and Information Science, vol. 93, pp. 85–93. Springer, Heidelberg (2010)
Chapter Google Scholar
Squartini, S., Ciavattini, E., Lattanzi, A., Zallocco, D., Bettarelli, F., Piazza, F.: Nu-tech:implementing dsp algorithms in a plug-in based software platform for real time audio applications. In: Proceedings of 118th Convention of the Audio Engineering Society (2005)
Google Scholar
Araki, S., Fujimoto, M., Ishizuka, K., Sawada, H., Makino, S.: A doa based speaker diarization system for real meetings. In: Hands-Free Speech Communication and Microphone Arrays, HSCMA 2008, pp. 29–32 (May 2008)
Google Scholar
Moulines, E., Duhamel, P., Cardoso, J., Mayrargue, S.: Subspace methods for the blind identification of multichannel FIR filters. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1994, pp. IV/573–IV/576 (1994)
Google Scholar
Huang, Y., Benesty, J.: A class of frequency-domain adaptive approaches to blind multichannel identification. IEEE Transactions on Signal Processing 51(1), 11–24 (2003)
Article MathSciNet Google Scholar
Rotili, R., Cifani, S., Principi, E., Squartini, S., Piazza, F.: A robust iterative inverse filtering approach for speech dereverberation in presence of disturbances. In: APCCAS 2008 - 2008 IEEE Asia Pacific Conference on Circuits and Systems, pp. 434–437 (2008)
Google Scholar
Habets, E.A.P.: Room impulse response (RIR) generator (May 2008), http://home.tiscali.nl/ehabets/rirgenerator.html

Download references

Author information

Authors and Affiliations

A3LAB, Department of Biomedics, Electronics and Telecommunications, Università Politecnica delle Marche, Via Brecce Bianche 1, 60131, Ancona, Italy
Rudy Rotili, Emanuele Principi, Stefano Squartini & Francesco Piazza

Authors

Rudy Rotili
View author publications
You can also search for this author in PubMed Google Scholar
Emanuele Principi
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Squartini
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Piazza
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Automation, Key Laboratory of Complex Systems and Intelligence Science, Chinese Academy of Sciences, 100190, Beijing, China
Derong Liu
College of Information Science and Engineering, Northeastern University, 110004, Shenyang, Liaoing, China
Huaguang Zhang
Department of Electrical and Computer Engineering, University of Cyprus, 75 Kallipoleos Avenue, 1678, Nicosia, Cyprus
Marios Polycarpou
Dipartimento di Elettronica, Politecnico di Milano, Piazza L. da Vinci 32, 20133, Milano, Italy
Cesare Alippi
Deptartment of Electrical, Computer and Biomedical Engineering, University of Rhode Island, 02881, Kingston, RI, USA
Haibo He

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rotili, R., Principi, E., Squartini, S., Piazza, F. (2011). Real-Time Joint Blind Speech Separation and Dereverberation in Presence of Overlapping Speakers. In: Liu, D., Zhang, H., Polycarpou, M., Alippi, C., He, H. (eds) Advances in Neural Networks – ISNN 2011. ISNN 2011. Lecture Notes in Computer Science, vol 6676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21090-7_52

Download citation

DOI: https://doi.org/10.1007/978-3-642-21090-7_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21089-1
Online ISBN: 978-3-642-21090-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics