Abstract
This paper presents a speech analysis/synthesis model based on periodic-aperiodic decomposition. In presented approach, decomposition is performed in whole speech band without making identification of voiced/unvoiced regions. Other important feature is pitch-tracking ability of decomposition algorithm. For this purpose a new pitch-tracking transformation called Time-Varying Discrete Fourier Transform (TVDFT) is employed. Periodic component is modelled as a sum of pitch harmonics with amplitudes and phases estimated with TVDFT. Aperiodic component is defined as a difference between original speech signal and synthesised periodic component. TVDFT needs accurate fundamental pitch estimation. This paper also presents a robust pitch estimation.. Experimental results showing advantages of suggested model are also given.
This work was supported by Bialystok Technical University under the grant W/WI/2/04
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
7 References
Kondoz A.M., “Digital speech: coding for low bit rate communication systems”, John Wiley & Sons, Inc., New York, 1996.
Spanias A.S., „Speech coding: a tutorial review“, Proc. IEEE, Vol. 82, No. 10, pp. 1541–1582, 1994.
Almeida L.B., Tribolet J.M., “Harmonic Coding: A Low Bit-Rate, Good Quality, Speech Coding Technique”, Proc. IEEE Int. Conf. on Accoust., Speech and Signal Processing, pp. 1664–1667, 1982.
McAulay R.J., Quatieri T.F., „Sinusoidal Coding“ in “Speech Coding and Synthesis” (W. Klein and K. Palival, eds.), Elsevier Science Publishers, Amsterdam, 1995.
George E.B., Smith M.J.T., “Speech Analysis/Synthesis and Modification Using an Analysis-by-Synthesis/Overlap-Add Sinusoidal Model”, IEEE Trans, on Speech and Audio Processing, Vol 5, No. 5, pp. 389–406, 1997.
Stylianou Y., „Applying the Harmonic Plus Noise Mode in Concatenative Speech Synthesis“ IEEE Trans, on Speech and Audio Processing, Vol. 9, No 1., pp. 21–29, 2001.
Griffin D.W., Lim J.S., „Multiband Excitation Vocoder“, IEEE Trans, on Acoust., Speech and Signal Processing, Vol. ASSP-36, pp. 1223–1235, 1988.
B. Yegnanarayana, C. d'Alessandro, V. Darsions, “An Iterative Algorithm for Decomposiiton of Speech Signals into Periodic and Aperiodic Components”, IEEE Trans. On Speech and Audio Coding, Vol. 6, No. 1, pp. 1–11, 1998.
Jackson P.J.B., Shadle C.H., “Pitch-Scaled Estimation of Simultaneous Voiced and Turbulence-Noise Components in Speech”, IEEE Trans. On Speech and Audio Processing, Vol. 9, No. 7, pp. 713–726, 2001
Sercov V., Petrovsky A., „An Improved Speech Model with Allowance for Time-Varying Pitch Harmonic Amplitudes and Frequencies in Low Bit-Rate MBE Coders”, Proc. of the 6ht European Conf. on Speech Communication and Technology EUROSPEECH'99, pp. 1479–1482 Budapest, Hungary, 1999.
Petrovsky A., Sercov V., “Low Bit-Rate AbS Spectral Coding Based on the Harmonic Analysis of Speech Agreed Upon with Time-Varying Pitch Frequency and Psychoacoustical Optimization”, Proc. of Nordic Signal Processing Symposium NORSIG2000, pp. 45–48, 2000.
Petrovsky A., Zubrycki P., Sawicki A., Tonal and noise components separation based on a pitch synchronous DFT analyzer as a speech coding method // Proc. of European Conference on Circuit Theory and Devices ECCTD2003, Vol. III, pp. 169–172, 2003.
Eric W. M. Yu, Cheung-Fat Chan, A harmonic+noise coder with improved transient speech performance // Proc. of European Signal Processing Conference EUSIPCO'99, Special Session “Speech Coding”, 1999.
Sondhi M.M., New Methods of Pitch Extraction, IEEE Trans, on Audio and Electroacoustics, Vol. AU-16, No. 2, pp. 262–266, 1968.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science+Business Media, Inc.
About this paper
Cite this paper
Zubrycki, P., Petrovsky, A.A. (2005). Analysis/Synthesis Speech Model Based on the Pitch-Tracking Periodic-Aperiodic Decomposition. In: Saeed, K., Pejaś, J. (eds) Information Processing and Security Systems. Springer, Boston, MA. https://doi.org/10.1007/0-387-26325-X_4
Download citation
DOI: https://doi.org/10.1007/0-387-26325-X_4
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-25091-5
Online ISBN: 978-0-387-26325-0
eBook Packages: Computer ScienceComputer Science (R0)