Abstract
Temporal envelopes are representatives of frequency-dependent signatures of sequences that exhibit the temporal dynamics of signals. This chapter formulates the modulation index, which characterizes the spectral magnitudes of the envelope frequencies normalized. Speech intelligibility is estimated by the modulation index of the narrow-band envelopes. An intriguing question is whether the magnitude or phase spectrum is dominant in synthesizing intelligible speech. Interestingly, it depends on the frame-length. Thus the phase is dominant when the frame length is very long or extremely short. A time-reversed speech sample is used as a good example demonstrating the phase effect on intelligibility. The envelopes also provide estimates of the fundamental period of a periodic sequence. This chapter introduces the period estimation of a periodic scheme when the fundamental of a signal is missing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
R. Drullman, Temporal envelope and fine structure cues for speech intelligibility. J. Acoust. Soc. Am. 97(1), 585–592 (1995)
T. Houtgast, H.J.M. Steeneken, R. Plomp, A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria. J. Acoust. Soc. Am. 77(3), 1069–1077 (1985)
M. Kazama, S. Gotoh, M. Tohyama, T. Houtgast, On the significance of phase in the short term Fourier spectrum for speech intelligibility. J. Acoust. Soc. Am. 127(3), 1432–1439 (2010)
K. Yoshida, M. Kazama, M. Tohyama, Pitch and speech-rate conversion using envelope modulation modeling. ICASSP I SP-P04.04 425-428 (2003)
K. Terada, M. Tohyama, T. Houtgast, The effect of envelope or carrier delays on the precedence effect. Acustica 91(6) 1016–1019 (2005)
M. Tohyama, Sound and Signals (Springer, 2011)
M.R. Schroeder, Modulation transfer functions: definition and measurement. Acustica 49(3), 179–182 (1981)
M.R. Schroeder, Fractals, Chaos, Power Laws (W.H. Freeman and Company, 1991)
A.V. Oppenheim, J.S. Lim, The importance of phase in signals. Proc. IEEE 69(5), 529–541 (1981)
L. Liu, J. He, G. Palm, Effects of phase on the perception of intervocalic stop consonants. Speech Commun. 22, 403–417 (1997)
K. Saberi, D.R. Perrot, Cognitive restoration of reversed speech. Nature 398, 760 (29 April 1999)
L. Longworth-Reed, E. Brandewie, P. Zahorik, Time-forward speech intelligibility in time-reversed rooms. J. Acoust. Soc. Am. Express Lett. 125(1), EL13–EL19 (2009, published online 22 December 2008)
Y. Hara, M. Tohyama, K. Miyoshi, Effects of temporal and spectral factors of maskers on speech intelligibility. Appl. Acoust. 73, 893–899 (2012)
R. Lutfi, How much masking in informational masking? J. Acoust. Soc. Am. 88(6), 2607–2610 (1990)
H.v. Helmholtz, Die Lehre von den Tonempfindungen als physiologische Grundlage fuer die Theorie der Musik, On the Sensation of Tone (translated by A.J. Ellis) (Dover, 1954)
R. Meddis, L. O’Mard, A unitary model of pitch perception. J. Acoust. Soc. Am. 102(3), 1811–1820 (1997)
M.R. Schroeder, Number Theory in Science and Communication (Springer, 1997)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer Japan
About this chapter
Cite this chapter
Tohyama, M. (2015). Modulation and Periodic Properties of Temporal Envelope. In: Waveform Analysis of Sound. Mathematics for Industry, vol 3. Springer, Tokyo. https://doi.org/10.1007/978-4-431-54424-1_5
Download citation
DOI: https://doi.org/10.1007/978-4-431-54424-1_5
Published:
Publisher Name: Springer, Tokyo
Print ISBN: 978-4-431-54423-4
Online ISBN: 978-4-431-54424-1
eBook Packages: EngineeringEngineering (R0)