Synonyms
Observations from speech; Speaker parameters
Definition
Speaker features are measurements extracted from the speech signal with the objective of determining the identity of a given speaker. In voice biometrics, speaker features whose source is known are typically used to build speaker models. Then, speaker features of unknown source are compared with the enrolled models in order to obtain measures of similarity. The identity of the speaker influences the speech production process in many different ways, due to vocal tract configuration, language spoken, social context, education, etc. Thus, several levels of identity can be identified in the speech signal, e.g., spectral, phonetic, prosodic, etc. Speaker features can be extracted at any of this identity levels, and therefore the speaker recognition process follows in essence a multilevel approach.
Identity Information in the Speech Signal
The identity levels in the speech signal are configured by the speech production process,...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
X. Huang, A. Acero, H.W. Hon, Spoken Language Processing: A Guide to Theory, Algorithm and System Development (Prentice Hall PTR, Upper Saddle River, 2001)
L.R. Rabiner, R.W. Schafer, Digital Processing of Speech Signals (Prentice Hall, Englewood Cliffs, 1978)
J.R. Deller, J.H.L. Hansen, J.L. Proakis, Discrete-Time Processing of Speech Signals, 2nd edn. (Wiley, New York, 1999)
J. Gonzalez-Rodriguez, D.T. Toledano, J. Ortega-Garcia, Voice biometrics, in Handbook of Biometrics, ed. by A.K. Jain, P. Flynn, A.A. Ross (Springer, Berlin, 2007)
D.A. Reynolds, The SuperSID project: exploiting high-level information for high-accuracy speaker recognition, in Proceedings of ICASSP, Hong Kong, 2003
G. Doddington, Speaker recognition based on idiolectal differences between speakers, in Proceedings of Eurospeech, Aalborg, 2001, pp. 2517–2520
J. Makhoul, Spectral analysis of speech by linear prediction. IEEE Trans. Audio Electroacoust. 21, 140–148 (1973)
S. Furui, Cepstral analysis technique for automatic speaker verification. IEEE Trans. Acoust. Speech, Signal Process. 29, 254–272 (1981)
J.S. Bridle, M.D. Brown, An experimental automatic word recognition system, Technical report 1003, Joint Speech Research Unit, Ruislip, 1974
H. Hermansky, B. Hanson, H. Wakita, Perceptually based linear predictive analysis of speech, in Proceedings of ICASSP, Tampa, vol. 10, 1985, pp. 509–512
L.R. Rabiner, A tutorial on hidden markov models and selected applications in speech recognition. Proc. IEEE 77, 257–286 (1989)
D.T. Toledano, L. Hernandez-Gomez, L. Villarrubia-Grande, Automatic phonetic segmentation. IEEE Trans. Speech Audio Process. 11, 617–625 (2003)
S. Kajarekar, L. Ferrer, K. Sonmez, J. Zheng, E. Shriberg, A. Stolcke, Modeling NERFs for speaker recognition, in Proceedings of Odyssey, Toledo, 2004, pp. 51–56
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer Science+Business Media New York
About this entry
Cite this entry
Ramos, D., Gonzalez-Dominguez, J., Toledano, D.T., González-Rodríguez, J. (2015). Speaker Features. In: Li, S.Z., Jain, A.K. (eds) Encyclopedia of Biometrics. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7488-4_203
Download citation
DOI: https://doi.org/10.1007/978-1-4899-7488-4_203
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-7487-7
Online ISBN: 978-1-4899-7488-4
eBook Packages: Computer ScienceReference Module Computer Science and Engineering