Speaker Features

Ramos, Daniel; Gonzalez-Dominguez, Javier; Toledano, Doroteo T.; González-Rodríguez, Joaquín

doi:10.1007/978-1-4899-7488-4_203

Daniel Ramos³,
Javier Gonzalez-Dominguez³,
Doroteo T. Toledano³ &
…
Joaquín González-Rodríguez³

Synonyms

Observations from speech; Speaker parameters

Definition

Speaker features are measurements extracted from the speech signal with the objective of determining the identity of a given speaker. In voice biometrics, speaker features whose source is known are typically used to build speaker models. Then, speaker features of unknown source are compared with the enrolled models in order to obtain measures of similarity. The identity of the speaker influences the speech production process in many different ways, due to vocal tract configuration, language spoken, social context, education, etc. Thus, several levels of identity can be identified in the speech signal, e.g., spectral, phonetic, prosodic, etc. Speaker features can be extracted at any of this identity levels, and therefore the speaker recognition process follows in essence a multilevel approach.

Identity Information in the Speech Signal

The identity levels in the speech signal are configured by the speech production process,...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 899.99; Price excludes VAT (USA)

Hardcover Book: USD 549.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

X. Huang, A. Acero, H.W. Hon, Spoken Language Processing: A Guide to Theory, Algorithm and System Development (Prentice Hall PTR, Upper Saddle River, 2001)
Google Scholar
L.R. Rabiner, R.W. Schafer, Digital Processing of Speech Signals (Prentice Hall, Englewood Cliffs, 1978)
Google Scholar
J.R. Deller, J.H.L. Hansen, J.L. Proakis, Discrete-Time Processing of Speech Signals, 2nd edn. (Wiley, New York, 1999)
Google Scholar
J. Gonzalez-Rodriguez, D.T. Toledano, J. Ortega-Garcia, Voice biometrics, in Handbook of Biometrics, ed. by A.K. Jain, P. Flynn, A.A. Ross (Springer, Berlin, 2007)
Google Scholar
D.A. Reynolds, The SuperSID project: exploiting high-level information for high-accuracy speaker recognition, in Proceedings of ICASSP, Hong Kong, 2003
Google Scholar
G. Doddington, Speaker recognition based on idiolectal differences between speakers, in Proceedings of Eurospeech, Aalborg, 2001, pp. 2517–2520
Google Scholar
J. Makhoul, Spectral analysis of speech by linear prediction. IEEE Trans. Audio Electroacoust. 21, 140–148 (1973)
Google Scholar
S. Furui, Cepstral analysis technique for automatic speaker verification. IEEE Trans. Acoust. Speech, Signal Process. 29, 254–272 (1981)
Google Scholar
J.S. Bridle, M.D. Brown, An experimental automatic word recognition system, Technical report 1003, Joint Speech Research Unit, Ruislip, 1974
Google Scholar
H. Hermansky, B. Hanson, H. Wakita, Perceptually based linear predictive analysis of speech, in Proceedings of ICASSP, Tampa, vol. 10, 1985, pp. 509–512
Google Scholar
L.R. Rabiner, A tutorial on hidden markov models and selected applications in speech recognition. Proc. IEEE 77, 257–286 (1989)
Google Scholar
D.T. Toledano, L. Hernandez-Gomez, L. Villarrubia-Grande, Automatic phonetic segmentation. IEEE Trans. Speech Audio Process. 11, 617–625 (2003)
Google Scholar
S. Kajarekar, L. Ferrer, K. Sonmez, J. Zheng, E. Shriberg, A. Stolcke, Modeling NERFs for speaker recognition, in Proceedings of Odyssey, Toledo, 2004, pp. 51–56
Google Scholar

Download references

Author information

Authors and Affiliations

ATVS – Biometric Recognition Group, Escuela Politecnica Superior, Universidad Autonoma de Madrid, Madrid, Spain
Daniel Ramos, Javier Gonzalez-Dominguez, Doroteo T. Toledano & Joaquín González-Rodríguez

Authors

Daniel Ramos
View author publications
You can also search for this author in PubMed Google Scholar
Javier Gonzalez-Dominguez
View author publications
You can also search for this author in PubMed Google Scholar
Doroteo T. Toledano
View author publications
You can also search for this author in PubMed Google Scholar
Joaquín González-Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Biometrics and Security, Research & National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China
Stan Z. Li
Departments of Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
Anil K. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Ramos, D., Gonzalez-Dominguez, J., Toledano, D.T., González-Rodríguez, J. (2015). Speaker Features. In: Li, S.Z., Jain, A.K. (eds) Encyclopedia of Biometrics. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7488-4_203

Download citation

DOI: https://doi.org/10.1007/978-1-4899-7488-4_203
Published: 03 July 2015
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-7487-7
Online ISBN: 978-1-4899-7488-4
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics