Skip to main content

Speaker Recognition, Standardization

  • Reference work entry
  • First Online:
Encyclopedia of Biometrics
  • 81 Accesses

Synonyms

Speaker authentication; Speaker biometrics; Speaker identification and verification, SIV;Voice authentication; Voice recognition

Definition

The term “speaker recognition” (SR) refers to a group of technologies that use information extracted from a person’s speech to perform biometric operations such as speaker identification and verification (SIV). Standards for SR are designed to support the development of applications that can work with technology from different vendors (application programming interface standards), the sharing of SR data (data interchange standards), the transmission of data in real time (distributed speaker recognition standards), and the management of data resources in distributed environments (process-control protocol standards).

Introduction

SR technologies stand at the juncture between speech processing and biometrics. They belong to speech processing, because they extract and analyze data from the stream of speech. They belong to biometrics, because...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 899.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 549.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. J. Markowitz, The speaker verification application programmers interface standard (SVAPI), in BiometriCon’97: Conference Proceeding, Arlington, ed. by D. Harper (Diane Publishing Company, Darby, 1997)

    Google Scholar 

  2. Novell Corporation, SRAPI and SVAPI Source Code (2006). http://developer.novell.com/wiki/index.php/SRAPI_and_SVAPI_Source_Code

  3. J. Colombi, Interface Specification: Human Authentication – Application Program Interface (HA-API) Ver. 2.0 (United States Biometrics Consortium, Fort Meade, 1998)

    Google Scholar 

  4. Enterprise Computer Technology Forum, S.100 Media Services Volume 6: Media Resources and Services, Revision 2.0 (1998). http://www.comptia.org/sections/ectf/Documents/s100r2v6.pdf

  5. J. Markowitz, K. Rehor, Standards for speaker recognition, in Proceedings of Biometric Consortium’06, Baltimore, 2006

    Google Scholar 

  6. V. Skerpec (ed.), Speaker Identification and Verification (SIV) Glossary. VoiceXML Forum (2007). http://www.voicexml.org/biometrics/

  7. Daboul, C., Eckert, M. (eds.), Speaker Identification and Verification Applications. VoiceXML Forum (2006). http://www.voicexml.org/biometrics/

  8. C. Daboul, P. Shinde (eds.), Speaker Identification and Verification (SIV) Requirements for VoiceXML Applications Ver. 2.0. VoiceXML Forum (2007). http://www.voicexml.org/biometrics/

  9. INCITS 456, Speaker Recognition Format for Raw Data Interchange (SIVR) (2008). http://www.techstreet.com/incitsgate.tmpl

  10. ISO, ISO 8601 2004(E) Data Elements and Interchange Formats – Interchange Formats – Representation of Dates and Times (International Standards Organization, Geneva, 2004)

    Google Scholar 

  11. European Telecommunications Standards Institute, Distributed speech recognition; front-end feature extraction algorithm; compression algorithms. ETSI document ES 201 108 V1.1.2 2000-04 (2000)

    Google Scholar 

  12. C.C. Broun, W.M. Campbell, D. Pearce, H. Kelleher, Distributed speaker recognition using the ETSI distributed speech recognition standard. Proc. Int. Conf. Artif. Intell. 1, 244–248 (2001). http://nsodl.org/resource/2200/2006H

  13. D. Oran, Requirements for Distributed Control of Automatic Speech Recognition (ASR), Speaker Identification/Speaker Verification (SI/SV), and Text-to-Speech (TTS) Resources. Internet Informational RFC 4313 (2005). http://www3.tools.ietf.org/html/rfc4313

  14. S. Shanmugham, P. Monaco, B. Eberman, A Media Resource Control Protocol (MRCP). Internet Informational RFC 4463 (2006). http://www.ietf.org/rfc/rfc4463.txt

  15. S. Shanmugham, D. Burnett, Media Resource Control Protocol Version 2 (MRCPv2) (2007). NOTE: This is draft 17. As of December, 2008 it was the current draft. Upon final approval a stable IETF Internet Informational RFC reference number will be assigned. http://tools.ietf.orglid

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer Science+Business Media New York

About this entry

Cite this entry

Markowitz, J. (2015). Speaker Recognition, Standardization. In: Li, S.Z., Jain, A.K. (eds) Encyclopedia of Biometrics. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7488-4_240

Download citation

Publish with us

Policies and ethics