Speaker Recognition, Standardization

Markowitz, Judith

doi:10.1007/978-1-4899-7488-4_240

Judith Markowitz³

81 Accesses

Synonyms

Speaker authentication; Speaker biometrics; Speaker identification and verification, SIV;Voice authentication; Voice recognition

Definition

The term “speaker recognition” (SR) refers to a group of technologies that use information extracted from a person’s speech to perform biometric operations such as speaker identification and verification (SIV). Standards for SR are designed to support the development of applications that can work with technology from different vendors (application programming interface standards), the sharing of SR data (data interchange standards), the transmission of data in real time (distributed speaker recognition standards), and the management of data resources in distributed environments (process-control protocol standards).

Introduction

SR technologies stand at the juncture between speech processing and biometrics. They belong to speech processing, because they extract and analyze data from the stream of speech. They belong to biometrics, because...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 899.99; Price excludes VAT (USA)

Hardcover Book: USD 549.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

J. Markowitz, The speaker verification application programmers interface standard (SVAPI), in BiometriCon’97: Conference Proceeding, Arlington, ed. by D. Harper (Diane Publishing Company, Darby, 1997)
Google Scholar
Novell Corporation, SRAPI and SVAPI Source Code (2006). http://developer.novell.com/wiki/index.php/SRAPI_and_SVAPI_Source_Code
J. Colombi, Interface Specification: Human Authentication – Application Program Interface (HA-API) Ver. 2.0 (United States Biometrics Consortium, Fort Meade, 1998)
Google Scholar
Enterprise Computer Technology Forum, S.100 Media Services Volume 6: Media Resources and Services, Revision 2.0 (1998). http://www.comptia.org/sections/ectf/Documents/s100r2v6.pdf
J. Markowitz, K. Rehor, Standards for speaker recognition, in Proceedings of Biometric Consortium’06, Baltimore, 2006
Google Scholar
V. Skerpec (ed.), Speaker Identification and Verification (SIV) Glossary. VoiceXML Forum (2007). http://www.voicexml.org/biometrics/
Daboul, C., Eckert, M. (eds.), Speaker Identification and Verification Applications. VoiceXML Forum (2006). http://www.voicexml.org/biometrics/
C. Daboul, P. Shinde (eds.), Speaker Identification and Verification (SIV) Requirements for VoiceXML Applications Ver. 2.0. VoiceXML Forum (2007). http://www.voicexml.org/biometrics/
INCITS 456, Speaker Recognition Format for Raw Data Interchange (SIVR) (2008). http://www.techstreet.com/incitsgate.tmpl
ISO, ISO 8601 2004(E) Data Elements and Interchange Formats – Interchange Formats – Representation of Dates and Times (International Standards Organization, Geneva, 2004)
Google Scholar
European Telecommunications Standards Institute, Distributed speech recognition; front-end feature extraction algorithm; compression algorithms. ETSI document ES 201 108 V1.1.2 2000-04 (2000)
Google Scholar
C.C. Broun, W.M. Campbell, D. Pearce, H. Kelleher, Distributed speaker recognition using the ETSI distributed speech recognition standard. Proc. Int. Conf. Artif. Intell. 1, 244–248 (2001). http://nsodl.org/resource/2200/2006H
D. Oran, Requirements for Distributed Control of Automatic Speech Recognition (ASR), Speaker Identification/Speaker Verification (SI/SV), and Text-to-Speech (TTS) Resources. Internet Informational RFC 4313 (2005). http://www3.tools.ietf.org/html/rfc4313
S. Shanmugham, P. Monaco, B. Eberman, A Media Resource Control Protocol (MRCP). Internet Informational RFC 4463 (2006). http://www.ietf.org/rfc/rfc4463.txt
S. Shanmugham, D. Burnett, Media Resource Control Protocol Version 2 (MRCPv2) (2007). NOTE: This is draft 17. As of December, 2008 it was the current draft. Upon final approval a stable IETF Internet Informational RFC reference number will be assigned. http://tools.ietf.orglid

Download references

Author information

Authors and Affiliations

J. Markowitz, Consultants, Chicago, IL, USA
Judith Markowitz

Authors

Judith Markowitz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Biometrics and Security, Research & National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China
Stan Z. Li
Departments of Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
Anil K. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Markowitz, J. (2015). Speaker Recognition, Standardization. In: Li, S.Z., Jain, A.K. (eds) Encyclopedia of Biometrics. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7488-4_240

Download citation

DOI: https://doi.org/10.1007/978-1-4899-7488-4_240
Published: 03 July 2015
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-7487-7
Online ISBN: 978-1-4899-7488-4
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics