ISCSLP SR Evaluation, UVA–CS_es System Description. A System Based on ANNs

Vivaracho, Carlos E.

doi:10.1007/11939993_55

Carlos E. Vivaracho²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4274))

Included in the following conference series:

International Symposium on Chinese Spoken Language Processing

1558 Accesses
2 Citations

Abstract

This paper shows a description of the system used in the ISCSLP06 Speaker Recognition Evaluation, text independent cross-channel speaker verification task. It is a discriminative Artificial Neural Network-based system, using the Non-Target Incremental Learning method to select world representatives. Two different training strategies have been followed: (i) to use world representative samples with the same channel type as the true model, (ii) to select the world representatives from a pool of samples without channel type identification. The best results have been achieved with the first alternative, but with the appearance of the additional problem of the true model channel type recognition. The system used in this task will also be shown.

This work has been supported by the Ministerio de Ciencia y Tecnología, Spain, under Project TIC2003-08382-C05-03.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Artieres, T., Bennani, Y., Gallinari, P., Montacie, C.: Connectionist and conventional models for free text talker identification. In: Proc. Neuro-Nimes, France (1991)
Google Scholar
Batista, G.: A study of the behavior of everal methods for balancing machine learning training data. SIGKDD explorations 6(1), 20–29 (2004)
Article MathSciNet Google Scholar
Bianchini, M., Frasconi, P., Gori, M.: Learning in multilayered networks used as autoassociators. IEEE Trans. on Neural Networks 6(2), 512–515 (1995)
Article Google Scholar
Chawla, N.V., Japkowicz, N., Kotcz, A.: Editorial: Special issue on learning from imbalance data sets. SIGKDD explorations 6(1), 1–6 (2004)
Article Google Scholar
Assaleh, K.T., Farrel, K.R., Mammone, R.J.: Speaker recognition using neural networks and conventional classifiers. IEEE Transactions on Speech and Audio Processing, part II 2(1) (1994)
Google Scholar
Ganchev, T., Tasoulis, D., Vrahatis, M.N., Fakotakis, N.: Locally recurrent probabilistic neural network for text-independent speaker verification. In: Proc. Eurospeech 2003, pp. 1673–1676 (2003)
Google Scholar
Juszczak, P., Duin, R.P.W.: Uncertainty sampling methods for oneclass classifiers. In: Proc. of the Workshop on Learning from Imbalanced Datasets II, ICML (2003)
Google Scholar
Kramer, M.A.: Nonlinear principal component analysis using autoassociative neural networks. AIChE 37(2), 233–243 (1991)
Article Google Scholar
Lapidot, I.: Som as likelihood estimator for speaker clustering. In: Proc. Eurospeech 2003, pp. 3001–3004 (2003)
Google Scholar
Lawrence, S., Burns, I., Back, A., Tsoi, A.C., Giles, C.L.: Neural networks classification and prior class probabilities. LNCS, pp. 299–314. Springer, Heidelberg (1998)
Google Scholar
Mansfield, A.J., Wayman, J.L.: Best pratices in testing and reporting performance of biometric devices. version 2.01. Technical report (2002)
Google Scholar
Mary, L., Sri Rama Murty, K., Mahadeva Prasanna, S.R., Yegnanarayana, B.: Features for speaker and language identification. In: Proc. Odyssey 2004, the Speaker and Language Recognition Workshop, May 31- June 3 (2004)
Google Scholar
Oglesby, J., Mason, J.S.: Optimization of neural models for speaker identification. In: Proceedings IEEE ICASSP, vol. S5-1, pp. 261–264 (1990)
Google Scholar
Oglesby, J., Mason, J.S.: Radial basis function networks for speaker recognition. In: Proc. IEEE ICASSP, vol. S6.7, pp. 393–396. IEEE, Los Alamitos (1991)
Google Scholar
Vivaracho, C.E., Ortega-Garcia, J., Alonso, L., Moro, Q.I.: Extracting the most discriminant subset from a pool of candidates to optimize discriminant classifier training. In: Zhong, N., Raś, Z.W., Tsumoto, S., Suzuki, E. (eds.) ISMIS 2003. LNCS (LNAI), vol. 2871, pp. 640–645. Springer, Heidelberg (2003)
Chapter Google Scholar
Vivaracho, C.E., Ortega-Garcia, J., Alonso, L., Moro, Q.I.: Improving the competitiveness of discriminant neural networks in speaker verification. In: Proc. Eurospeech, September 2003, pp. 2637–2640 (2003), ISSN 1018-4074
Google Scholar
Vivaracho-Pascual, C., Ortega-Garcia, J., Alonso-Romero, L., Moro-Sancho, Q.: A comparative study of mlp-based artificial neural networks in text-independent speaker verification against gmm-based systems. In: Lindberg, B., Dalsgaard, P., Benner, H. (eds.) Proc. of Eurospeech 2001, ISCA, September 3-7, vol. 3, pp. 1753–1756 (2001)
Google Scholar
Wan, V., Renals, S.: Speaker recognition using sequence discriminant support vector machines. IEEE Transactions on Speech and Audio Processing 13(2), 203–210 (2005)
Article Google Scholar
Yegnanarayana, B., Kishore, S.P.: Aann: an alternative to gmm for pattern recognition. Neural Networks 15(3), 459–469 (2002)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dep. Informática, U. de Valladolid,
Carlos E. Vivaracho

Authors

Carlos E. Vivaracho
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, The University of Hong Kong, Hong Kong
Qiang Huo
Human Language Technology Department, Institute for Infocomm Research (I2R), 119613, Singapore
Bin Ma
School of Computer Engineering, Nanyang Technological University (NTU), 639798, Singapore
Eng-Siong Chng
Institute for Infocomm Research, 21 Heng Mui Keng Terrace, 119613, Singapore
Haizhou Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vivaracho, C.E. (2006). ISCSLP SR Evaluation, UVA–CS_es System Description. A System Based on ANNs. In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_55

Download citation

DOI: https://doi.org/10.1007/11939993_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49665-6
Online ISBN: 978-3-540-49666-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics