Emotion Recognition from Semi Natural Speech Using Artificial Neural Networks and Excitation Source Features

Koolagudi, Shashidhar G.; Devliyal, Swati; Barthwal, Anurag; Sreenivasa Rao, K.

doi:10.1007/978-3-642-32129-0_30

Shashidhar G. Koolagudi⁷,
Swati Devliyal⁷,
Anurag Barthwal⁷ &
…
K. Sreenivasa Rao⁸

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 306))

Included in the following conference series:

International Conference on Contemporary Computing

1984 Accesses
5 Citations

Abstract

This paper proposes Linear Prediction (LP) residual of speech signal for characterizing the basic emotions. LP residual is extracted from speech signal by LP analysis, by inverse filtering of the speech signal. LP residual basically contains higher order relations among the samples. Instant of glottal closure in a speech signal is known as an epoch. The significant excitation of vocal tract usually takes place at the instant of glottal closure. For analysing speech emotions, the LP residual samples chosen around glottal closure instants are used. A semi-natural database GEU-SNESC (Graphic Era University Semi Natural Emotion Speech Corpus) is used for modeling the emotions. This database is collected by recording dialogs of film actors from Hindi movies. In the study four emotions namely anger, happy, neutral and sadness are used. Auto-associative neural network models are used for characterizing the basic emotions present in the speech. Average emotion recognition of 66% and 59% is observed respectively for the epoch based and entire LP residual samples.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Schuller, B., Rigoll, G., Lang, M.: Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture. In: Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, pp. 577–580. IEEE Press (May 2004)
Google Scholar
Dellert, F., Polzin, T., Waibel, A.: Recognizing emotion in speech. In: Fourth International Conference on Spoken Language Processing, Philadelphia, PA, USA, pp. 1970–1973 (October 1996)
Google Scholar
Koolagudi, S.G., Maity, S., Kumar, V.A., Chakrabarti, S., Rao, K.S.: IITKGP-SESC: Speech Database for Emotion Analysis. In: Ranka, S., Aluru, S., Buyya, R., Chung, Y.-C., Dua, S., Grama, A., Gupta, S.K.S., Kumar, R., Phoha, V.V. (eds.) IC3 2009. CCIS, vol. 40, pp. 485–492. Springer, Heidelberg (2009)
Chapter Google Scholar
Lee, C.M., Narayanan, S.S.: Toward detecting emotions in spoken dialogs. IEEE Trans. Speech and Audio Processing 13, 293–303 (2005)
Article Google Scholar
Nakatsu, R., Nicholson, J., Tosa, N.: Emotion recognition and its application to computer agents with spontaneous interactive capabilities. Knowledge Based Systems 13, 497–504 (2000)
Article Google Scholar
Charles, F., Pizzi, D., Cavazza, M., Vogt, T., Andr, E.: Emoemma: Emotional speech input for interactive story telling. In: Decker, Sichman, Sierra, Castelfranchi (eds.) Eighth Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS 2009), Budapest, Hungary, pp. 1381–1382 (May 2009)
Google Scholar
Ververidis, D., Kotropoulos, C.: A state of the art review on emotional speech databases. In: Eleventh Australasian International Conference on Speech Science and Technology, Auckland, New Zealand (December 2006)
Google Scholar
France, D.J., Shiavi, R.G., Silverman, S., Silverman, M., Wilkes, M.: Acoustical properties of speech as indicators of depression and suicidal risk. IEEE Transactions on Biomedical Engg. 47(7), 829–837 (2000)
Article Google Scholar
Nwe, T.L., Foo, S.W., Silva, L.C.D.: Speech emotion recognition using hidden Markov models. Speech Communication 41, 603–623 (2003)
Article Google Scholar
McGilloway, S., Cowie, R., Douglas-Cowie, E., Gielen, S., Westerdijk, M., Stroeve, S.: Approaching automatic recognition of emotion from voice: A rough benchmark, Belfast (2000)
Google Scholar
Dellaert, F., Polzin, T., Waibel, A.: Recognising emotions in speech. In: ICSLP 1996 (October 1996)
Google Scholar
Nicholson, J., Takahashi, K., Nakatsu, R.: Emotion recognition in speech using neural networks. In: Sixth International Conference on Neural Information Processing, ICONIP 1999, pp. 495–501 (1999)
Google Scholar
Ververidis, D., Kotropoulos, C., Pitas, I.: Automatic emotional speech classification. In: ICASSP 2004, pp. I593–I596. IEEE (2004)
Google Scholar
Iida, A., Campbell, N., Higuchi, F., Yasumura, M.: A corpus-based speech synthesis system with emotion. Speech Communication 40, 161–187 (2003)
Article MATH Google Scholar
Gobl, C., Chasaide, A.: The role of voice quality in communicating emotion, mood and attitude. In: SPC, vol. 40, pp. 189–212 (2003)
Google Scholar
Kwon, O., Chan, K., Hao, J., Lee, T.: Emotion recognition by speech signals. In: Eurospeech, Geneva, pp. 125–128 (2003)
Google Scholar
Wang, Y., Guan, L.: An investigation of speech-based human emotion recognition. In: IEEE 6th Workshop on Multimedia Signal Processing, pp. 15–18 (2004)
Google Scholar
Yegnanarayana, B., Murty, K.S.R.: Event-based instantaneous fundamental frequency estimation from speech signals. IEEE Trans. Audio, Speech, and Language Processing 17(4), 614–624 (2009)
Article Google Scholar
Koolagudi, S.G., Sreenivasa Rao, K.: Exploring Speech Features for Classifying Emotions along Valence Dimension. In: Chaudhury, S., Mitra, S., Murthy, C.A., Sastry, P.S., Pal, S.K. (eds.) PReMI 2009. LNCS, vol. 5909, pp. 537–542. Springer, Heidelberg (2009)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing, Graphic Era University, Dehradun, 248002, Uttarakhand, India
Shashidhar G. Koolagudi, Swati Devliyal & Anurag Barthwal
Indian Institute of Technology Kharagpur, Kharagpur, 721302, West Bengal, India
K. Sreenivasa Rao

Authors

Shashidhar G. Koolagudi
View author publications
You can also search for this author in PubMed Google Scholar
Swati Devliyal
View author publications
You can also search for this author in PubMed Google Scholar
Anurag Barthwal
View author publications
You can also search for this author in PubMed Google Scholar
K. Sreenivasa Rao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

TASSL, Dept. of Electrical & Computer Engineering, Rutgers the State University of New Jersey, Brett Road, 08854-8058, Piscataway, NJ, USA
Manish Parashar
Mathematics and Computer Science Division, Argonne National Laboratory, 60439, Argonne, IL, USA
Dinesh Kaushik
School of Computer Science and Welsh Science Center, Cardiff University, 5 The Parade, CF24 3AA, Cardiff, UK
Omer F. Rana
Division of Physical Sciences and Engineering, 4700 King Abdullah University of Science and Technology, Room 3221, Al Jazri Building, 23955-6900, Thuwal, Makkah, Saudi Arabia
Ravi Samtaney
Department of Electrical and Computer Engineering, Stony Brook University, 11794, Stony Brook, New York, USA
Yuanyuan Yang
Faculty of Engineering and Information Technologies, School of Information Technologies, University of Sydney, 2006, Sydney, NSW, Australia
Albert Zomaya

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Koolagudi, S.G., Devliyal, S., Barthwal, A., Sreenivasa Rao, K. (2012). Emotion Recognition from Semi Natural Speech Using Artificial Neural Networks and Excitation Source Features. In: Parashar, M., Kaushik, D., Rana, O.F., Samtaney, R., Yang, Y., Zomaya, A. (eds) Contemporary Computing. IC3 2012. Communications in Computer and Information Science, vol 306. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32129-0_30

Download citation

DOI: https://doi.org/10.1007/978-3-642-32129-0_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32128-3
Online ISBN: 978-3-642-32129-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics