Skip to main content

Acoustic Scene Classification Using Convolutional Neural Network

  • Conference paper
  • First Online:
Soft Computing and Signal Processing (ICSCSP 2019)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1118))

Included in the following conference series:

  • 572 Accesses

Abstract

This proposed research work presents acoustic scene classification (ASC) which is an errand to relate a semantic name to a sound stream that distinguishes the environment in which it has been delivered. ASC can be applied in many areas including mobile robot navigation systems and context-aware devices, such as an automatically mode-switching smart phones according to the current acoustic environment. Proposing a strong ASC system is difficult because the sound from natural setting compromises numerous audio sources and also the microphones do not seem to be organized in a very controlled condition. Furthermore, not all sounds from long-duration audio data are relevant for identifying scene label. The dataset for this assignment is that the DCASE 2018 dataset collected from Tampere University of Technology, comprising of sound recordings from different scenes like airport, metro station, shopping mall, etc. For each location, there are 5–6 min of audio files. We propose to implement the ASC task using convolutional neural network (CNN) that performs the task of classification. The audio files are converted to log mel-spectrograms which are provided as input to CNN. Upon training the CNN model by varying the number of layers and the hyperparameters, it is observed that significant accuracy of 78.4 and 73.84% has been achieved for the inputs RGB scale spectrograms and grayscale spectrograms, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 279.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dang, A., . Vu, T.-H., Wang, J.-C.: Acoustic scene classification using convolutional neural networks and multi-scale multi-feature extraction. In: Proceedings of Detection and Classification of Acoustic Scenes and Events (2017)

    Google Scholar 

  2. Mesaros, A., Heittola, T., Benetos, E., Foster, P., Lagrange, M., Virtane, T., Plumbley, M.D: Detection and classification of acoustic scenes and events: outcome of the DCASE 2016 challenge. In: IEEE/ACM Trans. Audio Speech Lang. Process. (2016)

    Google Scholar 

  3. Mesaros, A., Heittola, T., Virtanen, T.: Tut database for acoustic scene classification and sound event detection. In: Proceedings of Signal Processing Conference (EUSIPCO) (2016)

    Google Scholar 

  4. Li, D., Tam, J., Toub, D.: Auditory scene classification using machine learning techniques. In: Proceedings of IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events (2013)

    Google Scholar 

  5. Barchiesi, D., Giannoulis, D., Stowell, D., Plumbley, M.D.: Acoustic scene classification. In: Proceedings of Detection and Classification of Acoustic Scenes and Events, pp. 4–12 (2014)

    Google Scholar 

  6. Takahashi, G., Yamada, T., Ono, N., Makino, S: Performance evaluation of acoustic scene classification using DNN-GMM and frame-concatenated acoustic features. In: Proceedings of APSIPA Annual Summit and Conference (2017)

    Google Scholar 

  7. Jiang, H., Bai, J., Zhang, S., Xu, B: SVM-based audio scene classification. In: Proceedings of IEEE International Conference on Natural Language Processing and Knowledge Engineering (2005)

    Google Scholar 

  8. Battaglino, D., Lepauloux, L., Evans, N.: Acoustic scene classification using convolutional neural networks. In: Proceedings of Detection and Classification of Acoustic Scenes and Events (2016)

    Google Scholar 

  9. D. Giannoulis, E. Benetos, D. Stowell, M. Rossignol, M. Lagrange, and M. D. Plumbley: Detection and classification of acoustic scenes and events: an IEEE AASP challenge. In: Proceedings of IEEE Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 1–4 (2013)

    Google Scholar 

  10. Alexander, G., Alexander, L: Acoustic scene classification using convolutional neural networks and different channels representations and its fusion. In: Proceedings of Detection and Classification of Acoustic Scenes and Events (2018)

    Google Scholar 

  11. Hussaina, K., Hussainb, M., Khanc, M.G.: An improved acoustic scene classification method using convolutional neural networks (CNNs). Am. Sci. Res. J. Eng. Technol. Sci. (2018)

    Google Scholar 

  12. http://dcase.community/challenge2018/task-acoustic-scene-classification

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to S. Akshara .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Akshara, S., Hemapriyalakshmi, R., Keerthana, S., Bharathi, B., Kavitha, S. (2020). Acoustic Scene Classification Using Convolutional Neural Network. In: Reddy, V., Prasad, V., Wang, J., Reddy, K. (eds) Soft Computing and Signal Processing. ICSCSP 2019. Advances in Intelligent Systems and Computing, vol 1118. Springer, Singapore. https://doi.org/10.1007/978-981-15-2475-2_28

Download citation

Publish with us

Policies and ethics