Synonyms
Audio information retrieval; Semantic inference in audio
Definition
An audio signal is a signal that contains information in the audible frequency range. Audio content analysis refers to a set of theories, algorithms and systems that aim at extracting descriptors or metadata related to audio content and allowing search, retrieval and other user actions performed on audio signals.
Historical Background
Multimedia content analysis has been one of the most booming research directions in the past years. With the objective of providing fast, natural, intuitive and personalized content-based access to vast multimedia data collections, and building on the synergy of many scientific disciplines, such as signal processing, pattern recognition, machine learning, information retrieval, information theory, natural language processing and psychology, the research initiative born around the end of the 1980s has succeeded in inspiring and mobilizing enormous number of researchers worldwide....
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Cai R, Lu L, Hanjalic A. Unsupervised content discovery in composite audio. In: Proceedings of the IEEE International Conference on Multimedia and Expo; 2005. p. 628–37.
Cai R, Lu L, Hanjalic A, Zhang H-J, Cai L-H. A flexible framework for key audio effects detection and auditory context inference. IEEE Trans Audio Speech Lang Process. 2006;14(3):1026–39.
Casey M, et al. Content-based music information retrieval: current directions and future challenges. In: Proceedings of the IEEE, Special Issue on Advances in Multimedia Information Retrieval. 2008;96(4):668–96.
Cheng W-H, Chu W-T, Wu J-L. Semantic context detection based on hierarchical audio models. In: Proceedings of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval; 2003. p. 109–15.
Hanjalic A. Content-based analysis of digital video. Norwell: Kluwer; 2004.
Huang X, Acero A, Hon HW. Spoken language processing: a guide to theory, algorithm, and system development. Upper Saddle River: Prentice; 2001.
Lu L, Cai R, Hanjalic A. Audio elements based auditory scene segmentation. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing; 2006. p. 17–20.
Lu L, Zhang H-J, Jiang H. Content analysis for audio classification and segmentation. IEEE Trans Speech Audio Process. 2002;10(7):504–16.
Radhakrishnan R, Divakaran A, Xiong Z. A time series clustering based framework for multimedia mining and summarization using audio features. In: Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval; 2004. p. 157–64.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this entry
Cite this entry
Lu, L., Hanjalic, A. (2018). Audio Content Analysis. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_1528
Download citation
DOI: https://doi.org/10.1007/978-1-4614-8265-9_1528
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering