Abstract
Singer identification is the process of identifying or recognizing the singers based on the uniqueness in their singing voice. It is a challenging task in music information retrieval because of the combined instrumental music with the singing voice. The work presented in this paper recognizes a singer using Mel Frequency Cepstral Coefficient (MFCC) features and Chroma-Reduced Pitch (CRP) features with Support Vector Machines (SVM). The proposed technique for singer identification has two phases: feature extraction and identification. During the feature extraction phase, MFCC and CRP features are extracted from the songs in a database of popular music. In the second phase, the extracted features are trained with the SVM classifier. To evaluate our work, a dataset of 50 music clips was tested against the trained models of various singers. An equal error rate of 8% and 56% is achieved with SVM using MFCC and CRP features, respectively. By combining MFCC and CRP features at score level, an EER of 6.0% is obtained which indicates a significant increase in identification rate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
India Brand Equity Foundation. http://www.ibef.orglast. Accessed 25 Dec 2017
Ratanpara, T.: Singer identification using MFCC and LPCC from Indian video songs. In: Emerging the ICT for Bringing the Future, pp. 275–282 (2015)
Tsai, W.H., Lee, H.C.: Singer Identification based on spoken data in voice characterization. IEEE Trans. Acoust. Speech Signal Process. 20(8), 2291–2300 (2012)
Regnier, L., Peters, G.: Singer verification: Singer model Vs. Song model. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 437–440. Japan (2012)
Patil, A, H., Radadia, G, P., Basu, T, K.: Combining evidences from Mel cepstral features and cepstral Mean Subtracted features for singer identification. In: International Conference on Asian Language Processing, pp. 145–148. Vietnam (2012)
Dharini, D., Revathy, A.: Singer identification using clustering algorithm. In: International Conference on Communication and Signal Processing, pp. 1927–1931. Bangkok (2014)
Sarkar, R., Saha, S.K.: Singer based classification of songs dataset using vocal signature inherent in signal. In: Fifth National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics. Patna (2015)
Seetharaman, P., Rafii, Z.: Cover song identification with 2D Fourier transform sequences, In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans (2017)
Davis, S.B., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28, 357–366 (2009)
Youngmoo, E, Kim., Erik, M, Schmidt., Raymond, Migneco., Brandon, G, Morton.: Music emotion recognition: a state of the art review. In: 1st International Society for Music Information Retrieval Conference, pp. 255–266. Netherlands (2010)
Nalini, N.J., Palanivel, S., Balasubramanian, M.: Speech emotion recognition using residual phase and MFCC feature. Int. J. Eng. Technol. 5(6), 4515–4527 (2014)
Dan, jurafsky.: Speech Recognition and Synthesis-Feature Extraction and Acoustic Modeling lecture notes (2007)
O’Shaughnessy, O.: Interacting with computers by voice: automatic speech recognition and synthesis. Proc. IEEE 91(9), 1272–1305 (2003)
Peinado, A., Segura, J.: Speech Recognition over Digital Channels: Robustness and Standards, John Wiley and Sons (2006)
Vapnik, V.: Statistical learning theory. John Wiley and Sons, New York (1998)
Meinard, M., Sebastian, E., Sebastian, K.: Making chroma features more robust to timbre changes. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1869–1872, Taiwan (2009)
Müller, M., Kurth, F., Clausen, M.: Audio matchingvia chroma-based statistical features, In: International Music for Information Retrieval Conference, London (2005)
Ken, O’H., Sebastian, E., Johan, P., Mark, B., S.: Improved template based chord recognition using the CRP feature, In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans(2017)
O’Hanlon, K., Sandler, M.: A compositional approach to chroma estimation, In: Proceedings of the European Signal Processing Conference (EUSIPCO), Hungary (2016)
M¨uller, M., Ewert, S.: Towards timbre-invariant audiofeatures for harmony-based music. IEEE Trans. Audio Speech Lang Process. 18(3), 649–662 (2010)
Dhanalakshm, P., Palanivel, S., Ramalingam,V.: Classification of audio signals using SVM andRBFNN, Expert systems with applications, 36(3.part 2), pp. 6069–6075 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sangeetha, R., Nalini, N.J. (2020). Singer Identification Using MFCC and CRP Features with Support Vector Machines. In: Das, A., Nayak, J., Naik, B., Pati, S., Pelusi, D. (eds) Computational Intelligence in Pattern Recognition. Advances in Intelligent Systems and Computing, vol 999. Springer, Singapore. https://doi.org/10.1007/978-981-13-9042-5_25
Download citation
DOI: https://doi.org/10.1007/978-981-13-9042-5_25
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9041-8
Online ISBN: 978-981-13-9042-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)