Skip to main content

Visualizing Similarity among Estimated Melody Sequences from Musical Audio

  • Conference paper
The Grammar of Technology Development
  • 487 Accesses

Abstract

We have developed a music retrieval system that receives a humming query and finds similar audio intervals (segments) in a musical audio database. This system enables a user to retrieve a segment of a desired musical audio signal just by singing its melody. In this paper, we propose a method to summarize the music database through similarity analysis to thereby reduce the retrieval time. The distance of chroma vectors is used as a similarity measure. The key technique for summarization includes, mainly, a statistical smoothing method and a method of discriminant analysis. Practical experiments were conducted using 115 musical audio selections in the RWC popular music database. We report the summarization ratio as about 45%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bartsch, M. and Wakefield, G.H. (2001) To Catch a Chorus: Using Chroma-Based Representations For Audio Thumbnailing. in Proceedings, of the Workshop on Applications of Signal Processing to Audio and Acoustics, IEEE.

    Google Scholar 

  2. Cooper, M. and Foote, J. (2002) Automatic music summarization via similarity analysis, Proc. ISMIR, 81–85.

    Google Scholar 

  3. Dannenberg, R.B. and Hu, N. (2002) Pattern Discovery Techniques for Music Audio, Proc. ISMIR 2002, 63–70.

    Google Scholar 

  4. Goto, M. (2001) A Predominant-F0 Estimation Method for CD Recordings: MAP Estimation using EM Algorithm for Adaptive Tone Models, Proc. ICASSP 2001, V-3365–3368.

    Google Scholar 

  5. Goto, M., Hashiguchi, H., Nishimura, T. and Oka, R. (2002) RWC Music Database: Popular, Classical, and Jazz Music Databases, Proc. ISMIR 2002, 287–288.

    Google Scholar 

  6. Goto, M. (2002) A Real-time Music Scene Description System: A Chorus-Section Detecting Method, 2002-MUS-47-6, 2002 (100), 27–34 (in Japanese).

    MathSciNet  Google Scholar 

  7. Goto, M. (2003) A Chorus-Section Detecting Method for Musical Audio Signals, Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), pp. V-437–440.

    Google Scholar 

  8. Goto, M. (2003) SmartMusicKIOSK: Music Listening Station with Chorus-Search Function, Proceedings of 16th Annual ACM symposium on User Interface Software and Technology (UIST 2003), pp. 31–40.

    Google Scholar 

  9. Hashiguchi, H., Nishimura, T., Takita, T., Zhang, J.X. and Oka, R. (2001) Music Signal Spotting Retrieval by a Humming Query, Proceedings of Fifth World Multi-Conference on Systemics, Cybernetics and Informatics, VII 280–284.

    Google Scholar 

  10. Nishimura, T., Hashiguchi, H., Takita, J., Zhang, J.X. and Oka, R. (2001) Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming, Proc. ISMIR 2001, 211–218.

    Google Scholar 

  11. Ohtsu. N. (1979) A threshold selection method from gray-level histograms, IEEE Trans. SMC, SMC-9(1), 62–66.

    Google Scholar 

  12. Peeters, G., Burthe, A.L. and Rodet, X. (2002) Toward automatic music audio summary generation from signal analysis, Proc. ISMIR 2002, 94–100.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer

About this paper

Cite this paper

Hashiguchi, H. (2008). Visualizing Similarity among Estimated Melody Sequences from Musical Audio. In: Tsubaki, H., Yamada, S., Nishina, K. (eds) The Grammar of Technology Development. Springer, Tokyo. https://doi.org/10.1007/978-4-431-75232-5_15

Download citation

  • DOI: https://doi.org/10.1007/978-4-431-75232-5_15

  • Received:

  • Accepted:

  • Publisher Name: Springer, Tokyo

  • Print ISBN: 978-4-431-75231-8

  • Online ISBN: 978-4-431-75232-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics