skip to main content
10.1145/1027527.1027549acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Content-based music structure analysis with applications to music semantics understanding

Authors Info & Claims
Published:10 October 2004Publication History

ABSTRACT

In this paper, we present a novel approach for music structure analysis. A new segmentation method, beat space segmentation, is proposed and used for music chord detection and vocal/instrumental boundary detection. The wrongly detected chords in the chord pattern sequence and the misclassified vocal/instrumental frames are corrected using heuristics derived from the domain knowledge of music composition. Melody-based similarity regions are detected by matching sub-chord patterns using dynamic programming. The vocal content of the melody-based similarity regions is further analyzed to detect the content-based similarity regions. Based on melody-based and content-based similarity regions, the music structure is identified. Experimental results are encouraging and indicate that the performance of the proposed approach is superior to that of the existing methods. We believe that music structure analysis can greatly help music semantics understanding which can aid music transcription, summarization, retrieval and streaming.

References

  1. Bartsch, M. A., and Wakefield, G. H. To Catch a Chorus: Using Chroma-based Representations for Audio Thumbnailing. In Proc. WASPA. 2001.Google ScholarGoogle ScholarCross RefCross Ref
  2. Berenzweig, A. L., and Ellis, D.P.W. Location singing voice segments within music signals. In Proc. IEEE WASPAA. 2001.Google ScholarGoogle Scholar
  3. Chai, W., and Vercoe, B. Music Thumbnailing via Structural Analysis. In Proc. ACM Multimedia. 2003, 223--226. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Cooper, M., and Foote, J. Automatic Music Summarization via Similarity Analysis. In Proc. ISMIR. 2002.Google ScholarGoogle Scholar
  5. Collobert <http://www.idiap.ch/~collober>, R., and Bengio <http://www.idiap.ch/~bengio>, S. SVMTorch: Support Vector Machines for Large-Scale Regression Problems <http://www.ai.mit.edu/projects/jmlr/papers/volume1/collobert01a/collobert01a.ps.gz>. Journal of Machine Learning Research. 2001, Vol 1, 143--160. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Duxburg. C, Sandler. M., and Davies. M. A Hybrid Approach to Musical Note Onset Detection. In Proc. International Conference on DAFx. 2002.Google ScholarGoogle Scholar
  7. Foote, J., Cooper, M., and Girgensohn, A. Creating Music Video using Automatic Media Analysis. In Proc. ACM Multimedia. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Fujinaga, I. Machine Recognition of Timbre Using Steady-state Tone of Acoustic Musical Instruments. In Proc. ICMC. 1998, 207--210.Google ScholarGoogle Scholar
  9. Ghias, A., Logan, J., Chamberlin, D., and Smith, B. C. Query By Humming: Musical Information Retrieval in an Audio Database. In Proc. ACM Multimedia. 1995, 231--236. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Goto, M. A Chorus-Section Detecting Method for Musical Audio Signals. In Proc. IEEE ICASSP. 2003.Google ScholarGoogle Scholar
  11. Goto, M. An Audio-based Real-time Beat Tracking System for Music With or Without Drum-sounds. Journal of new Music Research. June. 2001, Vol.30, 159--171.Google ScholarGoogle Scholar
  12. Deller, J. R., Hansen, J.H.L., and Proakis, H. J. G. Discrete-Time Processing of Speech Signals, IEEE Press, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Kim, Y.K., and Brian, Y. Singer Identification in Popular Music Recordings Using Voice Coding Features. In Proc. ISMIR 2002.Google ScholarGoogle Scholar
  14. Logan, B., and Chu, S. Music Summarization Using Key Phrases. In Proc. IEEE ICASSP. 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Lu, L., and Zhang, H. Automated Extraction of Music Snippets. In Proc. ACM Multimedia. 2003, 140--147. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Navarro, G. A guided tour to approximate string matching, ACM Computing Surveys, March 2001, Vol.33, No 1, 31--88. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Rossing, T.D., Moore, F. R., and Wheeler, P. A. Science of Sound. Addison Wesley, 3rd edition 2001.Google ScholarGoogle Scholar
  18. Rudiments and Theory of Music. The associated board of the royal schools of music, 14 Bedford Square, London, WC1B 3JG, 1949.Google ScholarGoogle Scholar
  19. Scheirer, E. D. Tempo and Beat Analysis of Acoustic Musical Signals. Journal of the Acoustical Society of America. January 1998, Vol 103, No 1, 588 -- 601.Google ScholarGoogle Scholar
  20. Sheh, A., and Ellis, D.P.W. Chord Segmentation and Recognition using EM-Trained Hidden Markov Models. In Proc. ISMIR 2003.Google ScholarGoogle Scholar
  21. Shenoy, A., Mohapatra, R., and Wang, Y. Key Detection of Acoustic Musical Signals, In Proc, ICME 2004.Google ScholarGoogle Scholar
  22. Ten Minute Master No 18: Song Structure. MUSIC TECH magazine. www.musictechmag.co.uk <http://www.musictechmag.co.uk> (Oct. 2003), 62 -- 63.Google ScholarGoogle Scholar
  23. Wang, Y. et al. Content-Based UEP: A New Scheme for Packet Loss Recovery in Music Streaming. In Proc. ACM Multimedia. 2003. 412 -- 421. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Xu, C.S., Maddage, N.C., and Shao, X. Automatic Music Classification and Summarization. In IEEE Transaction on Speech and Audio Processing (accepted).Google ScholarGoogle Scholar
  25. Young, S. et al. The HTK Book. Dept of Engineering, University of Cambridge, Version 3.2, 2002.Google ScholarGoogle Scholar

Index Terms

  1. Content-based music structure analysis with applications to music semantics understanding

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          MULTIMEDIA '04: Proceedings of the 12th annual ACM international conference on Multimedia
          October 2004
          1028 pages
          ISBN:1581138938
          DOI:10.1145/1027527

          Copyright © 2004 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 10 October 2004

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          Overall Acceptance Rate995of4,171submissions,24%

          Upcoming Conference

          MM '24
          MM '24: The 32nd ACM International Conference on Multimedia
          October 28 - November 1, 2024
          Melbourne , VIC , Australia

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader