skip to main content
10.1145/1816041.1816074acmconferencesArticle/Chapter ViewAbstractPublication PagescivrConference Proceedingsconference-collections
poster

Music video affective understanding using feature importance analysis

Authors Info & Claims
Published:05 July 2010Publication History

ABSTRACT

Music video is a popular type of entertainment by viewers. Currently, the novel indexing and retrieval approach based on the affective cues contained in music videos becomes more and more attractive to users. Music video affective analysis and understanding is one of the most popular topics in current multimedia community. In this paper, we propose a novel feature importance analysis approach to select most representative arousal and valence features for arousal and valence modeling. Compared with state-of-the-art work by Zhang on music video affective analysis, our main contributions are in the following aspects: (1) Another 3 affect-related features are extracted to enrich the feature set and exploit their correlation with arousal and valence. (2) All extracted features are ordered via feature importance analysis, and then optimal feature subset is selected after ordering. (3) Different regression methods are compared for arousal and valence modeling in order to find the fittest estimation function. Our method achieves 33.39% and 42.17% deduction in terms of mean absolute error compared with Zhang's method. Experimental results demonstrate our proposed method has a considerable improvement on music video affective understanding.

References

  1. S. Arifin and P. Y. Cheung. User attention based arousal content modeling. In Proceedings of IEEE International Conference on Image Processing (ICIP), pages 433--436, March 2006.Google ScholarGoogle ScholarCross RefCross Ref
  2. S. Arifin and P. Y. Cheung. Affective level video segmentation by utilizing the pleasure-arousal-dominance information. IEEE Transactions on Multimedia, 10(7):1325--1341, November 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. O. Duda, P. E. Hart, and D. G. Stork. Pattern Classification. Wiley Interscience, New York, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. R. Gunn. Support vector machines for classification and regression. Technical report, Image Speech and Intelligent Systems Research Group, University of Southampton, U.K., 1998.Google ScholarGoogle Scholar
  5. A. Hanjalic and L.-Q. Xu. Affective video content representation and modeling. IEEE Transactions on Multimedia, 7(1):143--154, February 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. I. Jolliffe. Principle Component Analysis. Springer-Verlag, New York, 1986.Google ScholarGoogle ScholarCross RefCross Ref
  7. D. Li, I. K. Sethi, N. Dimitrova, and T. McGee. Classification of general audio data for content-based retrieval. Pattern Recognition Letters, 22:533--544, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. L. Lu, D. Liu, and H.-J. Zhang. Automatic mood detection and tracking of music audio signals. IEEE Transactions on Audio, Speech, and Language Processing, 14(1):5--18, January 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. M. Ruxanda, B. Y. Chua, Alexandros, and C. S. Jensen. Emotion-based music retrieval on a well-reduced audio feature space. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 181--184, April 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Soleymani, G. Chanel, J. J. Kierkels, and T. Pun. Affective characterization of movie scenes based on multimedia content analysis and user's physiological emotional responses. In Proceedings of the Tenth IEEE International Symposium on Multimedia, pages 228--235, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. K. Sun, J. Yu, Y. Huang, and X. Hu. An improved valence-arousal emotion space for video affective content representation and recognition. In Proceedings of IEEE International Conference on Multimedia and Expo (ICME), pages 566--569, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. P. Valdez and A. Mehrabian. Effects of color on emotions. Journal of Experimental Psychology, 123:394--409, 1994.Google ScholarGoogle ScholarCross RefCross Ref
  13. V. N. Vapnik. Statistical Learning Theory. John Wiley and Sons, Inc., New York, 1998.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. H. L. Wang and L.-F. Cheong. Affective understanding in film. IEEE Transactions on Circuits and Systems for Video Technology, 16(6):689--704, June 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. Weisberg. Applied Linear Regression. Wiley/Interscience, New York, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  16. M. Xu, J. S. Jin, S. Luo, and L. Duan. Hierarchical movie affective content analysis based on arousal and valence features. In Proceedings of ACM Multimedia, pages 677--680, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Zhang, Q. Huang, Q. Tian, S. Jiang, and W. Gao. i. mtv - an integrated system for mtv affective analysis. In Proceedings of ACM Multimedia (demenstration), pages 985--986, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Zhang, Q. Huang, Q. Tian, S. Jiang, and W. Gao. Personalized mtv affective analysis using user profile. In Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing, pages 327--337, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. S. Zhang, Q. Tian, Q. Huang, W. Gao, and S. Li. Utilizing affective analysis for effective movie browsing. In Proceedings of IEEE International Conference on Image Processing (ICIP), pages 677--680, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. S. Zhang, Q. Tian, S. Jiang, Q. Huang, and W. Gao. Affective mtv analysis based on arousal and valence features. In Proceedings of IEEE International Conference on Multimedia and Expo (ICME), pages 1369--1372, 2008.Google ScholarGoogle Scholar
  21. T. Zhang and C.-C. J. Kuo. Audio content analysis for online audiovisual data segmentation and classification. IEEE Transactions on Speech and Audio Processing, 9(4):441--457, May 2001.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Music video affective understanding using feature importance analysis

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        CIVR '10: Proceedings of the ACM International Conference on Image and Video Retrieval
        July 2010
        492 pages
        ISBN:9781450301176
        DOI:10.1145/1816041

        Copyright © 2010 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 5 July 2010

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • poster

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader