skip to main content
10.1145/1291233.1291251acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

A computation method for video segmentation utilizing the pleasure-arousal-dominance emotional information

Published:29 September 2007Publication History

ABSTRACT

Extracting video structures is important for video indexing and navigation in large digital video archives. It is usually achieved by video segmentation algorithms. Little research efforts has been invested on segmentation solutions that utilize the video's emotional content. These solutions not only have the potential of providing better performances than existing segmentation methods, but are also able to provide a more natural video segmentation with which viewers can associate with. The development of an affect-based segmentation solution faces many challenges, such as the dynamic and time evolving nature of a video's emotional content. This paper introduces a novel computation method for affect-based video segmentation. It is designed based on the Pleasure-Arousal-Dominance (P-A-D) emotion model[18], which in principle can represent a large number of emotions. This method consists of a P-A-D estimation stage and a segmentation stage. A P-A-D estimator based on the Dynamic Bayesian Networks (DBNs) is proposed for the first stage. A clustering-based algorithm that utilizes the video's P-A-D information is proposed for the second stage. Experimental results demonstrate the feasibility of the method.

References

  1. M. Bradley. Emotional memory: a dimensional analysis. Hillsdale, NJ: Lawrence Erlbaum, 1994.Google ScholarGoogle Scholar
  2. Celoxica. RC200/203 Manual, 2005.Google ScholarGoogle Scholar
  3. B. Y. Chua and G. Lu. Improved perceptual tempo detection of music. In Intl. Conf. of Multimedia Model., pages 316--321, Jan. 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. del Bimbo. Visual Information Retrieval. New York: Morgan Kaufmann, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. P. Ekman. Facial expression and emotion. American Psycho., 48(4):384--392, Apr. 1993.Google ScholarGoogle ScholarCross RefCross Ref
  6. J. J. Gross and R. W. Levenson. Emotion elicitation using films. Cog. and Emot., 9(1):87--108, Jan. 1995.Google ScholarGoogle ScholarCross RefCross Ref
  7. A. Hanjalic and R. Lagendijk. Automated high level segmentation for advanced video retrieval systems. IEEE Trans. Circuits Syst. Video Technol., 9(4):580--588, June 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Hanjalic and L. Q. Xu. Affective content representation and modeling. IEEE Trans. Multimedia, 7(1):143--154, Feb. 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. L. Itti, C. Koch, and E. Niebur. A model for saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell., 20(11):1254--1259, Nov. 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Jaimes, T. Nagamine, J. Liu, K. Omura, and N. Sebe. Affective meeting video analysis. In IEEE Intl. Conf. on Multimedia and Expo, pages 1412--1415, July 2005.Google ScholarGoogle Scholar
  11. H. B. Kang. Analysis of scene context related with emotional events. In ACM Intl. Conf. on Multimedia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. H. B. Kang. Affective content detection using hidden markov models. In ACM Intl. Conf. on Multimedia, pages 259--262, Nov. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Kender and B. L. Yeo. Video scene segmentation via continuous video coherence. In IEEE Conferenceon Comput. Vis. and Pattern Recog. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. P. Lang. Perspectives on Anger and Emotion, pages 109--134. Lawrence Erlbaum Associates, 1993.Google ScholarGoogle Scholar
  15. J. Laroche. Estimating tempo, swing and beat locations in audio recordings. In IEEE App. of Signal Process. to Audio and Acoust., pages 135--138, Oct.2001.Google ScholarGoogle Scholar
  16. M. Lew. Principles of Visual Information Retrieval. Springer-Verlag, Berlin, Germany, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. Lienhart, S. Pfeiffer, and W. Effelsberg. Scene determination based on video and audio features. In Intl. Conf. of Multimedia Syst. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Mehrabian. Pleasure-arousal-dominance: A general framework for describing and measuring individual differences in temperament. Current Psycho., 14(4):261--292, Dec. 1996.Google ScholarGoogle ScholarCross RefCross Ref
  19. S. Moncrieff, C. Dorai, and S. Venkatesh. Affect computing in film through sound energy dynamics. In ACM Intl. Conf. on Multimedia, volume 9. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. K. Murphy. Dynamic bayesian networks, Nov. 2002.Google ScholarGoogle Scholar
  21. C. E. Osgood, G. J. Suci, and P. H. Tannenbaum. The Measurement of Meaning. University of Illinois Press, 1967.Google ScholarGoogle Scholar
  22. M. Pantic and L. J. M. Rothkrantz. Toward an affect-sensitive multimodal human-computer interaction. Proc. IEEE, 91(9):1370--1390, Sept. 2003.Google ScholarGoogle ScholarCross RefCross Ref
  23. R. Picard. Affective Computing. The MIT Press, Cambridge, MA, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. T. Pohle. Extraction of audio descriptors and their evaluation in music clasification tasks. Diploma thesis, University of Kaiserslautern, Kaiserslautern, Germany, Jan. 2005.Google ScholarGoogle Scholar
  25. Y. Rui, T. S. Huang, and S. Mehrotra. Constructing table-of-content for videos. Multimedia Systems, 7(5). Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. P. Shaver, J. Schwartz, D. Kirson, and G. O'Connor. Emotions in Social Psycology: Key Readings in Social Psycology, pages 26--56. Psychology Press, 2001.Google ScholarGoogle Scholar
  27. P. Valdez and A. Mehrabian. Effects of color on emotions. J. of Exp. Psycho., 124(4):394--409, Dec.1994.Google ScholarGoogle ScholarCross RefCross Ref
  28. J. Vendrig and M. Worring. Systematic evaluation of logical story unit segmentation. IEEE Trans. Multimedia, 4(4):492--499, Dec. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. M. Xu, L. T. Chia, and J. Jin. Affective content analysis in comedy and horror videos by audio emotional event detection. In IEEE Intl. Conf. on Multimedia and Expo, July 2005.Google ScholarGoogle Scholar

Index Terms

  1. A computation method for video segmentation utilizing the pleasure-arousal-dominance emotional information

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          MM '07: Proceedings of the 15th ACM international conference on Multimedia
          September 2007
          1115 pages
          ISBN:9781595937025
          DOI:10.1145/1291233

          Copyright © 2007 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 29 September 2007

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          Overall Acceptance Rate995of4,171submissions,24%

          Upcoming Conference

          MM '24
          MM '24: The 32nd ACM International Conference on Multimedia
          October 28 - November 1, 2024
          Melbourne , VIC , Australia

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader