ABSTRACT
In this extended abstract, we present a novel story-driven approach to soundtrack retrieval for user-generated videos. Cinematic knowledge on cross-modal associations is exploited through folksonomic story text retrieval from collaborative online metadata resources. Subsequently, audiovisual synchronization is applied based on high-level features described by users. The approach is demonstrated in the MuseSync prototype system.
Supplemental Material
- A. J. Cohen. How music influences the interpretation of film and video: Approaches from experimental psychology. In R. A. Kendall and R. W. Savage, editors, Selected Reports in Ethnomusicology: Perspectives in Systematic Musicology, volume 12, pages 15--36. 2005.Google Scholar
- N. Cook. Analysing musical multimedia. Oxford University Press, New York, USA, 1998.Google Scholar
- J. Feng, B. Ni, and S. Yan. Auto-generation of professional background music for home-made videos. In Proc. ICIMCS, 2010. Google ScholarDigital Library
- J. Foote, M. Cooper, and A. Girgensohn. Creating music videos using automatic media analysis. Proc. ACM MM, 2002. Google ScholarDigital Library
- X. S. Hua and L. Lu. Optimization-based automated home video editing system. IEEE Trans. CSVT, 14(5):572--583, 2004. Google ScholarDigital Library
- S. Jeannin and A. Divakaran. MPEG-7 visual motion descriptors. IEEE Trans. CSVT, 11(6):720--724, 2001. Google ScholarDigital Library
- F.-F. Kuo, M.-F. Chiang, M.-K. Shan, and S.-Y. Lee. Emotion-based music recommendation by association discovery from film music. In Proc. ACM MM, 2005. Google ScholarDigital Library
- A. Stupar and S. Michel. PICASSO -- To Sing you must Close Your Eyes and Draw. In Proc. ACM SIGIR, 2011. Google ScholarDigital Library
- P. Tagg and B. Clarida. Ten Little Title Tunes -- Towards a Musicology of the Mass Media. The Mass Media Scholar's Press, New York, USA and Montreal, Canada, 2003.Google Scholar
Index Terms
- MuseSync: standing on the shoulders of Hollywood
Recommendations
A data-driven approach for tag refinement and localization in web videos
Our approach locates the temporal positions of tags in videos at the keyframe level.We deal with a scenario in which there is no pre-defined set of tags.We report experiments about the use of different web sources (Flickr, Google, Bing).We show state-of-...
Automatic tag expansion using visual similarity for photo sharing websites
In this paper we present an automatic photo tag expansion method designed for photo sharing websites. The purpose of the method is to suggest tags that are relevant to the visual content of a given photo at upload time. Both textual and visual cues are ...
Audio tag annotation and retrieval using tag count information
MMM'11: Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part IAudio tags correspond to keywords that people use to describe different aspects of a music clip, such as the genre, mood, and instrumentation. With the explosive growth of digital music available on the Web, automatic audio tagging, which can be used to ...
Comments