skip to main content
10.1145/2461466.2461511acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
poster

Multimedia information seeking through search and hyperlinking

Authors Info & Claims
Published:16 April 2013Publication History

ABSTRACT

Searching for relevant webpages and following hyperlinks to related content is a widely accepted and effective approach to information seeking on the textual web. Existing work on multimedia information retrieval has focused on search for individual relevant items or on content linking without specific attention to search results. We describe our research exploring integrated multimodal search and hyperlinking for multimedia data. Our investigation is based on the MediaEval 2012 Search and Hyperlinking task. This includes a known-item search task using the Blip10000 internet video collection, where automatically created hyperlinks link each relevant item to related items within the collection. The search test queries and link assessment for this task was generated using the Amazon Mechanical Turk crowdsourcing platform. Our investigation examines a range of alternative methods which seek to address the challenges of search and hyperlinking using multimodal approaches. The results of our experiments are used to propose a research agenda for developing effective techniques for search and hyperlinking of multimedia content.

References

  1. M Bron, B Huurnink, and M de Rijke. Linking archives using document enrichment and term selection. In Proceedings of TPDL 2011, pages 2357--2360, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. K. Chatfield, V. Lempitsky, A. Vedaldi, and A. Zisserman. The devil is in the details: an evaluation of recent feature encoding method. In Proceedings of BMVC 2011, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  3. R.G. Cinbis, Jakob Verbeek, and Cordelia Schmid. Unsupervised Metric Learning for Face Identification in TV Video. In Proceedings of ICCV 2011, Barcelona, Spain, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Eskevich, G.J. F. Jones, S. Chen, R. Aly, R.J.F. Ordelman, and M. Larson. Search and Hyperlinking Task at Mediaeval 2012. In MediaEval, volume 927 of CEUR Workshop Proceedings. CEUR-WS.org, 2012.Google ScholarGoogle Scholar
  5. M. Eskevich, G.J.F. Jones, M. Larson, and R.J.F. Ordelman. Creating a data collection for evaluating rich speech retrieval. In Proceedings of LREC 2012, Istanbul, Turkey, 2012.Google ScholarGoogle Scholar
  6. M. Eskevich, G.J.F. Jones, M. Larson, C. Wartena, R. Aly, T. Verschoor, and R.J.F. Ordelman. Comparing retrieval effectiveness of alternative content segmentation methods for internet video search. In Proeedings of CBMI 2012, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  7. M. Eskevich, W. Magdy, and G.J.F. Jones. New metrics for meaningful evaluation of informally structured speech retrieval. In Proceedings of ECIR 2012, pages 170--181, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J.S. Garofolo, C.G.P. Auzanne, and E.M. Voorhees. The TREC spoken document retrieval track: A success story. In Proceedings of RIAO 2000, pages 1--8, 2000.Google ScholarGoogle Scholar
  9. A. Girgensohn, L. Wilcox, F. Shipman, and S. Bly. Designing affordances for the navigation of detail-on-demand hypervideo. In Proceedings of AVI 2004, pages 290--297. ACM, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. L. Hardman. Modelling and authoring hypermedia documents. PhD thesis, Universiteit Amsterdam, 1998.Google ScholarGoogle Scholar
  11. D. Hiemstra. Using language models for information retrieval. PhD thesis, University of Twente, 2001.Google ScholarGoogle Scholar
  12. P. Hoffmann, T. Kochems, and M. Herczeg. HyLive: Hypervideo-Authoring for Live Television. In Changing Television Environments, pages 51--60. Springer, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. T. Kaneko, T. Takigami, and T. Akiba. STD based on hough transform and SDR using STD results: Experiments at NTCIR-9 SpokenDoc. In Proceedings of Ninth NTCIR Workshop Meeting, 2011.Google ScholarGoogle Scholar
  14. P. Kelm, S. Schmiedeke, and T. Sikora. Feature-based Video Key Frame Extraction for low Quality Video Sequences. In Proceedings of WIAMIS 2009.Google ScholarGoogle ScholarCross RefCross Ref
  15. Lori Lamel and Jean-Luc Gauvain. Speech processing for audio indexing. In Advances in Natural Language Processing, volume 5221 of LNCS, pages 4--15. 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Larson, M. Eskevich, R. Ordelman, C. Kofler, S. Schmiedeke, and G. J. F. Jones. Overview of MediaEval 2011 Rich Speech Retrieval Task and Genre Tagging Task. In MediaEval 2011 Workshop, Pisa, Italy, 2011.Google ScholarGoogle Scholar
  17. M. Larson, C. Kofler, and A. Hanjalic. Reading between the tags to predict real-world size-class for visually depicted objects in images. In Proceedings of ACM MM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Larson, M. Soleymani, M. Eskevich, P. Serdyukov, R.J.F. Ordelman, and G. J. F. Jones. The community and the crowd: Multimedia benchmark dataset development. IEEE MultiMedia, 19(3):15, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. M.A. Larson, S. Schmiedeke, P. Kelm, A. Rae, V. Mezaris, T. Piatrik, M. Soleymani, F. Metze, and G.J.F. Jones, editors. Working Notes Proceedings of the MediaEval 2012 Workshop, Santa Croce in Fossabanda, Pisa, Italy, October 4--5, 2012, volume 927 of CEUR Workshop Proceedings. CEUR-WS.org, 2012.Google ScholarGoogle Scholar
  20. B. Meixner, K. Matusik, C. Grill, and H. Kosch. Towards an easy to use authoring tool for interactive non-linear video. Multimedia Tools and Applications, pages 1--26, 2012.Google ScholarGoogle Scholar
  21. P. N. Mendes, M. Jakob, A. García-Silva, and C. Bizer. DBpedia spotlight: Shedding light on the web of documents. In Proceedings of the 7th International Conference on Semantic Systems (I-Semantics), 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. D. Milne and I.H. Witten. Learning to link with wikipedia. In Proceeding of CIKM 2008, pages 509--518. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. Morang, R.J.F. Ordelman, F.M.G. de Jong, and A.J. van Hessen. InfoLink: analysis of Dutch broadcast news and cross-media browsing. In Proceedings of ICME 2005, Los Alamitos, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  24. P. Pecina, P. Hoffmannova, G. J. F. Jones, Y. Zhang, and D. W. Oard. Overview of the CLEF 2007 cross-language speech retrieval track. In Proceedings of CLEF 2007, pages 674--686, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. S. Robertson, H. Zaragoza, and M. Taylor. Simple BM25 extension to multiple weighted fields. In Proceedings of ACM CIKM 2004, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. A. Rousseau, F. Bougares, P. Deléglise, H. Schwenk, and Y. Estèv. Lium's systems for the iwslt 2011 speech translation tasks. In Proceedings of IWSLT 2011, 2011.Google ScholarGoogle Scholar
  27. I. Sawhney, N. and Balcom, D. and Smith. Authoring and navigating video in space and time. MultiMedia, IEEE, 4(4):30--39, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. F. Shipman, A. Girgensohn, and L. Wilcox. Authoring, viewing, and generating hypervideo: An overview of Hyper-Hitchcock. ACM Trans. Multimedia Comput. Commun. Appl., (2):15:1---15:19, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. J. Sivic and A. Zisserman. Video google: a text retrieval approach to object matching in videos. In Proceedings of ICCV 2003, pages 1470 --1477 vol.2, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and trecvid. In Proceedings of MIR 2006, Santa Barbara, California, USA, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. A. W. M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell., 22(12):1349--1380, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. M. Utiyama and H. Isahara. A statistical model for domain-independent text segmentation. In Proceedings of ACL 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. E. Voorhees, D.K. Harman, National Institute of Standards, and Technology (US). TREC: Experiment and evaluation in information retrieval. MIT press USA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. E.M. Voorhees. The TREC-8 Question Answering Track Report. In Proceedings of TREC-8, pages 77--82, 1999.Google ScholarGoogle Scholar
  35. R. Yan. Probabilistic Models for Combining Diverse Knowledge Sources in Multimedia Retrieval. PhD thesis, Carnegie Mellon University, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Multimedia information seeking through search and hyperlinking

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        ICMR '13: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
        April 2013
        362 pages
        ISBN:9781450320337
        DOI:10.1145/2461466

        Copyright © 2013 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 16 April 2013

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • poster

        Acceptance Rates

        ICMR '13 Paper Acceptance Rate38of96submissions,40%Overall Acceptance Rate254of830submissions,31%

        Upcoming Conference

        ICMR '24
        International Conference on Multimedia Retrieval
        June 10 - 14, 2024
        Phuket , Thailand

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader