skip to main content
Probabilistic models for combining diverse knowledge sources in multimedia retrieval
Publisher:
  • Carnegie Mellon University
  • Schenley Park Pittsburgh, PA
  • United States
ISBN:978-0-549-29638-6
Order Number:AAI3286687
Pages:
219
Bibliometrics
Skip Abstract Section
Abstract

In recent years, the multimedia retrieval community is gradually shifting its emphasis from analyzing one media source at a time to exploring the opportunities of combining diverse knowledge sources from correlated media types and context. In order to combine multimedia knowledge sources, two basic issues must be addressed: what to combine and how to combine. While considerable effort has been expended to generate a wide range of ranking features from knowledge sources, relatively less attention has been given to the problem of finding a suitable strategy to combine them. It has always been a significant challenge to develop principled combination approaches and capture useful factors such as query information and context information in the retrieval process.

This thesis presents a conditional probabilistic retrieval model as a principled framework to combine diverse knowledge sources. This model can integrate multiple forms of ranking features (query dependent and query independent features) as well as query information and context information in a unified framework with a solid probabilistic foundation. Under this retrieval framework, we overview and develop a number of state-of-the-art approaches for extracting ranking features from multimedia knowledge sources. In order to deal with heterogenous features, a discriminative learning approach is suggested for estimating the combination parameters. Moreover, an efficient rank learning approach has been developed to explicitly model the ranking relations in the learning process with much less training time.

To incorporate query information in the combination model, this thesis develops a number of query analysis models that can automatically discover mixing structure of the query space based on previous retrieval results, and predict combination parameters for unseen queries. In more detail, we propose the query-class based analysis model which needs to manually define the query classes and a series of probabilistic latent query analysis (pLQA) models which can automatically discover latent query classes from the development data by unifying the combination weight optimization and query class categorization into a discriminative learning framework. To adapt the combination function on a per query basis, this thesis also presents a probabilistic local context analysis (pLCA) model to automatically leverage additional retrieval sources to improve initial retrieval outputs. A pLCA variant is proposed to utilize human feedback to adjust combination parameters.

All the proposed approaches are evaluated on multimedia retrieval tasks with large-scale video collections. Beyond multimedia collections, we also evaluate our approaches on meta-search tasks with large-scale text collections. Experi mental evaluations demonstrate the promising performance of the probabilistic retrieval framework with query analysis and context analysis in the task of knowledge source combination. The applicability of the proposed methods can be extended to many other areas, such as question answering, web IR, cross-lingual IR, multi-sensor fusion, human tracking, and so forth.

Cited By

  1. ACM
    Eskevich M, Jones G, Aly R, Ordelman R, Chen S, Nadeem D, Guinaudeau C, Gravier G, Sébillot P, de Nies T, Debevere P, Van de Walle R, Galuscakova P, Pecina P and Larson M Multimedia information seeking through search and hyperlinking Proceedings of the 3rd ACM conference on International conference on multimedia retrieval, (287-294)
  2. Aly R and Demeester T Towards a better understanding of the relationship between probabilistic models in IR Proceedings of the Third international conference on Advances in information retrieval theory, (164-175)
  3. ACM
    Aly R and Hiemstra D Concept detectors Proceedings of the 17th ACM international conference on Multimedia, (233-242)
  4. ACM
    Aly R, Hiemstra D, de Vries A and de Jong F A probabilistic ranking framework using unobservable binary events for video search Proceedings of the 2008 international conference on Content-based image and video retrieval, (349-358)
  5. ACM
    Kennedy L and Chang S A reranking approach for context-based concept fusion in video indexing and retrieval Proceedings of the 6th ACM international conference on Image and video retrieval, (333-340)
  6. ACM
    Hauptmann A, Yan R and Lin W How many high-level concepts will fill the semantic gap in news video retrieval? Proceedings of the 6th ACM international conference on Image and video retrieval, (627-634)
  7. ACM
    Natsev A, Haubold A, Tešić J, Xie L and Yan R Semantic concept-based query expansion and re-ranking for multimedia retrieval Proceedings of the 15th ACM international conference on Multimedia, (991-1000)
  8. Wang F, Xu D, Lu W and Wu W Automatic video annotation and retrieval based on bayesian inference Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I, (279-288)
  9. Yan R and Hauptmann A (2007). A review of text and image retrieval approaches for broadcast news video, Information Retrieval, 10:4-5, (445-484), Online publication date: 1-Oct-2007.
  10. Wang F, Xu D, Lu W and Xu H Automatic annotation and retrieval for videos Proceedings of the First Pacific Rim conference on Advances in Image and Video Technology, (1030-1040)
Contributors
  • Carnegie Mellon University
  • Facebook, Inc.

Recommendations