ABSTRACT
As an important procedure in image retrieval, off-line indexing focuses on organizing relevant images together and making them easy to access. However, most of existing indexing strategies view database images individually and only consider partial relevance, i.e., either visual or semantic relevance among them. To overcome these issues and design better indexing strategy, we propose to package semantically relevant images into superimages, and then index superimages instead of single images. Superimage effectively packages multiple images into one new unit, hence significantly decreases the number of images to be indexed. This naturally saves the memory cost and retrieval time. To make the final index file discriminative to both visual and semantic relevances, we extract local descriptors from superimages and index them with inverted file. During online retrieval, we only need to extract local descriptors from queries, but could get semantic-aware retrieval results. This is because during our off-line indexing stage, both the semantically and visually relevant images are organized together. Therefore, our approach is superior to many online retrieval fusion algorithms. Experimental results on UKbench, Holidays, and one large-scale dataset all manifest the promising performance of our approach, i.e., competitive precision, better efficiency, and only about 1/2 memory consumption compared with state-of-the-arts.
- Large scale visual recognition challenge. http://www.image-net.org/challenges/LSVRC/2010, 2010. 2, 3.1Google Scholar
- J. Deng, A. C. Berg, and L. Fei-Fei. Hierarchical semantic indexing for large scale image retrieval. In CVPR, 2011. 1, 2Google ScholarDigital Library
- M. Douze, A. Ramisa, and C. Schmid. Combining attributes and fisher vectors for effcient image retrieval. In CVPR, 2011. 1, 1, 2, 2 Google ScholarDigital Library
- C. Fellbaum. Wordnet: an electronic lexical database. Bradford Books, 1998. 2Google ScholarCross Ref
- B. J. Frey and D. Dueck. Clustering by passing messages between data points. Science, 315(5814):972--976, 2007. 3.2Google ScholarCross Ref
- A. Gionis, P. Indyky, and R. Motwaniz. Similarity search in high dimensions via hashing. In International Conference on Very Large Data Bases (VLDB), pages 518--529, 1999. 1 Google ScholarDigital Library
- M. J. Huiskes and M. S. Lew. The mir flickr retrieval evaluation. In MIR '08: Proceedings of the 2008 ACM ICMIR, New York, NY, USA, 2008. ACM. 5.1 Google ScholarDigital Library
- H. Jégou, M. Douze, and C. Schmid. Hamming embedding and weak geometric consistency for large scale image search. In ECCV, 2008. 2, 5.1, 2Google ScholarDigital Library
- H. Jégou, M. Douze, and C. Schmid. Improving bag-of-feature for large scale image search. IJCV, 87(3):316--336, 2010. 2 Google ScholarDigital Library
- H. Jégou, M. Douze, and C. Schmid. Product quantization for nearest neighbor search. TPAMI, 33(1):117--128, 2011. 3.2 Google ScholarDigital Library
- R. M. Karp. Reducibility among combinatorial problems. Springer, 1972. 3.2Google ScholarCross Ref
- A. Krizhevsky, I. Sutskever, and G. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012. 2Google ScholarDigital Library
- Z. Liu, H. Li, W. Zhou, and Q. Tian. Embedding spatial context into inverted file for large-scale image search. In ACM Multimedia, 2012. 1, 2, 2 Google ScholarDigital Library
- D. G. Lowe. Distinctive image features from scale invariant keypoints. IJCV, 60(2):91--110, 2004. 1, 2 Google ScholarDigital Library
- K. Makino and T. Uno. New algorithms for enumerating all maximal cliques. In Algorithm Theory-SWAT 2004, pages 260--272. Springer, 2004. 1, 3.2Google ScholarCross Ref
- T. Mei, Y. Rui, S. Li, and Q. Tian. Multimedia search reranking: A literature survey. ACM Computing Survey, 2013. 1 Google ScholarDigital Library
- A. Y. Ng, M. I. Jordan, Y. Weiss, et al. On spectral clustering: Analysis and an algorithm. Advances in neural information processing systems, 2:849--856, 2002. 3.2Google ScholarDigital Library
- D. Nistér and H. Stewénius. Scalable recognition with a vocabulary tree. In CVPR, 2006. 1, 1, 1, 2, 4.1, 5.1, 5.2, 3 Google ScholarDigital Library
- F. Perronnin, J. Sánchez, and T. Mensink. Improving the fisher kernel for large-scale image classification. In ECCV, volume 4, pages 143--156, 2010. 1, 2 Google ScholarDigital Library
- J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In CVPR, 2007. 2, 2Google ScholarCross Ref
- J. Sivic and A. Zisserman. Video google: A text retrieval approach to object matching in videos. In ICCV, 2003. 1, 2 Google ScholarDigital Library
- E. Tomita, A. Tanaka, and H. Takahashi. The worst-case time complexity for generating all maximal cliques and computational experiments. Theoretical Computer Science, 363(1):28--42, 2006. 1, 3.2 Google ScholarDigital Library
- A. Torralba, R. Fergus, and Y. Weiss. Small codes and large image databases for recognition. In CVPR, 2008. 1Google ScholarCross Ref
- L. Torresani, M. Szummer, and A. Fitzgibbon. Efficient object category recognition using classemes. In ECCV, pages 776--789, 2010. 1, 2, 3.1, 5.3 Google ScholarDigital Library
- Z. Wu, Q. Ke, M. Isard, and J. Sun. Bundling feature for large scale partial-duplicated web image search. In CVPR, 2009. 1, 2Google Scholar
- G. Ye, D. Liu, I.-H. Jhuo, and S.-F. Chang. Robust late fusion with rank minimization. In CVPR, 2012. 1, 1, 2Google Scholar
- S. Zhang, Q. Huang, G. Hua, S. Jiang, W. Gao, and Q. Tian. Building contextual visual vocabulary for large-scale image applications. In ACM Multimedia, 2010. 1, 1 Google ScholarDigital Library
- S. Zhang, Q. Tian, G. Hua, Q. Huang, and W. Gao. Descriptive visual words and visual phrases for image applications. In ACM Multimedia, 2009. 1, 2 Google ScholarDigital Library
- S. Zhang, Q. Tian, K. Lu, Q. Huang, and W. Gao. Edge-sift: Discriminative binary descriptor for scalable partial-duplicate mobile search. TIP, 2013. 1Google Scholar
- S. Zhang, M. Yang, T. Cour, K. Yu, and D. N. Metaxas. Query specific fusion for image retrieval. In ECCV, volume 2, pages 660--673, 2012. 1, 1, 2 Google ScholarDigital Library
- S. Zhang, M. Yang, X. Wang, Y. Lin, and Q. Tian. Sematnic-aware co-indexing for image retrieval. In ICCV, 2013. 2, 3.1 Google ScholarDigital Library
- Y. Zhang, Z. Jia, and T. Chen. Image retrieval with geometry-preserving visual phrases. In CVPR, 2011. 1, 2, 2Google ScholarDigital Library
Index Terms
- Superimage: Packing Semantic-Relevant Images for Indexing and Retrieval
Recommendations
Spatial similarity-based retrievals and image indexing by hierarchical decomposition
IDEAS '97: Proceedings of the 1997 International Symposium on Database Engineering & ApplicationsFor efficient search and spatial similarity based retrieval of image contents, the paper introduces a new symbolic image representation and indexing technique. In this technique, an image is recursively decomposed into a spatial arrangement of feature ...
Efficient indexing for large-scale image search
ICIMCS '15: Proceedings of the 7th International Conference on Internet Multimedia Computing and ServiceIn content-based image retrieval, inverted indexes allow fast access to database images and summarize all knowledge about the database. Indexing multiple clues of image contents allows retrieval algorithms search for relevant images from different ...
Intelligent Indexing of Crime Scene Photographs
Current approaches in automatic text-based image indexing fail to capture semantic information that is expressed in image captions, but such indexing is important for high precision and recall in subsequent image retrieval. The indexing prototype ...
Comments