skip to main content
10.1145/3123266.3123301acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Exploring Consistent Preferences: Discrete Hashing with Pair-Exemplar for Scalable Landmark Search

Authors Info & Claims
Published:19 October 2017Publication History

ABSTRACT

Content-based visual landmark search (CBVLS) enjoys great importance in many practical applications. In this paper, we propose a novel discrete hashing with pair-exemplar (DHPE) to support scalable and efficient large-scale CBVLS. Our approach mainly solves two essential problems in scalable landmark hashing: 1) Intra-landmark visual diversity, and 2) Discrete optimization of hashing codes. Motivated by the characteristic of landmark, we explore the consistent preferences of tourists on landmark as pair-exemplars for scalable discrete hashing learning. In this paper, a pair-exemplar is comprised of a canonical view and the corresponding representative tags. Canonical view captures the key visual component of landmarks, and representative tags potentially involve landmark-specific semantics that can cope with the visual variations of intra-landmark. Based on pair-exemplars, a unified hashing learning framework is formulated to combine visual preserving with exemplar graph and the semantic guidance from representative tags. Further, to guarantee direct semantic transfer for hashing codes and remove information redundancy, we design a novel optimization method based on augmented Lagrange multiplier to explicitly deal with the discrete constraint, the bit-uncorrelated constraint and balance constraint. The whole learning process has linear computation complexity and enjoys desirable scalability. Experiments demonstrate the superior performance of DHPE compared with state-of-the-art methods.

References

  1. Stephen Boyd, Neal Parikh, Eric Chu, Borja Peleato, and Jonathan Eckstein. 2011. Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers. Found. Trends Mach. Learn. Vol. 3, 1 (2011), 1--122. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Rick Chartrand. 2012. Nonconvex Splitting for Regularized Low-Rank Sparse Decomposition. IEEE Trans. Signal Process. Vol. 60, 11 (2012), 5810--5819. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Tao Chen, Kim-Hui Yap, and Dajiang Zhang. 2014. Discriminative Soft Bag-of-Visual Phrase for Mobile Landmark Recognition. IEEE Trans. Multimedia Vol. 16, 3 (2014), 612--622. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Zhiyong Cheng and Jialie Shen. 2016. On very large scale test collection for landmark image search benchmarking. Signal Process. Vol. 124 (2016), 13 -- 26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. David Crandall, Yunpeng Li, Stefan Lee, and Daniel Huttenlocher. 2016. Recognizing landmarks in large-scale social image collections Visual Analysis and Geolocalization of Large Scale Imagery, Asaad Hakeem, Richard Szeliski, Mubarak Shah, Luc Van Gool, and Amir Zamir (Eds.). Springer.Google ScholarGoogle Scholar
  6. G. Ding, Y. Guo, J. Zhou, and Y. Gao. 2016. Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing. IEEE Trans. Image Process. Vol. 25, 11 (2016), 5427--5440. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Ling-Yu Duan, Jie Chen, Rongrong Ji, Tiejun Huang, and Wen Gao. 2013. Learning Compact Visual Descriptors for Low Bit Rate Mobile Landmark Search. AI Magazine, Vol. 34, 2 (2013), 67--85.Google ScholarGoogle ScholarCross RefCross Ref
  8. Y. Gong, S. Kumar, H. A. Rowley, and S. Lazebnik. 2013 a. Learning Binary Codes for High-Dimensional Data Using Bilinear Projections CVPR. 484--491. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Yunchao Gong, S. Lazebnik, A. Gordo, and F. Perronnin. 2013 b. Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval. IEEE Trans. Pattern Anal. Mach. Intell. Vol. 35, 12 (2013), 2916--2929. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Rongrong Ji, Ling-Yu Duan, Jie Chen, Hongxun Yao, Junsong Yuan, Yong Rui, and Wen Gao. 2012. Location Discriminative Vocabulary Coding for Mobile Landmark Search. Int. J. Comput. Vision Vol. 96, 3 (2012), 290--314. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Qing-Yuan Jiang and Wu-Jun Li. 2015. Scalable Graph Hashing with Feature Transformation. IJCAI. 2248--2254. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Z. Jin, C. Li, Y. Lin, and D. Cai. 2014. Density Sensitive Hashing. IEEE Trans. Cybern. Vol. 44, 8 (2014), 1362--1371.Google ScholarGoogle ScholarCross RefCross Ref
  13. Wang-Cheng Kang, Wu-Jun Li, and Zhi-Hua Zhou. 2016. Column Sampling Based Discrete Supervised Hashing. AAAI. 1230--1236. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Shaishav Kumar and Raghavendra Udupa. 2011. Learning Hash Functions for Cross-View Similarity Search. IJCAI. 1360--1365. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Jingjing Li, Yue Wu, Jidong Zhao, and Ke Lu. 2016 a. Low-rank discriminant embedding for multiview learning. IEEE Trans. Cybern. (2016).Google ScholarGoogle Scholar
  16. Jingjing Li, Jidong Zhao, and Ke Lu. 2016 b. Joint Feature Selection and Structure Preservation for Domain Adaptation. IJCAI. 1697--1703. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Wei Liu, Junfeng He, and Shih-Fu Chang. 2010. Large Graph Construction for Scalable Semi-Supervised Learning ICML. 679--686. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Wei Liu, Wang Jun, Sanjiv Kumar, and Shih-Fu Chang. 2011. Hashing with Graphs. ICML. 1--8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Wei Liu, Cun Mu, Sanjiv Kumar, and Shih-Fu Chang. 2014. Discrete Graph Hashing. In NIPS. 3419--3427. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Yadan Luo, Yang Yang, Fumin Shen, Zi Huang, Pan Zhou, and Heng Tao Shen. 2017. Robust discrete code modeling for supervised hashing. Pattern Recognit. (2017).Google ScholarGoogle Scholar
  21. Yadong Mu, Wei Liu, Cheng Deng, Zongting Lv, and Xinbo Gao. 2016. Coordinate Discrete Optimization for Efficient Cross-View Image Retrieval IJCAI. 1860--1866. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Maxim Raginsky and Svetlana Lazebnik. 2009. Locality-sensitive binary codes from shift-invariant kernels NIPS. 1509--1517. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Fumin Shen, Wei Liu, Shaoting Zhang, Yang Yang, and Heng Tao Shen. 2015 a. Learning Binary Codes for Maximum Inner Product Search ICCV. 4148--4156. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen. 2015 b. Supervised Discrete Hashing. In CVPR. 37--45.Google ScholarGoogle Scholar
  25. F. Shen, X. Zhou, Y. Yang, J. Song, H. T. Shen, and D. Tao. 2016. A Fast Optimization Method for General Binary Code Learning. IEEE Trans. Image Process. Vol. 25, 12 (2016), 5610--5621. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Xiaoshuang Shi, Fuyong Xing, Jinzheng Cai, Zizhao Zhang, Yuanpu Xie, and Lin Yang. 2016. Kernel-Based Supervised Discrete Hashing for Image Retrieval ECCV. 419--433.Google ScholarGoogle Scholar
  27. Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR Vol. abs/1409.1556 (2014).Google ScholarGoogle Scholar
  28. Jingkuan Song, Yang Yang, Yi Yang, Zi Huang, and Heng Tao Shen. 2013. Inter-media Hashing for Large-scale Retrieval from Heterogeneous Data Sources SIGMOD. 785--796. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. J. Wang, T. Zhang, j. song, N. Sebe, and H. T. Shen. 2017. A Survey on Learning to Hash. IEEE Trans. Pattern Anal. Mach. Intell. (2017).Google ScholarGoogle Scholar
  30. Yair Weiss, Antonio Torralba, and Robert Fergus. 2008. Spectral Hashing NIPS. 1753--1760. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Tobias Weyand and Bastian Leibe. 2013. Discovering Details and Scene Structure with Hierarchical Iconoid Shift ICCV. 3479--3486. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Tobias Weyand and Bastian Leibe. 2015. Visual landmark recognition from Internet photo collections: A large-scale evaluation. Comput. Vis. Image Underst. Vol. 135 (2015), 1--15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Yan Xia, K. He, P. Kohli, and J. Sun. 2015. Sparse projections for high-dimensional binary codes CVPR. 3332--3339.Google ScholarGoogle Scholar
  34. Liang Xie, Jialie Shen, and Lei Zhu. 2016 a. Online Cross-Modal Hashing for Web Image Retrieval AAAI. 294--300. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Liang Xie, Lei Zhu, and Guoqi Chen. 2016 b. Unsupervised multi-graph cross-modal hashing for large-scale multimedia retrieval. Multimedia Tools Appl. Vol. 75, 15 (2016), 9185--9204. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Liang Xie, Lei Zhu, Peng Pan, and Yansheng Lu. 2016 c. Cross-Modal Self-Taught Hashing for large-scale image retrieval. Signal Process. Vol. 124 (2016), 81--92. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Yang Yang, Yadan Luo, Weilun Chen, Fumin Shen, Jie Shao, and Heng Tao Shen. 2016 a. Zero-Shot Hashing via Transferring Supervised Knowledge MM. 1286--1295. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Yang Yang, Fumin Shen, Zi Huang, and Heng Tao Shen. 2016 b. A Unified Framework for Discrete Spectral Clustering IJCAI. 2273--2279. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Felix X. Yu, Sanjiv Kumar, Yunchao Gong, and Shih-Fu Chang. 2014. Circulant Binary Embedding. In ICML. 946--954. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Jile Zhou, Guiguang Ding, and Yuchen Guo. 2014 a. Latent Semantic Sparse Hashing for Cross-modal Similarity Search SIGIR. 415--424. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Wengang Zhou, Ming Yang, Houqiang Li, Xiaoyu Wang, Yuanqing Lin, and Qi Tian. 2014 b. Towards Codebook-Free: Scalable Cascaded Hashing for Mobile Image Search. IEEE Trans. Multimedia Vol. 16, 3 (2014), 601--611. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Lei Zhu, Jialie She, Xiaobai Liu, Liang Xie, and Liqiang Nie. 2016. Learning Compact Visual Representation with Canonical Views for Robust Mobile Landmark Search. In IJCAI. 3959--3965. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Lei Zhu, Jialie Shen, Hai Jin, Liang Xie, and Ran Zheng. 2015 a. Landmark Classification With Hierarchical Multi-Modal Exemplar Feature. IEEE Trans. Multimedia Vol. 17, 7 (2015), 981--993.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Lei Zhu, Jialie Shen, Hai Jin, Ran Zheng, and Liang Xie. 2015 b. Content-Based Visual Landmark Search via Multimodal Hypergraph Learning. IEEE Trans. Cybern. Vol. 45, 12 (2015), 2756--2769.Google ScholarGoogle ScholarCross RefCross Ref
  45. L. Zhu, J. Shen, L. Xie, and Z. Cheng. 2016. Unsupervised Topic Hypergraph Hashing for Efficient Mobile Image Retrieval. IEEE Trans. Cybern. (2016).Google ScholarGoogle Scholar
  46. Lei Zhu, Jialie Shen, Liang Xie, and Zhiyong Cheng 2017. Unsupervised Visual Hashing with Semantic Assistant for Content-Based Image Retrieval. IEEE Trans. on Knowl. and Data Eng. Vol. 29, 2 (2017), 472--486. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Xiaofeng Zhu, Zi Huang, Heng Tao Shen, and Xin Zhao. 2013. Linear Cross-modal Hashing for Efficient Multimedia Search MM. 143--152. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. X. Zhu, L. Zhang, and Z. Huang. 2014. A Sparse Embedding and Least Variance Encoding Approach to Hashing. IEEE Trans. Image Process. Vol. 23, 9 (2014), 3737--3750.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Exploring Consistent Preferences: Discrete Hashing with Pair-Exemplar for Scalable Landmark Search

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        MM '17: Proceedings of the 25th ACM international conference on Multimedia
        October 2017
        2028 pages
        ISBN:9781450349062
        DOI:10.1145/3123266

        Copyright © 2017 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 19 October 2017

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        MM '17 Paper Acceptance Rate189of684submissions,28%Overall Acceptance Rate995of4,171submissions,24%

        Upcoming Conference

        MM '24
        MM '24: The 32nd ACM International Conference on Multimedia
        October 28 - November 1, 2024
        Melbourne , VIC , Australia

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader