research-article

Exploring Consistent Preferences: Discrete Hashing with Pair-Exemplar for Scalable Landmark Search

Authors:
Lei Zhu

University of Queensland, Brisbane, QLD, Australia

University of Queensland, Brisbane, QLD, Australia
View Profile

,
Zi Huang

University of Queensland, Brisbane, QLD, Australia

University of Queensland, Brisbane, QLD, Australia
View Profile

,
Xiaojun Chang

Carnegie Mellon University, Pittssburgh, PA, USA

Carnegie Mellon University, Pittssburgh, PA, USA
View Profile

,
Jingkuan Song

University of Electronic Science and Technology of China, Chengdu, China

University of Electronic Science and Technology of China, Chengdu, China
View Profile

,
Heng Tao Shen

University of Electronic Science and Technology of China, Chengdu, China

University of Electronic Science and Technology of China, Chengdu, China
View Profile

MM '17: Proceedings of the 25th ACM international conference on MultimediaOctober 2017Pages 726–734https://doi.org/10.1145/3123266.3123301

Published:19 October 2017Publication History

MM '17: Proceedings of the 25th ACM international conference on Multimedia

Pages 726–734

ABSTRACT

Content-based visual landmark search (CBVLS) enjoys great importance in many practical applications. In this paper, we propose a novel discrete hashing with pair-exemplar (DHPE) to support scalable and efficient large-scale CBVLS. Our approach mainly solves two essential problems in scalable landmark hashing: 1) Intra-landmark visual diversity, and 2) Discrete optimization of hashing codes. Motivated by the characteristic of landmark, we explore the consistent preferences of tourists on landmark as pair-exemplars for scalable discrete hashing learning. In this paper, a pair-exemplar is comprised of a canonical view and the corresponding representative tags. Canonical view captures the key visual component of landmarks, and representative tags potentially involve landmark-specific semantics that can cope with the visual variations of intra-landmark. Based on pair-exemplars, a unified hashing learning framework is formulated to combine visual preserving with exemplar graph and the semantic guidance from representative tags. Further, to guarantee direct semantic transfer for hashing codes and remove information redundancy, we design a novel optimization method based on augmented Lagrange multiplier to explicitly deal with the discrete constraint, the bit-uncorrelated constraint and balance constraint. The whole learning process has linear computation complexity and enjoys desirable scalability. Experiments demonstrate the superior performance of DHPE compared with state-of-the-art methods.

References

Stephen Boyd, Neal Parikh, Eric Chu, Borja Peleato, and Jonathan Eckstein. 2011. Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers. Found. Trends Mach. Learn. Vol. 3, 1 (2011), 1--122. Google ScholarDigital Library
Rick Chartrand. 2012. Nonconvex Splitting for Regularized Low-Rank Sparse Decomposition. IEEE Trans. Signal Process. Vol. 60, 11 (2012), 5810--5819. Google ScholarDigital Library
Tao Chen, Kim-Hui Yap, and Dajiang Zhang. 2014. Discriminative Soft Bag-of-Visual Phrase for Mobile Landmark Recognition. IEEE Trans. Multimedia Vol. 16, 3 (2014), 612--622. Google ScholarDigital Library
Zhiyong Cheng and Jialie Shen. 2016. On very large scale test collection for landmark image search benchmarking. Signal Process. Vol. 124 (2016), 13 -- 26. Google ScholarDigital Library
David Crandall, Yunpeng Li, Stefan Lee, and Daniel Huttenlocher. 2016. Recognizing landmarks in large-scale social image collections Visual Analysis and Geolocalization of Large Scale Imagery, Asaad Hakeem, Richard Szeliski, Mubarak Shah, Luc Van Gool, and Amir Zamir (Eds.). Springer.Google Scholar
G. Ding, Y. Guo, J. Zhou, and Y. Gao. 2016. Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing. IEEE Trans. Image Process. Vol. 25, 11 (2016), 5427--5440. Google ScholarDigital Library
Ling-Yu Duan, Jie Chen, Rongrong Ji, Tiejun Huang, and Wen Gao. 2013. Learning Compact Visual Descriptors for Low Bit Rate Mobile Landmark Search. AI Magazine, Vol. 34, 2 (2013), 67--85.Google ScholarCross Ref
Y. Gong, S. Kumar, H. A. Rowley, and S. Lazebnik. 2013 a. Learning Binary Codes for High-Dimensional Data Using Bilinear Projections CVPR. 484--491. Google ScholarDigital Library
Yunchao Gong, S. Lazebnik, A. Gordo, and F. Perronnin. 2013 b. Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval. IEEE Trans. Pattern Anal. Mach. Intell. Vol. 35, 12 (2013), 2916--2929. Google ScholarDigital Library
Rongrong Ji, Ling-Yu Duan, Jie Chen, Hongxun Yao, Junsong Yuan, Yong Rui, and Wen Gao. 2012. Location Discriminative Vocabulary Coding for Mobile Landmark Search. Int. J. Comput. Vision Vol. 96, 3 (2012), 290--314. Google ScholarDigital Library
Qing-Yuan Jiang and Wu-Jun Li. 2015. Scalable Graph Hashing with Feature Transformation. IJCAI. 2248--2254. Google ScholarDigital Library
Z. Jin, C. Li, Y. Lin, and D. Cai. 2014. Density Sensitive Hashing. IEEE Trans. Cybern. Vol. 44, 8 (2014), 1362--1371.Google ScholarCross Ref
Wang-Cheng Kang, Wu-Jun Li, and Zhi-Hua Zhou. 2016. Column Sampling Based Discrete Supervised Hashing. AAAI. 1230--1236. Google ScholarDigital Library
Shaishav Kumar and Raghavendra Udupa. 2011. Learning Hash Functions for Cross-View Similarity Search. IJCAI. 1360--1365. Google ScholarDigital Library
Jingjing Li, Yue Wu, Jidong Zhao, and Ke Lu. 2016 a. Low-rank discriminant embedding for multiview learning. IEEE Trans. Cybern. (2016).Google Scholar
Jingjing Li, Jidong Zhao, and Ke Lu. 2016 b. Joint Feature Selection and Structure Preservation for Domain Adaptation. IJCAI. 1697--1703. Google ScholarDigital Library
Wei Liu, Junfeng He, and Shih-Fu Chang. 2010. Large Graph Construction for Scalable Semi-Supervised Learning ICML. 679--686. Google ScholarDigital Library
Wei Liu, Wang Jun, Sanjiv Kumar, and Shih-Fu Chang. 2011. Hashing with Graphs. ICML. 1--8. Google ScholarDigital Library
Wei Liu, Cun Mu, Sanjiv Kumar, and Shih-Fu Chang. 2014. Discrete Graph Hashing. In NIPS. 3419--3427. Google ScholarDigital Library
Yadan Luo, Yang Yang, Fumin Shen, Zi Huang, Pan Zhou, and Heng Tao Shen. 2017. Robust discrete code modeling for supervised hashing. Pattern Recognit. (2017).Google Scholar
Yadong Mu, Wei Liu, Cheng Deng, Zongting Lv, and Xinbo Gao. 2016. Coordinate Discrete Optimization for Efficient Cross-View Image Retrieval IJCAI. 1860--1866. Google ScholarDigital Library
Maxim Raginsky and Svetlana Lazebnik. 2009. Locality-sensitive binary codes from shift-invariant kernels NIPS. 1509--1517. Google ScholarDigital Library
Fumin Shen, Wei Liu, Shaoting Zhang, Yang Yang, and Heng Tao Shen. 2015 a. Learning Binary Codes for Maximum Inner Product Search ICCV. 4148--4156. Google ScholarDigital Library
Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen. 2015 b. Supervised Discrete Hashing. In CVPR. 37--45.Google Scholar
F. Shen, X. Zhou, Y. Yang, J. Song, H. T. Shen, and D. Tao. 2016. A Fast Optimization Method for General Binary Code Learning. IEEE Trans. Image Process. Vol. 25, 12 (2016), 5610--5621. Google ScholarDigital Library
Xiaoshuang Shi, Fuyong Xing, Jinzheng Cai, Zizhao Zhang, Yuanpu Xie, and Lin Yang. 2016. Kernel-Based Supervised Discrete Hashing for Image Retrieval ECCV. 419--433.Google Scholar
Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR Vol. abs/1409.1556 (2014).Google Scholar
Jingkuan Song, Yang Yang, Yi Yang, Zi Huang, and Heng Tao Shen. 2013. Inter-media Hashing for Large-scale Retrieval from Heterogeneous Data Sources SIGMOD. 785--796. Google ScholarDigital Library
J. Wang, T. Zhang, j. song, N. Sebe, and H. T. Shen. 2017. A Survey on Learning to Hash. IEEE Trans. Pattern Anal. Mach. Intell. (2017).Google Scholar
Yair Weiss, Antonio Torralba, and Robert Fergus. 2008. Spectral Hashing NIPS. 1753--1760. Google ScholarDigital Library
Tobias Weyand and Bastian Leibe. 2013. Discovering Details and Scene Structure with Hierarchical Iconoid Shift ICCV. 3479--3486. Google ScholarDigital Library
Tobias Weyand and Bastian Leibe. 2015. Visual landmark recognition from Internet photo collections: A large-scale evaluation. Comput. Vis. Image Underst. Vol. 135 (2015), 1--15. Google ScholarDigital Library
Yan Xia, K. He, P. Kohli, and J. Sun. 2015. Sparse projections for high-dimensional binary codes CVPR. 3332--3339.Google Scholar
Liang Xie, Jialie Shen, and Lei Zhu. 2016 a. Online Cross-Modal Hashing for Web Image Retrieval AAAI. 294--300. Google ScholarDigital Library
Liang Xie, Lei Zhu, and Guoqi Chen. 2016 b. Unsupervised multi-graph cross-modal hashing for large-scale multimedia retrieval. Multimedia Tools Appl. Vol. 75, 15 (2016), 9185--9204. Google ScholarDigital Library
Liang Xie, Lei Zhu, Peng Pan, and Yansheng Lu. 2016 c. Cross-Modal Self-Taught Hashing for large-scale image retrieval. Signal Process. Vol. 124 (2016), 81--92. Google ScholarDigital Library
Yang Yang, Yadan Luo, Weilun Chen, Fumin Shen, Jie Shao, and Heng Tao Shen. 2016 a. Zero-Shot Hashing via Transferring Supervised Knowledge MM. 1286--1295. Google ScholarDigital Library
Yang Yang, Fumin Shen, Zi Huang, and Heng Tao Shen. 2016 b. A Unified Framework for Discrete Spectral Clustering IJCAI. 2273--2279. Google ScholarDigital Library
Felix X. Yu, Sanjiv Kumar, Yunchao Gong, and Shih-Fu Chang. 2014. Circulant Binary Embedding. In ICML. 946--954. Google ScholarDigital Library
Jile Zhou, Guiguang Ding, and Yuchen Guo. 2014 a. Latent Semantic Sparse Hashing for Cross-modal Similarity Search SIGIR. 415--424. Google ScholarDigital Library
Wengang Zhou, Ming Yang, Houqiang Li, Xiaoyu Wang, Yuanqing Lin, and Qi Tian. 2014 b. Towards Codebook-Free: Scalable Cascaded Hashing for Mobile Image Search. IEEE Trans. Multimedia Vol. 16, 3 (2014), 601--611. Google ScholarDigital Library
Lei Zhu, Jialie She, Xiaobai Liu, Liang Xie, and Liqiang Nie. 2016. Learning Compact Visual Representation with Canonical Views for Robust Mobile Landmark Search. In IJCAI. 3959--3965. Google ScholarDigital Library
Lei Zhu, Jialie Shen, Hai Jin, Liang Xie, and Ran Zheng. 2015 a. Landmark Classification With Hierarchical Multi-Modal Exemplar Feature. IEEE Trans. Multimedia Vol. 17, 7 (2015), 981--993.Google ScholarDigital Library
Lei Zhu, Jialie Shen, Hai Jin, Ran Zheng, and Liang Xie. 2015 b. Content-Based Visual Landmark Search via Multimodal Hypergraph Learning. IEEE Trans. Cybern. Vol. 45, 12 (2015), 2756--2769.Google ScholarCross Ref
L. Zhu, J. Shen, L. Xie, and Z. Cheng. 2016. Unsupervised Topic Hypergraph Hashing for Efficient Mobile Image Retrieval. IEEE Trans. Cybern. (2016).Google Scholar
Lei Zhu, Jialie Shen, Liang Xie, and Zhiyong Cheng 2017. Unsupervised Visual Hashing with Semantic Assistant for Content-Based Image Retrieval. IEEE Trans. on Knowl. and Data Eng. Vol. 29, 2 (2017), 472--486. Google ScholarDigital Library
Xiaofeng Zhu, Zi Huang, Heng Tao Shen, and Xin Zhao. 2013. Linear Cross-modal Hashing for Efficient Multimedia Search MM. 143--152. Google ScholarDigital Library
X. Zhu, L. Zhang, and Z. Huang. 2014. A Sparse Embedding and Least Variance Encoding Approach to Hashing. IEEE Trans. Image Process. Vol. 23, 9 (2014), 3737--3750.Google ScholarCross Ref

Index Terms

Exploring Consistent Preferences: Discrete Hashing with Pair-Exemplar for Scalable Landmark Search
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
        Visual content-based indexing and retrieval
2. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Multimedia and multimodal retrieval
        Image search

Recommendations

Searching for diversified landmarks by photo
MM '12: Proceedings of the 20th ACM international conference on Multimedia

This demo focuses on the problem of searching for diversified landmarks with photos as input. More particularly, we propose a system called DLMSearch that allows a user to upload a photo as a query and searches for a diverse set of relevant landmarks in ...
Read More
Supervised discrete hashing through similarity learning
Abstract
Supervised hashing has achieved better accuracy than unsupervised hashing in many practical applications owing to its use of semantic label information. However, the mutual relationship between semantic labels is always ignored when leveraging ...
Read More
Supervised Discrete Hashing With Mutual Linear Regression
MM '19: Proceedings of the 27th ACM International Conference on Multimedia

Supervised linear hashing can compress high-dimensional data into compact binary codes owing to its efficiency. Generally, the relation between label and hash codes is widely used in the existing hashing methods because of its effectiveness of improving ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '17: Proceedings of the 25th ACM international conference on Multimedia
October 2017
2028 pages
ISBN:9781450349062
DOI:10.1145/3123266
General Chairs:
Qiong Liu
FXPAL, USA
,
Rainer Lienhart
Universität Augsburg, Germany
,
Haohong Wang
TCL America, USA
,
Program Chairs:
Sheng-Wei "Kuan-Ta" Chen
Academia Sinica, Taiwan
,
Susanne Boll
University of Oldenburg, Germany
,
Phoebe Chen
La Trobe University, Australia
,
Gerald Friedland
Lawrence Livermore National Lab, USA
,
Jia Li
Google, USA
,
Shuicheng Yan
Qihoo 360, China
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 19 October 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
discrete hashing
landmark search
pair-exemplar
Qualifiers
- research-article
Conference

Acceptance Rates
MM '17 Paper Acceptance Rate189of684submissions,28%Overall Acceptance Rate995of4,171submissions,24%
More
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 33
  Total Citations
  View Citations
- 290
  Total Downloads
- Downloads (Last 12 months)7
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Exploring Consistent Preferences: Discrete Hashing with Pair-Exemplar for Scalable Landmark Search

MM '17: Proceedings of the 25th ACM international conference on Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Searching for diversified landmarks by photo

Supervised discrete hashing through similarity learning

Supervised Discrete Hashing With Mutual Linear Regression