skip to main content
10.1145/2461466.2461473acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Learning attribute-aware dictionary for image classification and search

Authors Info & Claims
Published:16 April 2013Publication History

ABSTRACT

Bag-of-visual words (BoW) model has recently been well advocated for image classification and search. However, one critical limitation of existing BoW model is the lack of semantic information. To alleviate the impact of this issue, it is imperative to construct semantic-aware visual dictionary. In this paper, we propose a novel approach for learning visual word dictionary embedding intermediate-level semantics. Specifically, we first introduce an Attribute aware Dictionary Learning(AttrDL) scheme to learn multiple sub-dictionaries with specific semantic meanings. We divide training images into different sets and each represents a specific attribute. For each image set, an attribute-aware sub-vocabulary is learned. Hence, these resulting sub-vocabularies are more discriminative for semantics than the traditional vocabularies. Second, to get semantic-aware and discriminative BoW representation with the learned sub-vocabularies, we adopt the idea of L21-norm regularized sparse coding and recode the resulting sparse representation of each image. Experimental results show that the proposed scheme outperforms the state-of-the-art algorithms in both image classification and search tasks.

References

  1. L. Cao, R. Ji, Y. Gao, Y, Yang and Q. Tian. Weakly supervised sparse coding with geometric consistency pooling. In CVPR, 2012.Google ScholarGoogle Scholar
  2. J. Wang, J. Yang, K. Yu, F. Lv, T. Huang and Y. Gong. Locality-constrained linear coding for image classification. In CVPR, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  3. J. Feng, B. Ni, Q. Tian and S. Yan. Geometric $\ell_p$-norm feature pooling for image classification. In CVPR, 2011.Google ScholarGoogle Scholar
  4. J. Yang, K. Yu, Y. Gong and T. Huang. Linear spatial pyramid matching using sparse conding forimage classification. In CVPR, 2009.Google ScholarGoogle Scholar
  5. S. Gao, I. Tsang, L.-T.Chia, and P. Zhao. Local features are not lonely - Laplacian sparse coding for image classification. In CVPR, 2011.Google ScholarGoogle Scholar
  6. L. Torresani, M. Szummer, and A. Fitzgibbon. Efficient object category recognition using Classemes. In ECCV, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Lazebnik, C. Schmid and J. Ponce. Beyond bags of features: Spatial pyramid matching for recognizingnatural scene categories. In CVPR, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Aharon, M. Elad, and A. Bruckstein. K-svd: an alogrithm for designing overcompletedictionaries for sparse representation. Transaction on Image Processing, 2006.Google ScholarGoogle Scholar
  9. K. Engan, S. O. Aase, and J. H. Husoy. Method of optimal directions for frame design. In ICASSP, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. H. Lee, A. Battle, R. Raina and A. Y. Ng. Efficient sparse coding algorithms. In NIPS, 2007.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. Mairal, F. Bach, J. Ponce, G. Saprio and A. Zisserman. Supervised dictionary learning. In NIPS, 2008.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. N. Zhou, Y. Shen, J. Peng and J. Fan. Learning inter-related visual dictionary for objectrecognition. In CVPR, 2012.Google ScholarGoogle Scholar
  13. A. Krause and D. Dueck. Submodular dictionary leanring for sparse representation. In ICML, 2011.Google ScholarGoogle Scholar
  14. N. Kumar, A. C. Berg, P. N. Belhumeur and S. K. Nayar. Attribute and simile classifers for face verification. In ICCV, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  15. C. Lampert, H. Nickisch and S. Harmeling. Learning to detect unseen object classes by between-class attribute transfer. In CVPR, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  16. G. Griffin, A. Holub and P. Perona. Caltech-256 object category dataset. Technical report, California Institute of Technology, 2007.Google ScholarGoogle Scholar
  17. Z.-J. Zha, L. Yang, T. Mei, M. Wang and Z. Wang. Visual Query Suggestion. In MM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Lazebnik, C. Schmid and J. Ponce. Beyond bags of features: Spatial pyramid matching forrecognizing natural scene categories. In CVPR, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. F. Li, R. Fergus and P. Perona. Learning generative visual models from few traningexamples: an incremental bayesian approach tested on 101 objectcategories. In CVPR workshop, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Y. Su and F. Jurie. Improving image classification using semantic attributes. International Journal of Computer Vision, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Liu, Y. Yang and M. Shah. Learning semantic visual vocabularies using diffusion distance. In CVPR, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  22. Z.-J. Zha, X.-S. Hua, T. Mei, J.Wang, G.-J. Qi and Z. Wang. Joint Multi-Label Multi-Instance Learning for image classification. In CVPR, 2009.Google ScholarGoogle Scholar
  23. S. Bengio, F. Pereira, Y. Singer and D. Strelow. Group sparse coding. In NIPS, 2009.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. A. Farhadi, I. Endres, D. Hoiem and D. Forsyth. Describing objects by their attributes. In CVPR, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  25. D. Parikh and K. Grauman. Relative attributes. In ICCV,2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. G. Patterson and J. Hays. SUN Attribute Database:Discovering, Annotating, and Recognizing Scene Attributes In CVPR, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. K. Yu, T. Zhang and Y. Gong. Nonlinear learning using local coordinate coding. In NIPS, 2009.Google ScholarGoogle Scholar
  29. J. Carreira, R. Caeoiro, J. Batista and C. Sminchisescu. Semantic segmentation with second-order pooling. In ECCV, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. J. Deng, A. Berg, K. Li and L. Feifei. What does classifying more than 10,000 image categoriestell us? In ECCV, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. H. Jegou, M. Douze and C. Schmid. Hamming embedding and weak geometric consistency for largescale image search. In ECCV, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. A. Shabou and H. L. Borgne. Locality-constrained and spatially regularized coding for scene categorization. In CVPR, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Z.-J. Zha, T. Mei, J.Wang, Z. Wang and X.-S. Hua. Graph-based Semi-Supervised Learning with Multiple Lables. In JVCIR, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. P. Raghavan, C. D. Manning and H. Schtze. An introduction to information retrieval. Cambridge University Press, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. J. Cai, Z. Zha, W. Zhou and Q. Tian. Attribute-assisted Reranking for Web Image Retrieval. In MM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. J. Cai, Z. Zha, Y. Zhao and Z. Wang. Evaluation Of Histogram Based Interest Point Detector In Web Image Classification And Search. In ICME, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  37. S. Zhang, Q. Tian, G. Hua, Q. Huang and S. Li. Discriptive visual words and visual phrases for image applications. In MM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Z.-J. Zha, L. Yang, T. Mei, M. Wang and Z. Wang. Visual Query Suggestion: Towards capturing user intent in internet image search. In TOMCCAP, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Learning attribute-aware dictionary for image classification and search

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      ICMR '13: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
      April 2013
      362 pages
      ISBN:9781450320337
      DOI:10.1145/2461466

      Copyright © 2013 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 16 April 2013

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      ICMR '13 Paper Acceptance Rate38of96submissions,40%Overall Acceptance Rate254of830submissions,31%

      Upcoming Conference

      ICMR '24
      International Conference on Multimedia Retrieval
      June 10 - 14, 2024
      Phuket , Thailand

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader