skip to main content
10.1145/2733373.2806309acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Cross-media Topic Detection with Refined CNN based Image-Dominant Topic Model

Authors Info & Claims
Published:13 October 2015Publication History

ABSTRACT

Online heterogenous data is springing up while the data has the rich auxiliary information (e.g. pictures and videos) around the text. However, traditional topic models are suffering from the limitations to discover the topics effectively from the cross-media data. Incorporating with the convolutional neural network (CNN) feature, we propose a novel image dominant topic model, which projects both the text modality and the visual modality into a semantic simplex. Further, an improved CNN feature is introduced to capture more visual details by fusing the convolutional layer and fully-connected layer. Experimental comparisons with state-of-the-art methods in the cross-media topic detection task show the effectiveness of our model.

References

  1. D. M. Blei and M. I. Jordan. Modeling annotated data. In ACM SIGIR, pages 127--134, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. JMLR, 3:993--1022, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Cao, Y. D. Zhang, Y. C. Song, Z. N. Chen, X. Zhang, and J. T. Li. Mcg-webv: A benchmark dataset for web video analysis. Beijing: Institute of Computing Technology, 10:324--334, 2009.Google ScholarGoogle Scholar
  4. J. Chang and D. M. Blei. Relational topic models for document networks. In AISTATS, pages 81--88, 2009.Google ScholarGoogle Scholar
  5. T. L. Griffiths and M. Steyvers. Finding scientific topics. NAS, 101(suppl 1):5228--5235, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  6. T. Hofmann. Probabilistic latent semantic indexing. In ACM SIGIR, pages 50--57, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. In ACM Multimedia, pages 675--678, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, pages 1097--1105, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. T. P. Minka. Expectation propagation for approximate bayesian inference. In UAI, pages 362--369, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Z. X. Niu, G. Hua, X. B. Gao, and Q. Tian. Semi-supervised relational topic model for weakly annotated image recognition in social media. In IEEE CVPR, pages 4233--4240, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. Rosen-Zvi, T. Griffiths, M. Steyvers, and P. Smyth. The author-topic model for authors and documents. In UAI, pages 487--494, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. Wang, D. M. Blei, and F. F. Li. Simultaneous image classification and annotation. In IEEE CVPR, pages 1903--1910, 2009.Google ScholarGoogle Scholar
  13. Y. Wang, J. Liu, J. S. Qu, Y. L. Huang, J. M. Chen, and X. Feng. Hashtag graph based topic model for tweet mining. In IEEE ICDM, pages 1025--1030, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. D. Zeiler and R. Fergus. Visualizing and understanding convolutional networks. In ECCV, pages 818--833. 2014.Google ScholarGoogle ScholarCross RefCross Ref
  15. Y. Zheng, Y. J. Zhang, and L. Hugo. A deep and autoregressive approach for topic modeling of multimodal data. arXiv preprint:1409.3970, 2014.Google ScholarGoogle Scholar

Index Terms

  1. Cross-media Topic Detection with Refined CNN based Image-Dominant Topic Model

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MM '15: Proceedings of the 23rd ACM international conference on Multimedia
      October 2015
      1402 pages
      ISBN:9781450334594
      DOI:10.1145/2733373

      Copyright © 2015 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 13 October 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      MM '15 Paper Acceptance Rate56of252submissions,22%Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader