short-paper

Cross-media Topic Detection with Refined CNN based Image-Dominant Topic Model

Authors:
Zhiyi Wang

University of Chinese Academy of Sciences, Beijing, China

University of Chinese Academy of Sciences, Beijing, China
View Profile

,
Liang Li

University of Chinese Academy of Sciences, Beijing, China

University of Chinese Academy of Sciences, Beijing, China
View Profile

,
Qingming Huang

University of Chinese Academy of Sciences, Beijing, China

University of Chinese Academy of Sciences, Beijing, China
View Profile

MM '15: Proceedings of the 23rd ACM international conference on MultimediaOctober 2015Pages 1171–1174https://doi.org/10.1145/2733373.2806309

Published:13 October 2015Publication History

MM '15: Proceedings of the 23rd ACM international conference on Multimedia

Pages 1171–1174

ABSTRACT

Online heterogenous data is springing up while the data has the rich auxiliary information (e.g. pictures and videos) around the text. However, traditional topic models are suffering from the limitations to discover the topics effectively from the cross-media data. Incorporating with the convolutional neural network (CNN) feature, we propose a novel image dominant topic model, which projects both the text modality and the visual modality into a semantic simplex. Further, an improved CNN feature is introduced to capture more visual details by fusing the convolutional layer and fully-connected layer. Experimental comparisons with state-of-the-art methods in the cross-media topic detection task show the effectiveness of our model.

References

D. M. Blei and M. I. Jordan. Modeling annotated data. In ACM SIGIR, pages 127--134, 2003. Google ScholarDigital Library
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. JMLR, 3:993--1022, 2003. Google ScholarDigital Library
J. Cao, Y. D. Zhang, Y. C. Song, Z. N. Chen, X. Zhang, and J. T. Li. Mcg-webv: A benchmark dataset for web video analysis. Beijing: Institute of Computing Technology, 10:324--334, 2009.Google Scholar
J. Chang and D. M. Blei. Relational topic models for document networks. In AISTATS, pages 81--88, 2009.Google Scholar
T. L. Griffiths and M. Steyvers. Finding scientific topics. NAS, 101(suppl 1):5228--5235, 2004.Google ScholarCross Ref
T. Hofmann. Probabilistic latent semantic indexing. In ACM SIGIR, pages 50--57, 1999. Google ScholarDigital Library
Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. In ACM Multimedia, pages 675--678, 2014. Google ScholarDigital Library
A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, pages 1097--1105, 2012.Google ScholarDigital Library
T. P. Minka. Expectation propagation for approximate bayesian inference. In UAI, pages 362--369, 2001. Google ScholarDigital Library
Z. X. Niu, G. Hua, X. B. Gao, and Q. Tian. Semi-supervised relational topic model for weakly annotated image recognition in social media. In IEEE CVPR, pages 4233--4240, 2014. Google ScholarDigital Library
M. Rosen-Zvi, T. Griffiths, M. Steyvers, and P. Smyth. The author-topic model for authors and documents. In UAI, pages 487--494, 2004. Google ScholarDigital Library
C. Wang, D. M. Blei, and F. F. Li. Simultaneous image classification and annotation. In IEEE CVPR, pages 1903--1910, 2009.Google Scholar
Y. Wang, J. Liu, J. S. Qu, Y. L. Huang, J. M. Chen, and X. Feng. Hashtag graph based topic model for tweet mining. In IEEE ICDM, pages 1025--1030, 2014. Google ScholarDigital Library
M. D. Zeiler and R. Fergus. Visualizing and understanding convolutional networks. In ECCV, pages 818--833. 2014.Google ScholarCross Ref
Y. Zheng, Y. J. Zhang, and L. Hugo. A deep and autoregressive approach for topic modeling of multimodal data. arXiv preprint:1409.3970, 2014.Google Scholar

Index Terms

Cross-media Topic Detection with Refined CNN based Image-Dominant Topic Model
1. Information systems
  1. Information retrieval
    1. Document representation

Recommendations

Image-regulated graph topic model for cross-media topic detection
ICIMCS '15: Proceedings of the 7th International Conference on Internet Multimedia Computing and Service

In recent years, pictures and videos have become ubiquitous on the Internet, which encourage the development of algorithm that analyze their semantic contents for detecting topics. Among them, topic modeling plays an essential role in discovering topics ...
Read More
Topic-driven reader comments summarization
CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management

Readers of a news article often read its comments contributed by other readers. By reading comments, readers obtain not only complementary information about this news article but also the opinions from other readers. However, the existing ranking ...
Read More
Research on Multi-document Summarization Based on LDA Topic Model
IHMSC '14: Proceedings of the 2014 Sixth International Conference on Intelligent Human-Machine Systems and Cybernetics - Volume 02

Compared with VSM (Vector Space Model) and graph-ranking models, LDA (Latent Dirichlet Allocation) Model can discover latent topics in the corpus and latent topics are beneficial to use sentence-ranking mechanisms to form a good summary. In the paper, ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '15: Proceedings of the 23rd ACM international conference on Multimedia
October 2015
1402 pages
ISBN:9781450334594
DOI:10.1145/2733373
General Chairs:
Xiaofang Zhou
The University of Queensland, Australia
,
Alan F. Smeaton
Dublin City University, Ireland
,
Qi Tian
The University of Texas at San Antonio, USA
,
Program Chairs:
Dick C.A. Bulterman
FXPAL, USA
,
Heng Tao Shen
The University of Queensland, Australia
,
Ketan Mayer-Patel
The University of North Carolina, USA
,
Shuicheng Yan
National University of Singapore, Singapore
Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 October 2015
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
convolutional neural networks
cross-media topic detection
topic model
Qualifiers
- short-paper
Conference

Acceptance Rates
MM '15 Paper Acceptance Rate56of252submissions,22%Overall Acceptance Rate995of4,171submissions,24%
More
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 8
  Total Citations
  View Citations
- 326
  Total Downloads
- Downloads (Last 12 months)7
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Cross-media Topic Detection with Refined CNN based Image-Dominant Topic Model

MM '15: Proceedings of the 23rd ACM international conference on Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Image-regulated graph topic model for cross-media topic detection

Topic-driven reader comments summarization

Research on Multi-document Summarization Based on LDA Topic Model