ABSTRACT
Online heterogenous data is springing up while the data has the rich auxiliary information (e.g. pictures and videos) around the text. However, traditional topic models are suffering from the limitations to discover the topics effectively from the cross-media data. Incorporating with the convolutional neural network (CNN) feature, we propose a novel image dominant topic model, which projects both the text modality and the visual modality into a semantic simplex. Further, an improved CNN feature is introduced to capture more visual details by fusing the convolutional layer and fully-connected layer. Experimental comparisons with state-of-the-art methods in the cross-media topic detection task show the effectiveness of our model.
- D. M. Blei and M. I. Jordan. Modeling annotated data. In ACM SIGIR, pages 127--134, 2003. Google ScholarDigital Library
- D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. JMLR, 3:993--1022, 2003. Google ScholarDigital Library
- J. Cao, Y. D. Zhang, Y. C. Song, Z. N. Chen, X. Zhang, and J. T. Li. Mcg-webv: A benchmark dataset for web video analysis. Beijing: Institute of Computing Technology, 10:324--334, 2009.Google Scholar
- J. Chang and D. M. Blei. Relational topic models for document networks. In AISTATS, pages 81--88, 2009.Google Scholar
- T. L. Griffiths and M. Steyvers. Finding scientific topics. NAS, 101(suppl 1):5228--5235, 2004.Google ScholarCross Ref
- T. Hofmann. Probabilistic latent semantic indexing. In ACM SIGIR, pages 50--57, 1999. Google ScholarDigital Library
- Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. In ACM Multimedia, pages 675--678, 2014. Google ScholarDigital Library
- A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, pages 1097--1105, 2012.Google ScholarDigital Library
- T. P. Minka. Expectation propagation for approximate bayesian inference. In UAI, pages 362--369, 2001. Google ScholarDigital Library
- Z. X. Niu, G. Hua, X. B. Gao, and Q. Tian. Semi-supervised relational topic model for weakly annotated image recognition in social media. In IEEE CVPR, pages 4233--4240, 2014. Google ScholarDigital Library
- M. Rosen-Zvi, T. Griffiths, M. Steyvers, and P. Smyth. The author-topic model for authors and documents. In UAI, pages 487--494, 2004. Google ScholarDigital Library
- C. Wang, D. M. Blei, and F. F. Li. Simultaneous image classification and annotation. In IEEE CVPR, pages 1903--1910, 2009.Google Scholar
- Y. Wang, J. Liu, J. S. Qu, Y. L. Huang, J. M. Chen, and X. Feng. Hashtag graph based topic model for tweet mining. In IEEE ICDM, pages 1025--1030, 2014. Google ScholarDigital Library
- M. D. Zeiler and R. Fergus. Visualizing and understanding convolutional networks. In ECCV, pages 818--833. 2014.Google ScholarCross Ref
- Y. Zheng, Y. J. Zhang, and L. Hugo. A deep and autoregressive approach for topic modeling of multimodal data. arXiv preprint:1409.3970, 2014.Google Scholar
Index Terms
- Cross-media Topic Detection with Refined CNN based Image-Dominant Topic Model
Recommendations
Image-regulated graph topic model for cross-media topic detection
ICIMCS '15: Proceedings of the 7th International Conference on Internet Multimedia Computing and ServiceIn recent years, pictures and videos have become ubiquitous on the Internet, which encourage the development of algorithm that analyze their semantic contents for detecting topics. Among them, topic modeling plays an essential role in discovering topics ...
Topic-driven reader comments summarization
CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge managementReaders of a news article often read its comments contributed by other readers. By reading comments, readers obtain not only complementary information about this news article but also the opinions from other readers. However, the existing ranking ...
Research on Multi-document Summarization Based on LDA Topic Model
IHMSC '14: Proceedings of the 2014 Sixth International Conference on Intelligent Human-Machine Systems and Cybernetics - Volume 02Compared with VSM (Vector Space Model) and graph-ranking models, LDA (Latent Dirichlet Allocation) Model can discover latent topics in the corpus and latent topics are beneficial to use sentence-ranking mechanisms to form a good summary. In the paper, ...
Comments