skip to main content
research-article

Learning to Recommend Descriptive Tags for Questions in Social Forums

Published:01 January 2014Publication History
Skip Abstract Section

Abstract

Around 40% of the questions in the emerging social-oriented question answering forums have at most one manually labeled tag, which is caused by incomprehensive question understanding or informal tagging behaviors. The incompleteness of question tags severely hinders all the tag-based manipulations, such as feeds for topic-followers, ontological knowledge organization, and other basic statistics. This article presents a novel scheme that is able to comprehensively learn descriptive tags for each question. Extensive evaluations on a representative real-world dataset demonstrate that our scheme yields significant gains for question annotation, and more importantly, the whole process of our approach is unsupervised and can be extended to handle large-scale data.

References

  1. Sameer Agarwal, Kristin Branson, and Serge Belongie. 2006. Higher order learning with graphs. In Proceedings of the International Conference on Machine Learning. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Morgan Ames and Mor Naaman. 2007. Why we tag: Motivations for annotation in mobile and online media. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Christopher H. Brooks and Nancy Montanez. 2006. Improved annotation of the blogosphere via autotagging and hierarchical clustering. In Proceedings of the International Conference on World Wide Web. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Gustavo Carneiro and Nuno Vasconcelos. 2005. Formulating semantic image annotation as a supervised learning problem. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Pi-Chuan Chang, Huihsin Tseng, Dan Jurafsky, and Christopher D. Manning. 2009. Discriminative reordering with Chinese grammatical relations features. In Proceedings of the Workshop on Syntax and Structure in Statistical Translation. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: A real-world Web image database from National University of Singapore. In Proceeding of the ACM International Conference on Image and Video Retrieval. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Pinar Duygulu, Kobus Barnard, Nando de Freitas, and David Forsyth. 2002. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In Proceedings of the European Conference on Computer Vision. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Zhouyu Fu, Guojun Lu, Kai ming Ting, and Dengsheng Zhang. 2011. A survey of audio-based music classification and annotation. IEEE Trans. Multimedia 13, 2, 303--319. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Yue Gao, Meng Wang, Zheng-Jun Zha, Jialie Shen, Xuelong Li, and Xindong Wu. 2012. Visual-textual joint relevance learning for tag-based social image search. IEEE Trans. Image Process. 22, 1, 363--376. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Scott A. Golder and Bernardo A. Huberman. 2006. Usage patterns of collaborative tagging systems. J. Inf. Sci. 32, 2, 198--208. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Winston H. Hsu, Lyndon S. Kennedy, and Shih-Fu Chang. 2007. Video search reranking through random walk over document-level context graph. In Proceedings of the ACM International Conference on Multimedia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Yuchi Huang, Qingshan Liu, Shaoting Zhang, and Dimitris N. Metaxas. 2010. Image retrieval via probabilistic hypergraph ranking. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle Scholar
  13. Jiwoon Jeon, Victor Lavrenko, and R. Manmatha. 2003. Automatic image annotation and retrieval using cross-media relevance models. In Proceedings of the International ACM SIGIR Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Feng Kang, Rong Jin, and Rahul Sukthankar. 2006. Correlated label propagation with application to multi-label learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Xirong Li, Cees G. M. Snoek, and Marcel Worring. 2009. Annotating images by harnessing worldwide user-tagged photos. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Dong Liu, Xian-Sheng Hua, Linjun Yang, Meng Wang, and Hong-Jiang Zhang. 2009. Tag ranking. In Proceedings of the International Conference on World Wide Web.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Dong Liu, Xian-Sheng Hua, Meng Wang, and Hong-Jiang Zhang. 2010. Image retagging. In Proceedings of the ACM International Conference on Multimedia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Hans Peter Luhn. 1958. The automatic creation of literature abstracts. IBM J. Res. Develop. 2, 2, 159--165. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Gilad Mishne. 2006. Autotag: A collaborative approach to automated tag assignment for weblog posts. In Proceedings of the International Conference on World Wide Web. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Florent Monay and Daniel Gatica-Perez. 2004. Plsa-based image auto-annotation: Constraining the latent space. In Proceedings of the ACM International Conference on Multimedia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Yasuhide Hironobu Mori, Hironobu Takahashi, and Ryuichi Oka. 1999. Image-to-word transformation based on dividing and vector quantizing images with words. In Proceedings of the International Workshop on Multimedia Intelligent Storage and Retrieval Management.Google ScholarGoogle Scholar
  22. Sascha Narr, Ernesto William De Luca, and Sahin Albayrak. 2011. Extracting semantic annotations from Twitter. In Proceedings of the Workshop on Exploiting Semantic Annotations in Information Retrieval. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Liqiang Nie, Meng Wang, Zheng-Jun Zha, Guangda Li, and Tat-Seng Chua. 2011. Multimedia answering: Enriching text QA with media information. In Proceedings of the International ACM SIGIR Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Liqiang Nie, Meng Wang, Zheng-Jun Zha, and Tat-Seng Chua. 2012a. Oracle in image search: A content-based approach to performance prediction. ACM Trans. Inf. Syst. 30, 2, Article 3. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Liqiang Nie, Shuicheng Yan, Meng Wang, Richang Hong, and Tat-Seng Chua. 2012b. Harvesting visual concepts for image search with complex queries. In Proceedings of the ACM International Conference on Multimedia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Liqiang Nie, Meng Wang, Yue Gao, Zheng-Jun Zha, and Tat-Seng Chua. 2013. Beyond text QA: Multimedia answer generation by harvesting Web information. IEEE Trans. Multimedia 15, 2, 426--441. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Guo-Jun Qi, Xian-Sheng Hua, Yong Rui, Jinhui Tang, Tao Mei, and Hong-Jiang Zhang. 2007. Correlative multi-label video annotation. In Proceedings of the ACM International Conference on Multimedia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Börkur Sigurbjörnsson and Roelof van Zwol. 2008. Flickr tag recommendation based on collective knowledge. In Proceedings of the International Conference on World Wide Web. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Sanjay Sood, Sara Owsley, Kristian Hammond, and Larry Birnbaum. 2007. Tagassist: Automatic tag suggestion for blog posts. In Proceedings of the International Conference on Weblogs and Social Media.Google ScholarGoogle Scholar
  30. Shankara B. Subramanya and Huan Liu. 2008. Socialtagger - Collaborative tagging for blogs in the long tail. In Proceedings of the ACM Workshop on Search in Social Media. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Jinhui Tang, Haojie Li, Guo-Jun Qi, and Tat-Seng Chua. 2010. Image annotation by graph-based inference with integrated multiple/single instance representations. IEEE Trans. Multimedia 12, 2, 131--141. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Xinmei Tian, Linjun Yang, Jingdong Wang, Yichen Yang, Xiuqing Wu, and Xian-Sheng Hua. 2008. Bayesian video search reranking. In Proceedings of the ACM International Conference on Multimedia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Kai Wang, Zhaoyan Ming, and Tat-Seng Chua. 2009. A syntactic tree matching approach to finding similar questions in community-based QA services. In Proceedings of the International ACM SIGIR Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Matthijs J. Warrens. 2010. Inequalities between multi-rater kappas. Adv. Data Anal. Classification 4, 4, 271--286. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Pengcheng Wu, Steven Chu-Hong Hoi, Peilin Zhao, and Ying He. 2011. Mining social images with distance metric learning for automated image tagging. In Proceedings of the ACM International Conference on Web Search and Data Mining. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Wei Wu, Bin Zhang, and Mari Ostendorf. 2010. Automatic generation of personalized annotation tags for twitter users. In Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Yu Xiang, Xiangdong Zhou, Tat-Seng Chua, and Chong-Wah Ngo. 2009. A revisit of generative model for automatic image annotation using markov random fields. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle ScholarCross RefCross Ref
  38. Zhichen Xu, Yun Fu, Jianchang Mao, and Difu Su. 2006. Towards the Semantic Web: Collaborative tag suggestions. In Proceedings of the International Conference on World Wide Web.Google ScholarGoogle Scholar
  39. Rong Yan, Alexander Hauptmann, and Rong Jin. 2003. Multimedia search with pseudo-relevance feedback. In Proceedings of the International Conference on Image and Video. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Changbo Yang, Ming Dong, and Jing Hua. 2006. Region-based image annotation using asymmetrical support vector machine-based multiple-instance learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Yang Yang, Yi Yang, and Heng Tao Shen. 2013. Effective transfer tagging from image to video. ACM Trans. Multimedia Comput. Commun. Appl. 9, 2, Article 14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Jun Yu, Dacheng Tao, and Meng Wang. 2012. Adaptive hypergraph learning and its application in image classification. IEEE Trans. Image Process. 21, 7, 3262--3272. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Dengyong Zhou, Olivier Bousquet, Thomas Navin Lal, Jason Weston, and Bernhard Schölkopf. 2004. Learning with local and global consistency. In Proceedings of the Advances in Neural Information Processing Systems Conference.Google ScholarGoogle Scholar
  44. Dengyong Zhou, Jiayuan Huang, and Bernhard Schölkopf. 2006. Learning with hypergraphs: Clustering, classification, and embedding. In Proceedings of the Advances in Neural Information Processing Systems Conference.Google ScholarGoogle Scholar

Index Terms

  1. Learning to Recommend Descriptive Tags for Questions in Social Forums

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Information Systems
      ACM Transactions on Information Systems  Volume 32, Issue 1
      January 2014
      123 pages
      ISSN:1046-8188
      EISSN:1558-2868
      DOI:10.1145/2576772
      Issue’s Table of Contents

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 January 2014
      • Accepted: 1 November 2013
      • Revised: 1 September 2013
      • Received: 1 February 2013
      Published in tois Volume 32, Issue 1

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader