ABSTRACT
Sentiment analysis or opinion mining aims to use automated tools to detect subjective information such as opinions, attitudes, and feelings expressed in text. This paper proposes a novel probabilistic modeling framework based on Latent Dirichlet Allocation (LDA), called joint sentiment/topic model (JST), which detects sentiment and topic simultaneously from text. Unlike other machine learning approaches to sentiment classification which often require labeled corpora for classifier training, the proposed JST model is fully unsupervised. The model has been evaluated on the movie review dataset to classify the review sentiment polarity and minimum prior information have also been explored to further improve the sentiment classification accuracy. Preliminary experiments have shown promising results achieved by JST.
- A. Abbasi, H. Chen, and A. Salem. Sentiment analysis in multiple languages: Feature selection for opinion classification in web forums. ACM Trans. Inf. Syst., 26(3):1--34, 2008. Google ScholarDigital Library
- D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993--1022, 2003. Google ScholarCross Ref
- J. Blitzer, M. Dredze, and F. Pereira. Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 440--447, Prague, Czech Republic, June 2007. Association for Computational Linguistics.Google Scholar
- Y. Choi, C. Cardie, E. Riloff, and S. Patwardhan. Identifying sources of opinions with conditional random fields and extraction patterns. In Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, pages 355--362, Vancouver, British Columbia, Canada, October 2005. Association for Computational Linguistics. Google ScholarDigital Library
- K. Eguchi and V. Lavrenko. Sentiment retrieval using generative models. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, pages 345--354, Sydney, Australia, July 2006. Association for Computational Linguistics. Google ScholarDigital Library
- T. Griffiths and M. Steyvers. Finding scientific topics. Proceedings of the National Academy of Sciences, 101(90001):5228--5235, 2004.Google ScholarCross Ref
- S. Hayakawa and E. Ehrlich. Choose the right word: A contemporary guide to selecting the precise word for every situation. Collins, 1994.Google Scholar
- T. Hofmann. Probabilistic latent semantic indexing. In SIGIR '99: Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, pages 50--57, New York, NY, USA, 1999. ACM. Google ScholarDigital Library
- N. Kaji and M. Kitsuregawa. Automatic construction of polarity-tagged corpus from html documents. In Proceedings of the COLING/ACL on Main conference poster sessions, pages 452--459, Morristown, NJ, USA, 2006. Association for Computational Linguistics. Google ScholarDigital Library
- A. Kennedy and D. Inkpen. Sentiment classification of movie reviews using contextual valence shifters. Computational Intelligence, 22(2):110--125, 2006.Google ScholarCross Ref
- S.-M. Kim and E. Hovy. Determining the sentiment of opinions. In COLING '04: Proceedings of the 20th international conference on Computational Linguistics, page 1367, Morristown, NJ, USA, 2004. Association for Computational Linguistics. Google ScholarDigital Library
- W. Li and A. McCallum. Pachinko allocation: Dag-structured mixture models of topic correlations. In ICML '06: Proceedings of the 23rd international conference on Machine learning, pages 577--584, 2006. Google ScholarDigital Library
- R. McDonald, K. Hannan, T. Neylon, M. Wells, and J. Reynar. Structured models for fine-to-coarse sentiment analysis. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 432--439, Prague, Czech Republic, June 2007. Association for Computational Linguistics.Google Scholar
- Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai. Topic sentiment mixture: modeling facets and opinions in weblogs. In WWW '07: Proceedings of the 16th international conference on World Wide Web, pages 171--180, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
- B. Pang and L. Lee. A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In ACL '04: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, page 271, Morristown, NJ, USA, 2004. Association for Computational Linguistics. Google ScholarDigital Library
- B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up?: sentiment classification using machine learning techniques. In EMNLP '02: Proceedings of the ACL-02 conference on Empirical methods in natural language processing, pages 79--86, Morristown, NJ, USA, 2002. Association for Computational Linguistics. Google ScholarDigital Library
- M. F. Porter. An algorithm for suffix stripping. pages 313--316, 1997. Google ScholarDigital Library
- M. Steyvers and T. Griffiths. Probabilistic Topic Models. Handbook of Latent Semantic Analysis, page 427, 2007.Google Scholar
- I. Titov and R. McDonald. A joint model of text and aspect ratings for sentiment summarization. In Proceedings of ACL-08: HLT, pages 308--316, Columbus, Ohio, June 2008. Association for Computational Linguistics.Google Scholar
- I. Titov and R. McDonald. Modeling online reviews with multi-grain topic models. In WWW '08: Proceeding of the 17th international conference on World Wide Web, pages 111--120, New York, NY, USA, 2008. ACM. Google ScholarDigital Library
- P. D. Turney. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pages 417--424, Morristown, NJ, USA, 2001. Association for Computational Linguistics. Google ScholarDigital Library
- P. D. Turney and M. L. Littman. Unsupervised learning of semantic orientation from a hundred-billion-word corpus. CoRR, cs.LG/0212012, 2002.Google Scholar
- H. M. Wallach. Topic modeling: beyond bag-of-words. In ICML '06: Proceedings of the 23rd international conference on Machine learning, pages 977--984, 2006. Google ScholarDigital Library
- C. Whitelaw, N. Garg, and S. Argamon. Using appraisal groups for sentiment analysis. In CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management, pages 625--631, New York, NY, USA, 2005. ACM. Google ScholarDigital Library
- J. Zhao, K. Liu, and G. Wang. Adding redundant features for CRFs-based sentence sentiment classification. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pages 117--126, Honolulu, Hawaii, October 2008. Association for Computational Linguistics. Google ScholarDigital Library
Index Terms
- Joint sentiment/topic model for sentiment analysis
Recommendations
Topic sentiment change analysis
MLDM'11: Proceedings of the 7th international conference on Machine learning and data mining in pattern recognitionPublic opinions on a topic may change over time. Topic Sentiment change analysis is a new research problem consisting of two main components: (a) mining opinions on a certain topic, and (b) detect significant changes of sentiment of the opinions on the ...
Weakly Supervised Joint Sentiment-Topic Detection from Text
Sentiment analysis or opinion mining aims to use automated tools to detect subjective information such as opinions, attitudes, and feelings expressed in text. This paper proposes a novel probabilistic modeling framework called joint sentiment-topic (JST)...
Social sentiment sensor: a visualization system for topic detection and topic sentiment analysis on microblog
As a new form of social media, microblogging provides platform sharing, wherein users can share their feelings and ideas on certain topics. Bursty topics from microblogs are the results of the emerging issues that instantly attract more followers and ...
Comments