skip to main content
10.1145/1341531.1341561acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

A holistic lexicon-based approach to opinion mining

Published:11 February 2008Publication History

ABSTRACT

One of the important types of information on the Web is the opinions expressed in the user generated content, e.g., customer reviews of products, forum posts, and blogs. In this paper, we focus on customer reviews of products. In particular, we study the problem of determining the semantic orientations (positive, negative or neutral) of opinions expressed on product features in reviews. This problem has many applications, e.g., opinion mining, summarization and search. Most existing techniques utilize a list of opinion (bearing) words (also called opinion lexicon) for the purpose. Opinion words are words that express desirable (e.g., great, amazing, etc.) or undesirable (e.g., bad, poor, etc) states. These approaches, however, all have some major shortcomings. In this paper, we propose a holistic lexicon-based approach to solving the problem by exploiting external evidences and linguistic conventions of natural language expressions. This approach allows the system to handle opinion words that are context dependent, which cause major difficulties for existing algorithms. It also deals with many special words, phrases and language constructs which have impacts on opinions based on their linguistic patterns. It also has an effective function for aggregating multiple conflicting opinion words in a sentence. A system, called Opinion Observer, based on the proposed technique has been implemented. Experimental results using a benchmark product review data set and some additional reviews show that the proposed technique is highly effective. It outperforms existing methods significantly

References

  1. A. Andreevskaia and S. Bergler. Mining WordNet for Fuzzy Sentiment: Sentiment Tag Extraction from WordNet Glosses. In EACL'06, pp. 209--216, 2006.Google ScholarGoogle Scholar
  2. P. Beineke, T. Hastie, C. Manning, and S. Vaithyanathan. An Exploration of Sentiment Summarization. In Proc. of the AAAI Spring Symposium on Exploring Attitude and Affect in Text: Theories and Applications, 2003.Google ScholarGoogle Scholar
  3. G. Carenini, R. Ng, and A. Pauls. Interactive Multimedia Summaries of Evaluative Text. IUI'06, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Das, and M. Chen. Yahoo! for Amazon: Extracting market sentiment from stock message boards. APFA'01, 2001.Google ScholarGoogle Scholar
  5. K. Dave, S. Lawrence, and D. Pennock. Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews. WWW'03, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. X. Ding and B. Liu. The Utility of Linguistic Rules in Opinion Mining." SIGIR-2007 (poster paper). Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Esuli and F. Sebastiani, EACL-06, 2006. Determining Term Subjectivity and Term Orientation for Opinion Mining, EACL-06, 2006.Google ScholarGoogle Scholar
  8. C. Fellbaum. WordNet: an Electronic Lexical Database, MIT Press, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  9. M. Gamon, A. Aue, S. Corston-Oliver, and E. K. Ringger. Pulse: Mining customer opinions from free text. IDA'2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. V. Hatzivassiloglou and J. Wiebe. Effects of adjective orientation and gradability on sentence subjectivity. COLING'00, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. V. Hatzivassiloglou and K. McKeown. Predicting the Semantic Orientation of Adjectives. ACL-EACL'97, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Hearst. Direction-based Text Interpretation as an Information Access Refinement. In P. Jacobs, editor, Text-Based Intelligent Systems. Lawrence Erlbaum Associates, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M. Hu and B. Liu. Mining and summarizing customer reviews. KDD'04, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. N. Jindal, and B. Liu. Mining Comparative Sentences and Relations. In AAAI'06, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. N. Kaji and M. Kitsuregawa. Automatic Construction of Polarity-Tagged Corpus from HTML Documents. COLING/ACL'06, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. H. Kanayama and T. Nasukawa. Fully Automatic Lexicon Expansion for Domain-Oriented Sentiment Analysis. EMNLP'06, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Kim and E. Hovy. Determining the Sentiment of Opinions. COLING'04, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Kim and E. Hovy. Automatic Identification of Pro and Con Reasons in Online Reviews. COLING/ACL 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. N. Kobayashi, R. Iida, K. Inui and Y. Matsumoto. Opinion Mining on the Web by Extracting Subject-Attribute-Value Relations. In Proc. of AAAI-CAAW'06, 2006.Google ScholarGoogle Scholar
  20. L.-W. Ku, Y.-T. Liang and H.-H. Chen. Opinion Extraction, Summarization and Tracking in News and Blog Corpora. In Proc. of the AAAI-CAAW'06, 2006.Google ScholarGoogle Scholar
  21. B. Liu, M. Hu, M. and J. Cheng. Opinion Observer: Analyzing and comparing opinions on the Web. WWW-05, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. S. Morinaga, K. Yamanishi, K. Tateishi, and T. Fukushima, Mining Product Reputations on the Web. KDD'02, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. T. Nasukawa and J. Yi. Sentiment analysis: Capturing favorability using natural language processing. K-CA-2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. V. Ng, S. Dasgupta and S. M. Niaz Arifin. Examining the Role of Linguistic Knowledge Sources in the Automatic Identification and Classification of Reviews. ACL'06, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. NLProcessor ¿ Text Analysis Toolkit. 2000. http://www.infogistics.com/textanalysis.html.Google ScholarGoogle Scholar
  26. B. Pang and L. Lee, Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales. ACL'05, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up? Sentiment Classification Using Machine Learning Techniques. EMNLP'2002, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. A-M. Popescu and O. Etzioni. Extracting Product Features and Opinions from Reviews. EMNLP-05, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. E. Riloff and J. Wiebe. 2003. Learning extraction patterns for subjective expressions. EMNLP'2003, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. V. Stoyanov and C. Cardie. Toward opinion summarization: Linking the sources. In Proc. of the Workshop on Sentiment and Subjectivity in Text, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. R. Tong. An Operational System for Detecting and Tracking Opinions in on-line discussion. SIGIR 2001 Workshop on Operational Text Classification, 2001.Google ScholarGoogle Scholar
  32. P. Turney. Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews. ACL'02, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. T. Wilson, J. Wiebe, and R. Hwa. Just how mad are you? Finding strong and weak opinion clauses. AAAI'04, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. J. Wiebe, and R. Mihalcea. Word Sense and Subjectivity. In ACL'06, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. J. Wiebe, and E. Riloff: Creating Subjective and Objective sentence classifiers from unannotated texts. CICLing, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. H. Yu, V. Hatzivassiloglou. Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. EMNLP'2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. L. Zhuang, F. Jing, X.-Yan Zhu, and L. Zhang. Movie Review Mining and Summarization. CIKM-06, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A holistic lexicon-based approach to opinion mining

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data Mining
          February 2008
          270 pages
          ISBN:9781595939272
          DOI:10.1145/1341531

          Copyright © 2008 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 11 February 2008

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate498of2,863submissions,17%

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader