skip to main content
10.1145/1835804.1835903acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Latent aspect rating analysis on review text data: a rating regression approach

Authors Info & Claims
Published:25 July 2010Publication History

ABSTRACT

In this paper, we define and study a new opinionated text data analysis problem called Latent Aspect Rating Analysis (LARA), which aims at analyzing opinions expressed about an entity in an online review at the level of topical aspects to discover each individual reviewer's latent opinion on each aspect as well as the relative emphasis on different aspects when forming the overall judgment of the entity. We propose a novel probabilistic rating regression model to solve this new text mining problem in a general way. Empirical experiments on a hotel review data set show that the proposed latent rating regression model can effectively solve the problem of LARA, and that the detailed analysis of opinions at the level of topical aspects enabled by the proposed model can support a wide range of application tasks, such as aspect opinion summarization, entity ranking based on aspect ratings, and analysis of reviewers rating behavior.

Skip Supplemental Material Section

Supplemental Material

kdd2010_wang_lar_01.mov

mov

70.7 MB

References

  1. Onix text retrieval toolkit stopword list. http://www.lextek.com/manuals/onix/stopwords1.html.Google ScholarGoogle Scholar
  2. D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. The Journal of Machine Learning Research, 3:993--1022, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. C. Burges. A tutorial on support vector machines for pattern recognition. Data mining and knowledge discovery, 2(2):121--167, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. C.-C. Chang and C.-J. Lin. LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/?cjlin/libsvm.Google ScholarGoogle Scholar
  5. H. Cui, V. Mittal, and M. Datar. Comparative experiments on sentiment classification for online product reviews. In Twenty-First National Conference on Artificial Intelligence, volume 21, page 1265, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. K. Dave, S. Lawrence, and D. M. Pennock. Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In WWW '03, pages 519--528, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Devitt and K. Ahmad. Sentiment polarity identification in financial news: A cohesion-based approach. In Proceedings of ACL'07, pages 984--991, 2007.Google ScholarGoogle Scholar
  8. A. Esuli and F. Sebastiani. SentiWordNet: A publicly available lexical resource for opinion mining. In Proceedings of LREC, volume 6, 2006.Google ScholarGoogle Scholar
  9. A. Goldberg and X. Zhu. Seeing stars when there arena2rt many stars: Graph-based semi-supervised learning for sentiment categorization. In HLT-NAACL 2006 Workshop on Textgraphs: Graph-based Algorithms for Natural Language Processing, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Hu and B. Liu. Mining and summarizing customer reviews. In W. Kim, R. Kohavi, J. Gehrke, and W. DuMouchel, editors, KDD, pages 168--177. ACM, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. K. Jarvelin and J. Kekalainen. IR evaluation methods for retrieving highly relevant documents. In Proceedings of SIGIR'00, pages 41--48. ACM, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. N. Jindal and B. Liu. Identifying comparative sentences in text documents. In Proceedings of SIGIR'06, pages 244--251, New York, NY, USA, 2006. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. H. Kim and C. Zhai. Generating Comparative Summaries of Contradictory Opinions in Text. In Proceedings of CIKM'09, pages 385--394, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S. Kim and E. Hovy. Determining the sentiment of opinions. In Proceedings of COLING, volume 4, pages 1367--1373, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. K. Lerman, S. Blair-Goldensohn, and R. T. McDonald. Sentiment summarization: Evaluating and learning user preferences. In EACL, pages 514--522, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. B. Liu, M. Hu, and J. Cheng. Opinion observer: Analyzing and comparing opinions on the web. In WWW '05, pages 342--351, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Y. Lu, C. Zhai, and N. Sundaresan. Rated aspect summarization of short comments. In Proceedings of WWW'09, pages 131--140. ACM New York, NY, USA, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Morinaga, K. Yamanishi, K. Tateishi, and T. Fukushima. Mining product reputations on the web. In KDD '02, pages 341--349, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. B. Pang and L. Lee. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the ACL, pages 115--124, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up? Sentiment classification using machine learning techniques. In EMNLP 2002, pages 79--86, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. A.-M. Popescu and O. Etzioni. Extracting product features and opinions from reviews. In Proceedings of HLT '05, pages 339--346, Morristown, NJ, USA, 2005. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. M. Porter. An algorithm for suffix stripping. Program, 14(3):130 -- 137, 1980.Google ScholarGoogle ScholarCross RefCross Ref
  23. B. Snyder and R. Barzilay. Multiple aspect ranking using the good grief algorithm. In Proceedings of NAACL HLT, pages 300--307, 2007.Google ScholarGoogle Scholar
  24. I. Titov and R. McDonald. A joint model of text and aspect ratings for sentiment summarization. In ACL '08, pages 308--316.Google ScholarGoogle Scholar
  25. Y. Yang and J. O.Pedersen. A comparative study on feature selection in text categorization. In Proceedings of ICML'97, pages 412 -- 420, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. L. Zhuang, F. Jing, and X. Zhu. Movie review mining and summarization. In Proceedings of CIKM 2006, page 50. ACM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Latent aspect rating analysis on review text data: a rating regression approach

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
      July 2010
      1240 pages
      ISBN:9781450300551
      DOI:10.1145/1835804

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 25 July 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader