skip to main content
10.5555/2145432.2145450dlproceedingsArticle/Chapter ViewAbstractPublication PagesemnlpConference Proceedingsconference-collections
research-article
Free Access

Semi-supervised recursive autoencoders for predicting sentiment distributions

Published:27 July 2011Publication History

ABSTRACT

We introduce a novel machine learning framework based on recursive autoencoders for sentence-level prediction of sentiment label distributions. Our method learns vector space representations for multi-word phrases. In sentiment prediction tasks these representations outperform other state-of-the-art approaches on commonly used datasets, such as movie reviews, without using any pre-defined sentiment lexica or polarity shifting rules. We also evaluate the model's ability to predict sentiment distributions on a new dataset based on confessions from the experience project. The dataset consists of personal user stories annotated with multiple labels which, when aggregated, form a multinomial distribution that captures emotional reactions. Our algorithm can more accurately predict distributions over such labels compared to several competitive baselines.

References

  1. P. Beineke, T. Hastie, C. D. Manning, and S. Vaithyanathan. 2004. Exploring sentiment summarization. In Proceedings of the AAAI Spring Symposium on Exploring Attitude and Affect in Text: Theories and Applications.Google ScholarGoogle Scholar
  2. Y. Bengio, R. Ducharme, P. Vincent, and C. Janvin. 2003. A neural probabilistic language model. Journal of Machine Learning Research, 3:1137--1155. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. M. Blei, A. Y. Ng, and M. I. Jordan. 2003. Latent dirichlet allocation. Journal of Machine Learning Research., 3:993--1022. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Y. Choi and C. Cardie. 2008. Learning with compositional semantics as structural inference for subsentential sentiment analysis. In EMNLP. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. R. Collobert and J. Weston. 2008. A unified architecture for natural language processing: deep neural networks with multitask learning. In Proceedings of ICML, pages 160--167. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Das and M. Chen. 2001. Yahoo! for Amazon: Extracting market sentiment from stock message boards. In Proceedings of the Asia Pacific Finance Association Annual Conference (APFA).Google ScholarGoogle Scholar
  7. K. Dave, S. Lawrence, and D. M. Pennock. 2003. Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In Proceedings of WWW, pages 519--528. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. X. Ding, B. Liu, and P. S. Yu. 2008. A holistic lexicon-based approach to opinion mining. In Proceedings of the Conference on Web Search and Web Data Mining (WSDM). Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. L. Elman. 1991. Distributed representations, simple recurrent networks, and grammatical structure. Machine Learning, 7(2--3):195--225. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Esuli and F. Sebastiani. 2007. Pageranking wordnet synsets: An application to opinion mining. In Proceedings of the Association for Computational Linguistics (ACL).Google ScholarGoogle Scholar
  11. C. Goller and A. Küchler. 1996. Learning task-dependent distributed representations by backpropagation through structure. In Proceedings of the International Conference on Neural Networks (ICNN-96).Google ScholarGoogle Scholar
  12. G. Grefenstette, Y. Qu, J. G. Shanahan, and D. A. Evans. 2004. Coupling niche browsers and affect analysis for an opinion mining application. In Proceedings of Recherche d'Information Assistée par Ordinateur (RIAO).Google ScholarGoogle Scholar
  13. D. Ikeda, H. Takamura, L. Ratinov, and M. Okumura. 2008. Learning to shift the polarity of words for sentiment classification. In IJCNLP.Google ScholarGoogle Scholar
  14. S. Kim and E. Hovy. 2007. Crystal: Analyzing predictive opinions on the web. In EMNLP-CoNLL.Google ScholarGoogle Scholar
  15. A. L. Maas, R. E. Daly, P. T. Pham, D. Huang, A. Y. Ng, and C. Potts. 2011. Learning accurate, compact, and interpretable tree annotation. In Proceedings of ACL.Google ScholarGoogle Scholar
  16. Y. Mao and G. Lebanon. 2007. Isotonic Conditional Random Fields and Local Sentiment Flow. In NIPS.Google ScholarGoogle Scholar
  17. P. Mirowski, M. Ranzato, and Y. LeCun. 2010. Dynamic auto-encoders for semantic indexing. In Proceedings of the NIPS 2010 Workshop on Deep Learning.Google ScholarGoogle Scholar
  18. T. Nakagawa, K. Inui, and S. Kurohashi. 2010. Dependency tree-based sentiment classification using CRFs with hidden variables. In NAACL, HLT. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. B. Pang and L. Lee. 2004. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. Pang and L. Lee. 2005. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In ACL, pages 115--124. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. B. Pang and L. Lee. 2008. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2(1--2):1--135. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. B. Pang, L. Lee, and S. Vaithyanathan. 2002. Thumbs up? Sentiment classification using machine learning techniques. In EMNLP. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. W. Pennebaker, R. J. Booth, and M. E. Francis. 2007. Linguistic inquiry and word count: Liwc2007 operators manual. University of Texas.Google ScholarGoogle Scholar
  24. L. Polanyi and A. Zaenen. 2006. Contextual valence shifters.Google ScholarGoogle Scholar
  25. J. B. Pollack. 1990. Recursive distributed representations. Artificial Intelligence, 46:77--105, November. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. C. Potts. 2010. On the negativity of negation. In David Lutz and Nan Li, editors, Proceedings of Semantics and Linguistic Theory 20. CLC Publications, Ithaca, NY.Google ScholarGoogle Scholar
  27. B. Snyder and R. Barzilay. 2007. Multiple aspect ranking using the Good Grief algorithm. In HLT-NAACL.Google ScholarGoogle Scholar
  28. R. Socher, C. D. Manning, and A. Y. Ng. 2010. Learning continuous phrase representations and syntactic parsing with recursive neural networks. In Proceedings of the NIPS-2010 Deep Learning and Unsupervised Feature Learning Workshop.Google ScholarGoogle Scholar
  29. R. Socher, C. C. Lin, A. Y. Ng, and C. D. Manning. 2011. Parsing Natural Scenes and Natural Language with Recursive Neural Networks. In ICML.Google ScholarGoogle Scholar
  30. P. J. Stone. 1966. The General Inquirer: A Computer Approach to Content Analysis. The MIT Press.Google ScholarGoogle Scholar
  31. J. Turian, L. Ratinov, and Y. Bengio. 2010. Word representations: a simple and general method for semi-supervised learning. In Proceedings of ACL, pages 384--394. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. P. Turney. 2002. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. L. Velikovich, S. Blair-Goldensohn, K. Hannan, and R. McDonald. 2010. The viability of web-derived polarity lexicons. In NAACL, HLT. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. T. Voegtlin and P. Dominey. 2005. Linear Recursive Distributed Representations. Neural Networks, 18(7). Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. J. Wiebe, T. Wilson, and C. Cardie. 2005. Annotating expressions of opinions and emotions in language. Language Resources and Evaluation, 39.Google ScholarGoogle Scholar
  36. T. Wilson, J. Wiebe, and P. Hoffmann. 2005. Recognizing contextual polarity in phrase-level sentiment analysis. In HLT/EMNLP. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. H. Yu and V. Hatzivassiloglou. 2003. Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In EMNLP. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Semi-supervised recursive autoencoders for predicting sentiment distributions

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image DL Hosted proceedings
        EMNLP '11: Proceedings of the Conference on Empirical Methods in Natural Language Processing
        July 2011
        1647 pages
        ISBN:9781937284114

        Publisher

        Association for Computational Linguistics

        United States

        Publication History

        • Published: 27 July 2011

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate73of234submissions,31%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader