ABSTRACT
We propose a method for extracting semantic orientations of words: desirable or undesirable. Regarding semantic orientations as spins of electrons, we use the mean field approximation to compute the approximate probability function of the system instead of the intractable actual probability function. We also propose a criterion for parameter selection on the basis of magnetization. Given only a small number of seed words, the proposed method extracts semantic orientations with high accuracy in the experiments on English lexicon. The result is comparable to the best value ever reported.
- Adam L. Berger, Stephen Della Pietra, and Vincent J. Della Pietra. 1996. A maximum entropy approach to natural language processing. Computational Linguistics, 22(1):39--71. Google ScholarDigital Library
- David Chandler. 1987. Introduction to Modern Statistical Mechanics. Oxford University Press.Google Scholar
- Jim Cowie, Joe Guthrie, and Louise Guthrie. 1992. Lexical disambiguation using simulated annealing. In Proceedings of the 14th conference on Computational linguistics, volume 1, pages 359--365. Google ScholarDigital Library
- Christiane Fellbaum. 1998. WordNet: An Electronic Lexical Database, Language, Speech, and Communication Series. MIT Press.Google Scholar
- Stuart Geman and Donald Geman. 1984. Stochastic relaxation, gibbs distributions, and the bayesian restoration of images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6:721--741.Google ScholarDigital Library
- Vasileios Hatzivassiloglou and Kathleen R. McKeown. 1997. Predicting the semantic orientation of adjectives. In Proceedings of the Thirty-Fifth Annual Meeting of the Association for Computational Linguistics and the Eighth Conference of the European Chapter of the Association for Computational Linguistics, pages 174--181. Google ScholarDigital Library
- Minqing Hu and Bing Liu. 2004. Mining and summarizing customer reviews. In Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery and data mining (KDD-2004), pages 168--177. Google ScholarDigital Library
- Yukito Iba. 1999. The nishimori line and bayesian statistics. Journal of Physics A: Mathematical and General, pages 3875--3888.Google Scholar
- Junichi Inoue and Domenico M. Carlucci. 2001. Image restoration using the q-ising spin glass. Physical Review E, 64:036121-1-036121-18.Google ScholarCross Ref
- Jaap Kamps, Maarten Marx, Robert J. Mokken, and Maarten de Rijke. 2004. Using wordnet to measure semantic orientation of adjectives. In Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC 2004), volume IV, pages 1115--1118.Google Scholar
- Nozomi Kobayashi, Takashi Inui, and Kentaro Inui. 2001. Dictionary-based acquisition of the lexical knowledge for p/n analysis (in Japanese). In Proceedings of Japanese Society for Artificial Intelligence, SLUD-33, pages 45--50.Google Scholar
- David J. C. Mackay. 2003. Information Theory, Inference and Learning Algorithms. Cambridge University Press. Google ScholarDigital Library
- Jose L. Marroquin. 1985. Optimal bayesian estimators for image segmentation and surface reconstruction. Technical Report A.I. Memo 839, Massachusetts Institute of Technology. Google ScholarDigital Library
- Ellen Riloff, Janyce Wiebe, and Theresa Wilson. 2003. Learning subjective nouns using extraction pattern bootstrapping. In Proceedings of the Seventh Conference on Natural Language Learning (CoNLL-03), pages 25--32. Google ScholarDigital Library
- Helmut Schmid. 1994. Probabilistic part-of-speech tagging using decision trees. In Proceedings of International Conference on New Methods in Language Processing, pages 44--49.Google Scholar
- Philip J. Stone, Dexter C. Dunphy, Marshall S. Smith, and Daniel M. Ogilvie. 1966. The General Inquirer: A Computer Approach to Content Analysis. The MIT Press.Google Scholar
- Kazuyuki Tanaka, Junichi Inoue, and Mike Titterington. 2003. Probabilistic image processing by means of the bethe approximation for the q-ising model. Journal of Physics A: Mathematical and General, 36:11023--11035.Google ScholarCross Ref
- Peter D. Turney and Michael L. Littman. 2003. Measuring praise and criticism: Inference of semantic orientation from association. ACM Transactions on Information Systems, 21(4):315--346. Google ScholarDigital Library
- Jean Veronis and Nancy M. Ide. 1990. Word sense disambiguation with very large neural networks extracted from machine readable dictionaries. In Proceedings of the 13th Conference on Computational Linguistics, volume 2, pages 389--394. Google ScholarDigital Library
- Janyce M. Wiebe. 2000. Learning subjective adjectives from corpora. In Proceedings of the 17th National Conference on Artificial Intelligence (AAAI-2000), pages 735--740. Google ScholarDigital Library
- Extracting semantic orientations of words using spin model
Recommendations
Annotating words using wordnet semantic glosses
ICONIP'12: Proceedings of the 19th international conference on Neural Information Processing - Volume Part IVAn approach to the word sense disambiguation (WSD) relaying on the WordNet synsets is proposed. The method uses semantically tagged glosses to perform a process similar to the spreading activation in semantic network, creating ranking of the most ...
Identifying the semantic orientation of foreign words
HLT '11: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2We present a method for identifying the positive or negative semantic orientation of foreign words. Identifying the semantic orientation of words has numerous applications in the areas of text classification, analysis of product review, analysis of ...
Spanish all-words semantic class disambiguation using Cast3LB corpus
MICAI'06: Proceedings of the 5th Mexican international conference on Artificial IntelligenceIn this paper, an approach to semantic disambiguation based on machine learning and semantic classes for Spanish is presented. A critical issue in a corpus-based approach for Word Sense Disambiguation (WSD) is the lack of wide-coverage resources to ...
Comments