Article

Free Access

The computation of word associations: comparing syntagmatic and paradigmatic approaches

Author:
Reinhard Rapp

University of Mainz, FASK, Germersheim, Germany

University of Mainz, FASK, Germersheim, Germany
View Profile

COLING '02: Proceedings of the 19th international conference on Computational linguistics - Volume 1August 2002Pages 1–7https://doi.org/10.3115/1072228.1072235

Published:24 August 2002Publication History

COLING '02: Proceedings of the 19th international conference on Computational linguistics - Volume 1

Pages 1–7

ABSTRACT

It is shown that basic language processes such as the production of free word associations and the generation of synonyms can be simulated using statistical models that analyze the distribution of words in large text corpora. According to the law of association by contiguity, the acquisition of word associations can be explained by Hebbian learning. The free word associations as produced by subjects on presentation of single stimulus words can thus be predicted by applying first-order statistics to the frequencies of word co-occurrences as observed in texts. The generation of synonyms can also be conducted on co-occurrence data but requires second-order statistics. The reason is that synonyms rarely occur together but appear in similar lexical neighborhoods. Both approaches are systematically compared and are validated on empirical data. It turns out that for both tasks the performance of the statistical system is comparable to the performance of human subjects.

References

Agarwal, R. (1995). Semantic Feature Extraction from Technical Texts with Limited Human Intervention. Dissertation, Mississippi State University. Google ScholarDigital Library
Berland, M., Charniak, E. (1999). Finding Parts in Very Large Corpora. In: Proceedings of ACL 1999, College Park. 57--64. Google ScholarDigital Library
de Saussure, F. (1916/1996). Cours de linguistique générale. Paris: Payot.Google Scholar
Dunning, T. (1993). Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19(1), 61--74. Google ScholarDigital Library
Grefenstette, G. (1993). Evaluation techniques for automatic semantic extraction: comparing syntactic and window based approaches. In: Proceedings of the Workshop on Acquisition of Lexical Knowledge from Text, Columbus, Ohio.Google Scholar
Grefenstette, G. (1994). Explorations in Automatic Thesaurus Discovery. Dordrecht: Kluwer. Google ScholarDigital Library
Kiss, G. R., Armstrong, C., Milroy, R., Piper, J. (1973). An associative thesaurus of English and its computer analysis. In: A. Aitken, R. Beiley and N. Hamilton-Smith (eds.): The Computer and Literary Studies, Edinburgh: University Press.Google Scholar
Landauer, T. K.; Dumais, S. T. (1997). A solution to Plato's problem: the latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological Review, 104(2), 211--240.Google ScholarCross Ref
Lin, D. (1998). Automatic Retrieval and Clustering of Similar Words. In: Proceedings of COLING-ACL 1998, Montreal, Vol. 2, 768--773. Google ScholarDigital Library
Rapp, R. (1996). Die Berechnung von Assoziationen. Hildesheim: Olms.Google Scholar
Rapp, R. (1999). Automatic identification of word translation from unrelated English and German corpora. In: Proceedings of ACL 1999, College Park. 519--526. Google ScholarDigital Library
Ruge, G. (1992). Experiments on Linguistically Based Term Associations. Information Processing & Management 28(3), 317--332. Google ScholarDigital Library
Ruge, G. (1995). Wortbedeutung und Termassoziation. Hildesheim: Olms.Google Scholar
Salton, G.; McGill, M. (1983). Introduction to Modern Information Retrieval. New York: McGraw-Hill. Google ScholarDigital Library
Schütze, H. (1997). Ambiguity Resolution in Language Learning: Computational and Cognitive Models. Stanford: CSLI Publications.Google Scholar
Smadja, F. (1993). Retrieving collocations from text: Xtract. Computational Linguistics 19(1), 143--177. Google ScholarDigital Library
Wettler, M.; Rapp, R. (1993). Computation of word associations based on the co-occurrences of words in large corpora. In: Proceedings of the 1st Workshop on Very Large Corpora: Columbus, Ohio, 84--93.Google Scholar
Wettler, M., Rapp, R., Ferber, R. (1993). Freie Assoziationen und Kontiguitäten von Wörtern in Texten. Zeitschrift für Psychologie, 201, 99--108.Google Scholar

The computation of word associations: comparing syntagmatic and paradigmatic approaches
1. Hardware
  1. Power and energy
    1. Power estimation and optimization

Recommendations

Korean Word Associations: The Linked Structures for Language Learning
ICALT '09: Proceedings of the 2009 Ninth IEEE International Conference on Advanced Learning Technologies

This paper briefly reports on a collection of Korean Word Associations (KorWA). Then by constructing a Korean semantic network based on the data, it identifies some structural features and discusses about a potential support for semantic study and ...
Read More
Two-Word Collocation Extraction Using Monolingual Word Alignment Method

Statistical bilingual word alignment has been well studied in the field of machine translation. This article adapts the bilingual word alignment algorithm into a monolingual scenario to extract collocations from monolingual corpus, based on the fact ...
Read More
Parsing, word associations and typical predicate-argument relations
HLT '89: Proceedings of the workshop on Speech and Natural Language

There are a number of collocational constraints in natural languages that ought to play a more important role in natural language parsers. Thus, for example, it is hard for most parsers to take advantage of the fact that wine is typically drunk, ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

COLING '02: Proceedings of the 19th international conference on Computational linguistics - Volume 1
August 2002
1184 pages
Sponsors
In-Cooperation
Publisher
Association for Computational Linguistics
United States
Publication History
- Published: 24 August 2002
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,537of1,537submissions,100%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 16
  Total Citations
  View Citations
- 1,252
  Total Downloads
- Downloads (Last 12 months)43
- Downloads (Last 6 weeks)9
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

The computation of word associations: comparing syntagmatic and paradigmatic approaches

COLING '02: Proceedings of the 19th international conference on Computational linguistics - Volume 1

ABSTRACT

References

Cited By

Recommendations

Korean Word Associations: The Linked Structures for Language Learning

Two-Word Collocation Extraction Using Monolingual Word Alignment Method

Parsing, word associations and typical predicate-argument relations

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

The computation of word associations: comparing syntagmatic and paradigmatic approaches

COLING '02: Proceedings of the 19th international conference on Computational linguistics - Volume 1

ABSTRACT

References

Cited By

Recommendations

Korean Word Associations: The Linked Structures for Language Learning

Two-Word Collocation Extraction Using Monolingual Word Alignment Method

Parsing, word associations and typical predicate-argument relations

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media