Corpus-based Techniques for Word Sense Disambiguation

Corpus-based Techniques for Word Sense DisambiguationDecember 1997

December 1997

1997 Technical Report

Author:
Gina Levow

Publisher:

Massachusetts Institute of Technology
201 Vassar Street, W59-200 Cambridge, MA
United States

Published:01 December 1997

Bibliometrics

Abstract

The need for robust and easily extensible systems for word sense disambiguation coupled with successes in training systems for a variety of tasks using large on-line corpora has led to extensive research into corpus-based statistical approaches to this problem. Promising results have been achieved by vector space representations of context, clustering combined with a semantic knowledge base, and decision lists based on collocational relations. We evaluate these techniques with respect to three important criteria: how their definition of context affects their ability to incorporate different types of disambiguating information, how they define similarity among senses, and how easily they can generalize to new senses. The strengths and weaknesses of these systems provide guidance for future systems which must capture and model a variety of disambiguating information, both syntactic and semantic.

Cited By

Contributors

Gina Anne Levow
University of Washington
- Publication Years1995 - 2020
- Publication counts34
- Citation count311
- Available for Download20
- Downloads (cumulative)5,415
- Downloads (12 months)541
- Downloads (6 weeks)88
- Average Downloads per Article271
- Average Citation per Article9
View Full Profile

Recommendations

A Sense Annotated Corpus for All-Words Urdu Word Sense Disambiguation

Word Sense Disambiguation (WSD) aims to automatically predict the correct sense of a word used in a given context. All human languages exhibit word sense ambiguity, and resolving this ambiguity can be difficult. Standard benchmark resources are required ...
Read More
A word sense disambiguation corpus for Urdu
Abstract
The aim of word sense disambiguation (WSD) is to correctly identify the meaning of a word in context. All natural languages exhibit word sense ambiguities and these are often hard to resolve automatically. Consequently WSD is considered an ...
Read More
Word Sense Disambiguation Corpus Development for Romanian Language
Abstract
Research in the area of the interconnection of lexical resources represents a real challenge, because it addresses the difficult problem of semantic understanding and, more precisely, the disambiguation of the meaning of the words - Word Sense ...
Read More

Comments

Browse Reports

Sections

Cited By

A Sense Annotated Corpus for All-Words Urdu Word Sense Disambiguation

A word sense disambiguation corpus for Urdu

Word Sense Disambiguation Corpus Development for Romanian Language

Save to Binder

Sections

Cited By

Save to Binder

Recommendations

A Sense Annotated Corpus for All-Words Urdu Word Sense Disambiguation

A word sense disambiguation corpus for Urdu

Word Sense Disambiguation Corpus Development for Romanian Language