Robust semantic role labeling using parsing variations and semantic classes

January 2007

Author:
Szu-Ting Yi
University of Pennsylvania
,
Adviser:
Martha Stone Palmer
University of Pennsylvania

Publisher:

University of Pennsylvania
Computer and Information Science Dept. 2000 South 33rd St. Philadelphia, PA
United States

ISBN:978-0-549-12260-9

Order Number:AAI3271840

Pages:

149

Purchase on ProQuest

Bibliometrics

Abstract

Correctly identifying semantic entities and successfully disambiguating the relations between them and their predicates is an important and necessary step for successful natural language processing applications, such as text summarization, question answering, and machine translation. Researchers have studied this problem, semantic role labeling (SRL), as a machine learning problem since 2000, after large-scale corpora annotated for arguments for a broad range of predicates became available. However, after using an optimal global inference algorithm to combine several SRL systems, the growth of SRL performance seems to have reached a plateau.

SRL systems typical rely on an upstream syntactic parser to gather argument candidates. We believe that this one-way relationship is the bottleneck of semantic role labeling, and we attempt to tackle it by training parsers more suitable for the SRL task. We incorporated semantic role annotation directly into the parse tree annotation, and trained different types of parsers on this data. We found that our maximum entropy style parser (Ratnaparkhi, 1999) derived more benefit from the additional features than our Collins style parser, based on Dan Bikel's implementation (Collins, 1999; Bikel, 2004). It also demonstrated better adaptability when ported to a different genre (the Brown corpus), outscoring the Collins style parser on Brown SRL by 10%.

A thorough error analysis indicated that a better route to creating a suitable syntactic parser for the task of semantic role labeling is to create training data which is more consistent and less contradictory. We then carefully examined different types of Treebank/PropBank mismatches and both Treebank and PropBank made changes in order to reach synchronization. The preliminary assessment on the merged data by comparing SRL performance on the old 300k and new 300k data indicate that the noisy data problem might still exist because the synchronization is not yet complete.

In order to achieve system robustness, we create a new set of semantic roles by transforming verb-specific PropBank roles to less verb-dependent thematic roles based on the mapping between PropBank and VerbNet. Our hypothesis is that a set of less verb-dependent roles should be easier to learn and port better to different genres. We compared SRL system performance trained on different sets of semantic roles, and the results confirm the hypothesis. The new system ports better to novel text. On a subtask of comparing one overloaded PropBank role to its mapped thematic roles, the new system trained on the WSJ corpus gains a 6% performance improvement on the test set extracted from WSJ, and a 10% performance improvement on the new genres from the Brown corpus.

Syntactic parsing is the bottleneck of the task of semantic role labeling and robustness is the ultimate goal. In this thesis, we investigate ways to train a better syntactic parser and increase SRL system robustness. We demonstrate that parse trees augmented by semantic role markups can serve as suitable training data for training a parser for an SRL system. Furthermore, we show that by resolving the discrepancies between Penn Treebank and PropBank, it is possible to create a cleaner training corpus both for training the parsers and the SRL systems. For system robustness, we propose that it is easier to learn a new set of semantic roles transformed from the original argument roles based on the mapping between VerbNet and PropBank. The new roles are less verb-dependent than the original PropBank roles. As a result, the SRL system trained on the new roles achieves significantly better robustness than the original system.

Contributors

Martha Stone Palmer
University of Colorado Boulder
- Publication Years1981 - 2024
- Publication counts112
- Citation count2,471
- Available for Download79
- Downloads (cumulative)35,214
- Downloads (12 months)2,992
- Downloads (6 weeks)438
- Average Downloads per Article446
- Average Citation per Article22
View Full Profile
Szuting Yi
University of Pennsylvania
- Publication Years2005 - 2007
- Publication counts3
- Citation count9
- Available for Download2
- Downloads (cumulative)398
- Downloads (12 months)48
- Downloads (6 weeks)6
- Average Downloads per Article199
- Average Citation per Article3
View Full Profile

Index Terms

Robust semantic role labeling using parsing variations and semantic classes

Recommendations

Chinese semantic role labeling with shallow parsing
EMNLP '09: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3

Most existing systems for Chinese Semantic Role Labeling (SRL) make use of full syntactic parses. In this paper, we evaluate SRL methods that take partial parses as inputs. We first extend the study on Chinese shallow parsing presented in (Chen et al., ...
Read More
Semantics-driven shallow parsing for Chinese semantic role labeling
ACLShort '10: Proceedings of the ACL 2010 Conference Short Papers

One deficiency of current shallow parsing based Semantic Role Labeling (SRL) methods is that syntactic chunks are too small to effectively group words. To partially resolve this problem, we propose semantics-driven shallow parsing, which takes into ...
Read More
Starting from scratch in semantic role labeling
ACL '10: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics

A fundamental step in sentence comprehension involves assigning semantic roles to sentence constituents. To accomplish this, the listener must parse the sentence, find constituents that are candidate arguments, and assign semantic roles to those ...
Read More

Comments

Browse Theses

Sections

Index Terms

Chinese semantic role labeling with shallow parsing

Semantics-driven shallow parsing for Chinese semantic role labeling

Starting from scratch in semantic role labeling

Sections

Save to Binder

Index Terms

Recommendations

Chinese semantic role labeling with shallow parsing

Semantics-driven shallow parsing for Chinese semantic role labeling

Starting from scratch in semantic role labeling