ABSTRACT
The Proposition Bank (PropBank) project is aimed at creating a corpus of text annotated with information about semantic propositions. The second phase of the project, PropBank II adds additional levels of semantic annotation which include eventuality variables, co-reference, coarse-grained sense tags, and discourse connectives. This paper presents the results of the parallel PropBank II project, which adds these richer layers of semantic annotation to the first 100K of the Chinese Treebank and its English translation. Our preliminary analysis supports the hypothesis that this additional annotation reconciles many of the surface differences between the two languages.
- Olga Babko-Malaya and Martha Palmer. 2005. Proposition Bank II: Delving Deeper. In Frontiers in Corpus Annotation, Workshop in conjunction with HLT/NAACL 2004, Boston, Massachusetts.Google Scholar
- Olga Babko-Malaya, Martha Palmer, Nianwen Xue, Aravind Joshi, and Seth Kulick. 2004. Exploiting Interactions between Different Types of Semantic Annotation. In Proceeding of ICWS-6, Tilburg, The Netherlands.Google Scholar
- C. Baker, C. Fillmore, and J. Lowe. 1998. The berkeley framenet project. In Proceedings of COLING-ACL, Singapore. Google ScholarDigital Library
- E. Charniak. 2001. Immediate-head Parsing for Language Models. In ACL-01. Google ScholarDigital Library
- Michael Collins. 1999. Head-driven Statistical Models for Natural Language Parsing. Ph.D. thesis, University of Pennsylvania. Google ScholarDigital Library
- M. Ellsworth, K. Erk, P. Kingsbury, and S. Pado. 2004. PropBank, SALSA and FrameNet: How design determines product. In Proceedings of the LREC 2004 Workshop on Building Lexical Resources from Semantically Annotated Corpora, Lisbon, Portugal.Google Scholar
- Charles J. Fillmore and B. T. Atkins. 1998. FrameNet and lexical relevantce. In Proceedings of the First International Conference on Language Resources and Evaluation, Granada, Spain.Google Scholar
- Eva Hajicova and Iyona Kucerova. 2002. Argument/Valency Structure in PropBank, LCS Database and Prague Dependency Treebank: A Comparative Pilot Study. In Proceedings of the Third International Conference on Language Resources and Evaluation, pages 846--851.Google Scholar
- Christopher R. Johnson, Charles J. Fillmore, Miriam R. L. Petruck, Collin Baker, Michael Ellsworth, Josef Ruppenhofer, and Esther J. Wood. 2002. FrameNet: Theory and Practice, Version 1.0, www.icsi.berkeley.edu/framenet.Google Scholar
- M. Marcus, B. Santorini, and M. A. Marcinkiewicz. 1993. Building a Large Annotated Corpus of English: the Penn Treebank. Computational Linguistics. Google ScholarDigital Library
- Mitchell Marcus, Grace Kim, Mary Ann Marcinkiewicz, et al. 1994. The Penn Treebank: Annotating Predicate Argument Structure. In Proc of ARPA speech and Natural language workshop.Google ScholarDigital Library
- E. Miltsakaki, R. Prasad, A. Joshi, and B. Webber. 2004. The Penn Discourse Treebank. In Proceedings of the 4th International Conference on Language Resources and Evaluation, Lisbon, Portugal.Google Scholar
- Martha Palmer, Olga Babko-Malaya, and Hoa Dang. 2004. Different Sense Granularities for Different Applications. In Proceedings of the 2nd Workshop on Scalable Natural Language Understanding Systems, Boston, Mass.Google Scholar
- Martha Palmer, Dan Gildea, and Paul Kingsbury. 2005. The proposition bank: An annotated corpus of semantic roles. Computational Linguistics, 31(1). Google ScholarDigital Library
- Martha Palmer, Hoa Trang Dang, and Christiane Fell-baum. to appear. Making fine-grained and coarsegrained sense distinctions, both manually and automatically. Journal of Natural Language Engineering.Google Scholar
- Nianwen Xue and Martha Palmer. 2003. Annotating the Propositions in the Penn Chinese Treebank. In The Proceedings of the 2nd SIGHAN Workshop on Chinese Language Processing, Sapporo, Japan. Google ScholarDigital Library
- Nianwen Xue, Fei Xia, Fu dong Chiou, and Martha Palmer. To appear. The Penn Chinese Treebank: Phrase Structure Annotation of a Large Corpus. Natural Language Engineering. Google ScholarDigital Library
- Nianwen Xue. To appear. Annotating the Discourse Connectives in the Chinese Treebank. In Proceedings of the ACL Workshop on Frontiers in Corpus Annotation, Ann Arbor, Michigan. Google ScholarDigital Library
- A parallel Proposition Bank II for Chinese and English
Recommendations
Automatic construction of English/Chinese parallel corpora
As the demand for global information increases significantly, multilingual corpora has become a valuable linguistic resource for applications to cross-lingual information retrieval and natural language processing. In order to cross the boundaries that ...
Building a Chinese-English wordnet for translingual applications
A WordNet-like linguistic resource is useful, but difficult to construct. This article proposes a method to integrate five linguistic resources, including English/Chinese sense-tagged corpora, English/Chinese thesauruses, and a bilingual dictionary. ...
Automatic construction of an English-Chinese bilingual FrameNet
HLT-NAACL-Short '04: Proceedings of HLT-NAACL 2004: Short PapersWe propose a method of automatically constructing an English-Chinese bilingual FrameNet where the English FrameNet lexical entries are linked to the appropriate Chinese word senses. This resource can be used in machine translation and cross-lingual IR ...
Comments