skip to main content
10.5555/1868720.1868732dlproceedingsArticle/Chapter ViewAbstractPublication PageslawConference Proceedingsconference-collections
research-article
Free Access

Complex predicates annotation in a corpus of Portuguese

Authors Info & Claims
Published:15 July 2010Publication History

ABSTRACT

We present an annotation scheme for the annotation of complex predicates, understood as constructions with more than one lexical unit, each contributing part of the information normally associated with a single predicate. We discuss our annotation guidelines of four types of complex predicates, and the treatment of several difficult cases, related to ambiguity, overlap and coordination. We then discuss the process of marking up the Portuguese CINTIL corpus of 1M tokens (written and spoken) with a new layer of information regarding complex predicates. We also present the outcomes of the annotation work and statistics on the types of CPs that we found in the corpus.

References

  1. }}A. Abeillé, D. Godard, and I. Sag, 1998. Complex Predicates in Nonderivational Syntax, volume 30 of Syntax and Semantics, chapter Two Kinds of Composition in French Complex predicates. San Diego Academic Press, San Diego.Google ScholarGoogle Scholar
  2. }}M. F. P. Bacelar do Nascimento, P. Marrafa, L. A. S. Pereira, R. Ribeiro, R. Veloso, and L. Wittmann. 1998. Le-parole - do corpus à modelização da informação lexical num sistema-multifunção. In Actas do XIII Encontro da Associação Portuguesa de Linguística, APL, pages 115--134, Lisboa.Google ScholarGoogle Scholar
  3. }}M. F. Bacelar do Nascimento, J. Bettencourt Gonçalves, R. Veloso, S. Antunes, F. Barreto, and R. Amaro, 2005. C-ORAL-ROM: Integrated Reference Corpora for Spoken Romance Languages, chapter The Portuguese Corpus, pages 163--207. Amsterdam/Philadelphia: John Benjamins Publishing Company, Studies in Corpus Linguistics. Editors: E. Cresti and M. Monegnia.Google ScholarGoogle Scholar
  4. }}M. F. Bacelar do Nascimento, 2000. Corpus, Méthodologie et Applications Linguistiques, chapter Corpus de Référence du Portugais Contemporain, pages 25--30. H. Champion et Presses Universitaires de Perpignan, Paris. Editor: M. Bilger.Google ScholarGoogle Scholar
  5. }}F. Barreto, A. Branco, E. Ferreira, A. Mendes, M. F. P. Bacelar do Nascimento, F. Nunes, and J. Silva. 2006. Open resources and tools for the shallow processing of portuguese. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC2006), Genoa, Italy.Google ScholarGoogle Scholar
  6. }}C. Bowern. 2006. Inter theorical approaches to complex verb constructions: position paper. In The Eleventh Biennal Rice University Linguistics Symposium.Google ScholarGoogle Scholar
  7. }}E. Carrilho and C. Magro, 2009. Syntactic Annotation System Manual of corpus CORDIAL-SIN. http://www.clul.ul.pt/sectores/variacao/cordialsin/Syntactic%20annotation%20manual.html.Google ScholarGoogle Scholar
  8. }}S. Cinková and V. Kolá&rcirc;ová. 2005. Nouns as components of support verb constructions in the prague dependency treebank. In Insight into Slovak and Czech Corpus Linguistics. Veda Bratislava.Google ScholarGoogle Scholar
  9. }}J. Cohen. 1960. A coefficient of agreement for nominal scales. Education and Psychological Measuremen, 20:37--46.Google ScholarGoogle ScholarCross RefCross Ref
  10. }}K. Erk, A. Kowalski, S. Padó, and M. Pinkal. 2003. Towards a resource for lexical semantics: A large german corpus with extensive semantic annotation. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pages 537--544, Sapporo, Japan, July. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. }}C. Fellbaum, A. Geyken, A. Herold, F. Koerner, and G. Neumann. 2006. Corpus-based studies of german idioms and light verbs. International Journal of Lexicography, 19(4):349--360.Google ScholarGoogle ScholarCross RefCross Ref
  12. }}A. Gonçalves. 2002. The causee in the faire-inf construction of portuguese. Journal of Portuguese Linguistics.Google ScholarGoogle Scholar
  13. }}A. Gonçalves. 2003. Defectividade funcional e predicados complexos em estruturas de controlo do português. In I. Castro and I. Duarte, editors, Miscelnea de estudos em homenagem a Maria Helena Mira Mateus, volume I. Imprensa Nacional-Casa da Moeda.Google ScholarGoogle Scholar
  14. }}J. Grimshaw. 1988. Light verbs and marking. Linguistic Inquiry, 19(2):205--232.Google ScholarGoogle Scholar
  15. }}M. Gross. 1981. Les bases empiriques de la notion de prédicat sémantique. Langages, 63:7--52.Google ScholarGoogle ScholarCross RefCross Ref
  16. }}O. Jespersen. 1949. A Modern English Grammar on Historical Principles. Londres: George Allen & Unwin; Copenhaga: Ejnar Munksgaard.Google ScholarGoogle Scholar
  17. }}R. Johansson and P. Nugues. 2006. Automatic annotation for all semantic layers in FrameNet. In Proceedings of EACL-2006, Trento, Italy, April 15--16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. }}C. R. Johnson and C. J. Fillmore. 2000. The framenet tagset for frame-semantic and syntactic coding of predicate-argument structure. In Proceedings of the 1st Meeting of the North American Chapter of the Association for Computational Linguistics (ANLP-NAACL 2000), pages 56--62, Seattle WA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. }}R. Kayne. 1975. French Syntax: the Transformational Cycle. The MIT Press, Cambridge, Mass.Google ScholarGoogle Scholar
  20. }}M. Marcus, S. Santorini, and M. Marcinkiewicz. 1993. Building a Large Annotated Corpus of English: the Penn Treebank. Computational Linguistics, 19(2):313--330. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. }}M. Butt. 1995. The Structure of Complex Predicates in Urdu. Stanford, CA: CSLI Publications.Google ScholarGoogle Scholar
  22. }}A. Meyers. 2007. Annotation guidelines for nombank -- noun argument structure for propbank. Technical report, New York University. http://nlp.cs.nyu.edu/meyers/nombank/nombank-specs-2007.pdf.Google ScholarGoogle Scholar
  23. }}M. Mikulová, A. Bémová, J. Hajič, E. Hajicková, and J. Havelka et al. 2006. Annotation on the tectogrammatical level in the prague dependency treebank annotation manual. technical report. Technical Report UFAL CKL Technical Report TR-2006-35, ÚFAL MFF UK, Prague, Czech Rep.Google ScholarGoogle Scholar
  24. }}N. Xue. 2006. Annotating the predicate-argument structure of chinese nominalizations. In Proceedings of the LREC 2006, pages 1382--1387, Genoa, Italy.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image DL Hosted proceedings
    LAW IV '10: Proceedings of the Fourth Linguistic Annotation Workshop
    July 2010
    305 pages
    ISBN:9781932432725

    Publisher

    Association for Computational Linguistics

    United States

    Publication History

    • Published: 15 July 2010

    Qualifiers

    • research-article

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader