skip to main content
10.3115/1073012.1073055dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free Access

A language--independent shallow--parser compiler

Published:06 July 2001Publication History

ABSTRACT

We present a rule--based shallow--parser compiler, which allows to generate a robust shallow-parser for any language, even in the absence of training data, by resorting to a very limited number of rules which aim at identifying constituent boundaries. We contrast our approach to other approaches used for shallow--parsing (i.e. finite-state and probabilistic methods). We present an evaluation of our tool for English (Penn Treebank) and for French (newspaper corpus "LeMonde") for several tasks (NP-chunking & "deeper" parsing).

References

  1. Abeillé A., Clément L. 1999: A tagged reference corpus for French. Proc. LINC-EACL'99. BergenGoogle ScholarGoogle Scholar
  2. Abeillé A., Clément L., Kinyon A., Toussenel F. 2001 Building a Treebank for French. In Treebanks (A Abeillé ed.). Kluwer academic publishers.Google ScholarGoogle Scholar
  3. Abney S. 1991. Parsing by chunks. In Principle--based Parsing. (R. Berwick, S. Abney and C. Tenny eds), Kluwer academic publishers.Google ScholarGoogle Scholar
  4. Aït--Mokhtar S. & Chanod J. P. 1997. Incremental Finite--State Parsing. Proc. ANLP'97, Washington, Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Bourigault 1992: Surface Grammatical analysis for the extraction of terminological noun phrases. Proc. COLING'92. Vol 3, pp. 977--981 Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Brants T., Skut W., Uszkoreit H., 1999. Syntactic Annotation of a German Newspaper Corpus. Proc. ATALA Treebank Workshop. Paris, France.Google ScholarGoogle Scholar
  7. Daelemans W., Buchholz S., Veenstra J. Memory--Based Shallow Parsing. Proc. CoNLL--EACL'99Google ScholarGoogle Scholar
  8. Grefenstette G. 1996. Light Parsing as Finite--State Filtering. Proc. ECAI '96 workshop on "Extended finite state models of language". Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Joshi A. K. Hopely P. 1997. A parser from antiquity. In Extended Finite State Models of Language. (A. Kornai ed.). University Press.Google ScholarGoogle Scholar
  10. Karlsson F., Voutilainen A., Heikkil J., Antilla A. (eds.) 1995. Constraint Grammar: a language--independent system for parsing unrestricted text. Mouton de Gruyer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kinyon A. 2000. Hypertags. Proc. COLING'00. Sarrebrucken. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Magerman D. M., 1994 Natural language parsing as statistical pattern recognition. PhD Dissertation, Stanford University. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Marcus M., Santorini B., and Marcinkiewicz M. A. 1993. Building a large annotated corpus of english: The penn treebank. Computational Linguistics, 19:313--330. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Ramshaw, L. A. & Marcus, M. P., 1995. Text Chunking using Transformation--Based Learning, ACL Third Workshop on Very Large Corpora, pp.82--94, 1995.Google ScholarGoogle Scholar
  15. Ratnaparkhi A. 1997. linear observed time statistical parser based on maximum entropy models. Technical Report cmp-lg/9706014.Google ScholarGoogle Scholar
  16. Tapanainen P. and Järvinen T., 1994, Syntactic Analysis of a Natural Language Using Linguistic Rules and Corpus--Based Patterns. Proc. COLING'94. Vol 1, pp 629--634. Kyoto. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Schmid H. 1994 Probabilistic Part--Of--Speech Tagging Using Decision Trees. Proc. NEMLAP'94.Google ScholarGoogle Scholar
  18. Vergne J. 1999. Etude et modélisation de la syntaxe des langues à l'aide de l'ordinateur. Analyse syntaxique automatique non combinatoire. Dossier d'habilitation à diriger des recherches. Univ. de Caen.Google ScholarGoogle Scholar
  1. A language--independent shallow--parser compiler

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image DL Hosted proceedings
        ACL '01: Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
        July 2001
        562 pages

        Publisher

        Association for Computational Linguistics

        United States

        Publication History

        • Published: 6 July 2001

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate85of443submissions,19%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader