skip to main content
10.5555/1654494.1654507dlproceedingsArticle/Chapter ViewAbstractPublication PagesparsingConference Proceedingsconference-collections
research-article
Free Access

A classifier-based parser with linear run-time complexity

Published:09 October 2005Publication History

ABSTRACT

We present a classifier-based parser that produces constituent trees in linear time. The parser uses a basic bottom-up shift-reduce algorithm, but employs a classifier to determine parser actions instead of a grammar. This can be seen as an extension of the deterministic dependency parser of Nivre and Scholz (2004) to full constituent parsing. We show that, with an appropriate feature set used in classification, a very simple one-path greedy parser can perform at the same level of accuracy as more complex parsers. We evaluate our parser on section 23 of the WSJ section of the Penn Treebank, and obtain precision and recall of 87.54% and 87.61%, respectively.

References

  1. Charniak, E. 2000. A maximum-entropy-inspired parser. Proceedings of the First Annual Meeting of the North American Chapter of the Association for Computational Linguistics. Seattle, WA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Collins, M. 1997. Three generative, lexicalized models for statistical parsing. Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (pp. 16--23). Madrid, Spain. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Daelemans, W., Zavrel, J., van der Sloot, K., and van den Bosch, A. 2004. TiMBL: Tilburg Memory Based Learner, version 5.1, reference guide. ILK Research Group Technical Report Series no. 04--02, 2004.Google ScholarGoogle Scholar
  4. Gildea, D., and Palmer, M. 2002. The necessity of syntactic parsing for predicate argument recognition. Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (pp. 239--246). Philadelphia, PA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Kalt, T. 2004. Induction of greedy controllers for deterministic treebank parsers. Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing. Barcelona, Spain.Google ScholarGoogle Scholar
  6. Kudo, T., and Matsumoto, Y. 2004. A boosting algorithm for classification of semi-structured text. Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing. Barcelona, Spain.Google ScholarGoogle Scholar
  7. Kudo, T., and Matsumoto, Y. 2001. Chunking with support vector machines. Proceedings of the Second Meeting of the North American Chapter of the Association for Computational Linguistics. Pittsburgh, PA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Johnson, M. 1998. PCFG models of linguistic tree representations. Computational Linguistics, 24:613--632. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Marcus, M. P., Santorini, B., and Marcinkiewics, M. A. 1993. Building a large annotated corpus of English: the Penn Treebank. Computational Linguistics, 19. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Nivre, J., and Scholz, M. 2004. Deterministic dependency parsing of English text. Proceedings of the 20th International Conference on Computational Linguistics (pp. 64--70). Geneva, Switzerland. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Ratnaparkhi, A. 1997. A linear observed time statistical parser based on maximum entropy models. Proceedings of the Second Conference on Empirical Methods in Natural Language Processing. Providence, Rhode Island.Google ScholarGoogle Scholar
  12. Veenstra, J., van den Bosch, A. 2000. Single-classifier memory-based phrase chunking. Proceedings of Fourth Workshop on Computational Natural Language Learning (CoNLL 2000). Lisbon, Portugal. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Wong, A., and Wu. D. 1999. Learning a lightweight robust deterministic parser. Proceedings of the Sixth European Conference on Speech Communication and Technology. Budapest.Google ScholarGoogle ScholarCross RefCross Ref
  14. Yamada, H., and Matsumoto, Y. 2003. Statistical dependency analysis with support vector machines. Proceedings of the Eighth International Workshop on Parsing Technologies. Nancy, France.Google ScholarGoogle Scholar
  15. Zhang, T., Damerau, F., and Johnson, D. 2002. Text chunking using regularized winnow. Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics. Tolouse, France. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. A classifier-based parser with linear run-time complexity

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image DL Hosted proceedings
          Parsing '05: Proceedings of the Ninth International Workshop on Parsing Technology
          October 2005
          214 pages

          Publisher

          Association for Computational Linguistics

          United States

          Publication History

          • Published: 9 October 2005

          Qualifiers

          • research-article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader