skip to main content
Natural language parsing as statistical pattern recognition
Publisher:
  • Stanford University
  • 408 Panama Mall, Suite 217
  • Stanford
  • CA
  • United States
Order Number:UMI Order No. GAX94-22102
Bibliometrics
Skip Abstract Section
Abstract

Traditional natural language parsers are based on rewrite rule systems developed in an arduous, time-consuming manner by grammarians. A majority of the grammarian's efforts are devoted to the disambiguation process, first hypothesizing rules which dictate constituent categories and relationships among words in ambiguous sentences, and then seeking exceptions and corrections to these rules.

In this work, I propose an automatic method for acquiring a statistical parser from a set of parsed sentences which takes advantage of some initial linguistic input, but avoids the pitfalls of the iterative and seemingly endless grammar development process. Based on distributionally-derived and linguistically-based features of language, this parser acquires a set of statistical decision trees which assign a probability distribution on the space of parse trees given the input sentence. These decision trees take advantage of significant amount of contextual information, potentially including all of the lexical information in the sentence, to produce highly accurate statistical models of the disambiguation process. By basing the disambiguation criteria selection on entropy reduction rather than human intuition, this parser development method is able to consider more sentences than a human grammarian can when making individual disambiguation rules.

In experiments between a parser, acquired using this statistical framework, and a grammarian's rule-based parser, developed over a ten-year period, both using the same training material and test sentences, the decision tree parser significantly outperformed the grammar-based parser on the accuracy measure which the grammarian was trying to maximize, achieving an accuracy of 78% compared to the grammar-based parser's 69%.

Cited By

  1. El-taher A, Abo Bakr H, Zidan I and Shaalan K (2014). An Arabic CCG approach for determining constituent types from Arabic Treebank, Journal of King Saud University - Computer and Information Sciences, 26:4, (441-449), Online publication date: 1-Dec-2014.
  2. Ivanova A, Oepen S, Øvrelid L and Flickinger D Who did what to whom? Proceedings of the Sixth Linguistic Annotation Workshop, (2-11)
  3. Luo X and Zhao B A statistical tree annotator and its applications Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1, (1230-1238)
  4. Manning C Part-of-speech tagging from 97% to 100% Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I, (171-189)
  5. Tse D and Curran J Chinese CCGbank Proceedings of the 23rd International Conference on Computational Linguistics, (1083-1091)
  6. Filimonov D and Harper M A joint language model with fine-grain syntactic tags Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3, (1114-1123)
  7. Surdeanu M, Johansson R, Meyers A, Màrquez L and Nivre J The CoNLL-2008 shared task on joint parsing of syntactic and semantic dependencies Proceedings of the Twelfth Conference on Computational Natural Language Learning, (159-177)
  8. Johansson R and Nugues P The effect of syntactic representation on semantic role labeling Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1, (393-400)
  9. Schmid H and Laws F Estimation of conditional probabilities with decision trees and an application to fine-grained POS tagging Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1, (777-784)
  10. Yih W, Goodman J, Vanderwende L and Suzuki H Multi-document summarization by maximizing informative content-words Proceedings of the 20th international joint conference on Artifical intelligence, (1776-1782)
  11. Wang M, Sagae K and Mitamura T A fast, accurate deterministic parser for Chinese Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, (425-432)
  12. Rus V, McCarthy P and Graesser A Analysis of a textual entailer Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing, (287-298)
  13. Rus V and Graesser A Deeper natural language processing for evaluating student answers in intelligent tutoring systems proceedings of the 21st national conference on Artificial intelligence - Volume 2, (1495-1500)
  14. Erdogan H, Sarikaya R, Chen S, Gao Y and Picheny M (2005). Using semantic analysis to improve speech recognition performance, Computer Speech and Language, 19:3, (321-343), Online publication date: 1-Jul-2005.
  15. O'Donovan R, Burke M, Cahill A, Van Genabith J and Way A (2005). Large-Scale Induction and Evaluation of Lexical Resources from the Penn-II and Penn-III Treebanks, Computational Linguistics, 31:3, (329-366), Online publication date: 1-Sep-2005.
  16. Cahill A, Burke M, O'Donovan R, van Genabith J and Way A Long-distance dependency resolution in automatically acquired wide-coverage PCFG-based LFG approximations Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, (319-es)
  17. O'Donovan R, Burke M, Cahill A, van Genabith J and Way A Large-scale induction and evaluation of lexical resources from the Penn-II treebank Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, (367-es)
  18. Sarikaya R, Gao Y and Picheny M A comparison of rule-based and statistical methods for semantic language modeling and confidence measurement Proceedings of HLT-NAACL 2004: Short Papers, (65-68)
  19. Gao Y, Zhou B, Diao Z, Sorensen J and Picheny M (2019). MARS, Machine Translation, 17:3, (185-212), Online publication date: 18-Dec-2002.
  20. Schmid H A generative probability model for unification-based grammars Proceedings of the 19th international conference on Computational linguistics - Volume 1, (1-7)
  21. Blaheta D Handling noisy training and testing data Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10, (111-116)
  22. Gao Y, Sorensen J, Erdogan H, Sarikaya R, Liu F, Picheny M, Zhou B and Diao Z A trainable approach for multi-lingual speech-to-speech translation system Proceedings of the second international conference on Human Language Technology Research, (231-234)
  23. Kinyon A A language--independent shallow--parser compiler Proceedings of the 39th Annual Meeting on Association for Computational Linguistics, (330-337)
  24. Van Uytsel D, Van Aelten F and Van Compernolle D A structured language model based on context-sensitive probabilistic left-corner parsing Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies, (1-8)
  25. Drábek E and Zhou Q Using co-occurrence statistics as an information source for partial parsing of Chinese Proceedings of the second workshop on Chinese language processing: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 12, (22-28)
  26. Cunningham H (2018). A definition and short history of Language Engineering, Natural Language Engineering, 5:1, (1-16), Online publication date: 1-Mar-1999.
  27. Daelemans W, Van Den Bosch A and Zavrel J (2019). Forgetting Exceptions is Harmful in Language Learning, Machine Language, 34:1-3, (11-41), Online publication date: 1-Feb-1999.
  28. Abney S, McAllester D and Pereira F Relating probabilistic grammars and automata Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, (542-549)
  29. Heeman P and Allen J (1999). Speech repairs, intonational phrases, and discourse markers, Computational Linguistics, 25:4, (527-571), Online publication date: 1-Dec-1999.
  30. McMahon J and Smith F (1998). A Review of Statistical Language Processing Techniques, Artificial Intelligence Review, 12:5, (347-391), Online publication date: 1-Oct-1998.
  31. Murthy S (1998). Automatic Construction of Decision Trees from Data, Data Mining and Knowledge Discovery, 2:4, (345-389), Online publication date: 1-Dec-1998.
  32. Wang Y and Waibel A Decoding algorithm in statistical machine translation Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, (366-372)
  33. Zavrel J and Daelemans W Memory-based learning Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, (436-443)
  34. Goodman J Parsing algorithms and metrics Proceedings of the 34th annual meeting on Association for Computational Linguistics, (177-183)
  35. Chen S and Goodman J An empirical study of smoothing techniques for language modeling Proceedings of the 34th annual meeting on Association for Computational Linguistics, (310-318)
  36. Ushioda A Hierarchical clustering of words Proceedings of the 16th conference on Computational linguistics - Volume 2, (1159-1162)
  37. McMahon J and Smith F (1996). Improving statistical language model performance with automatically generated word hierarchies, Computational Linguistics, 22:2, (217-247), Online publication date: 1-Jun-1996.
  38. Magerman D Statistical decision-tree models for parsing Proceedings of the 33rd annual meeting on Association for Computational Linguistics, (276-283)
  39. Magerman D (1995). Review of "Statistical language learning" by Eugene Charniak. The MIT Press 1993., Computational Linguistics, 21:1, (103-111), Online publication date: 1-Mar-1995.
  40. Ratnaparkhi A, Reynar J and Roukos S A maximum entropy model for prepositional phrase attachment Proceedings of the workshop on Human Language Technology, (250-255)
  41. Jelinek F, Lafferty J, Magerman D, Mercer R, Ratnaparkhi A and Roukos S Decision tree parsing using a hidden derivation model Proceedings of the workshop on Human Language Technology, (272-277)
Contributors
  • Advanced Telecommunications Research Institute International (ATR)

Index Terms

  1. Natural language parsing as statistical pattern recognition

    Recommendations