ABSTRACT
We present a classifier that discriminates between types of corrections made by teachers of English in student essays. We define a set of linguistically motivated feature templates for a log-linear classification model, train this classifier on sentence pairs extracted from the Cambridge Learner Corpus, and achieve 89% accuracy improving upon a 33% baseline. Furthermore, we incorporate our classifier into a novel application that takes as input a set of corrected essays that have been sentence aligned with their originals and outputs the individual corrections classified by error type. We report the F-Score of our implementation on this task.
- Robert Dale and Adam Kilgarriff. 2010. Helping our own: text massaging for computational linguistics as a new shared task. In Proceedings of the 6th International Natural Language Generation Conference, INLG '10, pages 263--267, Stroudsburg, PA, USA. Association for Computational Linguistics. Google ScholarDigital Library
- Erin Fitzgerald, Frederick Jelinek, and Keith Hall. 2009. Integrating sentence- and word-level error identification for disfluency correction. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2, EMNLP '09, pages 765--774, Stroudsburg, PA, USA. Association for Computational Linguistics. Google ScholarDigital Library
- Michael Gamon. 2011. High-order sequence modeling for language learner error detection. In Proceedings of the 6th Workshop on Innovative Use of NLP for Building Educational Applications, IUNLPBEA '11, pages 180--189, Stroudsburg, PA, USA. Association for Computational Linguistics. Google ScholarDigital Library
- Andrew Kachites McCallum. 2002. Mallet: A machine learning for language toolkit. http://www.cs.umass.edu/mccallum/mallet.Google Scholar
- D. Nicholls. 2003. The cambridge learner corpus: Error coding and analysis for lexicography and elt. In Proceedings of the Corpus Linguistics 2003 conference, pages 572--581.Google Scholar
- A. Rozovskaya, M. Sammons, J. Gioja, and D. Roth. 2011. University of illinois system in hoo text correction shared task.Google Scholar
- Sara Stymne. 2011. Blast: A tool for error analysis of machine translation output. In ACL (System Demonstrations), pages 56--61. Google ScholarDigital Library
- Randy West, Y. Albert Park, and Roger Levy. 2011. Bilingual random walk models for automated grammar correction of esl author-produced text. In Proceedings of the 6th Workshop on Innovative Use of NLP for Building Educational Applications, IUNLPBEA '11, pages 170--179, Stroudsburg, PA, USA. Association for Computational Linguistics. Google ScholarDigital Library
- Elif Yamangil and Stuart M. Shieber. 2010. Bayesian synchronous tree-substitution grammar induction and its application to sentence compression. In ACL, pages 937--947. Google ScholarDigital Library
- Helen Yannakoudakis, Ted Briscoe, and Ben Medlock. 2011. A new dataset and method for automatically grading esol texts. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1, HLT '11, pages 180--189, Stroudsburg, PA, USA. Association for Computational Linguistics. Google ScholarDigital Library
- Correction detection and error type selection as an ESL educational aid
Recommendations
A hybrid model for spelling error detection and correction for Urdu language
AbstractDetecting and correcting misspelled words in a written text are of great importance in many natural language processing applications. Errors can be broadly classified into two groups, namely spelling error and contextual errors. Spelling errors ...
Adaptive Spelling Error Correction Models for Learner English
Spelling errors are a characteristic of learner English and degrade the performances of natural language processing systems targeting English learners. This paper describes a method specially designed for automatically correcting spelling errors in ...
Statistical Context-Sensitive Spelling Correction Using Typing Error Rate
CSE '13: Proceedings of the 2013 IEEE 16th International Conference on Computational Science and EngineeringError words that appear in Korean texts can be largely categorized into non-word spelling errors and context-sensitive spelling errors. Of the two, context-sensitive spelling errors are shown only when considering the meaning of the word in the given ...
Comments