In:Learner Corpora in Language Testing and Assessment
Edited by Marcus Callies and Sandra Götz
[Studies in Corpus Linguistics 70] 2015
► pp. 35–58
Avalingua
Natural language processing for automatic error detection
Published online: 9 April 2015
https://doi.org/10.1075/scl.70.02gam
https://doi.org/10.1075/scl.70.02gam
The objective of this article is to present an automatic tool for detecting and
classifying grammatical errors in written language as well as to describe the
evaluation protocol we have carried out to measure its performance on learner
corpora. The tool was designed to detect and analyse the linguistic errors found
in text essays, assess the writing proficiency, and propose solutions with the aim
of improving the linguistic skills of students. It makes use of natural language
processing and knowledge-rich linguistic resources. So far, the tool has been
implemented for the Galician language. The system has been evaluated on two
learner corpora reaching 91% precision and 65% recall (76% F-score) for the
task of detecting different types of grammatical errors, including spelling, lexical
and syntactic ones.
References (30)
Alegria, I., Aranberri, N., Fresno, V., Gamallo, P., Padró, Ll., San Vicente, I., Turmo, J. & Zubiaga, A. 2013. Introducción a la tarea compartida Tweet-Norm: Normalización léxica de tuits en español. In
Proceedings of the Tweet Normalisation Workshop at SEPLN-2013
, 38–46. Sociedad Española para el Procesamiento del Lenguaje Natural, <[URL]> (1 July 2014).
Bender, E.M., Flickinger, D., Oepen, S., Walsh, A. & Baldwin, T. 2004. ARBORETUM: Using a precision grammar for grammar checking in CALL. In
Proceedings of the InSTIL/ICALL Symposium on
Computer Assisted Learning
, Venice, <[URL]> (1 July 2014).
Chodorow, M., Gamon, M. & Tetreault, J. 2010. The utility of article and preposition error correction systems for English language learners: Feedback and assessment. Language Testing 27(3): 419–436, <[URL]> (1 July 2014).
Chodorow, M., Dickinson, M., Israel, R. & Tetreault, J. 2012. Problems in evaluating grammatical error detection systems. In Proceedings of the International Conference on Computational Linguistics (COLING 2012), M. Kay, C. Boitet (eds), 611–628. Mumbai: Association for Computational Linguistics.
Council of Europe. 2009. Relating Language Examinations to the Common European Framework of Reference for Languages: Learning, Teaching, Assessment (CEFR). A Manual. Strasbourg: Language Policy Division, <[URL]> (1 July 2014).
Dahlmeier, D. & Tou Ng, H. 2011. Grammatical error correction with alternating structure optimization. In
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011)
, 915–923. Portland, OR: Association for Computational Linguistics, [URL] (1 July 2014).
Dale, R., Anisimoff, I. & Narroway, G. 2012. HOO 2012: A report on the preposition and determiner error correction shared task. In Proceedings of the 7th Workshop on Innovative Use of NLP for Building Educational Applications, 54–62. Montréal Québec: Association for Computational Linguistics, <[URL]> (1 July 2014).
Dale, R. & Kilgarriff, A. 2010. Helping our own: Text messaging for computational linguistics as a new shared task. In Proceedings of the 6th International Natural Language Generation Conference (NLG’10), J.D. Kelleher, B. Mac Namee, I. van der Sluis (eds), 263–267, <[URL]> (1 July 2014).
. 2011. Helping our own: The HOO 2011 pilot shared task. In Proceedings of the 13th European Workshop on Natural Language Generation (NLG’11) at EMNLP 2011, A. Belz, R. Evans, A. Gatt & K. Striegnitz (eds), 242–249. Nancy: Association for Computational Linguistics, <[URL]> (1 July 2014).
Ferris, D. 1999. The case for grammar correction in L2 writing classes: A response to Truscott (1996). Journal of Second Language Writing 8(1): 1–11.
Gamallo, P., Garcia, M. & Pichel, J.R. 2013a. A method to lexical normalisation of tweets. In
Tweet Normalisation Workshop at SEPLN-2013
, 81–85, <[URL]> (1 July 2014).
Gamallo P., Garcia, M., González, I., Muñoz. M. & Del Río, I. 2013b. Learning verb inflection using Cilenis conjugators. Eurocall Review 21(1): 12–19, <[URL]> (1 July 2014).
Gamallo, P. & González, I. 2011. A grammatical formalism based on patterns of part-of-speech tags. In International Journal of Corpus Linguistics 16(1): 45–71.
Gamon, M. 2010. Using mostly native data to correct errors in learners’ writing: A meta-classifier approach. In
Proceedings of HLT ‘10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2010), Association for Computational Linguistics (eds), 163–171. Stroudsburg PA: ACM Digital Library.
Garcia, M. & Gamallo, P. 2010. Using morphosyntactic post-processing to improve POS-tagging accuracy. In Proceedings of the 9th
International Conference on Computational Processing of Portuguese Language (PROPOR 2010).
Extended Activities Proceedings
, Porto Alegre, <[URL]> (1 July 2014).
Han, N., Chodorow, J.R. & Leacock, C. 2006. Detecting errors in English article usage by non-native speakers. Natural Language Engineering 12(2): 115–129.
Hartshorn, K.J., Evans, N.W., Merrill, P.F., Sudweeks, R.R., Strong-Krause, D. & Anderson, N.J. 2010. Effects of dynamic corrective feedback on ESL wiring accuracy. TESOL Quarterly 44(1): 84–109.
Hyland, K. & Hyland, F. 2006. State of the art article: Feedback on second language students’ writing. Language Teaching 39(2): 83–101.
Leacock, C., Chodorow, M., Gamon, M. & Tetreault J. 2010. Automated Grammatical Error Detection for Language Learners. San Rafael CA: Morgan & Claypool Publishers.
Liou, H.-C. 1991. Development of an English grammar checker: A progress report. CALICO Journal 9(1): 57–70.
Liu, Y. 2008. The effects of error feedback in second language writing. Arizona working papers in SLA & Teaching 15: 65–79.
Ng, H., Wu, S., Wu, Y., Hadiwinoto, C. & Tetreault, J. 2013. The CoNLL-2013 Shared Task on Grammatical Error Correction. In
Proceedings of the Seventeenth Conference on Computational Natural Language Learning: Shared Task (CoNLL-2013 Shared Task), 1–14. Sofia: Association for Computational Linguistics, <[URL]> (1 July 2014).
Padró, L. & Stanilovsky, E. 2012. FreeLing 3.0: Towards wider multilinguality. In
Proceedings of the Language Resources and Evaluation Conference (LREC 2012). Istanbul: European Language and Resources Association, <[URL]> (1 July 2014).
Russell, J. & Spada, N. 2006. The effectiveness of corrective feedback for the acquisition of L2 grammar: A metaanalysis of the research. In Synthesizing Research on Language Learning and Teaching [Language Learning & Language Teaching 13], J.M. Norris & L. Ortega (eds), 133–164. Amsterdam: John Benjamins.
Tetreault, J. & Chodorow, M. 2008. The ups and downs of preposition error detection in ESL writing. In
Proceedings of the International Conference on Computational Linguistics (COLING 2008), 865–872. Manchester: Association for Computational Linguistics, <[URL]> (1 July 2014).
Truscott, J. 1996. The case against grammar correction in L2 writing classes. Language Learning 46(2): 327–369.
Truscott, J. & Hsu, A.Y. 2008. Error correction, revision, and learning. Journal of Second Language Writing 17(4): 292–305.
Vandeventer, A. 2001. Creating a grammar checker for CALL by constraint relaxation: A feasibility study. ReCALL 04/2001 13(1): 110–120.
Ware, P.D. & Warschauer, M. 2006. Electronic feedback and second language writing. In Feedback in Second Language Writing: Contexts and Issues, K. Hyland & F. Hyland (eds), 105–122. Cambridge: CUP.
Yannakoudakis, H., Briscoe, T. & Medlock, B. 2011. A new dataset and method for automatically grading ESOL texts. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011), Association for Computational Linguistics (eds), 180–189, <[URL]> (1 July 2014).
Cited by (5)
Cited by five other publications
Nuñez Cortés, Juan Antonio & Iria Da Cunha Fanego
Zhang, Fuzhuang, Lan Yu, Jun Shen & Muhammad Arif
Da Cunha, Iria
Gamallo, Pablo, Marcos Garcia, Cesar Pineiro, Rodrigo Martinez-Castano & Juan C. Pichel
This list is based on CrossRef data as of 1 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
