Using learner corpora for automatic error detection and correction

Gamon, Michael; Chodorow, Martin; Leacock, Claudia; Tetreault, Joel

doi:10.1075/scl.59.09gam

In:Automatic Treatment and Analysis of Learner Corpus Data
Edited by Ana Díaz-Negrillo, Nicolas Ballier and Paul Thompson
[Studies in Corpus Linguistics 59] 2013
► pp. 127–150

Get fulltext from our e-platform

Download Book PDF

Using learner corpora for automatic error detection and correction

Published online: 18 December 2013

https://doi.org/10.1075/scl.59.09gam

In this chapter we discuss the use and importance of learner corpora for the development and evaluation of automatic systems for learner error detection and correction. We argue that learner corpora are crucial in three main areas in this process. First, these corpora play an important role in identifying and quantifying common error types, in order to prioritize development of error-specific algorithms. Second, learner corpora provide valuable training data for machine-learned approaches which are dominant in the field of natural language processing today. Finally, the evaluation of error detection and correction systems is most reliable and realistic when performed on real learner data.

Cited by (7)

Cited by seven other publications

Order by:

Vu, Phu & Lan Vu

2025. Enhancing collaborative writing with AI-enhanced feedback in graduate-level action research courses. Artificial Intelligence in Education ► pp. 1 ff.

Ahmed, Abdelhamid M., Xiao Zhang, Lameya M. Rezk & Wajdi Zaghouani

2024. Building an Annotated L1 Arabic/L2 English Bilingual Writer Corpus: The Qatari Corpus of Argumentative Writing (QCAW). Corpus-based Studies across Humanities 1:1 ► pp. 183 ff.

Leńko-Szymańska, Agnieszka & Sandra Götz

2022. Complexity, accuracy and fluency in learner corpus research. In Complexity, Accuracy and Fluency in Learner Corpus Research [Studies in Corpus Linguistics, 104], ► pp. 1 ff.

Pan, Jun, Billy Tak-Ming Wong & Honghua Wang

2022. Navigating learner data in translator and interpreter training. Babel. Revue internationale de la traduction / International Journal of Translation 68:2 ► pp. 236 ff.

Detey, Sylvain & Isabelle Racine

2017. Towards a perceptually assessed corpus of non-native French. International Journal of Learner Corpus Research 3:2 ► pp. 223 ff.

MALMASI, SHERVIN & MARK DRAS

2017. Multilingual native language identification. Natural Language Engineering 23:2 ► pp. 163 ff.

Park, Kwanghyun

2014. Corpora and Language Assessment: The State of the Art. Language Assessment Quarterly 11:1 ► pp. 27 ff.

This list is based on CrossRef data as of 1 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.