In this chapter we discuss the use and importance of learner corpora for the development and evaluation of automatic systems for learner error detection and correction. We argue that learner corpora are crucial in three main areas in this process. First, these corpora play an important role in identifying and quantifying common error types, in order to prioritize development of error-specific algorithms. Second, learner corpora provide valuable training data for machine-learned approaches which are dominant in the field of natural language processing today. Finally, the evaluation of error detection and correction systems is most reliable and realistic when performed on real learner data.
Ahmed, Abdelhamid M., Xiao Zhang, Lameya M. Rezk & Wajdi Zaghouani
2024. Building an Annotated L1 Arabic/L2 English Bilingual Writer Corpus: The Qatari Corpus of Argumentative Writing (QCAW). Corpus-based Studies across Humanities 1:1 ► pp. 183 ff.
2017. Multilingual native language identification. Natural Language Engineering 23:2 ► pp. 163 ff.
Park, Kwanghyun
2014. Corpora and Language Assessment: The State of the Art. Language Assessment Quarterly 11:1 ► pp. 27 ff.
This list is based on CrossRef data as of 29 october 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.