Detecting innovations in a parsed corpus of learner English
In research on L2 English, recent corpus-based studies indicate that some nonstandard forms are shared by indigenized (ESL) and foreign (EFL) varieties of English, which challenges the idea of a clear dichotomy between innovation and error. We present a data-driven large-scale method to detect innovations, test it on verb + preposition structures (including phrasal verbs) and adjective + preposition structures, and describe similarities and differences between EFL and ESL. We use a dependency-parsed version of the International Corpus of Learner English to automatically extract potential innovations, defined as patterns of overuse compared to the British National Corpus as reference corpus. We measure overuse by means of collocation measures like O/E or T-score, and compare our results with similar results for ESL. In both quantitative and qualitative analyses, we detect similarities between the two varieties (e.g. discuss about) and dissimilarities (e.g. accuse for, only distinctive for EFL). We report more verb/adjective + preposition combinations than previous studies and discuss the roles of analogy and transfer.
Cited by (3)
Cited by three other publications
Schneider, Gerold
2023.
Detecting and Analysing Learner Difficulties Using a Learner Corpus Without Error Tagging. In
Demystifying Corpus Linguistics for English Language Teaching,
► pp. 229 ff.
Ranta, Elina
2022.
From learners to users—errors, innovations, and universals.
ELT Journal 76:3
► pp. 311 ff.
This list is based on CrossRef data as of 27 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.