Detecting innovations in a parsed corpus of learner English

Schneider, Gerold; Gilquin, Gaëtanelle

doi:10.1075/bct.98.03sch

Part of

Rethinking Linguistic Creativity in Non-native Englishes
Edited by Sandra C. Deshors, Sandra Götz and Samantha Laporte
[Benjamins Current Topics 98] 2018
► pp. 47–74

Detecting innovations in a parsed corpus of learner English

Gerold Schneider | University of Konstanz & University of Zurich

Gaëtanelle Gilquin | University of Louvain

In research on L2 English, recent corpus-based studies indicate that some nonstandard forms are shared by indigenized (ESL) and foreign (EFL) varieties of English, which challenges the idea of a clear dichotomy between innovation and error. We present a data-driven large-scale method to detect innovations, test it on verb + preposition structures (including phrasal verbs) and adjective + preposition structures, and describe similarities and differences between EFL and ESL. We use a dependency-parsed version of the International Corpus of Learner English to automatically extract potential innovations, defined as patterns of overuse compared to the British National Corpus as reference corpus. We measure overuse by means of collocation measures like O/E or T-score, and compare our results with similar results for ESL. In both quantitative and qualitative analyses, we detect similarities between the two varieties (e.g. discuss about) and dissimilarities (e.g. accuse for, only distinctive for EFL). We report more verb/adjective + preposition combinations than previous studies and discuss the roles of analogy and transfer.

Keywords: Cognitive Linguistics, collocations, corpus linguistics, data-driven approach, English as a Foreign Language (EFL), English as a Second Language (ESL), Error Analysis, Learner English, linguistic innovations, verb-preposition constructions

Published online: 19 July 2018

https://doi.org/10.1075/bct.98.03sch

Cited by

Cited by 2 other publications

Ranta, Elina

2022. From learners to users—errors, innovations, and universals. ELT Journal 76:3 ► pp. 311 ff.

Schneider, Gerold

2023. Detecting and Analysing Learner Difficulties Using a Learner Corpus Without Error Tagging. In Demystifying Corpus Linguistics for English Language Teaching, ► pp. 229 ff.

This list is based on CrossRef data as of 22 april 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.