In our earlier work on three Asian Englishes and British English, we showed how lexico-syntactic co-occurrence preferences for three argument structure constructions revealed differences between varieties that correlated well with Schneider’s (2003, 2007) model of evolutionary stages. Here, we turn to lexical co-occurrence preferences and investigate if and to what degree n-grams distinguish between different modes and varieties in the same components of the International Corpus of English. Our approach to n-grams differs from previous work in that we neither use raw frequencies nor (problematic) MI-values but the newly proposed measure of lexical gravity (cf. Daudaravičius & Marcinkevičienė 2004), which takes type frequencies into consideration. We show how lexical gravity can be extended to handle n-grams with n ≥ 3 and apply this method to our n-gram data; in addition, we suggest a new concept for describing the tendency of a word to occur in significant n-grams: lexical stickiness.
2017. Frequency Consolidation Among Word N-Grams. In Computational and Corpus-Based Phraseology [Lecture Notes in Computer Science, 10596], ► pp. 432 ff.
Chai, Christine P.
2023. Comparison of text preprocessing methods. Natural Language Engineering 29:3 ► pp. 509 ff.
Dattner, Elitzur
2019. The Hebrew dative: Usage patterns as discourse profile constructions. Linguistics 57:5 ► pp. 1073 ff.
DUNN, JONATHAN
2017. Computational learning of construction grammars. Language and Cognition 9:2 ► pp. 254 ff.
2019. N-gram measures and L2 writing proficiency. System 80 ► pp. 176 ff.
Gries, Stefan Th.
2022. Multi-word units (and tokenization more generally): a multi-dimensional and largely information-theoretic approach. Lexis :19
Gries, Stefan Th. & Sandra C. Deshors
2015. EFL and/vs. ESL?. International Journal of Learner Corpus Research 1:1 ► pp. 130 ff.
Gries, Stefan Th. & Philip Durrant
2020. Analyzing Co-occurrence Data. In A Practical Handbook of Corpus Linguistics, ► pp. 141 ff.
Götz, Sandra
2017. Non-Canonical Syntax in South Asian Varieties of English: A Corpus-Based Pilot Study on Fronting. Zeitschrift für Anglistik und Amerikanistik 65:3 ► pp. 265 ff.
Hilpert, Martin
2022. Review of Laporte, Samantha. 2021. Corpora, Constructions, New Englishes. A Constructional and Variationist Approach to Verb Patterning. Amsterdam: John Benjamins. ISBN: 978-9-027-20850-7. https://doi.org/10.1075/scl.100. Research in Corpus Linguistics 10:2 ► pp. 147 ff.
Meng, Fanqi, Yujie Zheng, Songbin Bao & Jingdong Wang
2021. 2021 10th IEEE International Conference on Communication Systems and Network Technologies (CSNT), ► pp. 886 ff.
Pan, Fan, Randi Reppen & Douglas Biber
2016. Comparing patterns of L1 versus L2 English academic professionals: Lexical bundles in Telecommunications research journals. Journal of English for Academic Purposes 21 ► pp. 60 ff.
PARVIAINEN, HANNA
2016. The invariant tagisn't itin Asian Englishes. World Englishes 35:1 ► pp. 98 ff.
SCHNEIDER, EDGAR W.
2014. New reflections on the evolutionary dynamics of world Englishes. World Englishes 33:1 ► pp. 9 ff.
Schneider, Ulrike
2018. ΔP as a measure of collocation strength. Corpus Linguistics and Linguistic Theory 0:0
Spina, Stefania & Elena Tanganelli
2012. Les collocations comme indice pour distinguer les genres textuels. Corpus :11
Staples, Shelley, Jesse Egbert, Douglas Biber & Alyson McClair
2013. Formulaic sequences and EAP writing development: Lexical bundles in the TOEFL iBT writing section. Journal of English for Academic Purposes 12:3 ► pp. 214 ff.
2018. Multi-word Expressions: A Novel Computational Approach to Their Bottom-Up Statistical Extraction. In Lexical Collocation Analysis [Quantitative Methods in the Humanities and Social Sciences, ], ► pp. 85 ff.
This list is based on CrossRef data as of 24 september 2023. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.