Alangari, Manal, Sylvia Jaworska & Jacqueline Laws
2020.
Who's afraid of phrasal verbs? The use of phrasal verbs in expert academic writing in the discipline of linguistics.
Journal of English for Academic Purposes 43
► pp. 100814 ff.

Altmeyer, Stefan, Constantin Klein, Barbara Keller, Christopher F. Silver, Ralph W. Hood & Heinz Streib
Aydın, Özgür
2017.
Kitap Tanıtımı.
Dilbilim Araştırmaları Dergisi 28:2
► pp. 93 ff.

Baumann, Andreas & Katharina Sekanina
2022.
Accounting for the relationship between lexical prevalence and acquisition with Bayesian networks and population dynamics.
Linguistics Vanguard 8:1
► pp. 209 ff.

Bernaisch, Tobias, Stefan Th. Gries & Benedikt Heller
2022.
Theoretical models and statistical modelling of linguistic epicentres.
World Englishes 41:3
► pp. 333 ff.

Biber, Douglas, Randi Reppen, Erin Schnur & Romy Ghanem
Bittar, André, Sumithra Velupillai, Angus Roberts & Rina Dutta
2021.
Using General-purpose Sentiment Lexicons for Suicide Risk Assessment in Electronic Health Records: Corpus-Based Analysis.
JMIR Medical Informatics 9:4
► pp. e22397 ff.

Brezina, Vaclav
2018.
Statistics in Corpus Linguistics,

Brezina, Vaclav, Tony McEnery & Stephen Wattam
Brown, David West
2018.
English and Empire,

Bubenhofer, Noah, Willi Lange, Saburo Okamura & Joachim Scharloth
2015.
Wortschätze in Lehrbüchern für Deutsch als Fremdsprache – Möglichkeiten und Grenzen frequenzorientierter Ansätze. In
Linguistik und Schulbuchforschung,
► pp. 85 ff.

Buerki, Andreas
2020.
Formulaic Language and Linguistic Change,

Burch, Brent & Jesse Egbert
2020.
Zero-inflated beta distribution applied to word frequency and lexical dispersion in corpus linguistics.
Journal of Applied Statistics 47:2
► pp. 337 ff.

Burch, Brent & Jesse Egbert
2023.
Word Use Equivalence and Hierarchical Word Tiers.
Journal of Quantitative Linguistics 30:1
► pp. 104 ff.

Candarli, Duygu
2021.
A longitudinal study of multi-word constructions in L2 academic writing: the effects of frequency and dispersion.
Reading and Writing 34:5
► pp. 1191 ff.

Chen, Meilin
2017.
Phraseology in English as an Academic Lingua Franca. In
Handbook of Research on Individualism and Identity in the Globalized Digital Age [
Advances in Human and Social Aspects of Technology, ],
► pp. 478 ff.

Chesley, Paula & R. Harald Baayen
2010.
Predicting new words from newer words: Lexical borrowings in French.
Linguistics 48:6

Cotos, Elena & Yoo-Ree Chung
2019.
Functional language in curriculum genres: Implications for testing international teaching assistants.
Journal of English for Academic Purposes 41
► pp. 100766 ff.

Cotos, Elena & Yoo‐Ree Chung
2018.
Domain Description: Validating the Interpretation of the
TOEFL iBT
® Speaking Scores for International Teaching Assistant Screening and Certification Purposes
.
ETS Research Report Series 2018:1
► pp. 1 ff.

CROSSLEY, SCOTT, KRISTOPHER KYLE & THOMAS SALSBURY
2016.
A Usage-Based Investigation of L2 Lexical Acquisition: The Role of Input and Output.
The Modern Language Journal 100:3
► pp. 702 ff.

Crossley, Scott, Tom Salsbury, Ashley Titak & Danielle McNamara
Crossley, Scott A., Nicholas Subtirelu & Tom Salsbury
2013.
FREQUENCY EFFECTS OR CONTEXT EFFECTS IN SECOND LANGUAGE WORD LEARNING.
Studies in Second Language Acquisition 35:4
► pp. 727 ff.

Csomay, Eniko & Alexandra Prades
2018.
Academic vocabulary in ESL student papers: A corpus-based study.
Journal of English for Academic Purposes 33
► pp. 100 ff.

Cvrček, Václav & Masako Fidler
2022.
No keyword is an island: in search of covert associations.
Corpora 17:2
► pp. 259 ff.

Dang, Thi Ngoc Yen
2018.
The nature of vocabulary in academic speech of hard and soft-sciences.
English for Specific Purposes 51
► pp. 69 ff.

Dang, Thi Ngoc Yen, Averil Coxhead & Stuart Webb
2017.
The Academic Spoken Word List.
Language Learning 67:4
► pp. 959 ff.

De Troij, Robbert & Freek Van de Velde
2020.
Beyond Mere Text Frequency: Assessing Subtle Grammaticalization by Different Quantitative Measures. A Case Study on the Dutch Soort Construction.
Languages 5:4
► pp. 55 ff.

Degraeuwe, Jasper & Patrick Goethals
2020.
La selección temática del vocabulario para fines didácticos: evaluación de un acercamiento cuantitativo.
Revista de Lingüística y Lenguas Aplicadas 15:1
► pp. 1 ff.

Delmonte, Rodolfo
2020.
Venses
HaSpeeDe2 & SardiStance: Multilevel Deep Linguistically Based Supervised Approach to Classification. In
EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020,
► pp. 121 ff.

Desagulier, Guillaume
2019.
Can word vectors help corpus linguists?.
Studia Neophilologica 91:2
► pp. 219 ff.

Deshors, Sandra C.
2017.
Zooming in on Verbs in the Progressive: A Collostructional and Correspondence Analysis Approach.
Journal of English Linguistics 45:3
► pp. 260 ff.

Divjak, Dagmar
2017.
The Role of Lexical Frequency in the Acceptability of Syntactic Variants: Evidence Fromthat-Clauses in Polish.
Cognitive Science 41:2
► pp. 354 ff.

Divjak, Dagmar
2019.
Frequency in Language,

DUNN, JONATHAN
2017.
Computational learning of construction grammars.
Language and Cognition 9:2
► pp. 254 ff.

Durrant, P.
2014.
Discipline and Level Specificity in University Students' Written Vocabulary.
Applied Linguistics 35:3
► pp. 328 ff.

Dushku, Silvana & Youngshil Paek
2021.
Investigating ESL learners’ awareness of semantic prosody across proficiency levels.
Language Awareness 30:3
► pp. 234 ff.

Ebeling, Jarle & Signe O. Ebeling
2018.
Comparing n-gram-based functional categories in original versus translated texts.
Corpora 13:3
► pp. 347 ff.

Edwards, Alison & Rutger-Jan Lange
Egbert, Jesse, Brent Burch & Douglas Biber
Fitzgerald, Jill, Jackie Eunjung Relyea, Jeff Elmore & Elfrieda H. Hiebert
2021.
Has the Presence of First‐Grade Core Reading Program Academic Vocabulary Changed Across Six Decades?.
Reading Research Quarterly 56:4
► pp. 737 ff.

Furkó, Péter B.
2020.
Discourse Markers in Natural Conversations, Scripted Conversations and Political Interviews: Core and Peripheral Uses. In
Discourse Markers and Beyond,
► pp. 39 ff.

Furkó, Péter B.
2020.
The Use of Discourse Markers in Business English Textbooks: Issues in L2 Communicative Competence and Learners’ Input. In
Discourse Markers and Beyond,
► pp. 119 ff.

Gabrielatos, Costas
2019.
If-Conditionals and Modality: Frequency Patterns and Theoretical Explanations.
Journal of English Linguistics 47:4
► pp. 301 ff.

Gabrielatos, Costas, Eivind Nessa Torgersen, Sebastian Hoffmann & Susan Fox
2010.
A Corpus-Based Sociolinguistic Study of Indefinite Article Forms in London English.
Journal of English Linguistics 38:4
► pp. 297 ff.

Gerlach, Martin, Hanyu Shi & Luís A. Nunes Amaral
2019.
A universal information theoretic approach to the identification of stopwords.
Nature Machine Intelligence 1:12
► pp. 606 ff.

Gillings, Mathew, Gerlinde Mautner & Paul Baker
2023.
Corpus-Assisted Discourse Studies,

Graves, Michael F., Jeff Elmore & Jill Fitzgerald
2019.
The Vocabulary of Core Reading Programs.
The Elementary School Journal 119:3
► pp. 386 ff.

Gries, Stefan Th.
2015.
More (old and new) misunderstandings of collostructional analysis: On Schmid and Küchenhoff (2013).
Cognitive Linguistics 26:3
► pp. 505 ff.

Gries, Stefan Th.
2021.
A new approach to (key) keywords analysis: Using frequency, and now also dispersion.
Research in Corpus Linguistics 9:2
► pp. 1 ff.

Gries, Stefan Th.
2022.
Multi-word units (and tokenization more generally): a multi-dimensional and largely information-theoretic approach.
Lexis :19

Th. Gries, Stefan
2020.
Analyzing Dispersion. In
A Practical Handbook of Corpus Linguistics,
► pp. 99 ff.

Gries, Stefan Th. & Philip Durrant
2020.
Analyzing Co-occurrence Data. In
A Practical Handbook of Corpus Linguistics,
► pp. 141 ff.

Gries, Stefan Th. & Nick C. Ellis
2015.
Statistical Measures for Usage-Based Linguistics.
Language Learning 65:S1
► pp. 228 ff.

Grindrod, Jumbly
2022.
Justification: Insights from Corpora.
Episteme ► pp. 1 ff.

Gyeahyung Jeon & 문병열
2018.
A Study on Statistical Techniques for Semantic Description of Grammatical Elements.
Language & Information Society 33:null
► pp. 219 ff.

Güven, Selçuk, Naama Friedmann & Claudio Mulatti
2021.
Vowel dyslexia in Turkish: A window to the complex structure of the sublexical route.
PLOS ONE 16:3
► pp. e0249016 ff.

Hashimoto, Brett J. & Jesse Egbert
2019.
More Than Frequency? Exploring Predictors of Word Difficulty for Second Language Learners.
Language Learning 69:4
► pp. 839 ff.

Heffernan, Kevin
2021.
Correlating cognitive effort and noun role in spoken Japanese.
Journal of Japanese Linguistics 37:2
► pp. 181 ff.

Heffernan, Kevin, Yusuke Imanishi & Masaru Honda
2018.
Showcasing the interaction of generative and emergent linguistic knowledge with case marker omission in spoken Japanese.
Glossa: a journal of general linguistics 3:1

Hilpert, Martin & David Correia Saavedra
2017.
Why are grammatical elements more evenly dispersed than lexical elements? Assessing the roles of text frequency and semantic generality.
Corpora 12:3
► pp. 369 ff.

Hodge, Gabrielle & Trevor Johnston
2014.
Points, Depictions, Gestures and Enactment: Partly Lexical and Non-Lexical Signs as Core Elements of Single Clause-Like Units in Auslan (Australian Sign Language).
Australian Journal of Linguistics 34:2
► pp. 262 ff.

Hollis, Geoff
2020.
Delineating linguistic contexts, and the validity of context diversity as a measure of a word's contextual variability.
Journal of Memory and Language 114
► pp. 104146 ff.

Hsu, Chan-Chia
2020.
Exploring recurrent frames in written Chinese.
Corpora 15:3
► pp. 291 ff.

Hsu, Chan-Chia & Shu-Kai Hsieh
Jones, Michael N., Melody Dye & Brendan T. Johns
2017.
Context as an Organizing Principle of the Lexicon [
Psychology of Learning and Motivation, 67],
► pp. 239 ff.

Kamrotov, Mikhail, Ekaterina Talalakina & Denis Stukal
2022.
Technical vocabulary in languages for special purposes: The corpus-based Russian economics word list.
Lingua 273
► pp. 103326 ff.

Kern, Roman & Michael Granitzer
2009.
Proceedings of the International Conference on Management of Emergent Digital EcoSystems,
► pp. 167 ff.

Kern, Roman & Michael Granitzer
2010.
German Encyclopedia Alignment Based on Information Retrieval Techniques. In
Research and Advanced Technology for Digital Libraries [
Lecture Notes in Computer Science, 6273],
► pp. 315 ff.

Kern, Roman, Christin Seifert & Michael Granitzer
2010.
A hybrid system for German encyclopedia alignment.
International Journal on Digital Libraries 11:2
► pp. 75 ff.

Koplenig, Alexander
2015.
The Impact of Lacking Metadata for the Measurement of Cultural and Linguistic Change Using the Google Ngram Data Sets—Reconstructing the Composition of the German Corpus in Times of WWII.
Digital Scholarship in the Humanities ► pp. fqv037 ff.

Koplenig, Alexander & Gareth J. Baxter
2019.
A non-parametric significance test to compare corpora.
PLOS ONE 14:9
► pp. e0222703 ff.

Kusseling, Françoise & Deryle Lonsdale
2013.
A Corpus-Based Assessment of French CEFR Lexical Content.
The Canadian Modern Language Review 69:4
► pp. 436 ff.

Kyle, Kristopher & Scott Crossley
2016.
The relationship between lexical sophistication and independent and source-based writing.
Journal of Second Language Writing 34
► pp. 12 ff.

Kyle, Kristopher & Scott A. Crossley
2015.
Automatically Assessing Lexical Sophistication: Indices, Tools, Findings, and Application.
TESOL Quarterly 49:4
► pp. 757 ff.

Kyröläinen, Aki-Juhani & Veronika Laippala
2023.
Predictive keywords: Using machine learning to explain document characteristics.
Frontiers in Artificial Intelligence 5

Levshina, Natalia
2017.
Online film subtitles as a corpus: an n-gram approach.
Corpora 12:3
► pp. 311 ff.

Lijffijt, Jefrey & Stefan Th. Gries
Lijffijt, Jefrey, Terttu Nevalainen, Tanja Säily, Panagiotis Papapetrou, Kai Puolamäki & Heikki Mannila
2016.
Significance testing of word frequencies in corpora.
Digital Scholarship in the Humanities 31:2
► pp. 374 ff.

Lijffijt, Jefrey, Panagiotis Papapetrou & Kai Puolamäki
2012.
Size Matters: Finding the Most Informative Set of Window Lengths. In
Machine Learning and Knowledge Discovery in Databases [
Lecture Notes in Computer Science, 7524],
► pp. 451 ff.

Lijffijt, Jefrey, Panagiotis Papapetrou & Kai Puolamäki
2015.
Size matters: choosing the most informative set of window lengths for mining patterns in event sequences.
Data Mining and Knowledge Discovery 29:6
► pp. 1838 ff.

Lin, You-Min & Michelle Y. Chen
2020.
Understanding writing quality change: A longitudinal study of repeaters of a high-stakes standardized English proficiency test.
Language Testing 37:4
► pp. 523 ff.

Lorenzo-Dus, Nuria, Anina Kinzel & Matteo Di Cristofaro
2020.
The communicative modus operandi of online child sexual groomers: Recurring patterns in their language use.
Journal of Pragmatics 155
► pp. 15 ff.

Lorenzo-Dus, Nuria & Lella Nouri
2021.
The discourse of the US alt-right online – a case study of the Traditionalist Worker Party blog.
Critical Discourse Studies 18:4
► pp. 410 ff.

Lukin, Annabelle
2019.
War and Violence: Etymology, Definitions, Frequencies, Collocations. In
War and Its Ideologies [
The M.A.K. Halliday Library Functional Linguistics Series, ],
► pp. 81 ff.

McCallum, Lee
2019.
Assessing Second Language Proficiency Under ‘Unequal’ Perspectives: A Call for Research in the MENA Region. In
English Language Teaching Research in the Middle East and North Africa,
► pp. 3 ff.

McGrath, Darby & Cassi Liardét
2022.
A corpus-assisted analysis of grammatical metaphors in successful student writing.
Journal of English for Academic Purposes 56
► pp. 101090 ff.

McGrath, Darby & Cassi Liardét
2023.
Grammatical metaphor across disciplines: Variation, frequency, and dispersion.
English for Specific Purposes 69
► pp. 33 ff.

Meyer, Thomas George
2020.
Difference as privilege: identity, citizenship and the recontextualisation of human rights in Japan’s social studies curriculum.
Critical Studies in Education 61:1
► pp. 17 ff.

Meyer, Thomas George
2023.
Corpus Approaches to the Sociology of Curricula: A Methodological Case Study of Human Rights Learning in Japan.
Applied Corpus Linguistics ► pp. 100057 ff.

Miller, Don
2020.
Analysing Frequency Lists. In
A Practical Handbook of Corpus Linguistics,
► pp. 77 ff.

Miller, Don
2022.
Replication as a means of assessing corpus representativeness and the generalizability of specialized word lists.
Applied Corpus Linguistics 2:3
► pp. 100027 ff.

Mineiro, Ana, Inmaculada Concepción Báez-Montero, Mara Moita, Isabel Galhano-Rodrigues & Alexandre Castro-Caldas
2021.
Disentangling Pantomime From Early Sign in a New Sign Language: Window Into Language Evolution Research.
Frontiers in Psychology 12

Neels, Jakob
2020.
Lifespan change in grammaticalisation as frequency-sensitive automation: William Faulkner and thelet aloneconstruction.
Cognitive Linguistics 31:2
► pp. 339 ff.

Nelson, Robert
2018.
How ‘chunky’ is language? Some estimates based on Sinclair's Idiom Principle.
Corpora 13:3
► pp. 431 ff.

Nelson, Robert N.
2023.
Too Noisy at the Bottom: Why Gries’ (2008, 2020) Dispersion Measures Cannot Identify Unbiased Distributions of Words.
Journal of Quantitative Linguistics ► pp. 1 ff.

Noël, Dirk & Johan van der Auwera
Omidian, Taha & Anna Siyanova-Chanturia
2021.
Parameters of variation in the use of words in empirical research writing.
English for Specific Purposes 62
► pp. 15 ff.

Pham, Hien, Benjamin V. Tucker & R. Harald Baayen
2019.
Constructing two vietnamese corpora and building a lexical database.
Language Resources and Evaluation 53:3
► pp. 465 ff.

Pimas, Oliver, Stefan Klampfl, Thomas Kohl, Roman Kern & Mark Kröll
2016.
Generating Tailored Classification Schemas for German Patents. In
Natural Language Processing and Information Systems [
Lecture Notes in Computer Science, 9612],
► pp. 230 ff.

Rastelli, Stefano
2019.
The discontinuity model: Statistical and grammatical learning in adult second-language acquisition.
Language Acquisition 26:4
► pp. 387 ff.

Rauhut, Alexander
2021.
Exploring the Effect of Conversion on the Distribution of Inflectional Suffixes: A Multivariate Corpus Study.
Zeitschrift für Anglistik und Amerikanistik 69:3
► pp. 267 ff.

Reynolds, Barry Lee & Chen Ding
2022.
Effects of word-related factors on first and second language English readers’ incidental acquisition of vocabulary through reading an authentic novel.
English Teaching: Practice & Critique 21:2
► pp. 171 ff.

Rácz, Péter, Viktória Papp & Jennifer Hay
2016.
Frequency and Corpora. In
The Cambridge Handbook of Morphology,
► pp. 685 ff.

Schröter, Julian, Keli Du, Julia Dudar, Cora Rok & Christof Schöch
2021.
From Keyness to Distinctiveness – Triangulation and Evaluationin Computational Literary Studies.
Journal of Literary Theory 15:1-2
► pp. 81 ff.

Serigos, Jacqueline
2022.
Using automated methods to explore the social stratification of anglicisms in Spanish.
Corpus Linguistics and Linguistic Theory 18:2
► pp. 391 ff.

Siyanova-Chanturia, Anna & Stefania Spina
2015.
Investigation of Native Speaker and Second Language Learner Intuition of Collocation Frequency.
Language Learning 65:3
► pp. 533 ff.

Sun, Linlin & David Correia Saavedra
2020.
Measuring grammatical status in Chinese through quantitative corpus analysis.
Corpora 15:3
► pp. 317 ff.

Sönning, Lukas
2023.
Evaluation of keyness metrics: performance and reliability.
Corpus Linguistics and Linguistic Theory 0:0

Tidwell, Jacqueline
2019.
From a Smoking Gun to Spent Fuel: Principled Subsampling Methods for Building Big Language Data Corpora from Monitor Corpora.
Data 4:2
► pp. 48 ff.

Tonkin, E.L.
2016.
A Day at Work (with Text). In
Working with Text,
► pp. 23 ff.

Van Hoey, Thomas
2023.
ABB, a salient prototype of collocate–ideophone constructions in Mandarin Chinese.
Cognitive Linguistics 0:0

Velde, Freek Van de & Alek Keersmaekers
Vessey, Rachelle
2016.
Approaches to Language Ideology. In
Language and Canadian Media,
► pp. 59 ff.

Vitta, Joseph P., Christopher Nicklin & Simon W. Albright
2023.
Academic word difficulty and multidimensional lexical sophistication: An English‐for‐academic‐purposes‐focused conceptual replication of Hashimoto and Egbert (2019).
The Modern Language Journal 
Wahl, Alexander & Stefan Th. Gries
Wait, Charles, Tafadzwa Ruzive & Pierre le Roux
2017.
The Influence of Financial Market Development on Economic Growth in BRICS Countries.
International Journal of Management and Economics 53:1
► pp. 7 ff.

Wan, Minyu, Qi Su, Rong Xiang & Chu-Ren Huang
2023.
Data-driven analytics of COVID-19 ‘infodemic’.
International Journal of Data Science and Analytics 15:3
► pp. 313 ff.

Wang, Zhong, Weiwei Fan & Alex Chengyu Fang
2022.
Lexical Input in the Grammatical Expression of Stance: A Collexeme Analysis of the INTRODUCTORY IT PATTERN.
Frontiers in Psychology 12

Wild, Kate, Adam Kilgarriff & David Tugwell
2013.
The Oxford Children’s Corpus: Using a Children’s Corpus in Lexicography1.
International Journal of Lexicography 26:2
► pp. 190 ff.

Wilson, Andrew
2012.
Using corpora in depth psychology: a trigram-based analysis of a corpus of fetish fantasies.
Corpora 7:1
► pp. 69 ff.

Winter, Bodo & Martine Grice
2021.
Independence and generalizability in linguistics.
Linguistics 59:5
► pp. 1251 ff.

Wu, Baimei, Andrew K.F. Cheung & Jie Xing
Xia, Detong, Yudi Chen & Hye K. Pae
2022.
Lexical and grammatical collocations in beginning and intermediate L2 argumentative essays: a bigram study.
International Review of Applied Linguistics in Language Teaching 0:0

Xie, Wenxiu, Meng Ji, Mengdan Zhao, Tianqi Zhou, Fan Yang, Xiaobo Qian, Chi-Yin Chow, Kam-Yiu Lam & Tianyong Hao
2021.
Detecting Symptom Errors in Neural Machine Translation of Patient Health Information on Depressive Disorders: Developing Interpretable Bayesian Machine Learning Classifiers.
Frontiers in Psychiatry 12

Zhang, Haomin, Yuting Han, Xing Zhang & Liuran Cui
2022.
Frequency, Dispersion and Abstractness in the Lexical Sophistication Analysis of A Learner-Based Word Bank: Dimensionality Reduction and Identification.
Journal of Quantitative Linguistics 29:2
► pp. 195 ff.

Öksüz, Doğuş, Vaclav Brezina & Patrick Rebuschat
2021.
Collocational Processing in L1 and L2: The Effects of Word Frequency, Collocational Frequency, and Association.
Language Learning 71:1
► pp. 55 ff.

김미란, Jungha Hong & Jae-Woong Choe
2014.
Distributional characteristics in Korean onset-nucleus sequences and hierarchical clustering of Korean vowels.
Studies in Phonetics, Phonology, and Morphology 20:1
► pp. 23 ff.

[no author supplied]
2013.
Web Corpus Construction [
Synthesis Lectures on Human Language Technologies, ],

[no author supplied]
2018.
Vocabulary. In
Statistics in Corpus Linguistics,
► pp. 38 ff.

[no author supplied]
2020.
References. In
Introduction to Corpus Linguistics,
► pp. 233 ff.

This list is based on CrossRef data as of 22 may 2023. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.