References (32)
References
Baayen, R. H. (2001). Word frequency distributions. Springer Dordrecht. Google Scholar logo with link to Google Scholar
Biber, D., Reppen, R., Schnur, E., & Ghanem, R. (2016). On the (non)utility of Juilland’s D to measure lexical dispersion in large corpora. International Journal of Corpus Linguistics, 21(4). 439–464. Google Scholar logo with link to Google Scholar
Burch, B., Egbert, J., & Biber, D. (2017). Measuring and interpreting lexical dispersion in corpus linguistics. Journal of Research Design and Statistics in Linguistics and Communication Science, 3(2). 189–216. Google Scholar logo with link to Google Scholar
Carroll, J. B. (1970). An alternative to Juilland’s usage coefficient for lexical frequencies and a proposal for a standard frequency index. Computer Studies in the Humanities and Verbal Behaviour, 3(2). 61–65. [URL]
Church, K. W., & Gale, W. A. (1995). Poisson mixtures. Natural Language Engineering, 1(2). 163–190. Google Scholar logo with link to Google Scholar
Davies, M. (2008). The Corpus of Contemporary American English (COCA). [URL]
Egbert, J., & Burch, B. (2023). Which words matter most? Operationalizing lexical prevalence for rank-ordered word lists. Applied Linguistics, 44(1). 103–126. Google Scholar logo with link to Google Scholar
Egbert, J., Burch, B., & Biber, D. (2020). Lexical dispersion and corpus design. International Journal of Corpus Linguistics, 25(1). 89–115. Google Scholar logo with link to Google Scholar
Francis, W. N., & Kučera, H. (1964). Manual of information to accompany a standard corpus of present-day edited American English for use with digital computers. Department of Linguistics, Brown University. [URL]
Greenbaum, S., & Nelson, G. (1996). The International Corpus of English (ICE) project. World Englishes, 15(1). 3–15. Google Scholar logo with link to Google Scholar
Gries, S. Th. (2008). Dispersions and adjusted frequencies in corpora. International Journal of Corpus Linguistics, 13(4). 403–437. Google Scholar logo with link to Google Scholar
(2020a). Ten lectures on corpus linguistics with R: Applications for usage-based and psycholinguistic research. Brill.Google Scholar logo with link to Google Scholar
(2020b). Analyzing dispersion. In M. Paquot & S. Th. Gries (Eds.), A practical handbook of corpus linguistics (pp. 99–118). Springer. Google Scholar logo with link to Google Scholar
(2021). A new approach to (key) keywords analysis: Using frequency, and now also dispersion. Research in Corpus Linguistics, 9(2). 1–33. Google Scholar logo with link to Google Scholar
Halvorsen, K. T. (1991). Value splitting involving more factors. In D. C. Hoaglin, F. Mosteller, & J. W. Tukey (Eds.), Fundamentals of exploratory analysis of variance (pp. 72–113). Wiley. Google Scholar logo with link to Google Scholar
Juilland, A. G., & E. Chang-Rodríguez. (1964). Frequency dictionary of Spanish words. Mouton de Gruyter. Google Scholar logo with link to Google Scholar
Katz, S. M. (1996). Distribution of content words and phrases in text and language modelling. Natural Language Engineering, 2(1). 15–59. Google Scholar logo with link to Google Scholar
Keniston, H. (1920). Common words in Spanish. Hispania, 3(2). 85–96. Google Scholar logo with link to Google Scholar
Long, J. S. (1997). Regression models for categorical and limited dependent variables. Sage.Google Scholar logo with link to Google Scholar
Lyne, A. A. (1985). The vocabulary of French business correspondence. Slatkine-Champion.Google Scholar logo with link to Google Scholar
Mosteller, F., & D. L. Wallace. (1984). Applied Bayesian inference: The case of The Federalist Papers. Springer. Google Scholar logo with link to Google Scholar
R Core Team. (2022). R: A language and environment for statistical computing (Version 4.3.1). R Foundation for Statistical Computing. [URL]
Rigby, R. A. & M. D. Stasinopoulos. (2005). Generalized additive models for location, scale and shape. Applied Statistics, 54(3). 507–554. Google Scholar logo with link to Google Scholar
Rosengren, I. (1971). The quantitative concept of language and its relation to the structure of frequency dictionaries. Études de linguistique appliquée (Nouvelle Série), 11, 103–127.Google Scholar logo with link to Google Scholar
Sarkar, D. (2008). Lattice: Multivariate data visualization with R. Springer. Google Scholar logo with link to Google Scholar
Sönning, L. (2023a). The negative binomial distribution: A visual explanation. Statistics for linguist(ic)s. [URL]
(2023b). Different parameterizations of the negative binomial distribution. Statistics for linguist(ic)s. [URL]
(2025). Advancing our understanding of dispersion measures in corpus research. Corpora, 20(1). 3–35. Google Scholar logo with link to Google Scholar
Wickham, H. (2016). ggplot2: Elegant graphics for data analysis. Springer. Google Scholar logo with link to Google Scholar
Winter, B., & P.-C. Bürkner. (2021). Poisson regression for linguists: A tutorial introduction to modelling count data with brms. Language and Linguistics Compass, 15(11), e12439. Google Scholar logo with link to Google Scholar
Mobile Menu Logo with link to supplementary files background Layer 1 prag Twitter_Logo_Blue