Aatu Liimatta
List of John Benjamins publications for which Aatu Liimatta plays a role.
Text length and short texts: An overview of the problem Challenges in Corpus Linguistics: Rethinking corpus compilation and analysis, Kaunisto, Mark and Marco Schilk (eds.), pp. 106–125 | Chapter
2024 Variation in text length is an unavoidable confounder in quantitative text-analytic corpus-linguistic studies. Texts can be difficult to compare across text lengths, particularly if many of them are short, due to the difficulty of calculating meaningful frequencies for the lexical items and… read more
Register variation across text lengths: Evidence from social media International Journal of Corpus Linguistics 28:2, pp. 202–231 | Article
2023 This paper explores variation in lexico-grammatical register features across text lengths in a large-scale sample of Reddit comments. Very short texts are known to be problematic for many statistical methods, so understanding their nature is important for the corpus-linguistic study of social… read more
Do registers have different functions for text length? A case study of Reddit Register and social media, Clarke, Isobelle and Jack Grieve (eds.), pp. 263–287 | Article
2022 Similar to lexical and grammatical choices, the length of a text is also guided by situational constraints and functional needs. Consequently, texts of different lengths are associated with different communicative functions. This study explores the role of register in the functions which are… read more
Chapter 5. Using lengthwise scaling to compare feature frequencies across text lengths on Reddit Corpus Approaches to Social Media, Rüdiger, Sofia and Daria Dayter (eds.), pp. 111–130 | Chapter
2020 Texts of different lengths can be difficult to compare using quantitative methods. This is particularly true if many of the texts are extremely short, as is commonly the case with social media comments, where the median text length may be only a few dozen words. In this paper, I explore… read more
Exploring register variation on Reddit: A multi-dimensional study of language use on a social media website Register Studies 1:2, pp. 269–295 | Article
2019 While the language of the internet has been an increasingly popular research topic, there remain many understudied areas and topics which deserve more attention. This study explores register variation within the social media website Reddit using the multi-dimensional approach developed by… read more