Corpus Approaches to Social Media
From Twitter to Reddit, Facebook, and WhatsApp – social media is a part of modern everyday life. Studying the language used on social media platforms presents great opportunities as well as challenges to corpus linguists. The contributions in Corpus Approaches to Social Media address technical, ethical, and methodological issues by showcasing in-depth social media studies as conducted by corpus scholars. The chapters are based on a variety of social media platforms and include corpus perspectives on the language of online communities, linguistic variation in short media texts, and the role of images in computer-mediated communication. A particularly strong point of the collection are the detailed accounts of the methodological aspects of working with social media corpora. The volume features research applying traditional corpus linguistic methods to social media data as well as novel and innovative research methods for the analysis of multimodal material and atypical corpus texts.
[Studies in Corpus Linguistics, 98] 2020. vi, 210 pp.
Publishing status: Available
Published online on 22 October 2020
Published online on 22 October 2020
© John Benjamins
Table of Contents
-
Introduction. The expanding landscape of corpus-based studies of social media languageSofia Rüdiger and Daria Dayter | pp. 1–12
-
Part 1. Using corpus methods to investigate communities on social media
-
Chapter 1. Towards a digital sociolinguistics: Communities of Practice on RedditSven Leuckert and Martin Leuckert | pp. 15–40
-
Chapter 2. The control and censorship of linguistic resources in an online Community of PracticeLisa Donlan | pp. 41–62
-
Chapter 3. Talking about women: Elicitation, manual tagging, and semantic tagging in a study of pick-up artists’ referential strategiesDaria Dayter and Sofia Rüdiger | pp. 63–86
-
Part II. Linguistic variation in short social media texts
-
Chapter 4. Patterns of intra-individual variation in a Swiss WhatsApp corpus: Analysing real-time change and long-term accommodationSamuel Felder | pp. 89–110
-
Chapter 5. Using lengthwise scaling to compare feature frequencies across text lengths on RedditAatu Liimatta | pp. 111–130
-
Chapter 6. Double trouble: Are 280-character tweets comparable to 140-character tweets?Martin Eberl | pp. 131–146
-
Part III. The role of images
-
Chapter 7. Constructing corpora from images and text: An introduction to Visual Constituent AnalysisAlex Christiansen, William Dance and Alexander Wild | pp. 149–174
-
Chapter 8. Working with images and emoji in the 🦆 Dukki Facebook CorpusLuke C. Collins | pp. 175–196
-
Part IV. Discussion
-
Chapter 9. New developments in corpus approaches to social media: A responseClaire Hardaker | pp. 199–208
-
Index | pp. 209–210
Cited by (6)
Cited by six other publications
Broś, Karolina
Santamaría Urbieta, Alexandra, Elena Alcalde Peñalver & Peter Bannister
HOWE, CHAD
Sinnott, Richard O., Qi Li, Abdul Mohammad & Luca Morandini
Johansson, Marjut, Sanna-Kaisa Tanskanen & Jan Chovanec
This list is based on CrossRef data as of 19 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
Subjects
Main BIC Subject
CFX: Computational linguistics
Main BISAC Subject
LAN009050: LANGUAGE ARTS & DISCIPLINES / Linguistics / Sociolinguistics