This paper examines the potential large diachronic corpora hold for
the study of social change. Resources such as COHA or Google Books
allow us to detect shifts in the frequencies of linguistic elements, which can then be
interpreted as reflections of developments in society. This paper addresses the
practicalities of this question in two parts. The theoretical part surveys a series of
problems that need to be controlled for in analyses of diachronic textual data. The
second part implements these ideas in a study of the English
make-causative over the past 150 years. Examining the variables of
animacy and verb semantics, the study explores whether the diminishing social value of
interpersonal authority is reflected in changing patterns of language use.
Article outline
1.Introduction
2.Five pitfalls in the analysis of diachronic corpus data
2.1Corpus frequencies (semasiological frequencies) are not always equivalent to
frequencies of entities and events in the real world (onomasiological
frequencies)
2.2Corpus frequencies of polysemous words need to be broken down into sense-specific
and construction-specific frequencies
2.3Correlations in large datasets may be spurious
2.4Comparisons of frequency trends in diachronic corpora require adequate
statistical treatment
2.5It is not always easy to disentangle social change and linguistic change
3.Giving in to temptation: A case study of the English make-causative
3.1The English make-causative construction
3.2Corpus data and descriptive statistics
3.3Using distributional semantics to study the development of the
make-causative
2017Using
token-based semantic vector spaces for corpus-linguistic analyses: From practical
applications to tests of theoretical claims. Corpus
Linguistics and Linguistic Theory. (ahead of
print).
Hilpert, Martin & Gries, Stefan T.
2009Assessing
frequency changes in multi-stage diachronic corpora: Applications for historical
corpus linguistics and the study of language
acquisition. Literary and Linguistic
Computing 24(4): 385–401.
Hilpert, Martin & Perek, Florent
2015Meaning
change in a petri dish: Constructions, semantic vector spaces, and motion
charts. Linguistics
Vanguard 1(1): 339–350.
Hopper, Paul J. & Traugott, Elizabeth C.
2003Grammaticalization, 2nd
edn. Cambridge: CUP.
Kemmer, Suzanne
2001Causative
constructions and cognitive models: The English make
causative. In First Seoul
International Conference on Discourse and Cognitive Linguistics: Perspectives for
the 21st
Century, 803–846. Seoul: Discourse and Cognitive Linguistics Society of Korea.
Koplenig, Alexander & Müller-Spitzer, Carolin
2016Population
size predicts lexical diversity, but so does the mean sea level – one problem in
the analysis of temporal data. PLOS
ONE 11(3).
Liberman, Mark
2013The
culturomic psychology of urbanization. Language
Log, 18 August 2013, <[URL]> (31December 2018).
Michel, Jean-Baptiste, Shen, Yuan Kui, Presser Aiden, Aviva, Veres, Adrian, Gray, Matthew K., The Google Books
Team, Pickett, Joseph P., Hoiberg, Dale, Clancy, Dan, Norvig, Peter, Orwant, Jon, Pinker, Steven, Nowak, Martin A., & Lieberman Aiden, Erez
2011Quantitative
analysis of culture using millions of digitized
books. Science 331 (6014): 176–182.
McEnery, Tony & Wilson, Andrew
2001Corpus
Linguistics: An Introduction, 2nd
edn. Edinburgh: Edinburgh University Press.
Perek, Florent
2016Using
distributional semantics to study syntactic productivity in diachrony: A case
study. Linguistics 54(1): 149–188.
Pinker, Steven
2002The
Blank Slate: The Modern Denial of Human Nature. New York, NY: Viking.
Roberts, Seán & Winters, James
2013Linguistic
diversity and traffic accidents: Lessons from statistical studies of cultural
traits. PLOS
ONE 8(8).
Sagi, Eyal, Kaufmann, Stefan & Clark, Brady
2011Tracing
semantic change with latent semantic
analysis. In Current
Methods in Historical Semantics, Justyna Robynson & Kathryn Allan (eds), 161–83. Berlin: De Gruyter.
Szmrecsanyi, Benedikt
2016About
text frequencies in historical linguistics: Disentangling environmental and
grammatical change. Corpus Linguistics and
Linguistic
Theory 12(1): 153–171.
Traugott, Elizabeth C.
1989On
the rise of epistemic meanings in English: An example of subjectification in
semantic
change. Language 57(1): 33–65.
Traugott, Elizabeth C.
2010Revisiting
subjectification and
intersubjectification. In Subjectification,
Intersubjectification and Grammaticalization, Kristin Davidse, Lieven Vandelanotte & Hubert Cuyckens (eds), 27–70. Berlin: De Gruyter Mouton.
Traugott, Elizabeth C. & Trousdale, Graeme
2013Constructionalization
and Constructional
Changes. Oxford: OUP.
Turney, Peter D. & Pantel, Patrick
2010From
frequency to meaning: Vector space models of
semantics. Journal of Artificial Intelligence
Research 37: 141–188.
Verhagen, Arie
2000Interpreting
usage: Construing the history of Dutch causal
verbs. In Usage-Based
Models of Language, Michael Barlow & Suzanne Kemmer (eds), 261–286. Stanford, CA: CSLI.
Zaenen, Annie, Carlette, Jean, Garretson, Gregory, Bresnan, Joan, Koontz-Garboden, Andrew, Nikitina, Tatiana, O’Connor, M. Catherine & Wasow, Tom
2004Animacy
encoding in English: Why and
how. In Proceedings of
the 2004 ACL Workshop on Discourse Annotation, Donna Byron & Bonnie Webber (eds), 118–125. East Stroudsburg, PA: Association for Computational Linguistics (ACL).
Cited by
Cited by 4 other publications
Kiyama, Naoki & Yoshikata Shibuya
2023. A Topic-Based Diachronic Account of the Polysemy of the English Verb ‘Run’. Research in Language 21:2 ► pp. 145 ff.
Li, Longxing, Vincent Xian Wang & Chu-Ren Huang
2022. Social Changes Manifested in the Diachronic Changes of Reform-Related Chinese Near Synonyms. In Chinese Lexical Semantics [Lecture Notes in Computer Science, 13250], ► pp. 184 ff.
This list is based on CrossRef data as of 23 march 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.