Short term diachronic shifts in part-of-speech frequencies: A comparison of the tagged LOB and F-LOB corpora

Mair, Christian; Hundt, Marianne; Leech, Geoffrey N.; Smith, Nicholas

doi:10.1075/ijcl.7.2.05mai

Article published In:

International Journal of Corpus Linguistics
Vol. 7:2 (2002) ► pp.245–264

Short term diachronic shifts in part-of-speech frequencies

A comparison of the tagged LOB and F-LOB corpora

Christian Mair | Freiburg University, Germany

Marianne Hundt | Freiburg University, Germany

Geoffrey N. Leech | Lancaster University, UK

Nicholas Smith | Lancaster University, UK

The paper presents a comparison of tag frequencies in two matching one-million word reference corpora of British standard English, the 1961 LOB-corpus and its 1991 “clone” produced at Freiburg. Both corpora were tagged using a version of the CLAWS part-of-speech-tagger developed at Lancaster, and part of the material was post-edited manually in Freiburg to assess the accuracy of the automatic procedure. The comparison of tag frequencies is an essential complement to work on recent linguistic change carried out on the untagged material, because this work has been based on the – so far unverified – assumption that tag frequencies have remained constant over the thirty-year period in question. In addition, the paper discusses some common and partly contradictory claims about the prevalence of a “nominal” style in present-day written English. It is shown that while part-of-speech frequencies have not remained constant over the period investigated, the shifts are usually not big enough to invalidate the results obtained in analyses of the untagged material. With regard to style, the material shows a significant rise in the frequency of nouns, which, however, is not paralleled by a corresponding decrease in verbs.

Keywords: nominal style, corpus, British English (Modern), part-of-speech tagging, (recent) diachronic change

Published online: 4 April 2003

https://doi.org/10.1075/ijcl.7.2.05mai

Cited by (23)

Cited by 23 other publications

Order by:

Schneider, Gerold

2022. Chapter 7. Syntactic changes in verbal clauses and noun phrases from 1500 onwards. In English Historical Linguistics [Current Issues in Linguistic Theory, 358], ► pp. 164 ff.

Mair, Christian & Geoffrey N. Leech

2020. Current Changes in English Syntax. In The Handbook of English Linguistics, ► pp. 249 ff.

Yao, Xinyue & Peter Collins

2019. Developments in Australian, British, and American English Grammar from 1931 to 2006: An Aggregate, Comparative Approach to Dialectal Variation and Change. Journal of English Linguistics 47:2 ► pp. 120 ff.

González-Díaz, Victorina

2018. Chapter 3. Adjective stacking in Early Modern English. In The Noun Phrase in English [Linguistik Aktuell/Linguistics Today, 246], ► pp. 47 ff.

Koplenig, Alexander

2018. Using the parameters of the Zipf–Mandelbrot law to measure diachronic lexical, syntactical and stylistic changes – a large-scale corpus analysis. Corpus Linguistics and Linguistic Theory 14:1 ► pp. 1 ff.

Berg, Thomas

2017. Compounding in German and English. Languages in Contrast 17:1 ► pp. 43 ff.

Křen, Michal

2017. Grammatical Change Trends in Contemporary Czech Newspapers. Journal of Linguistics/Jazykovedný casopis 68:2 ► pp. 238 ff.

Säily, Tanja, Turo Vartiainen & Harri Siirtola

2017. Exploring part-of-speech frequencies in a sociohistorical corpus of English. In Exploring Future Paths for Historical Sociolinguistics [Advances in Historical Sociolinguistics, 7], ► pp. 23 ff.

DE SMET, HENDRIK & EVELYN VANCAYZEELE

2015. Like a rolling stone: the changing use of English premodifying present participles. English Language and Linguistics 19:1 ► pp. 131 ff.

Elsness, Johan

2014. The present perfect and the preterite in Late Modern and Contemporary English. In Corpus Interrogation and Grammatical Patterns [Studies in Corpus Linguistics, 63], ► pp. 81 ff.

Engwall, Lars, Enno Aljets, Tina Hedmo & Raphaël Ramuz

2014. Computer Corpus Linguistics: An Innovation in the Humanities. In Organizational Transformation and Scientific Change: The Impact of Institutional Restructuring on Universities and Intellectual Innovation [Research in the Sociology of Organizations, 42], ► pp. 331 ff.

Engwall, Lars, Enno Aljets, Tina Hedmo & Raphaël Ramuz

2014. Computer Corpus Linguistics: An Innovation in the Humanities. In Organizational Transformation and Scientific Change: The Impact of Institutional Restructuring on Universities and Intellectual Innovation [Research in the Sociology of Organizations, 42], ► pp. 331 ff.

Jatowt, Adam & Kevin Duh

2014. IEEE/ACM Joint Conference on Digital Libraries, ► pp. 229 ff.

Stajner, Sanja, Ruslan Mitkov & Geoffrey Leech

2013. Natural Language Processing methodology for tracking diachronic changes in the 20th century English language. Journal of Research Design and Statistics in Linguistics and Communication Science 1:1 ► pp. 71 ff.

Štajner, Sanja & Richard Evans

2013. Can Statistical Tests Be Used for Feature Selection in Diachronic Text Classification?. In Statistical Language and Speech Processing [Lecture Notes in Computer Science, 7978], ► pp. 273 ff.

Saily, T., T. Nevalainen & H. Siirtola

2011. Variation in noun and pronoun frequencies in a sociohistorical corpus of English. Literary and Linguistic Computing 26:2 ► pp. 167 ff.

Partington, Alan

2010. Modern Diachronic Corpus-Assisted Discourse Studies (MD-CADS) on UK newspapers: an overview of the project. Corpora 5:2 ► pp. 83 ff.

Zeldes, Amir

2010. Tony McEnery, Richard Xiao & Yukio Tono. 2006. Corpus-Based Language Studies. An Advanced Resource Book (Routledge Applied Linguistics). London, New York: Routledge. xx, 386 S. Zeitschrift für Rezensionen zur germanistischen Sprachwissenschaft 2:2

Szmrecsanyi, Benedikt

2009. Typological parameters of intralingual variability: Grammatical analyticity versus syntheticity in varieties of English. Language Variation and Change 21:3 ► pp. 319 ff.

Yadava, Yogendra P., Andrew Hardie, Ram Raj Lohani, Bhim N. Regmi, Srishtee Gurung, Amar Gurung, Tony McEnery, Jens Allwood & Pat Hall

2008. Construction and annotation of a corpus of contemporary Nepali. Corpora 3:2 ► pp. 213 ff.

Anderwald, Lieselotte & Susanne Wagner

2007. FRED — The Freiburg English Dialect Corpus: Applying Corpus-Linguistic Research Tools to the Analysis of Dialect Data. In Creating and Digitizing Language Corpora, ► pp. 35 ff.

Tagliamonte, Sali A.

2006. “So cool, right?”: Canadian English Entering the 21st Century. Canadian Journal of Linguistics/Revue canadienne de linguistique 51:2-3 ► pp. 309 ff.

[no author supplied]

2014. Educated Fiji English [Varieties of English Around the World, G47],

This list is based on CrossRef data as of 4 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.