Short term diachronic shifts in part-of-speech frequencies
A comparison of the tagged LOB and F-LOB corpora
Christian Mair | Freiburg University, Germany
Marianne Hundt | Freiburg University, Germany
Geoffrey N. Leech | Lancaster University, UK
Nicholas Smith | Lancaster University, UK
The paper presents a comparison of tag frequencies in two matching one-million word reference corpora of British standard English, the 1961 LOB-corpus and its 1991 “clone” produced at Freiburg. Both corpora were tagged using a version of the CLAWS part-of-speech-tagger developed at Lancaster, and part of the material was post-edited manually in Freiburg to assess the accuracy of the automatic procedure. The comparison of tag frequencies is an essential complement to work on recent linguistic change carried out on the untagged material, because this work has been based on the – so far unverified – assumption that tag frequencies have remained constant over the thirty-year period in question. In addition, the paper discusses some common and partly contradictory claims about the prevalence of a “nominal” style in present-day written English. It is shown that while part-of-speech frequencies have not remained constant over the period investigated, the shifts are usually not big enough to invalidate the results obtained in analyses of the untagged material. With regard to style, the material shows a significant rise in the frequency of nouns, which, however, is not paralleled by a corresponding decrease in verbs.
Keywords: nominal style, corpus, British English (Modern), part-of-speech tagging, (recent) diachronic change
Published online: 04 April 2003
https://doi.org/10.1075/ijcl.7.2.05mai
https://doi.org/10.1075/ijcl.7.2.05mai
Cited by
Cited by 20 other publications
Anderwald, Lieselotte & Susanne Wagner
DE SMET, HENDRIK & EVELYN VANCAYZEELE
Elsness, Johan
Engwall, Lars, Enno Aljets, Tina Hedmo & Raphaël Ramuz
Engwall, Lars, Enno Aljets, Tina Hedmo & Raphaël Ramuz
González-Díaz, Victorina
Partington, Alan
Saily, T., T. Nevalainen & H. Siirtola
Stajner, Sanja, Ruslan Mitkov & Geoffrey Leech
Szmrecsanyi, Benedikt
Säily, Tanja, Turo Vartiainen & Harri Siirtola
Tagliamonte, Sali A.
Yadava, Yogendra P., Andrew Hardie, Ram Raj Lohani, Bhim N. Regmi, Srishtee Gurung, Amar Gurung, Tony McEnery, Jens Allwood & Pat Hall
Yao, Xinyue & Peter Collins
Zeldes, Amir
Štajner, Sanja & Richard Evans
This list is based on CrossRef data as of 10 april 2021. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.