Chapter 7
Syntactic changes in verbal clauses and noun phrases from 1500 onwards
Can the promise of data-driven methods hold in historical linguistics? Can they detect salient syntactic changes and open new research avenues? I first use data-driven measures to detect patterns in the ARCHER corpus. Secondly, I qualitatively interpret the differences and build hypotheses. Thirdly, I validate these, with the Penn Corpora, investigating frequency and creativity. We observe several trends – among them: verbal (‘Doric’) style is decreasing, nominal (‘Attic’) style is increasing. A cascade from full to non-finite clause, and from paratactic to hypotactic style unfolds. Furthermore, constituent order is increasingly becoming fixed, strengthening the principle of dependency length minimisation. While data-driven approaches entail a complex interpretation step, their holistic perspective goes beyond well-trodden envelopes of variation to more global language models.
Article outline
- 1.Introduction
- 2.Preparing the corpora
- 2.1Data
- 2.2Methods of data preparation
- 2.3Evaluation of spelling normalisation, tagger and parser
- 2.4Querying the manually annotated corpora
- 3.Findings from the diagnostic step on ARCHER
- 4.Participial clauses
- 5.From verbal to nominal
- 5.1Verb/noun ratio
- 5.2Verb and noun modification
- 5.3Changes in productivity
- 6.Noun compression
- 7.From full clause to non-finite clause
- 8.Information structure
- 9.Conclusions
- Author queries
-
Notes
-
References
References
Ananiadou, Sophia, Douglas B. Kell & Jun-ichi Tsujii
2006 Text mining and its potential applications in systems biology.
Trends in Biotechnology 24(12). 571–579.
Arppe, Antti, Gaëtanelle Gilquin, Dylan Glynn, Martin Hilpert & Arne Zeschel
2010 Cognitive corpus linguistics: Five points of debate on current theory and methodology.
Corpora 5(1). 1–27.
Baron, Alistair & Paul Rayson
2008 VARD 2: A tool for dealing with spelling variation in historical corpora.
Proceedings of the Postgraduate Conference in Corpus Linguistics. Birmingham, UK: Aston University.
Behaghel, Otto
1909 Beziehungen zwischen Umfang und Reihenfolge von Satzgliedern.
Indogermanische Forschungen 25. 110–142.
Biber, Douglas
2003 Compressed noun-phrase structures in newspaper discourse: The competing demands of popularization vs. economy. In
Jean Aitchison &
Diana Lewis (eds.),
New media language, 169–181. London: Routledge.
Biber, Douglas, Edward Finegan & Dwight Atkinson
1994 ARCHER and its challenges: Compiling and exploring a representative corpus of historical English registers. In
Udo Fries,
Peter Schneider &
Gunnel Tottie (eds.),
Creating and using English language corpora: Papers from the 14th International Conference on English Language Research on Computerized Corpora, Zurich 1993, 1–13. Amsterdam: Rodopi.
Bresnan, Joan, Anna Cueni, Tatiana Nikitina & Harald Baayen
2007 Predicting the dative alternation. In
Gerlof Bouma,
Irene Krämer &
Joost Zwarts (eds.),
Cognitive foundations of interpretation, 69–94. Amsterdam: Royal Netherlands Academy of Science.
De Smet, Hendrik
2013 Spreading patterns: Diffusional change in the English system of complementation (
Oxford Studies in the History of English). New York: Oxford University Press.
Fanego, Teresa
2010 Variation in sentential complements in eighteenth- and nineteenth-century English: A processing-based explanation. In
Raymond Hickey (ed.),
Eighteenth-century English: Ideology and change, 200–220. Cambridge: Cambridge University Press.
Fonteyn, Lauren & Hubert Cuyckens
2013 The development of free adjuncts in English and Dutch.
Leuven Working Papers in Linguistics 2. 160–195.
Futrell, Richard, Kyle Mahowald & Edward Gibson
2015 Large-scale evidence of dependency length minimization in 37 languages.
Proceedings of the National Academy of Sciences of the USA 112(33). 10336.
Gildea, Daniel & David Temperley
2010 Do grammars minimize dependency length? Cognitive Science 34(2). 286–310.
Givón, Talmy
1979 On understanding grammar. New York: Academic Press.
Green, Clarence
2017 Patterns and developments in the English clause system. Singapore: Springer Nature.
Gulordava, Kristina & Paola Merlo
2015 Diachronic trends in word order freedom and dependency length in dependency-annotated corpora of Latin and Ancient Greek. In
Proceedings of the 3rd International Conference on Dependency Linguistics (Depling 2015), 121–130. Uppsala: Uppsala University.
Halliday, Michael A. K.
2001 Literacy and linguistics: Relationships between spoken and written language. In
Anne Burns &
Caroline Coffin (eds.),
Analyzing English in a global context: A reader, 181–193. New York: Routledge.
Halliday, Michael A. K.
2004 The language of science. Collected works of
M. A. K. Halliday, Volume 5 (
Jonathan J. Webster, ed.). London: Continuum.
Hawkins, John A.
1994 A performance theory of order and constituency. Cambridge: Cambridge University Press.
Hawkins, John A.
2004 Efficiency and complexity in grammars. Oxford: Oxford University Press.
Hilpert, Martin & Stefan Th. Gries
2017 Quantitative approaches to diachronic corpus linguistics. In
Merja Kytö &
Päivi Pahta (eds.),
The Cambridge handbook of English historical linguistics, 36–53. Cambridge: Cambridge University Press.
Hundt, Marianne, David Denison & Gerold Schneider
2012 Retrieving relatives from historical data.
Literary and Linguistic Computing 27(1). 3–16.
Ibaraki, Seishirou
2009 The development of the determiner system in the history of English.
English Linguistics 26. 67–95.
Kastovsky, Dieter
2006 Vocabulary. In
Richard Hogg &
David Denison (eds.),
A history of the English language, 199–270. Cambridge: Cambridge University Press.
Kroch, Anthony, Ann Taylor & Beatrice Santorini
2000 The Penn-Helsinki Parsed Corpus of Middle English (PPCME2). Department of Linguistics, University of Pennsylvania. CD-ROM, 2nd edn.,
release 4.
Kroch, Anthony, Beatrice Santorini & Lauren Delfs
2004 The Penn-Helsinki Parsed Corpus of Early Modern English (PPCEME). Department of Linguistics, University of Pennsylvania. CD-ROM, 1st edn.,
release 3.
Kroch, Anthony, Beatrice Santorini & Ariel Diertani
2016 The Penn Parsed Corpus of Modern British English (PPCMBE2). Department of Linguistics, University of Pennsylvania. CD-ROM, 2nd edn.,
release 1.
Leech, Geoffrey, Marianne Hundt, Christian Mair & Nicholas Smith
2009 Change in contemporary English: A grammatical study. Cambridge: Cambridge University Press.
Lehmann, Hans Martin & Gerold Schneider
2012 BNC Dependency Bank 1.0. In
Signe Oksefjell Ebeling,
Jarle Ebeling &
Hilde Hasselgård (eds.),
Studies in variation, contacts and change in English, Volume 12: Aspects of corpus linguistics: compilation, annotation, analysis. Helsinki: VARIENG.
Levy, Roger & Galen Andrew
2006 Tregex and Tsurgeon: Tools for querying and manipulating tree data structures. In
Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006). Genoa, Italy: European Language Resources Association.
Levy, Roger & Florian T. Jaeger
2007 Speakers optimize information density through syntactic reduction. In
Bernhard Schölkopf,
John C. Platt &
Thomas Hofmann (eds.),
Proceedings of the Twentieth Annual Conference on Neural Information Processing Systems, 849–856. Cambridge, MA: MIT Press.
Los, Bettelou
2005 The rise of the to-infinitive. Oxford: Oxford University Press.
Los, Bettelou
2012 The Loss of verb-second and the switch from bounded to unbounded systems. In
Anneli Meurman-Solin,
Maria José López-Couso &
Bettelou Los (eds.),
Information structure and syntactic change in the history of English (
Oxford Studies in the History of English), 21–46. Oxford: Oxford University Press.
Los, Bettelou & Gea Dreschler
2012 The loss of local anchoring: From adverbial local anchors to permissive subjects. In
Terttu Nevalainen &
Elizabeth C. Traugott (eds.),
The Oxford handbook of the history of English (
Oxford Handbooks in Linguistics), 859–871. New York: Oxford University Press.
Mair, Christian
2006 The 19th century as a critical period in the formation of the modern English system of nonfinite complement clauses: The case of remember. In
Merja Kytö &
Erik Smitterberg (eds.),
Nineteenth-century English: Stability and change, 215–228. Cambridge: Cambridge University Press.
Mair, Christian, Marianne Hundt, Geoffrey N. Leech & Nicholas Smith
Marcus, Mitchell P., Beatrice Santorini & Mary Ann Marcinkiewicz
1993 Building a large annotated corpus of English: The Penn Treebank.
Computational Linguistics 19. 313–330.
Nevalainen, Terttu
2006 An introduction to Early Modern English. Edinburgh: Edinburgh University Press.
Rissanen, Matti
1999 Syntax. In
Roger Lass (ed.),
The Cambridge history of the English language, Volume 3: Early Modern English 1476–1776, 187–331. Cambridge: Cambridge University Press.
Rohdenburg, Günter
1995 On the replacement of finite complement clauses by infinitives in English.
English Studies 76. 367–388.
Säily, Tanja, Terttu Nevalainen & Harri Siirtola
2011 Variation in noun and pronoun frequencies in a sociohistorical corpus of English.
Literary and Linguistic Computing 26. 167–188.
Schmid, Helmut
1994 Probabilistic part-of-speech tagging using decision trees. In
Proceedings of the International Conference on New Methods in Language Processing, Manchester, UK,
[URL]
Schneider, Gerold
2008 Hybrid long-distance functional dependency parsing. Zurich: University of Zurich Doctoral thesis.
Schneider, Gerold & Gintare Grigonyte
Schneider, Gerold, Marianne Hundt & Rahel Oppliger
2016 Part-of-speech in historical corpora: Tagger evaluation and ensemble systems on ARCHER. In
Proceedings of KONVENS 2016. Bochum, Germany: Bochumer Linguistische Arbeitsberichte.
Schneider, Gerold, Hans Martin Lehmann & Peter Schneider
2015 Parsing Early Modern English corpora.
Digital Scholarship in the Humanities 30(3). 423–439.
Schneider, Gerold, Eva Pettersson & Michael Percillier
2017 Comparing rule-based and SMT-based spelling normalisation for English historical texts. In
Gerlof Bouma &
Yvonne Adesam (eds.),
Proceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language, 40–46. Linköping: Linköping University Electronic Press.
Shannon, Claude E.
1951 Prediction and entropy of printed English.
The Bell System Technical Journal 30. 50–64.
Sinclair, John McHardy & Ronald Carter
2004 Trust the text: Language, corpus and discourse. Routledge, London.
Yamashita, Hiroko & Franklin Chang
2001 ‘Long before short’ preference in the production of a head-final language.
Cognition 81. B45–B55.
Zehentner, Eva Maria
2019 Competition in language change: The rise of the English dative alternation. Berlin: De Gruyter.
Cited by
Cited by 1 other publications
Zehentner, Eva, Marianne Hundt, Gerold Schneider & Melanie Röthlisberger
This list is based on CrossRef data as of 20 march 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.