The nature of graphs and graphemes in Middle Dutch writing and the problem of parsing

Wulf, Chris De

doi:10.1075/nowele.00069.wul

Article published In:

NOWELE
Vol. 75:2 (2022) ► pp.223–256

The nature of graphs and graphemes in Middle Dutch writing and the problem of parsing

Chris De Wulf | Universität Zürich | Vrije Universiteit Brussel

This article focuses on the practical problems that go along with the sequencing of older non-standardised spelling forms into consonant and vowel graphemes. The issue of segmental parsing is important in research on the development of Dutch spelling, mostly drawing from thirteenth and fourteenth century charter spellings. The research aims are introduced in part 1, and then the problems of defining graphs and then syllables are discussed in part 2. These problems stem from the theoretical graphemic level, but have a very direct impact on the practical level, as they hinder the automatic parsing of tokens in the corpus, as discussed in part 3. The goal of this article is to provide partial solutions in parsing of non-standardised language data for graphemic research.

Article outline

1.Introduction
- 1.1Development of Dutch Orthography
- 1.2Scope of the project
- 1.3Problems
2.Theoretical outset
- 2.1Terminology
  - 2.1.1Grapheme vs. graph
  - 2.1.2Graph base vs. graph extension
  - 2.1.3Syllable opposition
- 2.2Segmentation of graphs/graphemes
3.Finding graphs in the Middle Dutch text corpus
- 3.1Letter-based graph parsing
- 3.2Syllable-based graph parsing
- 3.3Lemma-based graph parsing
4.Summary and outlook
Acknowledgements
Notes
References

Published online: 22 November 2022

https://doi.org/10.1075/nowele.00069.wul

References (70)

Bach, H.

1937–1943: Die thüringisch-sächsische Kanzleisprache bis 1325. Bd. I: Vokalismus. Bd. II: Druckschwache Silben. Konsonantismus. Formenlehre. Copenhagen: Levin & Munksgaard/E. Munksgaard.

Beaulieux, C.

1967 Histoire de l’orthographe française. Nouveau tirage. Tome Premier: formation de l’orthographe des origins au milieu du XVIe siècle. Tome second: les accents et autres signes auxiliaires. Paris: Champion.

Benders, J. F.

2002 Bestuursstructuur en schrijfcultuur. Een analyse van de bestuurlijke verschriftelijking in Deventer tot het eind van de 15de eeuw. PhD dissertation, University of Groningen, Groningen. [URL] (26 April, 2022).

Benskin, M.

1997 Texts from a township in late medieval Ireland. Collegium Medievale 101. 91–173.

2001 The language of the English texts. In T. Hunt (ed.), Three receptaria from Medieval England, 193–230. Oxford: Society for the Study of Medieval Languages and Literature.

Benskin, M. & M. Laing

2013– An electronic version of a Linguistic Atlas of Late Mediaeval English with webscripts by V. Karaiskos & K. Williamson. Edinburgh: University of Edinburgh. [URL] (29 July, 2022).

Berteloot, A.

1984 Bijdrage tot een klankatlas van het dertiende-eeuwse Middelnederlands. 21 Parts. Gent: Leonard-Willems-Halettfonds.

Boesch, B.

1946 Untersuchungen zur alemannischen Urkundensprache des 15. Jahrhunderts. Laut- und Formenlehre. Bern: Francke.

Bouma, G. & B. J. H. Hermans

2012 Syllabification in Middle Dutch. In F. Mambrini, M. Passarotti & C. Sporleder (eds.), Proceedings of the second workshop on annotation of corpora for research in the humanities (ACRH-2), 27–38. Lisbon: Edições Colibri.

Catach, N.

2001/2008 Histoire de l’orthographe française. Édition posthume, réalisée par Renée Honvault avec la collaboration de Irène Rosier-Catach. Paris: Champion.

Condorelli, M.

2020 From the early modern era to an international research area. In M. Condorelli (ed.), Advances in historical orthography, c. 1500–1800, 1–15. Cambridge: Cambridge University Press.

CPWNL = Corpus Materiaal Pieter Willems

1880– Compiled by P. Willems. [URL] (22 November, 2021).

CGR13 = Corpus of 13th century Dutch charters

1977– Compiled by M. Gysseling, W. J. J. Pijnenburg & P. van Reenen. [URL] (25 June, 2021–29 July, 2022).

CRM14 = Corpus of 14th century Dutch charters

1993– Compiled by P. van Reenen & M. Mulder. [URL] (25 June, 2021–29 July, 2022).

De Wulf, C.

2014 Van af naar van . Amsterdamer Beiträge zur älteren Germanistik 711. 149–192.

2018 Een veertiende-eeuwse oorkonde uit Saaftinge of Kortrijk. In T. Colleman, J. De Caluwe, V. De Tier, A.-S. Ghyselen, L. Triest, R. Vandenberghe & U. Vogl. (eds.), Woorden om te bewaren. Huldeboek voor Jacques Van Keymeulen, 387–406. Gent: Skribis.

2019 Klankatlas van het veertiende-eeuwse Middelnederlands. Het dialectvocalisme in de spelling van lokale oorkonden. Gent: Koninklijke Academie voor Nederlandse Taal en Letteren. With online supplements at [URL] (26 April–29 July, 2022).

2021 Spellingonderzoek in ontwikkeling. Taal & Tongval 731. 111–136.

De Wulf, C. & K. Bischof

2021 Der Bochouter Schöffenbrief (1249). Im Vergleich mit den privatrechtlichen Urkundendatenbanken in der niederländischen historischen Sprachwissenschaft. In S. Höffler & K. Müller (eds.), Rechtsetzung als Textkonstitution. Sprachwissenschaftliche Zugänge zu modernen und historischen Rechtsquellen, 211–242. Zürich/St.Gallen: Dike Verlag.

De Wulf, C., J. Goossens, J. Taeldeman & G. Verleyen

1998–2005 Fonologische Atlas van de Nederlandse Dialecten (FAND). Part I–IV1. Gent: Koninklijke Academie voor Nederlandse Taal- en Letterkunde.

Dürscheid, C.

2016 Einführung in die Schriftlinguistik. Göttingen: Vandenhoeck & Ruprecht.

Elmentaler, M.

2003 Struktur und Wandel vormoderner Schreibsprachen. Berlin: De Gruyter.

2018 Historische Graphematik des Deutschen. Tübingen: Narr.

FAND = C. De Wulf et al.

1998–2005.

Fleischer, W.

1969 Die Entwicklung des neuhochdeutschen Graphemsystems. In E. Agricola, W. Fleischer & H. Protze (eds.), Kleine Enzyklopädie. Die deutsche Sprache. Erster Band, 228–234. Leipzig: VEB Bibliographisches Institut.

Fuhrhop, N. & J. Peters

2013 Einführung in die Phonologie und Graphematik. Stuttgart: Metzler.

Gallmann, P.

1985 Graphische Elemente der geschriebenen Sprache. Grundlagen für eine Reform der Orthographie. Tübingen: Niemeyer.

Glaser, E.

1985 Graphische Studien zum Schreibsprachwandel vom 13. bis 16. Jahrhundert. Vergleich verschiedener Handschriften des Augsburger Stadtbuches. Heidelberg: Winter Verlag.

Greule, A., J. Meier & A. Ziegler

(eds.) 2012 Kanzleisprachenforschung. Ein internationales Handbuch. Berlin: De Gruyter.

Haverals, W., F. Karsdorp & M. Kestemont

2019 Data-driven syllabification for Middle Dutch. Digital Medievalist 12(1). 1–23.

Hoebeke, M.

1968 De Middeleeuwse oorkondentaal te Oudenaarde: 13e eeuw–± 1500 (spelling en klankleer). Gent: Secretariaat van de Koninklijke Vlaamse Academie voor Taal- en Letterkunde.

Kettmann, G.

1967 Die kursächsische Kanzleisprache zwischen 1486 und 1546. Studien zum Aufbau und zur Entwicklung. Berlin: Akademie-Verlag.

LAEME = M. Laing

2013–.

Laing, M.

2013– A Linguistic Atlas of Early Middle English, 1150–1325 (LAEME), Version 3.2 (With R. Lass (Introduction) and webscripts by K. Willamson, V. Karaiskos & S. Branchaw.) Edinburgh: University of Edinburgh. [URL] (29 July, 2022).

Laing, M. & R. Lass

2003 Tales of the 1001 nists: the phonological implications of litteral substitution sets in some thirteenth-century South-West Midlands texts. English Language and Linguistics 7(2). 257–278.

Laing, M., R. Lass

2006 Early Middle English dialectology: Problems and prospects. In A. van Kemenade & B. Los (eds.), The handbook of the history of English, 417–451. Oxford: Blackwell.

LALME = A. McIntosh et al.

1986.

Larsen, N.-E.

2001 Grafematische analyse van een Middelnederlandse tekst. Het grafeemsysteem van de Vroegmiddelnederlandse statuten van de Gentse leprozerie uit 1236. Amsterdam: Rozenberg.

Lass, R. & M. Laing

2013 The early Middle English reflexes of Germanic *ik ‘I’: Unpacking the changes. Folia Linguistica Historica 341. 93–113.

Maas, U.

1997 Orthographische Regularitäten, Regeln und ihre Deregulierung. Am Beispiel der Dehnungszeichen im Deutschen. In G. Augst, K. Blüml, D. Nerius & H. Sitta (eds.), Zur Neuregelung der deutschen Orthographie. Begründung und Kritik, 337–364. Tübingen: Niemeyer.

Marynissen, A., D. Bock & A. Terhalle

2021 Op weg naar een geschreven eenheidstaal: de ontdialectisering van de schrijftaal bij Gheraert Leeu, drukker in Gouda en Antwerpen. Taal & Tongval 731. 245–295.

Marynissen, A. & T. A. J. M. Janssen

2014 Vroegnieuwnederlands voor commercie en cultuur. In F. Van de Velde, H. Smessaert, F. Van Eynde & S. Verbrugge (eds.), Patroon en argument. Een dubbelfeestbundel bij het emeritaat van William Van Belle en Joop van der Horst, 500–512. Leuven: Universitaire Pers.

McIntosh, A., M. L. Samuels & M. Benskin

(eds.) 1986 A Linguistic Atlas of Late Mediaeval English (LALME) (41 vols.). Aberdeen: Aberdeen University Press.

Meijering, H.

1974 De willekeuren van de Opstalsboom (1323). Een filologisch-historische monografie. Groningen: V.R.B. Offsetdrukkerij.

Meletis, D.

2015 Graphetik. Form und Materialität von Schrift. Glückstadt: Verlag Werner Hülsbusch.

Meletis, D. & C. Dürscheid

2022 Writing systems and their use. An overview of grapholinguistics. (Trends in Linguistics. Studies and Monographs 369). Berlin: De Gruyter.

(5 August, 2022).

Middelnederlands.nl = CGR13 + CRM14

[URL] (25 June, 2021–29 July, 2022).

Mooijaart, M. A.

1992 Atlas van Vroegmiddelnederlandse taalvarianten. Utrecht: LEd.

Moser, H.

1977 Die Kanzlei Kaiser Maximilians I. Graphematik eines Schreibusus. Teil I: Untersuchungen. Teil II: Texte. Innsbruck: Institut für Deutsche Philologie der Universität Innsbruck.

1987 Geredete Graphie. Zur Entstehung orthoepischer Normvorstellungen im Frühneuhochdeutschen. Zeitschrift für deutsche Philologie 1061. 379–399.

Moser, V.

1929 Frühneuhochdeutsche Grammatik / 1,1 1, Lautlehre; 1: Orthographie, Betonung, Stammsilbenvokale. Heidelberg: Winter.

Neef, M.

2005 Die Graphematik des Deutschen. Tübingen: Niemeyer.

Neef, M. & B. Primus

2001 Stumme Zeugen der Autonomie. Eine Replik auf Ossner. Linguistische Berichte 1871. 353–378.

Nunn, A. M.

1998 Dutch orthography; A systematic investigation of the spelling of Dutch words. Den Haag: LOT.

Paradis, C. & J.-F. Prunet

2000 Nasal vowels as two segments: Evidence from borrowings. Language 761. 324–357.

Peersman, C.

2007 De opkomst van de volkstalen in de oorkonden van de abdij van Ninove. Casestudy van een bewogen eeuw (1250–1350). Handelingen van de Koninklijke Zuid-Nederlandse Maatschappij voor Taal- en Letterkunde en Geschiedenis LXI1 2007 (appeared 2008). 129–145.

Pijnenburg, W. J. J.

1997 Vroegmiddelnederlands (circa 1200–1350). In M. C. van den Toorn, W. J. J. Pijnenburg, J. A. van Leuvensteijn & J. M. van der Horst (eds.), Geschiedenis van de Nederlandse taal, 69–145. Amsterdam: Amsterdam University Press.

Pleij, H.

2010 Printing as a long-term revolution. In H. Wijsman (ed.), Books in transition at the time of Philip the Fair: Manuscripts and printed books in the late fifteenth and early sixteenth century Low Countries, 297–307. Turnhout: Brepols.

Pletinckx, L.

2003 Woordenboek van het Asses: bijdrage tot de studie van de West-Brabants dialect. Asse: Koninklijke heemkring Ascania.

Rem, M.

2003 De taal van de klerken uit de Hollandse grafelijke kanselarij (1300–1340). Naar een lokaliseringsprocedure voor het veertiende-eeuwse Middelnederlands. Amsterdam: Stichting Neerlandistiek VU.

Rutten, G., R. Vosters, W. Vandenbussche & M. van der Wal

2014 Norms and usage in language history, 1600–1900: A sociolinguistic and comparative perspective. Amsterdam: Benjamins.

Schmitt, L. E.

1936 Die deutsche Urkundensprache in der Kanzlei Kaiser Karls IV. (1346–1378). Halle/Salle: Niemeyer.

Smessaert, H. & W. Decoster

2019 Basisbegrippen fonetiek en fonologie. 4th edn. Leuven: Acco.

Van Bree, C.

2016 Leerboek voor de historische grammatica van het Nederlands. Deel 1: Gotisch grammatica, inleiding, klankleer. 2nd edn. [URL] (26 April, 2022).

Vandenbussche, W.

2007 Shared standardization factors in the history of sixteen Germanic languages. In C. Fandrych & R. Salverda (eds.), Standard, Variation und Sprachwandel in germanischen Sprachen, 25–36. Tübingen: Narr.

Van Reenen, P. Th. & M. Mulder

1993 Een gegevensbank van 14de-eeuwse Middelnederlandse dialecten op computer. Lexikos 31. 259–279.

Van Loey, A.

1949 Middelnederlandse spraakkunst. 2. Klankleer. Groningen: Wolters.

Van Loon, J.

2014 Historische fonologie van het Nederlands. Antwerpen: Universitas.

Voeste, A.

2008 Orthographie und Innovation. Die Segmentierung des Wortes im 16. Jahrhundert. Hildesheim/New York: Olms.

Wolf, N.

1975 Regionale und überregionale Norm im späten Mittelalter. Graphematische und lexikalische Untersuchungen zu deutschen und niederländischen Schriftdialekten. Innsbruck: Inst. für dt. Philologie der Univ.