Ján Mačutek | Mathematical Institute of Slovak Academy of Sciences | Constantine the Philosopher University in Nitra
Ten chapters from a Russian novel and its translations into Croatian, Serbian, and Ukrainian are automatically syllabified following the same approach in all four languages. Syllable frequencies and syllable length are modelled by probability distributions which are commonly used for frequency and length of words (the Zipf-Mandelbrot distribution and the Dacey-Poisson distribution, respectively). We show that Zipf’s law of brevity, according to which the more frequent words tend to be shorter, can be extended to syllables. We suggest a generalization of the Menzerath-Altmann law, a relation between word length and the mean syllable length. The generalized version of the law is valid for both word types and word tokens.
Altmann, Gabriel. 1980. Prolegomena to Menzerath’s law. In Rüdiger Grotjahn (ed.), Glottometrika 2 (Quantitative Linguistics 3), 1–10. Bochum: Brockmeyer.
Altmann, Gabriel & Michael H. Schwibbe. 1989. Das Menzerathsche Gesetz in informationsverarbeitenden Systemen. Hildesheim: Georg Olms.
Antić, Gordana, Emmerich Kelih & Peter Grzybek. 2006. Zero-syllable words in determining word length. In Peter Grzybek (ed.), Contributions to the science of text and language. Word length studies and related issues (Text, Speech and Language Technology 31), 117–156. Dordrecht: Springer.
Bentz, Christian & Ramon Ferrer-i-Cancho. 2016. Zipf’s law of abbreviation as a language universal. In Christian Bentz, Gerhard Jäger & Igor Yanovich (eds.), Proceedings of the Leiden workshop on capturing phylogenetic algorithms for linguistics. Tübingen: University of Tübingen. [URL]. (17January, 2020.)
Best, Karl-Heinz. 2005. Wortlänge. In Reinhard Köhler, Gabriel Altmann & Rajmund G. Piotrowski (eds.), Quantitative linguistics. An international handbook (Handbooks of Linguistics and Communications Science 27), 260–273. Berlin: de Gruyter.
Blevins, Juliette. 1995. The syllable in the phonological theory. In John Goldsmith (ed.), The handbook of phonological theory, 206–244. Oxford: Blackwell.
Cairns, Charles & Eric Raimy. 2011. Introduction. In Charles E. Cairns & Eric Raimy (eds.), Handbook of the syllable (Brill’s Handbooks in Linguistics 1), 1–30. Leiden: Brill.
Casas, Bernardino, Antoni Hernández-Fernández, Neus Català, Ramon Ferrer-i-Cancho & Jaume Baixeries. 2019. Polysemy and brevity versus frequency in language. Computer Speech & Language 58. 19–50.
Clements, George N.1990. The role of the sonority cycle in core syllabification. In John Kingston & Mary E. Beckman (eds.), Papers in laboratory phonology I: Between the grammar and the physics of speech, 283–333. Cambridge: Cambridge University Press.
Cramer, Irene M.2005. Das Menzerathsche Gesetz. In Reinhard Köhler, Gabriel Altmann & Rajmund G. Piotrowski (eds.), Quantitative linguistics. An international handbook (Handbooks of Linguistics and Communications Science 27), 659–688. Berlin: de Gruyter.
Ferrer-i-Cancho, Ramon & Antoni Hernández-Fernández. 2013. The failure of the law of brevity in two New World primates. Statistical caveats. Glottotheory 4(1). 45–55.
Grzybek, Peter. 1999. Randbemerkungen zur Korrelation von Wort- und Silbenlänge im Kroatischen. In Branko Tošović (ed.), Die grammatischen Korrelationen, 67–77. Graz: Institut für Slawistik.
Haugen, Einar. 1956. The syllable in linguistic description. In Morris Halle, Horace G. Lunt, Hugh McLean & Cornelis H. van Schooneveld (eds.), For Roman Jakobson: Essays on the occasion of his sixtieth birthday, 213–221. The Hague: Mouton.
Hernández-Fernández, Antoni, Bernardino Casas, Ramon Ferrer-i-Cancho & Jaume Baixeries. 2016. Testing the robustness of laws of polysemy and brevity versus frequency. In Pavel Král & Carlos Martín-Vide (eds.), Statistical language and speech processing (Lecture Notes in Computer Science 9918), 19–29. Cham: Springer.
Izsák, János. 2006. Some practical aspects of fitting and testing the Zipf-Mandelbrot model. A short essay. Scientometrics 65. 107–120.
Kelih, Emmerich. 2009. Slawisches Parallel-Textkorpus: Projektvorstellung von “Kak zakaljalas’ stal’ (KZS)”. In Emmerich Kelih, Viktor Levickij & Gabriel Altmann (eds.), Methods of text analysis, 106–124. Chernivtsi: ČNU.
Kelih, Emmerich. 2010. Parameter interpretation of Menzerath’s law: Evidence from Serbian. In Peter Grzybek, Emmerich Kelih & Ján Mačutek (eds.), Text and language. Structures, functions, interrelations, quantitative perspectives, 71–78. Vienna: Praesens.
Kelih, Emmerich. 2012. Die Silbe in slawischen Sprachen. Von der Optimalitätstheorie zu einer funktionalen Interpretation (Specimina Philologiae Slavicae 168). Munich: Otto Sagner.
Köhler, Reinhard. 2005. Synergetic linguistics. In Reinhard Köhler, Gabriel Altmann & Rajmund G. Piotrowski (eds.), Quantitative linguistics. An international handbook (Handbooks of Linguistics and Communications Science 27), 760–775. Berlin: de Gruyter.
Köhler, Reinhard. 2011. Laws of language. In Patrick C. Hogan (ed.), The Cambridge encyclopedia of the language sciences, 424–426. Cambridge: Cambridge University Press.
Mačutek, Ján, Jan Chromý & Michaela Koščová. 2019. Menzerath-Altmann law and prothetic /v/ in spoken Czech. Journal of Quantitative Linguistics 26. 66–80.
Mačutek, Ján & Andrij Rovenchak. 2011. Canonical word forms: Menzerath-Altmann law, phonemic length and syllabic length. In Emmerich Kelih, Victor Levickij & Yuliya Matskulyak (eds.), Issues in quantitative linguistics 2 (Studies in Quantitative Linguistics 11), 136–147. Lüdenscheid: RAM-Verlag.
Mačutek, Ján & Gejza Wimmer. 2013. Evaluating goodness-of-fit of discrete distribution models in quantitative linguistics. Journal of Quantitative Linguistics 20. 227–240.
Menzerath, Paul. 1954. Die Architektonik des deutschen Wortschatzes. Bonn: Dümmler.
Mikros, Georgios & Jiří Milička. 2014. Distribution of the Menzerath’s law on the syllable level in Greek texts. In Gabriel Altmann, Radek Čech, Ján Mačutek & Ludmila Uhlířová (eds.), Empirical approaches to text and language analysis (Studies in Quantitative Linguistics 17), 180–189. Lüdenscheid: RAM-Verlag.
Piper, Predrag & Ivan Klajn. 2013. Normativna gramatika srpskog jezika. Novi Sad: Matica srpska.
Ponomariv, Oleksandr D. (ed.) 2001. Sučasna ukrajins’ka mova. Kyjiv: Lybid’.
Popescu, Ioan-Iovitz, Peter Grzybek, Bijapur D. Jayaram, Reinhard Köhler, Viktor Krupa, Ján Mačutek, Regina Pustet, Ludmila Uhlířová & Matummal N. Vidya. 2009. Word frequency studies (Quantitative Linguistics 64). Berlin: de Gruyter.
Popescu, Ioan-Iovitz, Sven Naumann, Emmerich Kelih, Andrij Rovenchak, Haruko Sanada, Anja Overbeck, Reginald Smith, Radek Čech, Panchanan Mohanty, Andrew Wilson & Gabriel Altmann. 2013. Word length: aspects and languages. In Reinhard Köhler & Gabriel Altmann (eds.), Issues in quantitative linguistics 3 (Studies in Quantitative Linguistics 13), 224–281. Lüdenscheid: RAM-Verlag.
Radojičić, Marija, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek & Lívia Leššová. 2019. Frequency and length of syllables in Serbian. Glottometrics 45. 114–123.
Roberts, Aaron H.1965. A statistical linguistic analysis of American English (Janua Linguarum Series Practica 8). The Hague: Mouton.
Strauss, Udo, Fengxiang Fan & Gabriel Altmann. 2008. Problems in quantitative linguistics 1 (Studies in Quantitative Linguistics 1). Lüdenscheid: RAM-Verlag.
Wimmer, Gejza & Gabriel Altmann. 1999. Thesaurus of univariate discrete probability distributions. Essen: Stamm.
Zipf, George K.1949. Human behavior and the principle of least effort. Cambridge, MA: Addison-Wesley.
Zörnig, Peter, Kamil Stachowski, Anna Rácová, Yunhua Qu, Michal Místecký, Kuizi Ma, Mihaiela Lupea, Emmerich Kelih, Volker Gröller, Hanna Gnatchuk, Alfiya Galieva, Sergey Andreev & Gabriel Altmann. 2019. Quantitative insights into syllable structure (Studies in Quantitative Linguistics 30). Lüdenscheid: RAM-Verlag.
Cited by (1)
Cited by one other publication
Motalová, Tereza, Ján Mačutek & Radek Čech
2023. Word Length in Chinese: The Menzerath-Altmann Law is Valid After All. Journal of Quantitative Linguistics 30:3-4 ► pp. 304 ff.
This list is based on CrossRef data as of 4 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.