Computational quantitative syntax
The case of Universal 18
Accounting for the constraints on the possible word orders of a sentence in a language and across the world languages is a core challenge for syntactic theory. In the spirit of computational quantitative syntax, in this paper we present quantitative evidence about Universal 18. We show that corpus data confirms a dispreference for the word order combination where adjectives precede but numerals follow the nouns (Adj-N and N-Num). We then investigate if this dispreference is better explained as a constraint expressed at the level of the dominant orders or at the level of individual structures. Corpus counts support the latter interpretation. Finally, we propose a formal model of how this bias against Adj-N-Num orders can be integrated in the grammar.
Article outline
- 1.Introduction
- 2.Typological data on Universal 18
- 3.Accounts of language universals and Universal 18
- 3.1Structure-level accounts
- 3.2Grammar-level accounts
- 4.Our approach to Universal 18
- 4.1Cross-linguistic corpus data
- Corpus data for Latin and Ancient Greek
- Preprocessing and collection of counts
- Collected counts
- 4.2Results and discussion
- 5.Towards a model explaining Universal 18
- 5.1Comparison of models 1 and 2
- 6.Conclusions
-
Notes
-
References
References
Abels, Klaus
2013 “
On 2-3-1.”
UCL Working Papers in Linguistics 25.
Baker, Mark
2001 The Atoms of Language: The Mind’s Hidden Rules of Grammar. New York, NY: Basic Books.
Bamman, David, and Gregory R. Crane
2011 “
The Ancient Greek and Latin Dependency Treebanks.” In
Language Technology for Cultural Heritage (
Theory and Applications of Natural Language Processing), ed. by
Caroline Sporleder,
Antal Bosch, and
Kalliopi Zervanou, 79–98. Berlin Heidelberg: Springer.
Biberauer, Theresa Anders Holmberg, and Ian Roberts
2014 “
A Syntactic Universal and its Consequences”.
Linguistic Inquiry 45 (2): 169–225.
Bresnan, Joan, Anna Cueni, Tatiana Nikitina, and Harald Baayen
2007 “
Predicting the Dative Alternation.” In
Cognitive Foundations of Interpretation, ed. by
G. Boume,
I. Kraemer, and
J. Zwarts, 69–94. Amsterdam: Royal Netherlands Academy of Science.
Bresnan, Joan, Shipra Dingare, and Christopher D. Manning
2001 “
Soft Constraints Mirror Hard Constraints: Voice and Person in English and Lummi.” In
Proceedings of the LFG 01 Conference, 13–32. CSLI.
Buchholz, Sabine, and Erwin Marsi
2006 “
CoNLL-X Shared Task on Multilingual Dependency Parsing.” In
Proceedings of the Tenth Conference on Computational Natural Language Learning, CoNLL-X ‘06, 149–164. Stroudsburg, PA, USA: Association for Computational Linguistics.
[URL].
Cinque, Guglielmo
2005 “
Deriving Greenberg’s Universal 20 and its Exceptions”.
Linguistic Inquiry 36 (3): 315–332.
Culbertson, Jennifer, and David Adger
2014 “
Language Learners Privilege Structured Meaning Over Surface Frequency”.
Proceedings of the National Academy of Sciences of the United States of America 111 (16): 5842–5847.
Culbertson, Jennifer, and Paul Smolensky
2012 “
A Bayesian Model of Biases in Artificial Language Learning: The Case of a Word-Order Universal”.
Cognitive Science 36 (8): 1468–1498.
Culbertson, Jennifer, Paul Smolensky, and Géraldine Legendre
2012 “
Learning Biases Predict a Word Order Universal”.
Cognition 122 (3): 306–329.
Cysouw, Michael
2010 “
Dealing with Diversity: Towards an Explanation of NP Word Order Frequencies”.
Linguistic Typology 14 (2): 253–287.
Dryer, Matthew S.
1992 “
The Greenbergian Word Order Correlations”.
Language 68: 81–138.
Dryer, Matthew S., and Martin Haspelmath
(eds.) 2011 The World Atlas of Language Structures Online. Munich: Max Planck Digital Library.
[URL]
Dunn, Michael, Simon J. Greenhill, Stephen C. Levinson, and Russell D. Gray
2011 “
Evolved Structure of Language Shows Lineage-Specific Trends in Word-Order Universals”.
Nature 473 (7345): 79–82.
Greenberg, Joseph H.
1963 “
Some Universals of Grammar with Particular Reference to the Order of Meaningful Elements.” In
Universals of Language, ed. by
Joseph H. Greenberg, 73–113. MIT Press.
Gulordava, Kristina, and Paola Merlo
2015 “
Structural and Lexical Factors in Adjective Placement in Complex Noun Phrases across Romance Languages”. In
Proceedings of the Nineteenth Conference on Computational Natural Language Learning
, 247–257. Beijing, China.
Gulordava, Kristina, Paola Merlo, and Benoit Crabbé
2015 “
Dependency Length Minimisation Effects in Short Spans: A Large-Scale Analysis of Adjective Placement in Complex Noun Phrases.” In
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing
(Volume 2: Short Papers), 477–482. Beijing, China.
Hajič, Jan, Massimiliano Ciaramita, Richard Johansson, Daisuke Kawahara, Maria Antònia Martí, Lluís Màrquez, Adam Meyers, Joakim Nivre, Sebastian Padó, Jan Štěpánek, Pavel Straňák, Mihai Surdeanu, Nianwen Xue, and Yi Zhang
2009 “
The CoNLL-2009 Shared Task: Syntactic and Semantic Dependencies in Multiple Languages.” In
Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, CoNLL ‘09
, 1–18. Stroudsburg, PA, USA: Association for Computational Linguistics.
[URL]
Haspelmath, Martin
2006 “
Against Markedness (and What to Replace it With)”.
Journal of Linguistics 42 (01): 25–70.
Haug, Dag T. T., and Marius L. Jøhndal
2008 “
Creating a Parallel Treebank of the Old Indo-European Bible Translations.” In
Proceedings of the 2nd Workshop on Language Technology for Cultural Heritage Data
, 27–34. Marrakech, Morocco.
Hawkins, John A.
1983 Word Order Universals. New York: Academic Press.
Hawkins, John A.
1994 A Performance Theory of Order and Constituency. Cambridge: Cambridge University Press.
Hawkins, John A.
2009 “
Language Universals and the Performance-Grammar Correspondence Hypothesis.” In
Language Universals, Chapter 4, ed. by
M. H. Christiansen,
C. Collins, and
S. Edelman, 54–79. Oxford University Press.
Hurford, James R.
2003 “
The Interaction between Numerals and Nouns.” In
Noun Phrase Structure in the Languages of EuropeVolume 20–7 of Empirical Approaches to Language Typology, ed. by
F. Plank. Mouton de Gruyter.
McDonald, Ryan, Joakim Nivre, Yvonne Quirmbach-Brundage, Yoav Goldberg, Dipanjan Das, Kuzman Ganchev, Keith Hall, Slav Petrov, Hao Zhang, Oscar Täckström, Claudia Bedini, Núria Bertomeu Castelló, and Jungmee Lee
2013 “
Universal dependency annotation for multilingual parsing.” In
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics
(Volume 2: Short Papers), 92–97. Association for Computational Linguistics.
[URL]
Merlo, Paola, and Suzanne Stevenson
2001 “
Automatic Verb Classification Based on Statistical Distributions of Argument Structure”.
Computational Linguistics 27 (3): 373–408.
Merlo, Paola, and Suzanne Stevenson
2004 “
Structure and frequency in verb classification.” In
Proceedings of Incontro di Grammatica Generativa XXX
, ed. by
Laura Brugé,
Giuliana Giusti,
Nicola Munaro,
Walter Schweikert, and
Giuseppina Turano, Venice, Italy.
Ouwayda, Sarah
2014 “
Where Number Lies: Plural Marking, Numerals, and the Collective-Distributive Distinction.” Ph.D. thesis, University of Southern California.
Petrov, Slav, Dipanjan Das, and Ryan McDonald
2012 “
A Universal Part-of-Speech Tagset.” In
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC-2012), ed. by
Nicoletta Calzolari,
Khalid Choukri,
Thierry Declerck,
Mehmet Uğur Doğan,
Bente Maegaard,
Joseph Mariani,
Jan Odijk, and
Stelios Piperidis, 2089–2096. Istanbul, Turkey, European Language Resources Association (ELRA).
Rijkhoff, Jan
1998 “
Order in the Noun Phrase of the Languages of Europe.” In
Constituent Order in the Languages of Europe, Volume 20–1 of Empirical Approaches to Language Typology, ed. by
Anna Siewierska, 321–382. Mouton de Gruyter.
Samardžić, Tanja, and Paola Merlo
2018 “
Probability of External Causation: An Empirical Account of Cross-Linguistic Variation in Lexical Causatives”.
Linguistics 56 (5): 895–938.
Samardžić, Tanja, and Paola Merlo
2012 “
The Meaning of Lexical Causatives in Cross-Linguistic Variation”.
Linguistic Issues in Language Technology 7 (1).
Schwarzschild, Roger
2009 “
Stubborn Distributivity, Multiparticipant Nouns and the Count/Mass Distinction.” In
Proceedings of the 40th Annual Meeting of the North East Linguistic Society, Volume 39, 1–18. Mass.: MIT.
Sheehan, Michelle, Theresa Biberauer, Ian Roberts, and Anders Holmberg
2017 The Final-Over-Final Condition: A Syntactic Universal, chapter The Final-Over-Final Condition and the Head-Final Filter. MIT Press.
Svenonius, Peter
2007 “
1…3-2.” In
Oxford Handbook of Linguistic Interfaces, ed. by
Gillian Ramchand, and
Charles Reiss, 239–288. Oxford University Press.
Williams, Edwin
1982 “
Another Argument that Passive is Transformational”.
Linguistic Inquiry 13: 160–163.
Cited by
Cited by 1 other publications
Merlo, Paola & Giuseppe Samo
2022.
Exploring T3 languages with quantitative computational syntax.
Theoretical Linguistics 48:1-2
► pp. 73 ff.
This list is based on CrossRef data as of 23 march 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.