A probabilistic assessment of the Indo-Aryan Inner–Outer Hypothesis
Chundra A. Cathcart | University of Zurich
This paper uses a novel data-driven probabilistic approach to address the century-old Inner-Outer hypothesis of
Indo-Aryan. I develop a Bayesian hierarchical mixed-membership model to assess the validity of this hypothesis using a large data
set of automatically extracted sound changes operating between Old Indo-Aryan and Modern Indo-Aryan speech varieties. I employ
different prior distributions in order to model sound change, one of which, the Logistic Normal distribution, has not received
much attention in linguistics outside of Natural Language Processing, despite its many attractive features. I find evidence for
cohesive dialect groups that have made their imprint on contemporary Indo-Aryan languages, and find that when a Logistic Normal
prior is used, the distribution of dialect components across languages is largely compatible with a core-periphery pattern similar
to that proposed under the Inner-Outer hypothesis.
Keywords: quantitative linguistics, Bayesian modeling, historical linguistics, language contact, Indo-Aryan, sound change
Article outline
- 1.Introduction
- 2.Background
- 2.1Indo-Aryan dialectal variation
- 2.1.1Pre-Old Indo Aryan period
- 2.1.2Old Indo Aryan period
- 2.1.3Middle Indo Aryan period
- 2.1.4New Indo Aryan period
- 2.2Proposed Indo-Aryan dialectal groupings
- 2.1Indo-Aryan dialectal variation
- 3.Rationale
- 3.1Bayesian models in linguistics and related fields
- 3.2Operationalizing the Inner-Outer Hypothesis
- 4.Data
- 5.Modeling sound change
- 5.1Prior distributions over sound change probabilities
- 6.Generative model
- 7.Implementation and inference
- 8.Results
- 8.1Sparsity of language-group distributions
- 8.2Language-group distributions
- 8.3Sound change distributions
- 8.4Posterior predictive checks
- 8.4.1Entropy
- 8.4.2Accuracy
- 9.Discussion and outlook
- 10.Conclusion
- Acknowledgements
- Notes
- Appendix (supplementary material)
- Appendix (supplementary material)
- Dirichlet model sound change probabilities
- Logistic normal model sound change probabilities
- Accuracy scores for sound change distributions for simulated data
- Appendix (supplementary material)
-
References
Published online: 25 May 2020
https://doi.org/10.1075/jhl.18038.cat
https://doi.org/10.1075/jhl.18038.cat
References
Aitchison, John
Blei, David M., Alp Kucukelbir & Jon D. McAuliffe
Blei, David M. & John D. Lafferty
Blei, David M., Andrew Y. Ng & Michael I. Jordan
Bouchard-Côté, Alexandre, Thomas L. Griffiths & Dan Klein
Bouchard-Côté, Alexandre, David Hall, Thomas L. Griffiths & Dan Klein
Bouchard-Côté, Alexandre, Percy S. Liang, Thomas L. Griffiths & Dan Klein
Bouchard-Côté, Alexandre, Percy S. Liang, Dan Klein & Thomas L. Griffiths
Box, George E. P.
Burrow, Thomas
Cardona, George & Dhanesh Jain
Carpenter, Bob, Andrew Gelman, Matthew D. Hoffman, Daniel Lee, Ben Goodrich, Michael Betancourt, Marcus Brubaker, Jiqiang Guo, Peter Li & Allen Riddell
Chang, Will & Lev Michael
Chatterji, Suniti Kumar
Cohen, Shay B., Kevin Gimpel & Noah A. Smith
Cohen, Shay B. & Noah A. Smith
2009 Shared Logistic Normal Distributions for Soft Parameter Tying in Unsupervised Grammar Induction. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, 74–82. Boulder, CO: Association for Computational Linguistics.
Deo, Ashwini
Elizarenkova, T. Y.
Emeneau, Murray B.
Fritz, Sonja
Gelman, Andrew, Xiao-Li Meng & Hal Stern
Gelman, Andrew & Donald B. Rubin
Geman, Stuart & Donald Geman
Hammarström, Harald, Robert Forkel & Martin Haspelmath
2017 Glottolog 3.3. Max Planck Institute for the Science of Human History. http://glottolog.org/accessed2017-12-13
von Hinüber, Oskar
Hock, Hans Henrich
Jäger, Gerhard
Jamison, Stephanie W.
Jeffers, Robert J.
Joshi, S. D.
Kingma, Diederik P. & Jimmy Ba
Kingma, Diederik P. & Adam Welling
Koskenniemi, Kimmo
Kucukelbir, Alp, Dustin Tran, Rajesh Ranganath, Andrew Gelman & David M. Blei
Kümmel, Martin
2015 Developments in the Dissolution of the Indo-Iranian Accentual System. Paper presented at the Workshop on Diachronic Morphophonology: Lexical Accent Systems at the 22nd International Conference on Historical Linguistics. Naples, July 27–31.
Lipp, Reiner
List, Johann-Mattis
Marr, David
Mayrhofer, Manfred
Meylan, Stephan, Michael Frank & Roger Levy
Meylan, Stephan C., Michael C. Frank, Brandon C. Roy & Roger Levy
Mimno, David, David M. Blei & Barbara E. Engelhardt
Mimno, David, Hanna Wallach & Andrew McCallum
Needleman, Saul B. & Christian D. Wunsch
Norton, Richard A., J. Andrés Christen & Colin Fox
Oberlies, Thomas
Parkes, Peter
Parpola, Asko
Peterson, John
Pritchard, Jonathan K., Matthew Stephens & Peter Donnelly
Ranganath, Rajesh, Linpeng Tang, Laurent Charlin & David Blei
Rasmussen, C. E. & C. K. I. Williams
Reesink, Ger, Ruth Singer & Michael Dunn
Rix, Helmut, Martin Kimmel, Thomas Zehnder, Reiner Lipp & Brigitte Schirmer
Salvatier, John, Thomas V. Wiecki & Christopher Fonnesbeck
Shaked, Shaul
Slaje, Walter
Smith, Caley
Srivastava, Akash & Charles Sutton
Syrjänen, Kaj, Terhi Honkola, Jyri Lehtinen, Antti Leino & Outi Vesakoski
Tedesco, P.
Teh, Yee Whye, Michael I. Jordan, Matthew J. Beal & David M. Blei
Thiel-Horstmann, Monika
Tran, Dustin, Matthew D. Hoffman, Rif A. Saurous, Eugene Brevdo, Kevin Murphy & David M. Blei
Turner, Ralph L.
Wieling, Martijn, Eliza Margaretha & John Nerbonne
Williamson, Sinead, Chong Wang, Katherine A. Heller & David M. Blei
Witzel, Michael
Yanovich, Igor
Zoller, Claus Peter
2012 Garhwali and the History of Indo-Aryan: Some Observations. Paper presented at Hindi Diwas (Day of Hindi). Uppsala, 14 September.
Cited by
Cited by 3 other publications
Borin, Lars, Anju Saxena, Bernard Comrie & Shafqat Mumtaz Virk
Peterson, John
Ranacher, Peter, Nico Neureiter, Rik van Gijn, Barbara Sonnenhauser, Anastasia Escher, Robert Weibel, Pieter Muysken & Balthasar Bickel
This list is based on CrossRef data as of 19 april 2022. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.