Article published In:
Journal of Historical Linguistics: Online-First ArticlesReply to Kassian et al. (2023). Calibrated weighted permutation test detects ancient language connections in the Circumpolar area (Chukotian-Nivkh and Yukaghir-Samoyedic)
Kassian et al. (2023) propose a method to detect ancient linguistic
relationships by means of a weighted permutation test applied to word lists. The method provides evidence for an ancient
historical connection between Chukotko-Kamchatkan languages and Nivkh languages, and between Samoyedic languages and Yukaghir. In
this paper, I argue that while the word lists collected by the authors represent a valid dataset for language comparison and are
supported by extensive etymological work, their statistical tests constitute a departure from the significance testing framework
that has propelled contemporary literature on long-range comparison. I then advance some proposals on how their tests can be
integrated and compared with those traditionally accepted in the literature.
Keywords: long-range comparison, significance testing of linguistic relationships, classical comparative method, Chukotko-Kamchatkan, Nivkh, Yukaghir, Samoyedic
Article outline
- 1.Introduction
- 2.Beyond reasonable doubt
- 3.Word lists
- 4.Semantic matching criteria
- 5.Metrics
- 6.Statistical tests
- 7.Results
- 8.Conclusion
-
References
Published online: 29 November 2024
https://doi.org/10.1075/jhl.00018.ceo
https://doi.org/10.1075/jhl.00018.ceo
References (21)
Baxter, William H. & Alexis Manaster Ramer. 2006. Beyond
lumping and splitting: Probabilistic issues in historical
linguistics. In Colin Renfrew, April McMahon & Larry Trask (eds.), Time
depth in historical
linguistics, vol. 11, 167–188. Cambridge, England: McDonald Institute for Archaeological Research.
Ceolin, Andrea. 2019. Significance
testing of the Altaic
family. Diachronica, 36(3). 299–336.
Ceolin, Andrea, Cristina Guardiano, Monica Alexandrina Irimia & Giuseppe Longobardi. 2020. Formal
syntax and deep history. Frontiers in
Psychology 111: 488871.
Ceolin, Andrea, Cristina Guardiano, Giuseppe Longobardi, Monica Alexandrina Irimia, Luca Bortolussi & Andrea Sgarro. 2021. At
the boundaries of syntactic prehistory. Philosophical Transactions of the Royal Society
B 376, 1824, 20200197.
Dellert, Johannes & Armin Buch. 2018. A
new approach to concept basicness and stability as a window to the robustness of concept list
rankings. Language Dynamics and
Change 8.21. 157–181.
Dolgopolsky, Aaron B. 1986. A probabilistic hypothesis
concerning the oldest relationships among the language families in Northern
Eurasia. In Vitalij V. Shevoroshkin & Thomas L. Markey (eds.), Typology,
relationship and time, 27–50. Ann Arbor: Karoma.
Heggarty, Paul, Cormac Anderson, Matthew Scarborough, Benedict King, Remco Bouckaert, Lechosław Jocz & Martin Joachim Kümmel. 2023. Language
trees with sampled ancestors support a hybrid model for the origin of Indo-European
languages. Science 3811. 6656.
Holman, Eric W., Søren Wichmann, Cecil H. Brown, Viveka Velupillai, André Müller & Dik Bakker. 2008. Explorations
in automated language classification. Folia
Linguistica 42(3–4). 331–354.
Kassian, Alexei, George Starostin, Anna Dybo & Vasiliy Chernov. 2010. The
Swadesh wordlist. An attempt at semantic specification. Вопросы языкового
родства 16(59). 46–89.
Kassian, Alexei, Mikhail Zhivlov & George Starostin. 2015. Proto-Indo-European-Uralic
com-parison from the probabilistic point of view. Journal of Indo-European
Studies 43 (3–4). 301–347.
Kassian, Alexei S., George Starostin, Ilya M. Egorov, Ekaterina S. Logunova & Anna V. Dybo. 2021. Permutation
test applied to lexical reconstructions partially supports the Altaic linguistic
macrofamily. Evolutionary Human
Sciences 31. e32.
Kassian, Alexei S., George Starostin, Mikhail Zhivlov & Sergey A. Spirin. 2023. Calibrated
weighted permutation test detects ancient language connections in the Circumpolar area (Chukotian-Nivkh and
Yukaghir-Samoyedic). Journal of Historical Linguistics.
Kessler, Brett. 2001. The
significance of word lists. Stanford, California: Center for the Study of Language and Information.
. 2007. Word
similarity metrics and multilateral comparison. In Proceedings of
Ninth Meeting of the ACL Special Interest Group in Computational Morphology and
Phonology, 6–14. Association for Computational Linguistics.
. 2015. Response
to Kassian et al. 2015. Proto-Indo-European-Uralic comparison from the probabilistic point of
view. Journal of Indo-European
Studies 43(3–4). 357–367.
Kessler, Brett & Annukka Lehtonen. 2006. Multilateral
comparison and significance testing of the Indo-Uralic
question. In Peter Forster & Colin Renfrew (eds.), Phylogenetic
methods and the prehistory of
languages, 33–42. Cambridge, England: McDonald Institute for Archaeological Research.
Nichols, Johanna. 1995. The
comparative method as heuristic. In Mark Durie & Malcolm Ross (eds.), The
comparative method reviewed: Regularity and irregularity in language
change, 39–71. Oxford University Press.
Oswalt, Robert L. 1970. The detection of remote
linguistic relationships. Computer Studies in the Humanities and Verbal
Behavior 3(3). 117–129.
Ringe, Donald A. 1992. On calculating the factor of
chance in language comparison. Transactions of the American Philosophical
Society 82(1). 1–110.
1998. Probabilistic evidence for
Indo-Uralic. In Joseph Salmons & Brian Joseph (eds.), Nostratic:
Sifting the
evidence, 153–197. Amsterdam: John Benjamins.
Tadmor, Uri, Martin Haspelmath & Bradley Taylor. 2010. Borrowability
and the notion of basic
vocabulary. Diachronica 27(2). 226–246.