Article published in:Language Processing and Grammars: The role of functionally oriented computational models
Edited by Brian Nolan and Carlos Periñán-Pascual
[Studies in Language Companion Series 150] 2014
► pp. 13–38
From the extraction of continuous features in parallel texts to visual analytics of heterogeneous areal-typological datasets
The aim of this paper is twofold. First, we show that functionally motivated procedural approaches may help to automatically extract typological features from texts. This idea is illustrated with measuring cross-linguistic variation in the domain of morphological typology based on parallel texts. Second, we demonstrate that the methodology developed in the field of visual analytics allows for detecting patterns or regularities in the automatically extracted features. At the heart of our approach lies an extended sunburst visualization, which enables a cross-comparison of a large number of features within the context of language genealogy and areal information. We provide evidence of the usefulness of the present approach with case studies where the visualizations of the extracted features reveal interesting insights.
Published online: 01 April 2014
2012 Some challenges and directions for the visualization of language and linguistic data. Paper presented at the AVML 2012 conference in York.
Cysouw, Michael & Wälchli, Bernhard
Dryer, Matthew S. & Haspelmath, Martin
eds). The World Atlas of Language Structures Online . Munich: Max Planck Digital Library. http://wals.info/>
Greenberg, Joseph H.
Hardwick, Roma & Healey, Joan
Hoel, Hanna Marie, Ikaheimonen, Tarja & Nozawa, Michiyo
1994 Mende Grammar Essentials . http://www.sil.org/pacific/png/pubs/928474531144/Mende_Grammar_Essentials.pdf>
Hopper, Paul J.
Keim, Daniel A., Mansmann, Florian, Schneidewind, Jörn, Thomas, Jim & Ziegler, Hartmut
Lock, Arnold Hugo
Mayer, Thomas, Rohrdantz, Christian, Butt, Miriam, Plank, Frans & Keim, Daniel A.
McElhanon, Kenneth A.
Multitree: A Digital Library of Language Relationships
Ypsilanti MI: Institute for Language Information and Technology (LINGUIST List), Eastern Michigan University. http://multitree.org/>
Popescu, Ioan-Iovitz, Mačutek, Ján & Altmann, Gabriel
Rohrdantz, Christian, Hund, Michael, Mayer, Thomas, Wälchli, Bernhard & Keim, Daniel A.
Stasko, John & Zhang, Eugene
Forthcoming. Algorithmic typology and going from known to similar unknown categories within and across languages. In Aggregating Dialectology, Typology, and Register Analysis: Linguistic Variation in Text and Speech, Within and Across Languages , Benedikt Szmrecsanyi & Bernhard Wälchli (eds). Berlin: de Gruyter.
Ward, Matthew, Grinstein, Georges & Keim, Daniel A.
Wichmann, Søren, Müller, André, Velupillai, Viveka, Wett, Annkathrin, Brown, Cecil H., Molochieva, Zarina, Bishoffberger, Julia, Holman, Eric W., Sauppe, Sebastian, Brown, Pamela, Bakker, Dik, List, Johann-Mattis, Egorov, Dmitry, Belyaev, Oleg, Urban, Matthias, Hammarström, Harald, Carrizo, Agustina, Mailhammer, Robert, Geyer, Helen, Beck, David, Korovina, Evgenia, Epps, Pattie, Valenzuela, Pilar & Grant, Anthony
2012 The ASJP Database (version 15).
Cited by 1 other publications
Gutierrez-Vasques, Ximena & Victor Mijangos
This list is based on CrossRef data as of 14 may 2022. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.