Text Variation Explorer
Towards interactive visualization tools for corpus linguistics
This paper reviews the gap between current methods of text visualization and the needs of corpus-linguistic research, and introduces a tool that takes a step towards bridging that gap. Current text visualization methods tend to treat the problem as a data-encoding issue only, and do not strive for interactive, tightly coupled representations of text that would foster discovery. The paper argues that such visualizations should always be linked for effortless movement between the text and its visualization, and that the visualization controls should provide continuous and immediate feedback to facilitate exploration. We introduce a tool, Text Variation Explorer (TVE), to demonstrate the aforementioned requirements. TVE allows visual and interactive examining of the behaviour of linguistic parameters affected by text window size and overlap, and in addition, performs interactive principal component analysis based on a user-given set of words.
Keywords: interactive exploration, text visualization
Published online: 01 September 2014
Biber, D., Connor, U. & Upton, T.A.
Card, S.K., Mackinlay, J.D. & Shneiderman, B.
Craft, B. & Cairns, P.
Culy, C. & Lyding, V.
2014 The International Corpus of English. http://ice-corpora.net/ice/index.htm (accessed February 2014).
Kaufer, D., Geisler, C., Vlachos, P. & Ishizaki, S.
Keim, D.A. & Oelke, D.
Lijffijt, J., Papapetrou, P. & Puolamäki, K.
2012 “Size matters: Finding the most informative set of window lengths”. In P.A. Flach, T. De Bie & N. Christianini (Eds.), Proceedings of the European Conference of Machine Learning and Principles and Practices of Knowledge Discovery in Databases (ECML-PKDD 2012), Part II.Berlin and Heidelberg: Springer, 451–466.
Pike, W.A., Stasko, J., Chang, R. & O’Connell, T.A.
R Development Core Team
2011 R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing. Available at: http://www.R-project.org (accessed March 2012).
2012 Text Variation Explorer (TVE). Available at: http://www.uta.fi/sis/tauchi/virg/projects/dammoc/tve.html (accessed April 2012).
2013 TVE Video Tutorial. Available at: http://www.uta.fi/sis/tauchi/virg/projects/dammoc/tve/TVE.mp4 (accessed February 2013).
Siirtola, H., Nevalainen, T., Säily, T. & Räihä, K.-J.
2011 “Visualisation of text corpora: A case study of the PCEEC”. In T. Nevalainen & S.M. Fitzmaurice (Eds.), How to Deal with Data: Problems and Approaches to the Investigation of the English Language over Time and Space. Helsinki: VARIENG. http://www.helsinki.fi/varieng/series/volumes/07/siirtola_et_al/ (accessed February 2013).
2011 Mondrian – Interactive Statistical Data Visualization in Java. http://stats.math.uni-augsburg.de/mondrian/ (accessed February 2014).
Theus, M. & Urbanek, S.
Cited by 8 other publications
Kosmajac, Dijana, Vlado Keselj & Evangelos Milios
Kucher, Kostiantyn, Teri Schamp-Bjerede, Andreas Kerren, Carita Paradis & Magnus Sahlgren
Siirtola, Harri, Poika Isokoski, Tanja Saily & Terttu Nevalainen
Siirtola, Harri, Tanja Saily & Terttu Nevalainen
Säily, Tanja, Arja Nurmi, Minna Palander-Collin & Anita Auer
Säily, Tanja, Turo Vartiainen & Harri Siirtola
This list is based on CrossRef data as of 09 april 2021. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.