Chapter published in:
Language and Text: Data, models, information and applicationsEdited by Adam Pawłowski, Jan Mačutek, Sheila Embleton and George Mikros
[Current Issues in Linguistic Theory 356] 2021
► pp. 137–144
The perils of big data
Sheila Embleton | York University
Dorin Uritescu | York University
Eric S. Wheeler | York University
The use of large amounts of data and the technologies to process them are characteristic of modern research. However, such practices come with risks of misleading the researcher. While there is much that could be said on this topic, here briefly is our cautionary tale to others, based on our direct experiences.
Keywords: big data, research practices, statistical packages, Romanian, dialects, Crișana, shibboleths
Article outline
- 1.Motivation
- 2.Some background
- 3.The muddle in the middle
- 4.Faith and reason
- 5.Data, and more data
- 6.In short
-
Acknowledgements -
References
Published online: 22 December 2021
https://doi.org/10.1075/cilt.356.09emb
https://doi.org/10.1075/cilt.356.09emb
References
Embleton, Sheila, Dorin Uritescu & Eric S. Wheeler
2002, 2007a Romanian Online Dialect Atlas. http://vpacademic.yorku.ca/romanian (now at http://pi.library.yorku.ca/dspace/ under the “dialectology” community, “RODA” collection)
2011 Defining dialect regions with interpretations. Advancing the multidimensional scaling approach. Paper presented at Methods In Dialectology 14 Conference, London, Canada, August 2–6.
Embleton, Sheila & Eric S. Wheeler
Kettunen, Lauri
McGuire, Patricia
2019 October 27. How higher education’s data obsession leads us astray. The Chronicle of Higher Education. https://www-chronicle-com.ezproxy.library.yorku.ca/article/How-Higher-Education-s-Data/247409. Accessed October 31, 2019.
Stan, Ionel & Dorin Uritescu
Uritescu, Dorin