Working with language data
Practical, technical and scientific considerations
Article outline
- 1.Introduction
- 2.Important concepts
- 2.1Data and metadata
- 2.2Legal aspects
- 2.3Ethical aspects
- 2.4Data formats
- 2.5Versioning
- 2.6Digitization
- 2.7Archiving
- 2.8Discoverability
- 2.9Reproducibility
- 2.10Citation
- 3.Summary of recommendations
- 4.Research history
- 5.Further reading
-
Acknowledgments
-
Notes
-
References
-
Appendix
References (18)
References
Aarts, Jan. 2011. “Corpus analysis.” In Vol. 15, Handbook of Pragmatics, ed. by Jan-Ola Östman, and Jef Verschueren, 1–14. Amsterdam: John Benjamins Publishing Company.
Andreassen, Helene N., Andrea L. Berez-Kroeker, Lauren Collister, Philipp Conzett, Christopher Cox, Koenraad De Smedt, Bradley McDonnell, and Research Data Alliance Linguistic Data Interest Group. 2019. Tromsø Recommendations for Citation of Research Data in Linguistics. RDA Linguistics Data Interest Group.
Austin, Peter K. 2006. “Data and language documentation.” In Essentials of Language Documentation, ed. by Jost Gippert, Ulrike Mosel, and Nikolaus Himmelmann, 87–112. Berlin: Mouton de Gruyter.
Austin, Peter K. 2013. “Language documentation and meta-documentation.” In Keeping Languages Alive: Documentation, Pedagogy and Revitalisation, ed. by Mari C. Jones, and Sarah Ogilvie. Cambridge: Cambridge University Press, 3–15.
Berez-Kroeker, Andrea L., Lauren Gawne, Susan Smythe Kung, Barbara F. Kelly, Tyler Heston, Gary Holton, Peter Pulsifer, et al. Woodbury. 2017. “Reproducible research in linguistics: A position statement on data citation and attribution in our field.” Linguistics 56.1, 1–18.
Berez-Kroeker, Andrea L., Bradley McDonnell, Eve Koller, Lauren B. Collister (eds). 2022. The Open Handbook of Linguistic Data Management. Cambridge, MA: The MIT Press.
Bird, Steven, and Gary Simons. 2003. “Seven dimensions of portability for language documentation and description.” Language 79 (3), 557–582.
Conzett, Philipp, and Koenraad De Smedt. 2022. “Guidance for citing linguistic data.” In The Open Handbook of Linguistic Data Management, ed. by Andrea L. Berez-Kroeker, Bradley McDonnell, Eve Koller, and Lauren B. Collister, Ch. 11. Cambridge, MA: The MIT Press.
Darnell, Regna. 2003. “Franz Boas.” In Vol. 9, Handbook of Pragmatics, ed. by Jan-Ola Östman, and Jef Verschueren. Amsterdam: John Benjamins Publishing Company.
Gippert, Jost, Ulrike Mosel, and Nikolaus Himmelmann (eds). 2006. Essentials of Language Documentation. Berlin: Mouton de Gruyter.
Henke, Ryan, and Andrea L. Berez-Kroeker. 2016. “A brief history of archiving in language documentation, with an annotated bibliography.” Language Documentation & Conservation 10, 411–457.
Himmelmann, Nikolaus P. 1998. “Documentary and descriptive linguistics.” Linguistics. An Interdisciplinary Journal of the Language Sciences 36, 161–195.
Himmelmann, Nikolaus P. 2006. “Language documentation. What is it and what is it good for?” In Essentials of language documentation, ed. by Jost Gippert, Ulrike Mosel, and Nikolaus Himmelmann, 1–30. Berlin: Mouton de Gruyter.
Jucker, Andreas H. 2013. “Corpus pragmatics.” In Vol. 17, Handbook of Pragmatics, ed. by Jan-Ola Östman and Jef Verschueren, 1–18. Amsterdam: John Benjamins Publishing Company.
Seyfeddinipur, Mandana, Felix Ameka, Lissant Bolton, Jonathan Blumtritt, Brian Carpenter, Hilaria Cruz, Sebastian Drude, et al. 2019. “Public access to research data in language documentation. Challenges and possible strategies”. Language Documentation & Conservation 13, 545–563. URL: [URL]
Wilkinson, Mark D., Michel Dumontier, IJsbrand Jan Aalbersberg, Gabrielle Appleton, Myles Axton, Arie Baak, Niklas Blomberg, et al. 2016. “The FAIR guiding principles for scientific data management and stewardship”. Scientific Data 3.
Woodbury, Anthony C. 2003. “Defining documentary linguistics”. In Vol. 1, Language Documentation and Description, ed. by Peter K. Austin, 35–51. London: SOAS, University of London.