Data curation for a VALID Archive of Dutch Language Impairment Data
Henk van den Heuvel | Radboud University
Eric Sanders | Radboud University
Jetske Klatter-Folmer | Radboud University
Roeland van Hout | Radboud University
Paula Fikkert | Radboud University
Anne Baker | University of Amsterdam
Jan de Jong | University of Amsterdam
Frank Wijnen | Utrecht University
Paul Trilsbeek | Max Planck Institute for Psycholinguistics, Nijmegen
The VALID Data Archive is an open multimedia data archive in which data from children and adults with language and/or communication problems are brought together. A pilot project, funded by CLARIN-NL, was carried out in which five existing data sets were curated. This pilot enabled us to build up experience in conserving different kinds of pathological language data in a searchable and persistent manner. These data sets reflect current research in language pathology rather well, both in the range of designs and the variety in pathological problems, such as Specific Language Impairment, deafness, dyslexia, and ADHD. In this paper, we present the VALID initiative, explain the curation process and discuss the materials of the data sets.
Keywords: language and communication impairments, interoperability, SLI, deafness, data curation, data sharing, dyslexia, ADHD
Published online: 10 November 2014
Broeder, D., Van Uytvanck, D., Windhouwer, M., Gavrilidou, M., & Trippel, T.
(2012) Standardizing a component metadata infrastructure. In Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC-2012 (pp. 1387–1390). Istanbul, Turkey.
De Bree, E., Snowling, M., Gerrits, E., Van Alphen, P., Van der Leij, A., & Wijnen, F.
Klatter, J., Van Hout, R., Van den Heuvel, H., Fikkert, P., Baker, A., De Jong J., Wijnen, F., Sanders, E., & Trilsbeek, P.
(2014) Vulnerability in acquisition, language impairments in Dutch: Creating a VALID data archive. In Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC-2014 (pp. 357–364). Reykjavik, 26-31 May 2014.
Kort, W., Schittekatte, M., Bosmans, M., Compaan, E.L., Dekker, P.H., Vermeir, G., & Verhaeghe, P.
(2010) The CLARIN-NL project. In Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC-2010 (pp. 48–53). Valletta, Malta.
Oostdijk, N., & Van den Heuvel, H.
(2014) The evolving infrastructure for language resources and the role for data scientists. In Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC-2014 (pp. 608–612). Reykjavik, 26-31 May 2014.
Sanders, E., Van de Craats, I., & De Lint, V.
(2014) The Dutch LESLLA corpus. In Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC-2014 (pp. 2715–2718). Reykjavik.
Stehouwer, H., & Auer, E.
(2012) Text production in adults with reading and writing difficulties. Doctoral dissertation, Göteborg University, Göteborg Sweden.
Cited by 1 other publications
Cucchiarini, Catia & Monique Lamers
This list is based on CrossRef data as of 23 august 2021. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.