Developing corpus interoperability for phonetic investigation of learner corpora

Ballier, Nicolas; Martin, Philippe

doi:10.1075/scl.59.05bal

Part of

Automatic Treatment and Analysis of Learner Corpus Data
Edited by Ana Díaz-Negrillo, Nicolas Ballier and Paul Thompson
[Studies in Corpus Linguistics 59] 2013
► pp. 33–64

Developing corpus interoperability for phonetic investigation of learner corpora

Nicolas Ballier

Philippe Martin

Although automatic analysis and computer-aided annotation tools are being developed, spoken learner corpora are still smaller and less numerous than written learner corpora. This chapter gives a critical overview of some of the phonetic research questions addressed by spoken learner corpora in relation to their annotation schemes and software. Some of their annotation schemes and guidelines are presented and assessed. Corpus design and tools are discussed in relation to some two of the challenges of spoken learner corpora: comparability of data and the potential contribution to prosodic modeling. It is argued that reusability of annotated spoken data and critical statistics should be the real order of the day.

Published online: 18 December 2013

https://doi.org/10.1075/scl.59.05bal

Cited by

Cited by 1 other publications

Wang, Wei. & Jinsong Zhang

2023. Factors predicting human performance in error annotation for non-native speech corpus. Speech Communication 149 ► pp. 38 ff.

This list is based on CrossRef data as of 19 june 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.