Chapter published in:In Search of Basic Units of Spoken Language: A corpus-driven approach
Edited by Shlomo Izre'el, Heliana Mello, Alessandro Panunzi and Tommaso Raso
[Studies in Corpus Linguistics 94] 2020
► pp. 285–299
Cross-linguistic comparison of automatic detection of speech breaks in read and narrated speech in four languages
This chapter tests an algorithm for the automatic detection of speech breaks in read and narrated speech in Brazilian Portuguese (BP), European Portuguese (EP), French, and German. The algorithm is independent of previous transcription or linguistic analysis (syllable, phone labeling and segmentation), requiring only the audio file. It operates in two stages: vowel onsets detection firstly, followed by V-to-V duration intervals normalization for smoothed duration z-scores. Peaks over 2.5 of the latter were considered speech breaks. Compared to human segmentation, hits for reading (70%) were higher than for narration (60%). Crosslinguistic results show EP and French having the highest proportion of hits. A test with the English Navy audio file reveals a hit proportion similar to German.
Keywords: automatic speech segmentation, duration, prosodic boundary, cross-linguistic comparison
Published online: 18 June 2020
Avanzi, M., Lacheret, A., & Victorri, B.
Barbosa, P. A.
Boersma, P. & Weenink, D.
(2017) Praat: Doing phonetics by computer (Version 6.0.29) [Computer software]. Retrieved from www.praat.org
Botinis, A., Granström, B., & Möbius, B.
Chistovich, L. A., & Ogorodnikova, E. A.
Cummins, F., & Port, R.
Eriksson, A., & Heldner, M.
Godfrey, J. J., Holliman, E. C., & McDaniel, J.
Gotoy, Y., & Renals, S.
(2004) Automatic detection of sentence boundaries, disfluencies, and conversational fillers in spontaneous speech (Unpublished doctoral dissertation). University of Washington. Retrieved from https://ssli.ee.washington.edu/papers/grad/theses/jkim-ms-thesis.pdf
Lacheret-Dujour, A., Simon, A., Goldman, J., & Avanzi, M.
Mettouchi, A., Lacheret-Dujour, A., Silber-Varod, V., & Izre’el, S.
Mittman, M. M., & Barbosa, P. A.
Ni, C. J., Zhang, A. Y., Liu, W. J., & Xu, B.
Raso, T., Barbosa, P. A., Cavalcante, F. A., & Mittmann, M. M.
this volume). Segmentation and analysis of the two English excerpts: The Brazilian team proposal. In S. Izre’el, H. Mello, A. Panunzi, & T. Raso Eds. In search of basic units of spoken language: A corpus-driven approach Amsterdam John Benjamins
Shriberg, E., Stolcke, A., Hakkani-Tür, D., & Tür, G.
Tamburini, F., & Wagner, P.
Teixeira, B., Barbosa, P., & Raso, T.
(2018) Automatic detection of prosodic boundaries in Brazilian Portuguese spontaneous speech. In A. Villavicencio, M. Viviane, A. Abad, H. Caseli, P. Gamallo, C. Ramisch, H. R. Gonçalo Oliveira & G. H. Paetzold (Eds.), Computational processing of the Portuguese language. PROPOR 2018 (pp. 429–437). Canela, Brazil. Cham: Springer.
Cited by 1 other publications
Izre'el, Shlomo, Heliana Mello, Alessandro Panunzi & Tommaso Raso
This list is based on CrossRef data as of 18 september 2021. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.