Article published In:
BabelVol. 68:5 (2022) ► pp.668–696
Neutral voices in audio descriptions
What does it mean?
A neutral delivery has often been considered to be the norm in audio description, but it is unclear what a
‘neutral voice’ means. This article begins with a discussion of neutrality in prosody and with a contextualization of AD voicing.
It then presents an acoustic analysis of a corpus of audio descriptions in Catalan, English, and Spanish. Based on the results of
this analysis, a perception test is designed, and its results are discussed here. The perception test involves participants with
sight loss (31 in Spanish, 35 in Catalan, 40 in English) and without sight loss (29 in Spanish, 46 in Catalan, 31 in English) who
are asked to define what a neutral voice is for them. Respondents are also asked to select the male and female voices that they
consider most neutral. The qualitative analysis of the replies, together with the selection made by participants for both male and
female voices across three different languages, sheds some light on how neutrality (or non-neutrality) could be defined. The study
does not aim to determine what acoustic features voices should have in the context of audio description but tries to better
understand what a neutral voice is, considering that this has traditionally been a frequent expression in research on audio
description delivery.
Article outline
- 1.Introduction
- 2.Voice in audio description
- 3.Neutrality in prosody
- 4.Experimental method
- 4.1Acoustic analysis: Materials
- 4.2Acoustic analysis
- 4.3Selection of voices for the perception test
- 4.4Perception test
- 5.Results
- 5.1Perception tests
- Spanish audio describers
- English audio describers
- Catalan audio describers
- 5.2Defining a neutral voice: Qualitative results
- 6.Conclusions
- Acknowledgements
-
References
References (51)
AENOR
2005 Norma
UNE:153030. Audiodescripción para personas con discapacidad visual. Requisitos para la audiodescripción y elaboración de
audioguías [
Audio description for visually impaired people. Guidelines for audio
description procedures and for the preparation of audio
guides]. AENOR.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
BAI
2012 BAI Guidelines –
Audiodescription.
[URL]
Banse, Rainer, and Klaus R. Scherer
1996 “
Acoustic
Profiles in Vocal Emotion Expression.”
Journal of Personality and Social
Psychology 70 (3): 614–636.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Boersma, Paul, and David Weenik
2020 Praat:
Doing Phonetics by Computer [Computer Program].
Electronic version,
v.6.1.37:
[URL]
Busso, Carlos, Sungbok Lee, and Shrikanth. S. Narayanan
2007 “
Using
Neutral Speech Models for Emotional Speech Analysis.” In
Proceedings of
INTERSPEECH, 2225–2228.
[URL] ![DOI logo](https://benjamins.com/logos/doi-logo.svg)
Cabeza-Cáceres, Cristóbal
2013 “
Audiodescripció i recepció. Efecte de la velocitat de narració, l’entonació i l’explicitació en la comprensió
fílmica” [Audio description and reception. The effect of speed of narration,
intonation and explicitation on film comprehension]. Ph.D.
diss., Universitat Autònoma de Barcelona.
[URL]
Cornew, Lauren, Leslie J. Carver, and Tracy Love
2009 “
There’s
More to Emotion than Meets the Eye: A Processing Bias for Neutral Content in the Domain of Emotional
Prosody.”
Cognition and
Emotion 24 (7): 1133–1152.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Crangle, Colleen, Rui Wang, Marcos Perreau-Guimaraes, Michelle U. Nguyen, Duc T. Nguyen, and Patrick Suppes
2019 “
Machine
Learning for the Recognition of Emotion in the Speech of Couples in Psychotherapy Using the Stanford Suppes Brain Lab
Psychotherapy Dataset.”
ArXiv.
[URL]
Crystal, David
2008 A
Dictionary of Linguistics and Phonetics, 6th
ed. Malden, MA and Oxford, UK: Blackwell.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
De Araújo Carvalho, Wilson J., Bruna Alves Leao, and Charleston Teixeria Palmeira
2017 “
Locuçao e audiodescriçao nos estudios de traduçao audiovisual” [Voicing and audio description in audiovisual translation studies].
Trabalhos em
Lingüística Aplicada 56 (2).
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Fernández-Torné, Anna, and Anna Matamala
2015 “
Text-to-Speech
vs. Human Voiced Aaudio Descriptions: A Reception Study in Films Dubbed into
Catalan.”
JosTrans: Journal of Specialized
Translation 241: 61–88.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Frick, Robert W.
1985 “
Communicating Emotion: The Role
of Prosodic Features.”
Psychological
Bulletin 97 (3): 412–429.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Fryer, Louise
2016 An
Introduction to Audio Description: A Practical Guide. Abingdon, Oxon and New York: Routledge.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Grabe, Esther, and Ee Ling Low
2002 “
Durational
Variability in Speech and the Rhythm Class Hypothesis.” In
Laboratory
Phonology 7, edited by
Carlos Gussenhoven and
Natasha Warner, 515–546. Berlin and New York: De Gruyter Mouton.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Hareli, Shlomo, Noga Shomrat, and Ursula Hess
2009 “
Emotional
versus Neutral Expressions and Perceptions of Social Dominance and
Submissiveness.”
Emotion 9 (3): 378–384.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Hirvonen, Maija, and Mari Wilund
2021 “
From
Image to Text to Speech: The Effects of Speech Prosody on Information Sequencing in Audio
Description.”
Text and
Talk 41 (3): 309–334.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Iglesias-Fernández, Emilia
2010 “
La dimensión paralingüística de la audiodescripción: un acercamiento
multidisciplinar [The paralinguistic dimension of audio description: a
multidisciplinary approach].” In
Un corpus de cine.
Teoría y práctica de la audiodescripción [
A corpus of cinema. Theory and practice
of audio description], edited by
Catalina Jiménez,
Ana Rodríguez, and
Claudia Seibel, 205–222. Granada: Ediciones Tragacanto.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Iglesias-Fernández, Emilia, Silvia Martínez Martínez, and Antonio J. Chica Núñez
2015 “
Cross-fertilization
between Reception Studies in Audio Description and Interpreting Quality Assessment: The Role of the Describer’s
Voice.” In
Audiovisual translation in a Global Context: Mapping an
Ever-changing Lanscape, edited by
Jorge Díaz-Cintas, and
Rocío Baños Piñero, 72–95. Bassingstoke: Palgrave Macmillan.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Iturregui-Gallardo, Gonzalo, and Anna Matamala
2021 “
Audio
Subtitling: Dubbing and Voice-Over Effects and Their Impact on User Experience.”
Perspectives:
Studies in
Translatology 29 (1): 64–83.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
ISO
2015 Information Technology –
User Interface Component Accessibility – Part 21: Guidance on Audio Descriptions (ISO/IEC TS
20071-21).
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
ITC
2000 ITC Guidance on Standards for
Audio Description.
[URL]
Kadiri, Sudarsana R., and Paavo Alku
2020 “
Excitation
Features of Speech for Speaker Specific Emotion Detection.”
IEEE
Access, 81: 60382–60391.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Kamiloğlu, Roza G., Agneta H. Fischer, and Disa A. Sauter
2020 “
Good
Vibrations: A Review of Vocal Expressions of Positive Emotions.”
Psychon Bull
Review 271: 237–265.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Keating, Patricia, and Christina Esposito
2006 “
Linguistic
Voice Quality.”
UCLA Working Papers in
Phonetics 1051: 85–91.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Kraxenberger, Maria, Winfried Menninghaus, Anna Roth, and Mathias Scharinger
2018 “
Prosody-based
Sound-Emotion Associations in Poetry.”
Frontiers in Psychology 9:1284.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Limbach, Christiane
2012 “
La neutralidad en la audiodescripción fílmica desde un punto de vista
traductológico” [Neutrality in filmic audio description from a translation
point of view]. Ph.D. diss., Universidad de Granada.
[URL]
López, Mariana, Gavin Kearney, and Krisztian Hofstadter
2021 “
Enhancing
Audio description: Inclusive Cinematic Experiences Through Sound Design.”
JosTrans: Journal of
Audiovisual
Translation 4 (1): 157–182.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Machuca, María J., Anna Matamala, and Antonio Ríos
2020a “
Prosodic
Features in Spanish Audio Descriptions of the VIW
Corpus.”
MonTI 121: 53–77.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Machuca, María J., Anna Matamala, and Antonio Ríos
2020b “
Los audiodescriptores: voces neutras y voces agradables” [Audio
describers: neutral voices and pleasant
voices].
Loquens 7 (2): e076.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Maszerowska, Anna, Anna Matamala, and Pilar Orero
Matamala, Anna
2018 “
One
Short Film, Different Audio Descriptions: Analyzing the Language of Audio Descriptions Created by Students and
Professionals.”
Onomazéin 411: 185–207.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Matamala, Anna, Olga Soler-Vilageliu, Gonzalo Iturregui-Gallardo, Anna Jankowska, Jorge L. Méndez-Ulrich, and Anna Serrano
2020 “
Electrodermal
Activity as a Measure of Emotions in Media Accessibility Research: Methodological
Considerations.”
JosTrans: Journal of Specialized
Translation 331: 129–151.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Matamala, Anna, and Marta Villegas
2016 “
Building
an Audio Description Multilingual Multimodal Corpus: The VIW
Project.” In
Proceedings: Multimodal Corpora: Computer Vision and
Language processing- MMC2016, edited by
Jens Edlund,
Dirk Heylen, and
Patrizia Paggio, 29–32. ELRA.
[URL]
Mazur, Iwona, and Agnieszka Chmiel
2012 “
Audio
Description Made to Measure: Reflections on Interpretation in Audio Description Based on the Pear Tree Project
Data.” In
Audiovisual Translation and Media Accessibility at the
Crossroads, edited by
Aline Remael,
Pilar Orero, and
Mary Carroll, 173–188. Amsterdam: Rodopi.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Morisset, Laure, and Frédéric Gonant
2008 La charte de l’audiodescription [
The audio description
sheet]. Ministère des Affaires Sociales et de la Santé.
[URL]
Netflix
N.d.
Netflix Audio
Description Style Guide v1.0.
[URL]
Nordström, Henrik
2019 “
Emotional
Communication in the Human Voice.” Ph.D. diss., Stockholm University.
[URL]
Pell, Marc D., Silke Paulmann, Chinar Dara, Areej Alasseri, and Sonja A. Kotz
2009 “
Factors
in the Recognition of Vocally Expressed Emotions: A Comparison of Four Languages.”
Journal of
Phonetics 37 (4): 417–435.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Quinto, Lena, William F. Thompson, and Felicity L. Keating
2013 “
Emotional
Communication in Speech and Music: The Role of Melodic and Rhythmic Contrasts.”
Frontiers in
Psychology 41.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Remael, Aline, Nina Reviers, and Gert Vercauteren
2015 Pictures
Painted in Words. ADLAB Audio Description
Guidelines. Trieste: Edizione Università di Trieste.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Rodríguez González de Antona, Alicia
2017 “
Audiodescripción y propuesta de estandarización de volumen de audio: hacia una producción
sistematizada” [Audio description and audio volume standardization proposal:
towards a systematized production]. Ph.D. diss., Universitat Autònoma de Barcelona.
[URL]
Scherer, Klaus R.
1986 “
Vocal Affect Expression: A
Review and a Model for Future Research.”
Psychological
Bulletin 991: 143.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Scherer, Klaus R.
2003 “
Vocal Communication of Emotion:
A Review of Research Paradigms.”
Speech
Communication 401: 227–256.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Scherer, Klaus R.
2013 “
Emotion in Action, Interaction,
Music, and Speech.” In
Language, Music, and the Brain: A Mysterious
Relationship, edited by
Michael A. Arbib, 107–139. Cambridge, MA: MIT Press.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Snyder, Joel
2014 The
Visual Made Verbal: A Comprehensive Training Manual and Guide to the History and Applications of Audio
Description. Arlington, VA: American Council for the Blind.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Stolarski, Łukasz
2015 “
Pitch
Patterns in Vocal Expression of ‘Happiness’ and ‘Sadness’ in the Reading aloud of Prose on the Basis of Selected
Audiobooks.”
Research in
Language 131: 141–162.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Szarkowska, Agnieszka
2011 “
Text-to-Speech
Audio Description: Towards Wider Availability of Audio dDescription.”
JosTrans: Journal of
Specialized
Translation 151: 142–162.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Szymańska, Barbara, and Tomasz Strzymiński
2010 “
Standardy tworzenia audiodeskrypcji do produkcji audiowizualnych” [Standards for creating audio description for audiovisual productions].
Audio Deskrypcja [
Audio description].
[URL]
Ververidis, Dimitrios, and Constantine Kotropoulos
2006 “
Emotional
Speech Recognition: Resources, Features, and Methods.”
Speech
Communication 48 (9): 1162–1181.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Villoslada, Ana
2018 “
La lengua de la audiodescripción del guion audiodescrito de Ocho apellidos
vascos” [The language of the audio description of the audio described script
of Ocho apellidos vascos].
TRANS: revista de
traductología 221: 61–80.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Walczak, Agnieszka, and Louise Fryer
2018 “
Vocal
Delivery of Audio Description by Genre: Measuring Users’ Presence.”
Perspectives: Studies in
Translatology 26 (1): 69–83.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Cited by (2)
Cited by 2 other publications
Fernández Alonso, Inés, María J. Machuca & Anna Matamala
2024.
Voces neutras y alteración tonal.
Círculo de Lingüística Aplicada a la Comunicación 97
► pp. 195 ff.
![DOI logo](//benjamins.com/logos/doi-logo.svg)
Jiang, Lucy, Crescentia Jung, Mahika Phutane, Abigale Stangl & Shiri Azenkot
2024.
Proceedings of the CHI Conference on Human Factors in Computing Systems,
► pp. 1 ff.
![DOI logo](//benjamins.com/logos/doi-logo.svg)
This list is based on CrossRef data as of 4 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.