Neutral voices in audio descriptions: What does it mean?

Machuca, María Jesús; Matamala, Anna

doi:10.1075/babel.00287.mac

Article published In:

Babel
Vol. 68:5 (2022) ► pp.668–696

Neutral voices in audio descriptions

What does it mean?

María Jesús Machuca | Universitat Autònoma de Barcelona

Anna Matamala | Universitat Autònoma de Barcelona

A neutral delivery has often been considered to be the norm in audio description, but it is unclear what a ‘neutral voice’ means. This article begins with a discussion of neutrality in prosody and with a contextualization of AD voicing. It then presents an acoustic analysis of a corpus of audio descriptions in Catalan, English, and Spanish. Based on the results of this analysis, a perception test is designed, and its results are discussed here. The perception test involves participants with sight loss (31 in Spanish, 35 in Catalan, 40 in English) and without sight loss (29 in Spanish, 46 in Catalan, 31 in English) who are asked to define what a neutral voice is for them. Respondents are also asked to select the male and female voices that they consider most neutral. The qualitative analysis of the replies, together with the selection made by participants for both male and female voices across three different languages, sheds some light on how neutrality (or non-neutrality) could be defined. The study does not aim to determine what acoustic features voices should have in the context of audio description but tries to better understand what a neutral voice is, considering that this has traditionally been a frequent expression in research on audio description delivery.

Keywords: audio description, prosody, audiovisual translation, media accessibility, perception

Article outline

1.Introduction
2.Voice in audio description
3.Neutrality in prosody
4.Experimental method
- 4.1Acoustic analysis: Materials
- 4.2Acoustic analysis
- 4.3Selection of voices for the perception test
- 4.4Perception test
5.Results
- 5.1Perception tests
  - Spanish audio describers
  - English audio describers
  - Catalan audio describers
- 5.2Defining a neutral voice: Qualitative results
6.Conclusions
Acknowledgements
References

Published online: 22 September 2022

https://doi.org/10.1075/babel.00287.mac

References (51)

References

AENOR. 2005. Norma UNE:153030. Audiodescripción para personas con discapacidad visual. Requisitos para la audiodescripción y elaboración de audioguías [Audio description for visually impaired people. Guidelines for audio description procedures and for the preparation of audio guides]. AENOR.

BAI. 2012. BAI Guidelines – Audiodescription. [URL]

Banse, Rainer, and Klaus R. Scherer. 1996. “Acoustic Profiles in Vocal Emotion Expression.” Journal of Personality and Social Psychology 70 (3): 614–636.

Boersma, Paul, and David Weenik. 2020. Praat: Doing Phonetics by Computer [Computer Program]. Electronic version, v.6.1.37: [URL]

Busso, Carlos, Sungbok Lee, and Shrikanth. S. Narayanan. 2007. “Using Neutral Speech Models for Emotional Speech Analysis.” In Proceedings of INTERSPEECH, 2225–2228. [URL]

Cabeza-Cáceres, Cristóbal. 2013. “Audiodescripció i recepció. Efecte de la velocitat de narració, l’entonació i l’explicitació en la comprensió fílmica” [Audio description and reception. The effect of speed of narration, intonation and explicitation on film comprehension]. Ph.D. diss., Universitat Autònoma de Barcelona. [URL]

Cornew, Lauren, Leslie J. Carver, and Tracy Love. 2009. “There’s More to Emotion than Meets the Eye: A Processing Bias for Neutral Content in the Domain of Emotional Prosody.” Cognition and Emotion 24 (7): 1133–1152.

Crangle, Colleen, Rui Wang, Marcos Perreau-Guimaraes, Michelle U. Nguyen, Duc T. Nguyen, and Patrick Suppes. 2019. “Machine Learning for the Recognition of Emotion in the Speech of Couples in Psychotherapy Using the Stanford Suppes Brain Lab Psychotherapy Dataset.” ArXiv. [URL]

Crystal, David. 2008. A Dictionary of Linguistics and Phonetics, 6th ed. Malden, MA and Oxford, UK: Blackwell.

De Araújo Carvalho, Wilson J., Bruna Alves Leao, and Charleston Teixeria Palmeira. 2017. “Locuçao e audiodescriçao nos estudios de traduçao audiovisual” [Voicing and audio description in audiovisual translation studies]. Trabalhos em Lingüística Aplicada 56 (2).

Fernández-Torné, Anna, and Anna Matamala. 2015. “Text-to-Speech vs. Human Voiced Aaudio Descriptions: A Reception Study in Films Dubbed into Catalan.” JosTrans: Journal of Specialized Translation 241: 61–88.

Frick, Robert W. 1985. “Communicating Emotion: The Role of Prosodic Features.” Psychological Bulletin 97 (3): 412–429.

Fryer, Louise. 2016. An Introduction to Audio Description: A Practical Guide. Abingdon, Oxon and New York: Routledge.

Grabe, Esther, and Ee Ling Low. 2002. “Durational Variability in Speech and the Rhythm Class Hypothesis.” In Laboratory Phonology 7, edited by Carlos Gussenhoven and Natasha Warner, 515–546. Berlin and New York: De Gruyter Mouton.

Hareli, Shlomo, Noga Shomrat, and Ursula Hess. 2009. “Emotional versus Neutral Expressions and Perceptions of Social Dominance and Submissiveness.” Emotion 9 (3): 378–384.

Hirvonen, Maija, and Mari Wilund. 2021. “From Image to Text to Speech: The Effects of Speech Prosody on Information Sequencing in Audio Description.” Text and Talk 41 (3): 309–334.

Iglesias-Fernández, Emilia. 2010. “La dimensión paralingüística de la audiodescripción: un acercamiento multidisciplinar [The paralinguistic dimension of audio description: a multidisciplinary approach].” In Un corpus de cine. Teoría y práctica de la audiodescripción [A corpus of cinema. Theory and practice of audio description], edited by Catalina Jiménez, Ana Rodríguez, and Claudia Seibel, 205–222. Granada: Ediciones Tragacanto.

Iglesias-Fernández, Emilia, Silvia Martínez Martínez, and Antonio J. Chica Núñez. 2015. “Cross-fertilization between Reception Studies in Audio Description and Interpreting Quality Assessment: The Role of the Describer’s Voice.” In Audiovisual translation in a Global Context: Mapping an Ever-changing Lanscape, edited by Jorge Díaz-Cintas, and Rocío Baños Piñero, 72–95. Bassingstoke: Palgrave Macmillan.

Iturregui-Gallardo, Gonzalo, and Anna Matamala. 2021. “Audio Subtitling: Dubbing and Voice-Over Effects and Their Impact on User Experience.” Perspectives: Studies in Translatology 29 (1): 64–83.

ISO. 2015. Information Technology – User Interface Component Accessibility – Part 21: Guidance on Audio Descriptions (ISO/IEC TS 20071-21).

ITC. 2000. ITC Guidance on Standards for Audio Description. [URL]

Kadiri, Sudarsana R., and Paavo Alku. 2020. “Excitation Features of Speech for Speaker Specific Emotion Detection.” IEEE Access, 81: 60382–60391.

Kamiloğlu, Roza G., Agneta H. Fischer, and Disa A. Sauter. 2020. “Good Vibrations: A Review of Vocal Expressions of Positive Emotions.” Psychon Bull Review 271: 237–265.

Keating, Patricia, and Christina Esposito. 2006. “Linguistic Voice Quality.” UCLA Working Papers in Phonetics 1051: 85–91.

Kraxenberger, Maria, Winfried Menninghaus, Anna Roth, and Mathias Scharinger. 2018. “Prosody-based Sound-Emotion Associations in Poetry.” Frontiers in Psychology 9:1284.

Limbach, Christiane. 2012. “La neutralidad en la audiodescripción fílmica desde un punto de vista traductológico” [Neutrality in filmic audio description from a translation point of view]. Ph.D. diss., Universidad de Granada. [URL]

López, Mariana, Gavin Kearney, and Krisztian Hofstadter. 2021. “Enhancing Audio description: Inclusive Cinematic Experiences Through Sound Design.” JosTrans: Journal of Audiovisual Translation 4 (1): 157–182.

Machuca, María J., Anna Matamala, and Antonio Ríos. 2020a. “Prosodic Features in Spanish Audio Descriptions of the VIW Corpus.” MonTI 121: 53–77.

. 2020b. “Los audiodescriptores: voces neutras y voces agradables” [Audio describers: neutral voices and pleasant voices]. Loquens 7 (2): e076.

Maszerowska, Anna, Anna Matamala, and Pilar Orero. (eds.). 2014. Audio Description: New Perspectives Illustrated. Amsterdam and Philadelphia: John Benjamins Publishing Company.

Matamala, Anna. 2018. “One Short Film, Different Audio Descriptions: Analyzing the Language of Audio Descriptions Created by Students and Professionals.” Onomazéin 411: 185–207.

Matamala, Anna, Olga Soler-Vilageliu, Gonzalo Iturregui-Gallardo, Anna Jankowska, Jorge L. Méndez-Ulrich, and Anna Serrano. 2020. “Electrodermal Activity as a Measure of Emotions in Media Accessibility Research: Methodological Considerations.” JosTrans: Journal of Specialized Translation 331: 129–151.

Matamala, Anna, and Marta Villegas. 2016. “Building an Audio Description Multilingual Multimodal Corpus: The VIW Project.” In Proceedings: Multimodal Corpora: Computer Vision and Language processing- MMC2016, edited by Jens Edlund, Dirk Heylen, and Patrizia Paggio, 29–32. ELRA. [URL]

Mazur, Iwona, and Agnieszka Chmiel. 2012. “Audio Description Made to Measure: Reflections on Interpretation in Audio Description Based on the Pear Tree Project Data.” In Audiovisual Translation and Media Accessibility at the Crossroads, edited by Aline Remael, Pilar Orero, and Mary Carroll, 173–188. Amsterdam: Rodopi.

Morisset, Laure, and Frédéric Gonant. 2008. La charte de l’audiodescription [The audio description sheet]. Ministère des Affaires Sociales et de la Santé. [URL]

Netflix. N.d. Netflix Audio Description Style Guide v1.0. [URL]

Nordström, Henrik. 2019. “Emotional Communication in the Human Voice.” Ph.D. diss., Stockholm University. [URL]

Pell, Marc D., Silke Paulmann, Chinar Dara, Areej Alasseri, and Sonja A. Kotz. 2009. “Factors in the Recognition of Vocally Expressed Emotions: A Comparison of Four Languages.” Journal of Phonetics 37 (4): 417–435.

Quinto, Lena, William F. Thompson, and Felicity L. Keating. 2013. “Emotional Communication in Speech and Music: The Role of Melodic and Rhythmic Contrasts.” Frontiers in Psychology 41.

Remael, Aline, Nina Reviers, and Gert Vercauteren. 2015. Pictures Painted in Words. ADLAB Audio Description Guidelines. Trieste: Edizione Università di Trieste.

Rodríguez González de Antona, Alicia. 2017. “Audiodescripción y propuesta de estandarización de volumen de audio: hacia una producción sistematizada” [Audio description and audio volume standardization proposal: towards a systematized production]. Ph.D. diss., Universitat Autònoma de Barcelona. [URL]

Scherer, Klaus R. 1986. “Vocal Affect Expression: A Review and a Model for Future Research.” Psychological Bulletin 991: 143.

2003. “Vocal Communication of Emotion: A Review of Research Paradigms.” Speech Communication 401: 227–256.

2013. “Emotion in Action, Interaction, Music, and Speech.” In Language, Music, and the Brain: A Mysterious Relationship, edited by Michael A. Arbib, 107–139. Cambridge, MA: MIT Press.

Snyder, Joel. 2014. The Visual Made Verbal: A Comprehensive Training Manual and Guide to the History and Applications of Audio Description. Arlington, VA: American Council for the Blind.

Stolarski, Łukasz. 2015. “Pitch Patterns in Vocal Expression of ‘Happiness’ and ‘Sadness’ in the Reading aloud of Prose on the Basis of Selected Audiobooks.” Research in Language 131: 141–162.

Szarkowska, Agnieszka. 2011. “Text-to-Speech Audio Description: Towards Wider Availability of Audio dDescription.” JosTrans: Journal of Specialized Translation 151: 142–162.

Szymańska, Barbara, and Tomasz Strzymiński. 2010. “Standardy tworzenia audiodeskrypcji do produkcji audiowizualnych” [Standards for creating audio description for audiovisual productions]. Audio Deskrypcja [Audio description]. [URL]

Ververidis, Dimitrios, and Constantine Kotropoulos. 2006. “Emotional Speech Recognition: Resources, Features, and Methods.” Speech Communication 48 (9): 1162–1181.

Villoslada, Ana. 2018. “La lengua de la audiodescripción del guion audiodescrito de Ocho apellidos vascos” [The language of the audio description of the audio described script of Ocho apellidos vascos]. TRANS: revista de traductología 221: 61–80.

Walczak, Agnieszka, and Louise Fryer. 2018. “Vocal Delivery of Audio Description by Genre: Measuring Users’ Presence.” Perspectives: Studies in Translatology 26 (1): 69–83.

Cited by (2)

Cited by two other publications

Fernández Alonso, Inés, María J. Machuca & Anna Matamala

2024. Voces neutras y alteración tonal. Círculo de Lingüística Aplicada a la Comunicación 97 ► pp. 195 ff.

Jiang, Lucy, Crescentia Jung, Mahika Phutane, Abigale Stangl & Shiri Azenkot

2024. Proceedings of the CHI Conference on Human Factors in Computing Systems , ► pp. 1 ff.

This list is based on CrossRef data as of 9 october 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.