When French becomes Canadian French
The curious case of localizing COVID-19 terms with Microsoft Translator
In late 2020, the free online translation tool Microsoft Translator began to offer the option of translating into
“French (Canada)” as a target language, alongside the previously offered “French”. Using a list of ten COVID-19 terms previously
identified by
Bowker (2020) as having different equivalents in Canadian French and
European French, we evaluate the ability of Microsoft Translator to localize these terms into the two varieties of French. The
findings indicate that while this tool does a good job of localizing the terms into Canadian French, it also uses a high number of
Canadian French terms when the target language is set to “French”. One potential reason for this may be that the corpus used to
train the tool for “French” contains a disproportionate number of examples from Canadian sources, and so there may be a problem of
bias where the tool is amplifying Canadian French in the machine translation output.
Article outline
- 1.Introduction
- 2.Data-driven approaches to MT
- 3.Method
- 3.1Testing for influence of geographic location or explicit contextual clues
- 3.2Experiment
- 4.Results
- 5.Discussion
- 6.Concluding remarks
-
References
References (43)
References
Académie française. 2020, May 7. “Le covid 19 ou la covid 19.” Available at: [URL]
Carpuat, Marine. 2014. “Mixed-language code-switching in the Canadian Hansard.” In Proceedings of the First Workshop on Computational Approaches to Code-Switching, 107–115. Association for Computational Linguistics. Available at: [URL]. ![DOI logo](https://benjamins.com/logos/doi-logo.svg)
Castilho, Sheila, Joss Moorkens, Federico Gaspari, Rico Sennrich, Vilelmini Sosoni, Panayota Georgakopoulou, Pintu Lohar, Andy Way, Antonio Valerio Miceli Barone, and Maria Gialama. 2017. “A Comparative Quality Evaluation of PBSMT and NMT Using Professional Translators.” In Proceedings of the 16th Machine Translation Summit (MT Summit XVI), 116–31. Available at: [URL]
DeepL. 2020, August 23. “DeepL Translator now offers both American and British English!” DeepL Blog. Available at: [URL]
DeepL Translator (2022): [URL]
Desjardins, Renée. 2022. “Hello/Bonjour won’t cut it in a health crisis: An analysis of language policy and translation strategy across Manitoban websites and social media during COVID-19.” In Translation and Social Media Communication in the Age of the Pandemic, ed. by Tong King Lee and Dingkun Wang, 78–97. London: Routledge. ![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Dubuc, Robert. 2002. Manuel pratique de terminologie (4e édition) Brossard: Linguatech.![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
French Language Services Act (1990). Available at: [URL]
Google Translate (2022): [URL]
Hajek, John, Maria Karidakis, Riccardo Amorati, Yu Hao, Medha Sengupta, Anthony Pym, and Robyn Woodward-Kron. 2022. “Understanding the experiences and communication needs of culturally and linguistically diverse communities during the COVID-19 pandemic.” Report prepared for the Victorian Department of Families, Fairness and Housing, Australia. Available at: [URL]
Inter-Active Terminology for Europe (IATE) (2020): [URL]
Isabelle, Pierre. 1993. “Bi-text: Toward a New Generation of Support Tools for Translation and Terminology.” Technical report prepared for the Centre for Information Technologies Innovation, Canada. Available at: [URL]
Koehn, Philipp. 2005. “Europarl: A parallel corpus for statistical machine translation.” In Proceedings of Machine Translation Summit
X
1, 79–86. Available at: [URL]
Kumar, Sachin, Antonios Anastasopoulos, Shuly Wintner, and Yulia Tevetkov. 2021. “Machine Translation into Low-resource Language Varieties.” In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Vol 2 (Short Papers), 110–121. Available at: [URL]. ![DOI logo](https://benjamins.com/logos/doi-logo.svg)
Lakew, Surafel M., Aliia Erofeeva, and Marcello Federico. 2018. “Neural Machine Translation into Language Varieties.” In Proceedings of the Third Conference on Machine Translation (WMT), 156–164. Available at: [URL]. ![DOI logo](https://benjamins.com/logos/doi-logo.svg)
Läubli, Samuel, Chantal Amrhein, Patrick Düggelin, Beatriz Gonzalez, Alena Zwahlen, and Martin Volk. 2019. “Post-editing productivity with neural machine translation: An empirical assessment of speed and quality in the banking and finance domain.” In Proceedings of MT Summit XVII, Vol. 1, 267–272. Available at: [URL]
Ma, Xiaoyi. 1999. “Parallel Text Collections at the Linguistic Data Consortium.” In Proceedings of Machine Translation Summit VII, 346–348. Available at: [URL]
McCulloch, Gretchen. 2020, May 31. “Covid-19 is history’s biggest translation challenge.” WIRED. Available at: [URL]
Microsoft. 2020a, December 16. “Microsoft Translator’s highlights of 2020.” Microsoft Translator Blog. Available at: [URL]
Microsoft. 2020b, October 20. “C’est tiguidou ça! Translator adds Canadian French.” Microsoft Translator Blog. Available at: [URL]
Microsoft. 2020c, April 29. “Portuguese: A tale of two dialects.” Microsoft Translator Blog. Available at: [URL]
Microsoft Translator (2022): [URL]
Mitchell, Christine. 2022, March 8. “How Canada Accidentally Helped Crack Computer Translation.” The Walrus. Available at: [URL]
Office québécois de la langue française (OQLF) 2022, April 12. Fiche et articles en lien avec la COVID-19. Grand dictionnaire terminologique. Available at: [URL]
Organisation internationale de la Francophonie. 2019. La Langue française dans le monde. Paris: Gallimard. Available at: [URL]
Official Languages Act (1988). Available at: [URL]
Pérez-Ortiz, Juan Antonio, Mikel L. Forcada, and Felipe Sánchez-Martínez. 2022. “How neural machine translation works.” Machine Translation for Everyone: Empowering Users in the Age of Artificial Intelligence, ed. by Dorothy Kenny, 141–164. Berlin: Language Science Press. Available at: [URL] ![DOI logo](https://benjamins.com/logos/doi-logo.svg)
Radio-Canada. 2020, October 27. “COVID-19: Une garderie frustrée par la communication unilingue de la province.” Radio-Canada website. Available at: [URL]
Saint, Elizabeth. 2021. Faire participer le public à l’aménagement terminologique : proposition et application du modèle d’externalisation ouverte « En bons termes », PhD thesis, University of Ottawa. Available at: ![DOI logo](https://benjamins.com/logos/doi-logo.svg)
Standing Committee on Official Languages, Government of Canada. 2021. “Translation Bureau.” Available at: [URL]
Statistics Canada. 2017. “English, French and official language minorities in Canada: Census of Population 2016.” Available at: [URL]
TERMIUM Plus (2020): [URL]
Translation Bureau. 2020. Glossary on the COVID-19 pandemic. Available at: [URL]
Translation Bureau. 2022. Internal report on TERMIUM Plus usage (April 2021-March 2022). Terminology Standardization Division, Linguistic Services, Translation Bureau, Public Services and Procurement, Government of Canada.![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Vanmassenhove, Eva, Christian Hardmeier, and Andy Way. 2018. “Getting gender right in neural machine translation.” In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), 3003–3008. Available at: [URL]. ![DOI logo](https://benjamins.com/logos/doi-logo.svg)
Vanmassenhove, Eva, Dimitar Shterionov, and Matthew Gwillam. 2021. “Machine Translationese: Effects of Algorithmic Bias on Linguistic Complexity in Machine Translation.” In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, 2203–2213. Available at: [URL]. ![DOI logo](https://benjamins.com/logos/doi-logo.svg)
Volansky, Vered, Noam Ordan, and Shuly Wintner. 2015. “On the features of translationese.” Digital Scholarship in the Humanities 30(1): 98–118. ![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Willing, Jon. 2022, Feb 25. “Vaccine clinics generate French-language complaints.” Ottawa Citizen. Available at: [URL]
World Intellectual Proper Organization (WIPO). 2020. WIPO Pearl COVID-19 Glossary. Geneva: WIPO. Available at: [URL]
Cited by (2)
Cited by two other publications
Wermuth, Maria-Cornelia & Paul Sambre
Techawongstien, Koraya & Narongdej Phanthaphoommee
This list is based on CrossRef data as of 4 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.