Machine translation and legal terminology
Data-driven approaches to contextual accuracy
This chapter addresses machine translation (MT) with an eye to legal terminology. The translation
of legal terms and phrasemes may be fraught with contextual complexities, and context has long been the Achilles’ heel
of MT. Nevertheless, neural MT (NMT) and statistical MT (SMT) have made considerable progress in recent years, thanks
to data-driven approaches making use of potentially related corpora to overcome contextual obstacles. Such approaches
and the potential frozenness or repetitiveness of legal terms and phrases may allow MT to overcome some of these
obstacles. This chapter reviews contextual complexities surrounding legal terminology, NMT and SMT architectures, and
research on MT and legal translation to determine what might be expected from data-driven MT in this context.
Article outline
- 1.Introduction
- 2.Translation of legal terminology
- Target language (TL) equivalents
- ST comprehension
- TT drafting
- 3.Data-driven MT: SMT and NMT
- 4.Research on machine translation of legal texts: Legal terminology
- 5.Conclusions
-
Notes
-
References
References (91)
References
Alcaraz Varó, Enrique. 2009. “Isomorphism and Anisomorphism in the Translation of Legal Texts.” In Translation Issues in Language and Law, edited by Frances Olsen, Alexander Lorz, and Dieter Stein, 182–192. New York: Palgrave Macmillan.
Alcaraz Varó, Enrique and Brian Hughes. 2002. Legal Translation Explained. Manchester: St. Jerome Publishing.
Arnejšek, Mateja and Alenka Unk. 2020. “Multidimensional Assessment of the eTranslation Output for English-Slovene.” In Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 3–5
November, Lisbon, edited by André Martins, Helena Moniz, Sara Fumega, Bruno Martins, Fernando Batista, Luisa Coheur, Carla Parra, Isabel Trancoso, Marco Turchi, Arianna Bisazza, Joss Moorkens, Ana Guerberof, Mary Nurminen, Lena Marg, and Mikel L. Forcada, 383–392. Lisbon: European Association for Machine Translation. [URL]
Bahdanau, Dzmitry, Kyung H. Cho, and Yoshua Bengio. 2015. “Neural Machine Translation by Jointly Learning to Align and Translate.” In
Proceedings of the International Conference on Learning Representations
, 7–8 May 2015, San Diego, CA, USA. [URL]
Bar-Hillel, Yehoshua. 1960. “The Present Status of Automatic Translation of Languages.” Advances in Computers 1:91–163.
Bendana, Lola and Alan Melby. 2012. Almost Everything You Ever Wanted to Know about Translation. Toronto: Multi-Languages Corporation.
Bentivogli, Luisa, Arianna Bisazza, Mauro Cettolo, and Marcello Federico. 2016. “Neural versus Phrase-Based Machine Translation Quality: A Case Study.” In
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
, 1–5 November 2016, Austin, Texas, USA, 257–267. Red Hook, New York: Curran Associates.
Bestué, Carmen and Mariana Orozco Jutorán. 2010. Translation of Legal Texts. Brownsville, Texas: University of Texas at Brownsville and Texas Southmost College.
Biel, Łucja. 2007. “Translation and Multilingual EU Legislation as a Subgenre of Legal Translation.” In Court Interpreting and Legal Translation in the Enlarged Europe 2006, edited by Danuta Kierzkowska, 144–163. Warsaw: Translegis.
Biel, Łucja. 2008. “Legal Terminology in Translation Practice: Dictionaries, Googling or Discussion
Forums?” Skase 3(1):22–38.
Biel, Łucja. 2017. “Researching Legal Translation: A Multi-perspective and Mixed-method Framework for Legal
Translation.” Revista de Llengua i Dret, Journal of Language and Law 68:76–188.
Bojar, Ondrej, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana Neves, Martin Popel, Matt Post, Raphael Rubino, Carolina Scarton, Lucia Specia, Marco Turchi, Karin Verspoor, and Marcos Zampieri. 2016. “Findings of the 2016 Conference on Machine Translation.” In Proceedings of the First Conference on Machine Translation of the Association for Computational
Linguistics, Volume 2: Shared Task Papers, 11–12 August 2016, Berlin, Germany, edited by Ondřej Bojar, Christian Buck, Rajen Chatterjee, Christian Federmann, Liane Guillou, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Aurélie Névéol, Mariana Neves, Pavel Pecina, Martin Popel, Philipp Koehn, Christof Monz, Matteo Negri, Matt Post, Lucia Specia, Karin Verspoor, Jörg Tiedemann, and Marco Turchi, 131–198. Berlin: Association for Computational Linguistics. [URL].
Borja Albi, Anabel. 2005. “¿Es posible traducir realidades jurídicas? Restricciones y prioridades en la traducción de
documentos de sucesiones británicos al español.” In La traducción y la interpretación en las relaciones jurídicas internacionales, edited by Esther Monzó Nebot and Anabel Borja Albi, 65–89. Castelló de la Plana: Publicacions de la Universitat Jaume I.
Borja Albi, Anabel. 2013. “Freelance Translation for Multinational Corporations and Law Firms.” In Legal Translation in Context, edited by Anabel Borja Albi and Fernando Prieto Ramos, 53–74. Bern: Peter Lang.
Cao, Deborah. 2007. Translating Law. Clevedon: Multilingual Matters.
Castilho, Sheila, Joss Moorkens, Federico Gaspari, Iacer Calixto, John Tinsley, and Andy Way. 2017a. “Is Neural Machine Translation the New State of the Art? The Prague Bulletin of Mathematical Linguistics 108(1):109–120.
Castilho, Sheila, Joss Moorkens, Federico Gaspari, Rico Sennrich, Vilelmini Sosoni, Panayota Georgakopoulou, Pintu Lohar, Andy Way, Antonio V. Miceli Barone, and Maria Gialama. 2017b. “A Comparative Quality Evaluation of PBSMT and NMT Using Professional Translators.” In Proceedings of MT Summit XVI. Volume 1: Research Track, 18–22 September 2017, Nagoya, Japan, 116–131. [URL]
Chromá, Marta. 2011. “Synonymy and Polysemy in Legal Terminology and Their Application to Bilingual and Bijural
Translation.” Research in Language 9(1):31–50.
Chromá, Marta. 2014. “Making Sense in Legal Translation.” Semiotica 201:121–144.
Crego, Josep, Jungi Kim, Guillaume Klein, Anabel Rebollo, Kathy Yang, Jean Senellart, Egor Akhanov, Patrice Brunelle, Aurélien Coquard, Yongchao Deng, Satoshi Enoue, Chiyo Geiss, Joshua Johanson, Ardas Khalsa, Raoum Khiari, Byeongil Ko, Catherine Kobus, Jean Lorieux, Leidiana Martins, Dang C. Nguyen, Alexandra Priori, Thomas Riccardi, Natalia Segal, Christophe Servan, Cyril Tiquet, Bo Wang, Jin Yang, Dakun Zhang, Jing Zhou, and Peter Zoldan. 2016. “Systran’s Pure Neural Machine Translation Systems.” CoRR, abs/1610.05540. [URL]
Desmet, Luca. 2021. An Exploratory Study of Professional Post-Edits by English-Dutch DGT Translators. Unpublished MA thesis. Ghent University.
Dik, Hugo. 2020. CTRL+V for Verdict: An Analysis of Dutch to English Legal Machine Translation. Unpublished MA thesis. Leiden University.
Duro Moreno, Miguel. 2012. “El modelo de los entornos de la traducción: aplicaciones didácticas a la traducción
jurídica.” In À propos de l’enseignement de la traduction et l’interprétation en Europe/Sobre la enseñanza de la
traducción y la interpretación en Europa, edited by Emilio Ortega Arjonilla, Christian Balliu, Esperanza Alarcón Navío, and Ana Belén Martínez López, 279–288. Granada: Comares.
Farzindar, Atefeh and Guy Lapalme. 2009. “Machine Translation of Legal Information and Its Evaluation.” In Advances in Artificial Intelligence, edited by Yong Gao and Nathalie Japkowicz, 64–73. Berlin: Springer.
Forcada, Mikel L. 2010. “Machine Translation Today.” In Handbook of Translation Studies, Vol. 1, edited by Yves Gambier and Luc van Doorslaer, 215–223. Amsterdam: John Benjamins.
Franco Aixelá, Javier. 2022. “Anisomorphisms.” In @ ENTI (Enciclopedia de traducción e interpretación). AIETI (Asociación Ibérica de Estudios de Traducción e Interpretación).
García, Ignacio. 2010. “The Proper Place of Professionals (and Non-Professionals and Machines) in Web
Translation.” Tradumàtica 8:1–7.
García, Ignacio. 2011. “Translating by Post-Editing: Is it the Way Forward?” Machine Translation 25:217–237.
Garrido Nombela, Ramón. 1996. “La traducción en la Comunidad Europea y el lenguaje jurídico comunitario.” Hieronymus 3:35–41.
Glanert, Simone. 2008. “Speaking Language to Law: The Case of Europe.” Legal Studies 28(2):161–171.
Glanert, Simone. 2014. “Law-in-Translation: An Assemblage in Motion.” The Translator 20(3):255–272.
Gotti, Fabrizio, Atefeh Farzindar, Guy Lapalme, and Elliott Macklovitch. 2008. “Automatic Translation of Court Judgments.” In Proceedings of The Eighth Conference of the Association for Machine Translation in the Americas:
Government and Commercial Uses of MT, 21–25 October 2008, Waikiki, Hawaii, USA, 370–379. [URL]
Groot, Gerard-René de and Conrad J. P. van Laer. 2008. “The Quality of Legal Dictionaries: An Assessment.” Maastricht Faculty of Law Working Paper 6, [URL].
Harvey, Malcolm. 2002. “What’s so Special about Legal Translation?” Meta 47(2):177–185.
Heiss, Christine and Marcello Soffritti. 2018. “DeepL Traduttore e didattica della traduzione dall’italiano in tedesco.” In Translation and Interpreting for Language Learners (TAIL), special issue of inTRAlinea, edited by Laurie Anderson, Laura Gavioli, and Federico Zanetting. [URL]
Juncal, Julio A. 2009. “Del papel a la pantalla, de la utopía a la realidad.” Panace@ 10(29):13–15.
Junczys-Dowmunt, Marcin, Tomasz Dwojak, and Hieu Hoang. 2016. “Is Neural Machine Translation Ready for Deployment? A Case Study on 30 Translation
Directions.” In Proceedings of the 13th International Conference on Spoken Language Translation, 8–9 December 2016,
Seattle, WA, USA, edited by Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, Rolando Cattoni, and Marcello Federico. [URL]
Kenny, Dorothy and Stephen Doherty. 2014. “Statistical Machine Translation in the Translation Curriculum: Overcoming Obstacles and Empowering
Translators.” The Interpreter and Translator Trainer 8(2):276–294.
Killman, Jeffrey. 2014. “Vocabulary Accuracy of Statistical Machine Translation in the Legal Context.” In Proceedings of the Third Workshop on Post-Editing Technology and Practice (WPTP-3), The 11th Conference of
the Association for Machine Translation in the Americas, 22–26 October 2014, Vancouver, BC, Canada, edited by Sharon O’Brien, Michel Simard, and Lucia Specia, 85–98. Vancouver: AMTA.
Kim-Prieto, Dennis C. 2008. “
En la tierra del ciego, el tuerco es rey: Problems with Current English-Spanish Legal
Dictionaries, and Notes toward a Critical Comparative Legal Lexicography.” Law Library Journal 100(2):251–278.
Kjær, Anne L. 1990. “Context-Conditioned Word Combinations in Legal Language.” Journal of the International Institute for Terminology Research 1(1/2):21–32.
Kockaert, Hendrik J., Tom Vanallemeersch, and Frieda Steurs. 2008. “Term-Based Context Extraction in Legal Terminology: A Case Study in Belgium.” In Proceedings of Current Trends in Terminology, International Conference on Terminology, 9–10 November 2007,
Szombathely, Hungary. [URL]
Koehn, Philipp. 2005. “Europarl: A Parallel Corpus for Statistical Machine Translation.” In Proceedings of Machine Translation Summit X, 13–15 September 2005, Phuket, Thailand, 79–86. [URL]
Koehn, Philipp. 2010. Statistical Machine Translation. New York: Cambridge University Press.
Koehn, Philipp. 2020. Neural Machine Translation. Cambridge: Cambridge University Press.
Koehn, Philipp, Franz J. Och, and Daniel Marcu. 2003. “Statistical Phrase-Based Translation.” In Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the
Association for Computational Linguistics, 27 May–1 June, Edmonton, Canada, 127–133. [URL].
Koehn, Philipp and Rebecca Knowles. 2017. “Six Challenges for Neural Machine Translation.” In Proceedings of the First Workshop on Neural Machine Translation of the Association for Computational
Linguistics, 4 August 2017, Vancouver, BC, Canada, edited by Thang Luong, Alexandra Birch, Graham Neubig, and Andrew Finch, 28–39. Stroudsburg, Pennsylvania, USA: Association for Computational Linguistics.
Lesznyák, Ágnes. 2019. “Hungarian Translators’ Perceptions of Neural Machine Translation in the European
Commission.” In Proceedings of MT Summit XVIII: Translator, Project and User Tracks, 19–23 August, Dublin, 16–22. [URL]
Macken, Lieve, Daniel Prou, and Arda Tezcan. 2020. “Quantifying the Effect of Machine Translation in a High-Quality Human Translation Production
Process.” Informatics 7(2):12.
Matulewska, Aleksandra. 2013. Legilinguistic Translatology: A Parametric Approach to Legal Translation. Bern: Peter Lang.
McAuliffe, Karen. 2009. “Translation at the Court of Justice of the European Communities.” In Translation Issues in Language and Law, edited by Frances Olson, Alexander Lorz, and Dieter Stein, 99–115. London: Palgrave Macmillan.
Melby, Alan and Christopher Foster. 2010. “Context in Translation: Definition, Access and Teamwork.” Translation & Interpreting 2(2):1–15.
Mileto, Fiorenza. 2019. “Post-Editing and Legal Translation.” Digital Humanities Journal 1.
Molina, Lucía and Amparo Hurtado Albir. 2002. “Translation Techniques Revisited: A Dynamic Functional Approach.” Meta 47(4):498–512.
Moorkens, Joss. 2018. “What to Expect from Neural Machine Translation: A Practical In-Class Translation Evaluation
Exercise.” The Interpreter and Translator Trainer 12(4):375–387.
Orts Llopis, María Ángeles. 2007. “The Untranslatability of Law? Lexical Differences in Spanish and American Contract
Law.” European Journal of English Studies 11(1):17–28.
Papineni, Kishore, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. “BLEU: A Method for Automatic Evaluation of Machine Translation.” In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, July,
Philadelphia, 311–318. [URL]
Pasteur, Olivier. 2013. “Technology at the service of specialized translators at International Organizations.” In Legal Translation in Context, edited by Anabel Borja Albi and Fernando Prieto Ramos, 283–297. Bern: Peter Lang.
Popović, Maja. 2017. “Comparing Language Related Issues for NMT and PBMT between German and English.” The Prague Bulletin of Mathematical Linguistics 108(1):209–220.
Prieto Ramos, Fernando. 2014. “International and Supranational Law in Translation: From Multilingual Lawmaking to
Adjudication.” The Translator 20(3):313–331.
Prieto Ramos, Fernando. 2016. “Parameters for Problem-Solving in Legal Translation: Implications for Legal Lexicography and
Institutional Terminology Management.” In The Ashgate Handbook of Legal Translation, edited by Le Cheng, King Kui Sin, and Anne Wagner, 121–134. London: Routledge.
Pym, Anthony. 2000. “The European Union and its Future Languages: Questions for Language Policies and Translation
Theories.” Across Languages and Cultures 1(1):1–11.
Pym, Anthony. 2011. “What Technology does to Translating.” Translation & Interpreting 1:1–9.
Robertson, Colin. 2011. “Multilingual Legislation in the European Union: EU and National Legislative-Language Styles and
Terminology.” Research in Language 9(1):51–67.
Roiss, Silvia. 2021. “Y las máquinas rompieron a traducir… Consideraciones didácticas en relación con la traducción
automática de referencias culturales en el ámbito jurídico.” TRANS. Revista de Traductología 25:491–505.
Rossi, Caroline and Jean-Pierre Chevrot. 2019. “Uses and Perceptions of Machine Translation at the European Commission.” The Journal of Specialised Translation 31:177–200.
Şahin, Mehmet and Nilgün Dungan. 2014. “Translation Testing and Evaluation: A Study on Methods and Needs.” Translation & Interpreting 6(2):67–90.
Šarčević, Susan. 1997. New Approach to Legal Translation. The Hague: Kluwer Law International.
Šarčević, Susan. 2000. “Legal Translation and Translation Theory: A Receiver-Oriented Approach” In La traduction juridique: Histoire, théorie(s) et pratique/Legal Translation: History, Theory/ies,
Practice (Proceedings, Geneva, 17–19 February 2000), 329–347. Bern/Geneva: ASTTI/ETI.
Šarčević, Susan. 2018. “Challenges to Legal Translators in Institutional Settings.” In Institutional Translation for International Governance, edited by Fernando Prieto Ramos, 9–24. London: Bloomsbury.
Sennrich, Rico, Barry Haddow, and Alexandra Birch. 2016. “Edinburgh Neural Machine Translation Systems for WMT 16.” In Proceedings of the First Conference on Machine Translation (WMT16), 11–12 August, Berlin, Germany, edited by Ondřej Bojar, Christian Buck, Rajen Chatterjee, Christian Federmann, Liane Guillou, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Aurélie Névéol, Mariana Neves, Pavel Pecina, Martin Popel, Philipp Koehn, Christof Monz, Matteo Negri, Matt Post, Lucia Specia, Karin Verspoor, Jörg Tiedemann, and Marco Turchi, 371–376.
Simonnæs, Ingrid. 2016. “Legal Language: Pragmatic Approaches to Its Interconnectivity with Legal Interpretation and Legal
Translation.” Meta 61(2):421–438.
Stefaniak, Karolina. 2020. “Evaluating the Usefulness of Neural Machine Translation for the Polish Translators in the European
Commission.” In Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 3–5
November, Lisbon, edited by André Martins, Helena Moniz, Sara Fumega, Bruno Martins, Fernando Batista, Luisa Coheur, Carla Parra, Isabel Trancoso, Marco Turchi, Arianna Bisazza, Joss Moorkens, Ana Guerberof, Mary Nurminen, Lena Marg, and Mikel L. Forcada. [URL]
Sutskever, Ilya, Oriol Vinyals, and Quoc V. Le. 2014. “Sequence to Sequence Learning with Neural Networks.” In Advances in Neural Information Processing Systems, edited by Zoubin Ghahramani, Max Welling, Corinna Cortes, Neil D. Lawrence, and Kilian Q. Weinberger, 3104–3112. Montréal: NIPS.
Thiry, Bernard. 2009. “Análisis crítico de algunos diccionarios jurídicos publicados.” EntreCulturas 1:443–468.
Toral, Antonio and Víctor M. Sánchez-Cartagena. 2017. “A Multifaceted Evaluation of Neural versus Phrase-Based Machine Translation for 9 Language
Directions.” In Proceedings of the 15th Conference of the European Chapter of the Association for Computational
Linguistics: Volume 1, Long Papers, 3–7 April 2017, Valencia, Spain, edited by Mirella Lapata, Phil Blunsom, and Alexander Koller, 1063–1073. Stroudsburg, Pennsylvania, USA: Association for Computational Linguistics.
Toral, Antonio, Sheila Castilho, Ke Hu, and Andy Way. 2018. “Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Machine
Translation.” In Proceeding of the Third Conference on Machine Translation of the Association for Computational
Linguistics, Volume 1: Research Papers, 31 October–1 November, Brussels, 113–123. [URL].
Vanallemeersch, Tom and Hendrick J. Kockaert. 2010. “Automatic Detection of Inconsistent Phraseology Translation.” Southern African Linguistics and Applied Language Studies 28(3):283–290.
Vardaro, Jennifer, Moritz Schaeffer, and Silvia Hansen-Schirra. 2019. “Translation Quality and Error Recognition in Professional Neural Machine Translation
Post-Editing.” Informatics 6(3):41.
Way, Andy. 2020. “Machine Translation: Where Are We at Today?” In The Bloomsbury Companion to Language Industry Studies, edited by Erik Angelone, Maureen Ehrensberger-Dow, and Gary Massey, 311–332. New York: Bloomsbury Academic.
Way, Catherine. 2016. “The Challenges and Opportunities of Legal Translation and Translator Training in the 21st
Century.” International Journal of Communication 10:1009–1029.
Wiesmann, Eva. 2019. “Machine Translation in the Field of Law: A Study of the Translation of Italian Legal Texts into
German.” Comparative Legilinguistics 37:117–153.
Wu, Yonghui, Mike Schuster, Zhifeng Chen, Quoc V. Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, Jeff Klingner, Apurva Shah, Melvin Johnson, Xiaobing Liu, Łukasz Kaiser, Stephan Gouws, Yoshikiyo Kato, Taku Kudo, Hideto Kazawa, Keith Stevens, George Kurian, Nishant Patil, Wei Wang, Cliff Young, Jason Smith, Jason Riesa, Alex Rudnick, Oriol Vinyals, Greg Corrado, Macduff Hughes, and Jeffrey Dean. 2016. “Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine
Translation.” CoRR, abs/1609.08144. arXiv:1609.08144
Yates, Sarah. 2006. “Scaling the Tower of Babel Fish: An Analysis of the Machine Translation of Legal
Information.” Law Library Journal 98(3):481–500.
Cited by (1)
Cited by one other publication
Killman, Jeffrey
2024.
Machine translation literacy in the legal translation context: a SWOT analysis perspective.
The Interpreter and Translator Trainer 18:2
► pp. 271 ff.
This list is based on CrossRef data as of 4 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.