From theoretical hypotheses to practical tools: Computational phraseology and translation studies

Colson, Jean-Pierre

doi:10.1075/ivitra.24.04col

Part of

Computational Phraseology
Edited by Gloria Corpas Pastor and Jean-Pierre Colson
[IVITRA Research in Linguistics and Literature 24] 2020
► pp. 65–82

Computational phraseology and translation studies

From theoretical hypotheses to practical tools

Jean-Pierre Colson | Université catholique de Louvain

The notion of phraseology is now used across a wide range of linguistic disciplines but it is conspicuously absent from most studies in the area of Translation Studies (e.g. Delisle, 2003; Baker and Saldanha, 2011). The paradox is that many practical difficulties encountered by translators and interpreters are directly related to phraseology in the broad sense (Colson, 2008, 2013), and this can also clearly be seen in the failure of machine translation systems to deal efficiently with the translation of phraseological units (PUs).

We argue that phraseology and translation studies have much to gain from cross fertilisation, because both disciplines are regularly criticised for their lack of coherent terminological description and for the insufficient number of reproducible experiments they involve.

Decoding phraseology in the source text is far from easy for translators and interpreters, all the more so as they are usually not native speakers of the source language. Finding a natural formulation in the target language and avoiding translationese requires an excellent mastery of the phraseology of the target language. Even experienced professionals sometimes fail to detect the fixed or semi-fixed character of a source text construction. We argue that algorithms derived from text mining and information retrieval techniques can be efficient and (computationally) cost-effective in order to build up unfiltered collections of recurrent fixed or semi-fixed phrases, from which translators could gain information about the number of PUs in the source text. Such an algorithm has been proposed in Colson (2016) and has been implemented in a web application enabling translators and language professionals to automatically retrieve most PUs from a source text. Other tools should be developed in order to bridge the gap between the findings of computational phraseology and the practice of translation and interpreting.

Keywords: phraseology, computational linguistics, translation, interpreting, text mining

Article outline

1.Introduction
2.Phraseology and translation studies
3.Problems posed by phraseology to human translation
4.Problems posed by phraseology to machine translation
5.Theoretical hypotheses
6.Towards new practical tools
7.Conclusion
Notes
References

Published online: 8 May 2020

https://doi.org/10.1075/ivitra.24.04col

References (34)

References

Baeza-Yates, R., & Ribeiro-Neto, B. (1999). Modern Information Retrieval. New York: ACM Press, Addison Wesley.

Baker, M., & Saldanha, G. (Eds). (2011). Routledge Encyclopedia of Translation Studies. New York, NY: Routledge.

Baroni, M., Bernardini, S., Ferrares, A. & Zanchetta, E. (2009). The WaCky Wide Web: A collection of very large linguistically processed Web-crawled corpora. Journal of Language Resources and Evaluation, 43, 209–26.

.

Barreiro, A., Monti, J., Batista, F., & Orliac, B. (2013). When multiword go bad in machine translation. In J. Monti, R. Mitkov, G. Corpas Pastor, & V. Secretan (Eds.), Workshop Proceedings: Multi-word units in machine translation and translation technologies, Nice 14th Machine Translation Summit (pp. 26–33).

Burger, H., Dobrovol’skij D., Kühn, P., & Norrick, N. R. (Eds.) (2007). Phraseologie / Phraseology. Ein internationals Handbuch der zeitgenössischen Forschung / An International Handbook of Contemporary Research. Berlin/New York: de Gruyter.

Colson, J.-P. (2008). Cross-linguistic phraseological studies: An overview. In S. Granger, & F. Meunier (Eds.), Phraseology. An interdisciplinary perspective (pp. 191–206). Amsterdam/Philadelphia: John Benjamins.

(2010). Fraseologie en (ver)taalcompetentie. In C. Van de Poel, & W. Segers (Eds.), Tolk- en vertaalcompetentie. Onderwijs- entoetsvormen (pp. 111–124). Leuven: Acco.

(2011). La traduction spécialisée basée sur les corpus : une expériencedans le domaine informatique. In I. Sfar, & Mejri, S. (Eds.), La traduction de textes spécialisés : retour sur des lieux communs (pp. 115–123). Synergies Tunisie 2.

(2013). Pratique traduisante et idiomaticité :l’importance des structures semi-figées. In P. Mogorrón Huerta, D. Gallego Hernández, & M. Tolosa Igualada (Eds.), Fraseología, Opacidad y Traducción (pp. 207–218). Frankfurt am Main: Peter Lang.

(2016). Set phrases around globalization: an experiment in corpus-based computational phraseology. In F. Alonso Almeida, I. Ortega Barrera, E. Quintana Toledo, & M. E. Sánchez Cuervo (Eds.), Input a Word, Analyze the World. Selected Approaches to Corpus Linguistics (pp. 141–152). Newcastle: Cambridge Scholars Publishing.

Corpas Pastor, G. (2000). Acerca de la (in)traducibilidad de la fraseología. In G. Corpas Pastor (Ed.), Las lenguas de Europa: Estudios de fraseología, fraseografía y traducción (pp. 483–522). Granada: Comares.

(2007). Europäismen– von Natur aus phraseologische Äquivalente? Von blauem Blut und sangreazul. In M. Emsel, & J. Cuartero Ota (Eds.), Brücken: Übersetzen und interkulturelle Kommunikationen. Festschrift für Gerd Wotjakzum 65. Geburtstag(pp. 65–77). Frankfurt: Peter Lang.

Corpas Pastor, G. (2008). Investigar con corpus en traducción: los retos de un nuevo paradigma (Studien zur romanische Sprachwissenschaft und interkulturellen Kommunikation, 49). Frankfurt: Peter Lang.

(2013). Detección, descripción y contraste de las unidades fraseológicas mediante tecnologías lingüísticas. In I. Olza, & E. Manero (Eds.), Fraseopragmática (pp. 335–373). Berlin: Frank & Timme.

Delisle, J. (2003). La traduction raisonnée. Ottawa: Presses de l’Université d’Ottawa.

Erman, B., & Warren, B. (2000). The idiom principle and the open-choice principle. Text, 20(1), 29–62.

Firth, J. R. (1957). A synopsis of linguistic theory, 1930–1955. In Philological Society (Ed.), Studies in Linguistic Analysis (pp. 1–32). Oxford: Basil Blackwell.

Granger, S. & Meunier, F. (Eds). (2008). Phraseology. An interdisciplinary perspective. Amsterdam/Philadelphia: John Benjamins.

Gries, S. Th. (2013). 50-something years of work on collocations. What is or should be next… International Journal of Corpus Linguistics, 18(1), 137–165.

Hanks, P. (2010). Lexicography, Terminology, and Phraseology. In Proceedings of Euralex 2010. Leeuwarden.

Jackendoff, R. S. (1995). The boundaries of the lexicon. In M. Everaert, E. van der Linden, A. Schenk, & R. Schroeder (Eds.), Idioms: Structural and psychological perspectives (pp. 133–165). Hillsdale, NJ: Lawrence Erlbaum Associates.

Kruger, A., Wallmach, K., & Munday, J. (Eds.) (2011). Corpus-Based Translation Studies. Research and Applications. London: Bloomsbury.

Leiva Rojo, J. (2013). La traducción de unidades fraseológicas (alemán-español/español-alemán) como parámetro para la evaluación y revisión de traducciones. In C. Mellado Blanco, P. Buján, N. M. Iglesias, M. C. Losada, & A. Mansilla (Eds.), La fraseología del alemán y el español: lexicografía y traducción (ELS, Etudes Linguistiques / Linguistische Studien, Band 11)(pp. 31–42). München: Peniope.

Lewis, M. (1993). The Lexical Approach. Hove: Language Teaching Publications.

(Ed.) (1997). Implementing the Lexical Approach. Hove: Language Teaching Publications.

Mel’čuk, I. (1995). Phrasemes in language and phraseology in linguistics. In M. Everaert, E. van der Linden, A. Schenk, & R. Schroeder (Eds.), Idioms: Structural and psychological perspectives (pp. 167–232). Hillsdale, NJ: Lawrence Erlbaum Associates.

Mellado Blanco, C. (2013). La gramaticalización de las restricciones y preferencias de uso de las unidades fraseológicas del español y alemán desde un enfoque cognitivo-pragmático. In I. Olza, & E. Manero (Eds.), Fraseopragmática(pp. 303–331). Berlin: Frank & Timme.

Monti, J., Mitkov, R., Corpas Pastor G., & Seretan, V. (Eds.) (2013). Workshop Proceedings: Multi-word units in machine translation and translation technologies, Nice 14th Machine Translation Summit.

Omazié, M. (2005). Cognitive linguistic theories in phraseology. Jezikoslovlje, 6(1), 37–56.

Schnur, E. (2014). Phraseological Signaling of Discourse Organization in Academic Lectures: A Comparison of Lexical Bundles in Authentic Lectures and EAP Listening Materials. Yearbook of Phraseology, 5(1), 95–121.

Sinclair, J. (1991). Corpus, Concordance, Collocation. Oxford: Oxford University Press.

Teubert, W. (2002). The role of parallel corpora in translation and multilingual lexicography. In B. Altenberg, & S. Granger (Eds.), Lexis in Contrast: Corpus-based approaches (pp. 189–214). Amsterdam / Philadelphia: John Benjamins.

Tirkkonen-Condit, S. (2002). Translationese – a myth or an empirical fact? Target, 14, 207–220.

Wray, A. (2009). Formulaic Language: Pushing the Boundaries. Oxford: Oxford University Press.

Cited by (3)

Cited by three other publications

Bekhta, Ivan, Nataliya Bondarchuk, Nataliia Hrytsiv, Olena Levchenko, Olha Matviienkiv & Oksana Melnychuk

2022. 2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT), ► pp. 103 ff.

Giczela-Pastwa, Justyna

2021. Developing phraseological competence in L2 legal translator trainees: a proposal of a data mining technique applied in translation from an LLD into ELF. The Interpreter and Translator Trainer 15:2 ► pp. 187 ff.

Monti, Johanna, Violeta Seretan, Gloria Corpas Pastor & Ruslan Mitkov

2018. Multiword units in machine translation and translation technology. In Multiword Units in Machine Translation and Translation Technology [Current Issues in Linguistic Theory, 341], ► pp. 2 ff.

This list is based on CrossRef data as of 28 june 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.