Publications

Publication details [#7329]

Langlais, Philippe and Michael Carl. 2004. General-purpose statistical translation engine and domain specific texts: would it work? In Daille, Béatrice, Kyo Kageura, Hiroshi Nakagawa and Lee-Feng Chien, eds. Recent trends in computational terminology. Special issue of Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 10 (1): 135–157.

Publication type

Article in Special issue

Publication language

English

Keywords

Journal DOI

10.1075/term

Abstract

Past decades have witnessed exciting work in the field of statistical machine translation (SMT). However, accurate evaluation of its potential in real-life contexts is still an open question. In this study, the authors investigate the behavior of an SMT engine faced with a corpus far different from the one it has been trained on. They show that terminological databases are obvious resources that should be used to boost the performance of a statistical engine. They propose and evaluate one way of integrating terminology into a SMT engine which yields a significant reduction in word error rate.

Source : Based on abstract in journal