Publications
Publication details [#38508]
Zhang, Xiaojun. 2021. Machine Translation Problems at Discourse Level: pro-drop language and large-context machine translation. In Wang, Caiwen and Binghan Zheng (郑冰寒), eds. Empirical Studies of Translation and Interpreting: the post-structuralist approach (Routledge Advances in Translation and Interpreting Studies). London: Routledge. pp. 198–216.
Publication type
Article in jnl/bk
Publication language
English
Abstract
This chapter introduces the novel discourse-processing methods in statistical machine translation (SMT) and neural machine translation (NMT) architectures. It describes the first attempt at investigating the potential for implicitly incorporating discourse information into machine translation (MT) and addresses the research question about MT problems at discourse level, regarding the influence of global context on SMT and NMT performance. Focusing on two typical discourse MT scenarios as pro-drop language and large-context NMT, the chapter presents some novel approaches to translation quality improvement targeting these problems. Experiment results show that it is crucial to identify the dropped pronouns to improve translation performance and demonstrate that the novel model significantly outperforms a strong attention-based large-context NMT baseline system.
Source : Publisher information