Universals in machine translation?
A corpus-based study of Chinese-English translations by WeChat Translate
By examining and comparing the linguistic patterns in a self-built corpus of Chinese-English translations produced by WeChat Translate, the latest online machine translation app from the most popular social media platform (WeChat) in China, this study explores such questions as whether or not and to what extent simplification and normalization (hypothesized Translation Universals) exhibit themselves in these translations. The results show that, whereas simplification cannot be substantiated, the tendency of normalization to occur in the WeChat translations can be confirmed. The research finds that these results are caused by the operating mechanism of machine translation (MT) systems. Certain salient words tend to prime WeChat’s MT system to repetitively resort to typical language patterns, which leads to a significant overuse of lexical chunks. It is hoped that the present study can shed new light on the development of MT systems and encourage more corpus-based product-oriented research on MT.
- 2.Translation Universals
- 3.Corpora and methodology
- 3.1Corpora compilation
- 4.Findings and discussion
- 4.1Results for simplification
- 4.2Results for normalization
- 4.2.1NV Ratio
- 4.2.2Correction of punctuation
- 4.2.3The overuse of typical grammatical patterns