Publications

Publication details [#3352]

Chuang, Thomas C., G. N. You and Jason S. Chang. 2002. Adaptive bilingual sentence alignment. In Richardson, Stephen D., ed. Machine Translation: from research to real users (Lecture Notes in Computer Science 2499). Cham: Springer. pp. 21–30.
Publication type
Article in jnl/bk
Publication language
English
Source language
Target language

Abstract

The authors present a new approach to the problem of aligning English and Chinese sentences in a bilingual corpus based on adaptive learning. While using length information alone produces surprisingly good results for aligning bilingual French and English sentences with success rates well over 95%, it does not fare as well for the alignment of English and Chinese sentences. The crux of the problem lies in greater variability of lengths and match types of the matched sentences. The authors propose to cope with such variability via a two-pass scheme under which model parameters can be learned from the data at hand. Experiments show that under the approach bilingual English-Chinese texts can be aligned effectively across diverse domains, genres and translation directions with accuracy rates approaching 99%.
Source : Bitra