Publication details [#4413]

Gao, Yuqing, Bowen Zhou, Zijian Diao, Jeffrey Sorensen and Michael Picheny. 2002. MARS: a statistical semantic parsing and generation-based multilingual automatic translation system. Machine Translation 17 (3) : 185–212.
Publication type
Article in jnl/bk
Publication language
Source language
Target language


In this article MARS (Multilingual Automatic tRanslation System) is presented, a research prototype speech-to-speech translation system. MARS is aimed at two-way conversational spoken language translation between English and Mandarin Chinese for limited domains, such as air travel reservations. In MARS, machine translation is embedded within a complex speech processing task, and the translation performance is highly effected by the performance of other components, such as the recognizer and semantic parser, etc. All components in the proposed system are statistically trained using an appropriate training corpus. The speech signal is first recognized by an automatic speech recognizer (ASR). Next, the ASR-transcribed text is analyzed by a semantic parser, which uses a statistical decision-tree model that does not require hand-crafted grammars or rules. Furthermore, the parser provides semantic information that helps further re-scoring of the speech recognition hypotheses. The semantic content extracted by the parser is formatted into a language-independent tree structure, which is used for an interlingua based translation. A Maximum Entropy based sentence-level natural language generation (NLG) approach is used to generate sentences in the target language from the semantic tree representations. In a final instance, the generated target sentence is synthesized into speech by a speech synthesizer.
Source : Based on abstract in journal