A directional parallel corpus of Chinese source texts and English translations: Chapter 2. ZHEN

Gu, Yi; Frankenberg-Garcia, Ana

doi:10.1075/btl.158.02gu

Part of

Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations
Edited by Julia Lavid-López, Carmen Maíz-Arévalo and Juan Rafael Zamorano-Mansilla
[Benjamins Translation Library 158] 2021
► pp. 49–74

Chapter 2
ZHEN

A directional parallel corpus of Chinese source texts and English translations

Yi Gu | Centre for Translation Studies, University of Surrey

Ana Frankenberg-Garcia | Centre for Translation Studies, University of Surrey

Most Chinese-English parallel corpora consist of English source texts translated into Chinese. This chapter discusses the need for corpora representative of the under-resourced Chinese into English translation direction. After a brief overview of the current Chinese-English translation scenario and an analysis of existing parallel corpora for this language pair, we discuss problems in mining contemporary Chinese to English translations and issues in Chinese to English parallel text alignment. We then introduce ZHEN, a corpus of circa one-million characters of contemporary simplified Chinese source texts from a range of text types aligned with authentic translations into English. Its aim is to contribute to our understanding of Chinese to English translation norms and of features of English translated from Chinese.

Keywords: parallel corpora, parallel text alignment, Chinese-English translation, ZHEN corpus

Article outline

1.Introduction
2.A brief overview of the Chinese-English translation scenario
3.Chinese-English parallel corpora
4.The ZHEN corpus
- 4.1Text selection
- 4.2Parallel text alignment
  - 4.2.1Alignment criteria
  - 4.2.2Alignment tools
- 4.3Corpus compilation
5.Applications of ZHEN
6.Conclusion and future directions
Notes
References
Appendix

Published online: 8 December 2021

https://doi.org/10.1075/btl.158.02gu

References (37)

References

2020. [URL]. Accessed February 6. [URL]

Abrahamsen, Eric. 2020. “Chinese Literature in Translation.” Paper Republic. Accessed February 6. [URL]

Anthony, Lawrence. 2012. “AntPConc.” Waseda University, Tokyo.

Barlow, Michael. 2002. “ParaConc: Concordance software for multilingual parallel corpora.” Proceedings of the Third International Conference on Language Resources and Evaluation. Workshop on Language Resources in Translation Work and Research.

Frankenberg-Garcia, Ana. & Santos, D. 2003. “Introducing COMPARA: the Portuguese-English Parallel Corpus”. In Corpora in Translator Education, ed. by Federico Zanettin, Silvia Bernardini & Dominic Stewart. Manchester: St. Jerome, pp 71–87.

Frankenberg-Garcia, Ana. 2009. “Are Translations Longer than Source Texts?: A Corpus-Based Study of Explicitation.” In Corpus Use and Translating, ed. by Allison Beeby, Patricia Rodríguez & Patricia Sánchez-Gijón. 47–58. Amsterdam: John Benjamins.

. 2019. “A Corpus Study of Splitting and Joining Sentences in Translation.” Corpora 14 (1): 1–30.

Gu, Yi. 2017. Chinglish Manifestations in Chinese to English Simultaneous Interpreting, Master dissertation. University of Surrey, England.

. 2019. Understanding Chinese to English Translation through the Compilation and Analysis of a Chinese-English Parallel Corpus (ZHEN Corpus), Unpublished PhD confirmation report. University of Surrey, England.

Hu, Dexiang. 2006. “Reflections on the Cultural Factors of Translating into and from Chinese” Journal of Hainan University 24(3): 355–359.

Johansson, Stig. 1998. “Why Change the Subject? On Changes in Subject Selection in Translation from English into Norwegian.” Target 16(1): 29–52.

Izquierdo, Marlén, Knut Hofland, and Øystein Reigem. 2008. “The ACTRES Parallel Corpus: an English–Spanish Translation Corpus.” Corpora 3 (1): 31–41.

Kilgarriff, Adam, Vít Baisa, Jan Bušta, Miloš Jakubíček, Vojtěch Kovář, Jan Michelfeit, Pavel Rychlý, and Vít Suchomel. 2014. “The Sketch Engine: Ten Years On.” Lexicography 1 (1): 7–36.

Lao, Long. 1996. “Diudiao huanxiang lianxi shijian [My view on translatology].” Chinese Translators Journal. 17(2):38–41.

Liu, Cixin. 2014. The three-body problem (Vol. 1). Macmillan.

Lixun, W. 2001. “Exploring parallel concordancing in English and Chinese.” Language Learning & Technology, 5(3), 174–184.

Mauranen, Anna. 2004. “Corpora, Universals and Interference.” Translation Universals Benjamins, 65–82.

McEnery, Tony, Richard Xiao, and Yukio Tono. 2006. Corpus-Based Language Studies: An Advanced Resource Book. Routledge. London.

Mingxing He. 2013. “Zhongguo dangdai wenxue haiwai chuban chuanbo liushi nian [60 years of overseas publication and dissemination of Contemporary Chinese Literature]”. Publish Guangjiao 7, 18–21.

MultiUN: A Multilingual corpus from United Nation Documents, Andreas Eisele and Yu Chen, LREC 2010

ODS Databases. 2020. United Nations. United Nations. Accessed February 6. [URL]

Pan, Jun, & Wong, Tak Ming. 2018. “A corpus-driven study of contrastive markers in Cantonese‒English political interpreting.” BRAIN, 9(2):168–176.

Pinkham, J. 2000. Zhongshi yingyu zhi jian [Lessons Drawn from Others’ Mistakes of Chingish]. Beijing: Foreign Language Teaching and Research Press.

Scott, Mike. 2008. “Developing WordSmith.” International Journal of English Studies 8.1, 95–106.

Tiedemann, J. 2012. Parallel Data, Tools and Interfaces in OPUS. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC’2012), 2214–2218.

Translators Association of China (TAC). [URL]. Accessed 25 November 2019.

Varga, Dániel, Péter Halácsy, András Kornai, Viktor Nagy, László Németh, and Viktor Trón. 2007. “Parallel Corpora for Medium Density Languages.” Recent Advances in Natural Language Processing IV Current Issues in Linguistic Theory, 247–58.

Wang, Kefei. 2004a. Shuangyu Duiying Yuliaoku: Yanzhi yu Yingyong [A parallel corpus: Compilation and application]. Beijing: Foreign Language Teaching and Research Press.

. 2004b. “Shuangyu pingxing yuliaoku zai fanyi jiaoxue shang de yongtu [The use of parallel corpora in translator training]”. Computer-Assisted Foreign Language Education 6: 27–32.

. 2009. “yingyihan yuyan tezheng tantao” [The features of translated Chinese from English]”. Computer-Assisted Foreign Language Education 1.

Xiao, Richard. 2008. “Theory-driven Corpus Research: Using Corpora to Inform Aspect Theory.” In Corpus Linguistics: An International Handbook, ed. by Anke Lüdeling & Merja Kytö, 977–1007, Berlin: Mouton de Gruyter.

Xiao, Richard, He, Lianzhen, & Yue, Ming. 2008. The ZJU Corpus of Translational Chinese (ZCTC). Zhejiang University.

Xiao, Richard. 2010. “How Different Is Translated Chinese from Native Chinese?: A Corpus-Based Study of Translation Universals.” International Journal of Corpus Linguistics 15 (1): 5–35.

Yang, Zhengquan. 1999. “Xu [Introduction].” In Zhongguo waiwenju wushinian dashiji [A chronology of China Foreign Languages Publishing and Distribution Administration (1949–1999)], ed by Yannian Dai and Rinong Chen. Beijing: New Star Press, I–VI.

Zanettin, Federico, 2000. “Parallel Corpora in Translation Studies: Issues in Corpus Design and Analysis.” In Intercultural Faultlines, ed. by Maeve Olohan, 105–118, London: Routledge.

. 2014. Translation-Driven Corpora: Corpus Resources for Descriptive and Applied Translation Studies. Routledge.

Ziemski, M., Junczys-Dowmunt, M., and Pouliquen, B. 2016. The United Nations Parallel Corpus, Language Resources and Evaluation (LREC’16), Portorož, Slovenia.

Cited by (1)

Cited by one other publication

Liu, Lei & Min Zhu

2023. Bertalign: Improved word embedding-based sentence alignment for Chinese–English parallel corpora of literary texts. Digital Scholarship in the Humanities 38:2 ► pp. 621 ff.

This list is based on CrossRef data as of 4 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

Chapter 2ZHEN

A directional parallel corpus of Chinese source texts and English translations

Cited by one other publication

Chapter 2
ZHEN