Most Chinese-English parallel corpora consist of English source texts translated into Chinese. This chapter discusses the need for corpora representative of the under-resourced Chinese into English translation direction. After a brief overview of the current Chinese-English translation scenario and an analysis of existing parallel corpora for this language pair, we discuss problems in mining contemporary Chinese to English translations and issues in Chinese to English parallel text alignment. We then introduce ZHEN, a corpus of circa one-million characters of contemporary simplified Chinese source texts from a range of text types aligned with authentic translations into English. Its aim is to contribute to our understanding of Chinese to English translation norms and of features of English translated from Chinese.
Barlow, Michael. 2002. “ParaConc: Concordance software for multilingual parallel corpora.” Proceedings of the Third International Conference on Language Resources and Evaluation. Workshop on Language Resources in Translation Work and Research.
Frankenberg-Garcia, Ana. & Santos, D.2003. “Introducing COMPARA: the Portuguese-English Parallel Corpus”. In Corpora in Translator Education, ed. by Federico Zanettin, Silvia Bernardini & Dominic Stewart. Manchester: St. Jerome, pp 71–87.
Frankenberg-Garcia, Ana. 2019. “A Corpus Study of Splitting and Joining Sentences in Translation.” Corpora 14 (1): 1–30.
Gu, Yi. 2017. Chinglish Manifestations in Chinese to English Simultaneous Interpreting, Master dissertation. University of Surrey, England.
Gu, Yi. 2019. Understanding Chinese to English Translation through the Compilation and Analysis of a Chinese-English Parallel Corpus (ZHEN Corpus), Unpublished PhD confirmation report. University of Surrey, England.
Hu, Dexiang. 2006. “Reflections on the Cultural Factors of Translating into and from Chinese” Journal of Hainan University 24(3): 355–359.
Izquierdo, Marlén, Knut Hofland, and Øystein Reigem. 2008. “The ACTRES Parallel Corpus: an English–Spanish Translation Corpus.” Corpora 3 (1): 31–41.
Kilgarriff, Adam, Vít Baisa, Jan Bušta, Miloš Jakubíček, Vojtěch Kovář, Jan Michelfeit, Pavel Rychlý, and Vít Suchomel. 2014. “The Sketch Engine: Ten Years On.” Lexicography 1 (1): 7–36.
Lao, Long. 1996. “Diudiao huanxiang lianxi shijian [My view on translatology].” Chinese Translators Journal. 17(2):38–41.
Liu, Cixin. 2014. The three-body problem (Vol. 1). Macmillan.
Lixun, W.2001. “Exploring parallel concordancing in English and Chinese.” Language Learning & Technology, 5(3), 174–184.
McEnery, Tony, Richard Xiao, and Yukio Tono. 2006. Corpus-Based Language Studies: An Advanced Resource Book. Routledge. London.
Mingxing He. 2013. “Zhongguo dangdai wenxue haiwai chuban chuanbo liushi nian [60 years of overseas publication and dissemination of Contemporary Chinese Literature]”. Publish Guangjiao 7, 18–21.
MultiUN: A Multilingual corpus from United Nation Documents, Andreas Eisele and Yu Chen, LREC 2010
ODS Databases. 2020. United Nations. United Nations. Accessed February 6. [URL]
Pan, Jun, & Wong, Tak Ming. 2018. “A corpus-driven study of contrastive markers in Cantonese‒English political interpreting.” BRAIN, 9(2):168–176.
Pinkham, J.2000. Zhongshi yingyu zhi jian [Lessons Drawn from Others’ Mistakes of Chingish]. Beijing: Foreign Language Teaching and Research Press.
Scott, Mike. 2008. “Developing WordSmith.” International Journal of English Studies 8.1, 95–106.
Tiedemann, J.2012. Parallel Data, Tools and Interfaces in OPUS. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC’2012), 2214–2218.
Translators Association of China (TAC). [URL]. Accessed 25 November 2019.
Varga, Dániel, Péter Halácsy, András Kornai, Viktor Nagy, László Németh, and Viktor Trón. 2007. “Parallel Corpora for Medium Density Languages.” Recent Advances in Natural Language Processing IV Current Issues in Linguistic Theory, 247–58.
Wang, Kefei. 2004a. Shuangyu Duiying Yuliaoku: Yanzhi yu Yingyong [A parallel corpus: Compilation and application]. Beijing: Foreign Language Teaching and Research Press.
Wang, Kefei. 2004b. “Shuangyu pingxing yuliaoku zai fanyi jiaoxue shang de yongtu [The use of parallel corpora in translator training]”. Computer-Assisted Foreign Language Education 6: 27–32.
Wang, Kefei. 2009. “yingyihan yuyan tezheng tantao” [The features of translated Chinese from English]”. Computer-Assisted Foreign Language Education 1.
Xiao, Richard. 2008. “Theory-driven Corpus Research: Using Corpora to Inform Aspect Theory.” In Corpus Linguistics: An International Handbook, ed. by Anke Lüdeling & Merja Kytö, 977–1007, Berlin: Mouton de Gruyter.
Xiao, Richard, He, Lianzhen, & Yue, Ming. 2008. The ZJU Corpus of Translational Chinese (ZCTC). Zhejiang University.
Yang, Zhengquan. 1999. “Xu [Introduction].” In Zhongguo waiwenju wushinian dashiji [A chronology of China Foreign Languages Publishing and Distribution Administration (1949–1999)], ed by Yannian Dai and Rinong Chen. Beijing: New Star Press, I–VI.
Zanettin, Federico, 2000. “Parallel Corpora in Translation Studies: Issues in Corpus Design and Analysis.” In Intercultural Faultlines, ed. by Maeve Olohan, 105–118, London: Routledge.
Zanettin, Federico. 2014. Translation-Driven Corpora: Corpus Resources for Descriptive and Applied Translation Studies. Routledge.
Ziemski, M., Junczys-Dowmunt, M., and Pouliquen, B.2016. The United Nations Parallel Corpus, Language Resources and Evaluation (LREC’16), Portorož, Slovenia.
Cited by (1)
Cited by one other publication
Liu, Lei & Min Zhu
2023. Bertalign: Improved word embedding-based sentence alignment for Chinese–English parallel corpora of literary texts. Digital Scholarship in the Humanities 38:2 ► pp. 621 ff.
This list is based on CrossRef data as of 4 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.