Chapter published in:Parallel Corpora for Contrastive and Translation Studies: New resources and applications
Edited by Irene Doval and M. Teresa Sánchez Nieto
[Studies in Corpus Linguistics 90] 2019
► pp. 93–101
A parallel corpus of 40 languages
This chapter presents the current version of InterCorp, a parallel corpus created at the Faculty of Arts, Charles University in Prague. The corpus contains texts in Czech aligned with one or more foreign-language version(s), including Czech and 39 other languages. The chapter analyses its structure and technical parameters, and describes some technical tools used with the corpus (Kontext, a corpus query interface, and InterText, a parallel text alignment editor created specifically for the project). Similarly, the contribution discusses Treq (Translation Equivalents Database), a collection of bilingual Czech-foreign language dictionaries built automatically from InterCorp. In the last section of the chapter, the possibilities for methodological and linguistic exploitation of the corpus are discussed.
Keywords: parallel corpus, InterCorp, comparison of languages, Spanish, Czech National Corpus
Published online: 20 March 2019
Čermák, František, Corness, Patrick & Klégr, Aleš
Čermák, František & Rosen, Alexandr
2016 Kontext. http://kontext.korpus.cz (18 November 2017).
Och, Franz Josef & Ney, Hermann
Repository of bibliographical items based on the Czech National Corpus
2017 <https://www.korpus.cz/biblio (18 November 2017).
Rosen, Alexander & Vavřín, Martin
2016 Korpus InterCorp, version 9 of 9 Sep 2016. Institute of the Czech National Corpus, Charles University, Prague 2014. http://www.korpus.cz (18 November 2017).
Rosen, Alexandr & Vavřín, Martin
Škrabal, Michal & Vavřín, Martin
Štichauer, Pavel & Čermák, Petr
2017 <http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/ (18 November 2017).
2016 Intertext, Parallel Text Alignment Editor. http://wanthalf.saga.cz/intertext (18 November 2017).
Cited by 1 other publications
This list is based on CrossRef data as of 15 june 2021. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.