Publications
Publication details [#55748]
Čermák, Frantisek and Alexandr Rosen. 2012. The case of InterCorp, a multilingual parallel corpus. International Journal of Corpus Linguistics 17 (3) : 411–427.
Publication type
Article in journal
Publication language
English
Keywords
Language as a subject
Place, Publisher
John Benjamins
Journal DOI
10.1075/ijcl
Annotation
This paper introduces InterCorp, a parallel corpus including texts in Czech and 27 other languages, available for online searches via a web interface. After discussing some issues and merits of a multilingual resource, it is argued that it has an important role especially for languages with fewer native speakers, supporting both comparative research and studies of the language from the perspective of other languages. The paper proceeds with an overview of the corpus - the strategy and criteria for including new texts, the representation of available languages and text types, linguistic annotation, and a sketch of pre-processing issues. Finally, it presents the search interface and suggest some research opportunities.