Publications

Found 1 recordActive filters:

Publication details [#42340]

Ulc, Michal. 2005. The Representativeness of Czech corpora. International Journal of Corpus Linguistics 10 (3) : 357–366.
Publication type
Article in journal
Publication language
English
Language as a subject
Place, Publisher
John Benjamins
Journal DOI
10.1075/ijcl

Annotation

The attempt to balance corpora with respect to their future usage led to the introduction of the termexpectations(Králík 2001b). On the bases of several statistical inquiries of such expectations, the textual structure ofSYN2000,which is the synchronic part of the Czech National Corpus (CNC), was proposed and realised. The present article explains the original composition briefly and discusses two new inquiries concerning expectations(A-2001andC-2001).Important corrections for future work on the CNC are suggested. The expectations concerning newspapers changed radically during 1996–2001. Within the same period, an obvious rise of interest in fiction can be detected. The reasons for these developments can be traced to trends in Czech society. Thus, we have proposed a considerable reduction in the proportion of newspaper texts and a large increase in the proportion of fiction texts. According to new searches, more detailed percentages for specific subject areas are suggested.