The attempt to balance corpora with respect to their future usage led to the introduction of the termexpectations(Králík 2001b). On the bases of several statistical inquiries of such expectations, the textual structure ofSYN2000,which is the synchronic part of the Czech National Corpus (CNC), was proposed and realised. The present article explains the original composition briefly and discusses two new inquiries concerning expectations(A-2001andC-2001).Important corrections for future work on the CNC are suggested. The expectations concerning newspapers changed radically during 1996–2001. Within the same period, an obvious rise of interest in fiction can be detected. The reasons for these developments can be traced to trends in Czech society. Thus, we have proposed a considerable reduction in the proportion of newspaper texts and a large increase in the proportion of fiction texts. According to new searches, more detailed percentages for specific subject areas are suggested.
2018. Frequency data from corpora partially explain native-speaker ratings and choices in overabundant paradigm cells. Corpus Linguistics and Linguistic Theory 14:2 ► pp. 197 ff.
Bibiri, Anca Diana, Speranţa Cecilia Bolea, Liviu Andrei Scutelnicu, Alex Mihai Moruz, Laura Pistol & Dan Cristea
2015. Proceedings of the 7th Balkan Conference on Informatics Conference, ► pp. 1 ff.
Bijankhan, Mahmood, Javad Sheykhzadegan, Mohammad Bahrani & Masood Ghayoomi
2011. Lessons from building a Persian written corpus: Peykare. Language Resources and Evaluation 45:2 ► pp. 143 ff.
2011. Corpus Academicum Lithuanicum: Design Criteria, Methodology, Application. In Human Language Technology. Challenges for Computer Science and Linguistics [Lecture Notes in Computer Science, 6562], ► pp. 412 ff.
Králík, Jan & Ludmila Uhlířová
2007. The Czech Academic Corpus (CAC), its history and presence*. Journal of Quantitative Linguistics 14:2-3 ► pp. 265 ff.
This list is based on CrossRef data as of 17 october 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.