New generation corpus-based frequency dictionaries: The case of Czech

Čermák, František; Kren, Michal

doi:10.1075/ijcl.10.4.04cer

Article published In:

International Journal of Corpus Linguistics
Vol. 10:4 (2005) ► pp.453–467

New generation corpus-based frequency dictionaries

The case of Czech

František Čermák | Charles University, Prague

Michal Kren | Charles University, Prague

With a brief outline of the history of frequency dictionaries of the past, a renewed need for these dictionaries, based on modern large corpora, is observed. A linguistic background and some criticism of what has been done is offered. It is stressed that, next to the traditional and predominant paradigmatic approach, at least some attention should be paid to syntagmatic or combinatorial aspects of the lexis as well, as these are readily available in modern corpora. Against this background, solution to this need and projection of corpus possibilities is presented as it has been realized in the recent large Frequency Dictionary of Czech (Frekvencní slovník cetiny). A brief discussion of the linguistic and lexicographic problems is followed by a brief step-by-step description of technical solution.

Keywords: syntagmatic properties of words, representativeness, corpus, Czech National Corpus, frequency dictionary, quantitative linguistics, language core, homonymy, disambiguation, variants

Published online: 7 November 2005

https://doi.org/10.1075/ijcl.10.4.04cer

Cited by (1)

Cited by one other publication

Čermák, František

2007. Grammatical Idioms. Linguistica Pragensia 17:2

This list is based on CrossRef data as of 3 august 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.