Evolution and present situation of corpus research in China
In this paper, the author introduces in detail the development and present situation of corpus linguistics in China: earlier corpora, large-scale & authentic text corpora, national corpora, speech corpora, bilingual corpora and corpora of minority languages in China. The various processing techniques for corpora are also introduced: automatic word segmentation of Chinese text, automatic PoS tagging, automatic tagging of phrase structure and automatic alignment of bilingual corpora. This paper is a bird’s-eye view of corpus linguistics of China. Finally, the author discusses several problems in present corpus research: standardization of corpus specifications, commonly sharing of language resources, knowledge properties, etc.
Cited by 9 other publications
Cai, Qing, Marc Brysbaert & Antoni Rodriguez-Fornells
. SUBTLEX-CH: Chinese Word and Character Frequencies Based on Film Subtitles
. PLoS ONE
pp. e10729 ff.
. Corpora: Chinese-Language
. In The Encyclopedia of Applied Linguistics
. Corpus linguistics in Chinese contexts
pp. 118 ff.
. Reading Chinese online entertainment news: Metaphor and language play
. Journal of Pragmatics
pp. 3473 ff.
Wong, Hai Ming, Susan M. Bridges, Cynthia K.Y. Yiu, Colman P.J. McGrath, Terry K. Au & Divya S. Parthasarathy
. Development and validation of Hong Kong Rapid Estimate of Adult Literacy in Dentistry
. Journal of Investigative and Clinical Dentistry
pp. 118 ff.
. The Corpus Approach to the Teaching and Learning of Chinese as an L1 and an L2 in Retrospect
. In Computational and Corpus Approaches to Chinese Language Learning
[Chinese Language Learning Sciences
, ], ►
pp. 33 ff.
. The SNU Korean Learner Corpus of English: Compilation and Application
. English Language and Linguistics
pp. 203 ff.
This list is based on CrossRef data as of 21 march 2023. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.