Jianxin Wang | Beijing University of Posts and Telecom
This paper discusses some of the new developments in corpus linguistics in China. In the area of Chinese corpus compilation it presents large-scale text databases, representative corpora, annotated corpora, lexical databases for information processing, phonological, dialectal, spoken and other specialized corpora. In connection with the analysis and annotation of Chinese corpora, the characteristics of the Chinese language, word segmentation, tagging, parsing, and some corpus analytical systems are described. Concerning English corpus studies, some corpora of English as a Foreign Language and corpus-based research are depicted. On this basis tentative conclusions are drawn.
2019. Cantonese AphasiaBank: An annotated database of spoken discourse and co-verbal gestures by healthy and language-impaired native Cantonese speakers. Behavior Research Methods 51:3 ► pp. 1131 ff.
Sze, Wei Ping, Susan J. Rickard Liow & Melvin J. Yap
2014. The Chinese Lexicon Project: A repository of lexical decision behavioral responses for 2,500 Chinese characters. Behavior Research Methods 46:1 ► pp. 263 ff.
Leung, Man-Tak, Sam-Po Law & Suk-Yee Fung
2004. Type and token frequencies of phonological units in Hong Kong Cantonese. Behavior Research Methods, Instruments, & Computers 36:3 ► pp. 500 ff.
This list is based on CrossRef data as of 5 august 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.