Exploiting a Large Spoken Corpus
An End-user's Way to the BNC
The British National Corpus (BNC) contains a spoken component of about 10 million words, consisting of spoken language of various kinds produced by different speakers in a variety of situations. Starting from an end-user s perspective, this paper surveys the potential of this resource and some possible problems one might encounter if not fully versed in the details of the compilation and coding plans. Among the issues touched upon are questions relating to the composition of the component, the transcription principles employed, and points relating to the nature and coverage of the mark-up. By way of illustration, examples are drawn from a case study of the variant forms gonna and going to.
Cited by (3)
Cited by three other publications
Nokkonen, Soili
2010.
‘How many taxis there needs to be?’ The sociolinguistic variation of need to in spoken British English.
Corpora 5:1
► pp. 45 ff.
Butler, Christopher S.
2009.
Criteria of adequacy in functional linguistics.
FLIN 43:1
► pp. 1 ff.
[no author supplied]
2000.
References. In
Emerging English Modals,
► pp. 299 ff.
This list is based on CrossRef data as of 5 august 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.