Article published in:Learner Corpora in Language Testing and Assessment
Edited by Marcus Callies and Sandra Götz
[Studies in Corpus Linguistics 70] 2015
► pp. 85–112
First steps in assigning proficiency to texts in a learner corpus of computer-mediated communication
This chapter presents a new method for assigning proficiency levels to texts in a learner corpus of computer-mediated communication (CMC). The CMC comes from learner comments on news articles that form part of an English language course for university students in Japan. The rationale for using the CMC discourse as the basis of a learner corpus will be discussed, followed by a justification of using a text-centred approach of assigning proficiency. The use of binary decision trees to account for the complexity, accuracy and fluency evident in the texts will be described, followed by a snapshot of the results from using the method so far. The chapter concludes with the suggestion that while some of the details may need refining, in principle the method could be of use in categorizing the proficiency of texts in other learner corpora.
Published online: 09 April 2015
British Broadcasting Corporation (BBC)
2001–2014 Have Your Say, http://www.bbc.co.uk/news/have_your_say (5 July 2014).
2014 Web Vocabprofile. An adaptation of Heatley, Nation & Coxhead’s (2002) Range , http://www.lextutor.ca/vp (5 July 2014).
Council of Europe.
2008 The Corpus of Contemporary American English: 425 million words, 1990–present, http://corpus.byu.edu/coca (5 July 2014).
Du, H.S. & Wagner, C.
Erbaggio, P., Gopalakrishnan, S., Hobbs, S. & Liu, H.
Fulcher, G., Davidson, F. & Kemp, J.
Heatley, A., Nation, P. & Coxhead, A.
2002 RANGE and FREQUENCY programs, http://www.victoria.ac.nz/lals/staff/paul-nation (5 July 2014).
Hillocks Jr, G.
Housen, A. & Kuiken, F.
Hsu, C.-L. & Lin, J.C.-C.
Jarvis, S. & Pavlenko, A.
2010–2014 News Based English, http://www.newsbased.com (5 July 2014).
Marchand, T. & Akutsu, S.
Forthcoming. The compilation and use of a CMC learner corpus for Japanese university students. In Studies in Learner Corpus Linguistics: Research and Applications for Foreign Language Teaching and Assessment, E. Castello, K. Ackerley & F. Coccetta (eds) Frankfurt Peter Lang
Marchand, T. & Rowlett, B.
Mizrahi, E. & Laufer, B.
2010 Lexical competence of highly advanced L2 users: Is their collocation knowledge as good as their productive vocabulary size? Paper presented at EUROSLA 20.
2007 A corpus-driven approach to genre analysis: The reinvestigation of academic, newspaper and literary texts. ELR Journal 1(2), http://ejournals.org.uk/ELR/article/2007/2 (5 July 2014).
Norris, J.M. & Ortega, L.
Skehan, P. & Foster, P.
Upshur, J.A. & Turner, C.E.
2013 American teacher in Japan under fire for lesson’s on Japan’s history of discrimination, http://www.washingtonpost.com/blogs/worldviews/wp/2013/02/22/american-teacher-in-japan-under-fire-for-lessons-on-japans-history-of-discrimination (13 October 2013).