Publications

Publication details [#59584]

Lu, Xiaofei and Kwanghyun Park. 2015. Automatic analysis of thematic structure in written English. International Journal of Corpus Linguistics 20 (1) : 81–101.
Publication type
Article in journal
Publication language
English
Language as a subject
Place, Publisher
John Benjamins
Journal DOI
10.1075/ijcl

Annotation

This paper proposes and describes a computational system for the automatic analysis of thematic structure, as defined in Systemic Functional Linguistics, in written English. The system takes an English text as input and produces as output an analysis of the thematic structure of each sentence in the text. The system is evaluated using data from The Wall Street Journal section of the Penn Treebank (Marcus et al. 1993) and the British Academic Written English corpus (Gardner & Nesi 2013). An experiment using these data shows that the system achieves a high degree of reliability in regard to both identifying theme-rheme boundaries and determining several of the linguistic properties of the identified themes, including syntactic nodes, theme function, markedness, mood types, and theme roles. To illustrate how the system is used, the paper describes an example application designed to compare collections of novice and expert academic writing in terms of thematic structure.