Article published in:Directions in Empirical Literary Studies: In honor of Willie van Peer
Edited by Sonia Zyngier, Marisa Bortolussi, Anna Chesnokova and Jan Auracher
[Linguistic Approaches to Literature 5] 2008
► pp. 271–282
Empirical evaluation: Towards an automated index of lexical variety
This chapter proposes an objective approach to the formal analysis of literary prose in English in order to investigate the relation between lexical density and judgments of canonicity. Based on the concepts of literariness proposed by the Russian Formalists and lexical variety, a mathematical index is designed, relating three variables which take the materiality of text into consideration: (a) relative frequency of lexical bundles, (b) lexical bundle type/token ratio, and (c) word type/token ratio. The index is described and illustrated with 46 canonical and non-canonical literary works. Statistical analysis shows no significant relation between lexical richness and decisions of what has been classified as canonical, indicating that these judgments may be influenced by factors other than the text itself.
Keywords: canonicity, Corpus Linguistics, empirical study, lexical bundles, lexical variety, literary discourse
Published online: 15 May 2008
Cited by 1 other publications
Melissourgou, Maria N. & Katerina T. Frantzi
This list is based on CrossRef data as of 26 march 2021. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.