A Corpus Linguistic Approach to Literary Language and Characterization

Virginia Woolf's The Waves

ISBN 9789027234070 | EUR 99.00 | USD 149.00
ISBN 9789027270429 | EUR 99.00 | USD 149.00
This book focusses on computer methodologies as a way of investigating language and character in literary texts. Both theoretical and practical, it surveys investigations into characterization in literary linguistics and personality in social psychology, before carrying out a computational analysis of Virginia Woolf’s experimental novel The Waves. Frequencies of grammatical and semantic categories in the language of the six speaking characters are analyzed using Wmatrix software developed by UCREL at Lancaster University. The quantitative analysis is supplemented by a qualitative analysis into recurring patterns of metaphor. The author concludes that these analyses successfully differentiate all six characters, both synchronically and diachronically, and claims that this methodology is also applicable to the study of personality in non-literary language. The book, written in a clear and accessible style, will be of interest to post-graduate students and academics in linguistics, stylistics, literary studies, psychology and also computational approaches.
[Linguistic Approaches to Literature, 18]  2014.  xxi, 277 pp.
Publishing status: Available
Table of Contents
“This book presents an original and systematic approach to characterisation in Virginia Woolf’s The Waves, and provides an excellent demonstration of how corpus-based methods can address issues relevant to both stylisticians and literary critics.”
“This interesting and clearly written book describes a quantitative study of the individual characters’ spoken words in Virginia Woolf’s “The Waves”. Both technical details of the computer packages and statistical tests used and a suitable literary analysis of the book are brought together, including a discussion of why the narrative structure of “The Waves” makes it particularly amenable to a quantitative analysis. All the steps in the analysis are carefully explained, including the pre-processing of the original corpus, part of speech and semantic tagging using the Wmatrix tool developed by Paul Rayson at Lancaster University, the visualisation of the output in Microsoft Excel, and a statistical analysis of character idiolects using the Log-Likelihood measure. The analysis is both synchronic, comparing the speeches of the individual characters who appear in the book, and diachronic, following changes in the characters’ speech throughout their lifetimes. The findings show that the characters are indeed differentiated and that these differences are more pronounced during their adolescence and in later life. Since previous critics using traditional analyses have claimed to have found no language differences between the six characters, this book shows how a macroscopic, quantitative analysis of a literary work can provide evidence for language distinctions that a “naked eye” reading might miss. The methodology described in this book is of general applicability to corpus studies of literary texts, and thus will inspire future work in computational stylometry. However, rather than comparing the frequencies of lexical words to compare speech samples, Balossi has taken the relatively novel approach of comparing the frequencies of parts of speech and semantic fields. A particularly useful feature of this book is the thorough use of footnotes to guide the reader to each of the online linguistic resources mentioned in the book.”
“This book contains an in-depth and comprehensive analysis of Virginia Woolf's The Waves. It exploits the full power of a multidisciplinary approach using corpus linguistics and computational methods to investigate the linguistic differentiation of the six characters in the novel and goes well beyond the simple counting of word frequencies or function words. But the author also goes further to locate the study in the appropriate contexts of the cognitive model of characterization, authorship attribution, cognitive metaphor theory, stylistics and literary criticism. The author is to be congratulated on taking all these disparate theoretical and practical elements and weaving them together in such an interesting book. Not only will this book appeal to those interested in the work of Woolf but also those who wish to learn how to apply similar methods more widely in the study of characterization in literature.”
“A Corpus Linguistic Approach to Literary Language and Characterization Virginia Woolf's The Waves is a strong and valuable contribution to the computational study of literature. I especially like the thoroughness of the book’s treatment of earlier criticism on The Waves. The close attention to that criticism helps to focus the study, which also effectively presents some strong corrective arguments against some strains within it. The book is an excellent partial answer to Stephen Ramsay’s claim that trying to solve the question of whether the voices are different is a category mistake: it shows rather definitively that there “really”are different voices. I will want to go back over the book more closely later, but I especially appreciated chapter 7: in spite of some doubts I have about the semantic categorization that the chapter is based upon (even after the reasonable corrections offered there), I think the closer attention to the lexis of the novel in chapter 7 makes it more satisfying than the analysis of categories in chapter 6. The multiple methods and theories used in the book seem a bit surprising at first, but it makes very good use of a variety of approaches and does a good job of wrapping them up into an effective and persuasive final package.”
Cited by

Cited by 10 other publications

Brooke, Julian, Adam Hammond & Graeme Hirst
2016. Using Models of Lexical Style to Quantify Free Indirect Discourse in Modernist Fiction. Digital Scholarship in the Humanities  pp. fqv072 ff. Crossref logo
Guo, Hua
2017. Speech presentation: A preliminary review from multiple perspectives. Journal of Language and Cultural Education 5:1  pp. 182 ff. Crossref logo
Jones, Henry
2019. Shifting characterizations of the ‘Common People’ in modern English retranslations of Thucydides’ History of the Peloponnesian War: a corpus-based analysis. Palgrave Communications 5:1 Crossref logo
Jones, Henry
2020. Retranslating Thucydides as a scientific historian. Target. International Journal of Translation Studies 32:1  pp. 59 ff. Crossref logo
Montoro, Rocío
2015. The Year’s Work in stylistics 2014. Language and Literature: International Journal of Stylistics 24:4  pp. 355 ff. Crossref logo
OMAR, Abdulfattah
2021. Identifying themes in fiction: A centroid-based lexical clustering approach. Dil ve Dilbilimi Çalışmaları Dergisi  pp. 580 ff. Crossref logo
Piao, Scott, Fraser Dallachy, Alistair Baron, Jane Demmen, Steve Wattam, Philip Durkin, James McCracken, Paul Rayson & Marc Alexander
2017. A time-sensitive historical thesaurus-based semantic tagger for deep semantic annotation. Computer Speech & Language 46  pp. 113 ff. Crossref logo
Reali, Florencia
2020. Emotion metaphors in James Joyce’s A Portrait of the artist as a young man . Journal of Literary Semantics 49:1  pp. 41 ff. Crossref logo
Sherstinova, Tatiana & Margarita Kirina
2022.  In Digital Transformation and Global Society [Communications in Computer and Information Science, 1503],  pp. 332 ff. Crossref logo
Smith, Nicholas & Cathleen Waters
2019. Variation and change in a specialized register. International Journal of Corpus Linguistics 24:2  pp. 169 ff. Crossref logo

This list is based on CrossRef data as of 26 april 2022. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

Subjects & Metadata
BIC Subject: DSBH – Literary studies: from c 1900 -
BISAC Subject: LIT004120 – LITERARY CRITICISM / European / English, Irish, Scottish, Welsh
ONIX Metadata
ONIX 2.1
ONIX 3.0
U.S. Library of Congress Control Number:  2014001873 | Marc record