Literary Detective Work on the Computer

| University of Wolverhampton
HardboundAvailable
ISBN 9789027249999 | EUR 99.00 | USD 149.00
 
e-Book
ISBN 9789027270139 | EUR 99.00 | USD 149.00
 
Computational linguistics can be used to uncover mysteries in text which are not always obvious to visual inspection. For example, the computer analysis of writing style can show who might be the true author of a text in cases of disputed authorship or suspected plagiarism. The theoretical background to authorship attribution is presented in a step by step manner, and comprehensive reviews of the field are given in two specialist areas, the writings of William Shakespeare and his contemporaries, and the various writing styles seen in religious texts. The final chapter looks at the progress computers have made in the decipherment of lost languages. This book is written for students and researchers of general linguistics, computational and corpus linguistics, and computer forensics. It will inspire future researchers to study these topics for themselves, and gives sufficient details of the methods and resources to get them started.
[Natural Language Processing, 12]  2014.  x, 283 pp.
Publishing status: Available
Table of Contents
Preface
ix–x
Chapter 1. Author identification
1–58
Chapter 2. Plagiarism and spam filtering
59–98
Chapter 3. Computer studies of Shakespearean authorship
99–148
Chapter 4. Stylometric analysis of religious texts
149–206
Chapter 5. Computers and decipherment
207–258
References
259–280
Index
281–283
“Interesting, packed and wide-ranging.”
“This book will prove a valuable resource for anyone wishing to gain a working knowledge of the methods and achievements of computational stylometry. It covers a wide range of studies in the field, explaining the main results and the techniques used to find them in an accessible manner. A strong point is that it includes a number of worked examples showing, with the aid of small-scale data sets, how some of the more important quantitative methods can be implemented. A further strength is its use of the public-domain system R to illustrate how certain important ideas could be put into practice.

Both newcomers and experienced bardographers will find much of interest in Chapter 3, which gives a dispassionate, empirically grounded overview of a number of key studies of the authorship of the Shakespearean canon. It also includes a clear step-by-step exposition of how Bayes's Rule may be used in investigations of this kind. Chapter 5, on decipherment, contains fascinating accounts of the attempts to decipher the Rongorongo glyphs of Rapanui (Easter Island) and the ancient seals of the lost Indus Valley civilization, among others -- introducing the basic notions of Information Theory and Markov modelling as it does so.”
“In my view this is an excellent and much-missed overview of, and introduction to, the use of statistical tests, methods and approaches to language decipherment and recognition. The in-depth discussions of the methods employed in the so-far unsuccessful decipherment of Rongorongo and the Indus Valley texts is an especially engaging read.”
“The chapter, illustrated with several examples of problems in New Testament Studies, will provide a good introduction and overview of the subject area, targeted at computer scientists who are specialists neither in statistics nor in biblical studies. The emphasis on the methods of multivariate statistical analysis, such as correspondence analysis, is a welcome feature.”
“Very comprehensive and easy to read.”
“This book is a valuable repository of techniques, methods, tasks, cases, and background relevant to

computational stylometry. I admire the way in which Oakes’ interpretation of his own research and that of others supports deeper understanding of the tasks tackled.”
Cited by

Cited by other publications

No author info given
2014.  In The Synoptic Problem and Statistics,  pp. 191 ff. Crossref logo
No author info given
2015. BOOKS RECEIVED 2014–15. New Testament Studies 61:4  pp. 615 ff. Crossref logo
Altmeyer, Stefan, Constantin Klein, Barbara Keller, Christopher F. Silver, Ralph W. Hood & Heinz Streib
2015. Subjective definitions of spirituality and religion. International Journal of Corpus Linguistics 20:4  pp. 526 ff. Crossref logo
Franklin, Emma & Michael Oakes
2016. Ngrams and Engrams: the use of structural and conceptual features to discriminate between English translations of religious texts. Corpora 11:3  pp. 299 ff. Crossref logo
Grieve, Jack, Isobelle Clarke, Emily Chiang, Hannah Gideon, Annina Heini, Andrea Nini & Emily Waibel
2019. Attributing the Bixby Letter using n-gram tracing. Digital Scholarship in the Humanities 34:3  pp. 493 ff. Crossref logo
Klaussner, Carmen, John Nerbonne & Çağrı Çöltekin
2015. Finding characteristic features in stylometric analysis. Digital Scholarship in the Humanities  pp. fqv048 ff. Crossref logo
Mealand, David L.
2016. The Seams and Summaries of Luke and of Acts. Journal for the Study of the New Testament 38:4  pp. 482 ff. Crossref logo
Može, Sara & Emad Mohamed
2019.  In Computational and Corpus-Based Phraseology [Lecture Notes in Computer Science, 11755],  pp. 315 ff. Crossref logo
Neal, Tempestt, Kalaivani Sundararajan, Aneez Fatima, Yiming Yan, Yingfei Xiang & Damon Woodard
2018. Surveying Stylometry Techniques and Applications. ACM Computing Surveys 50:6  pp. 1 ff. Crossref logo
Nini, Andrea
2018. An authorship analysis of the Jack the Ripper letters. Digital Scholarship in the Humanities 33:3  pp. 621 ff. Crossref logo
Schneider, Gerold
2020.  In Corpora and the Changing Society [Studies in Corpus Linguistics, 96],  pp. 29 ff. Crossref logo
Urbina Nájera, Argelia B., Jorge de la Calleja & Ma. Auxilio Medina
2017. Associating students and teachers for tutoring in higher education using clustering and data mining. Computer Applications in Engineering Education 25:5  pp. 823 ff. Crossref logo

This list is based on CrossRef data as of 25 june 2020. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

ReferencesThe requested document (/db/data/shared.benjamins.com/references/nlp/nlp.12.refs.xml) was not found
Subjects
BIC Subject: CFX – Computational linguistics
BISAC Subject: LAN009000 – LANGUAGE ARTS & DISCIPLINES / Linguistics / General
U.S. Library of Congress Control Number:  2014007366