Chapter published in:
Language and Text: Data, models, information and applications
Edited by Adam Pawłowski, Jan Mačutek, Sheila Embleton and George Mikros
[Current Issues in Linguistic Theory 356] 2021
► pp. 225238
Chiang, Holly, Yifan Ge & Connie Wu
2015Classification of book genres by cover and title. http://​cs229​.stanford​.edu​/proj2015​/127​_report​.pdf (7 September 2020).
Goodman, Joshua
2001Classes for fast maximum entropy training. In 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, I-561–I-564. Salt Late City, UT: IEEE. https://​arxiv​.org​/pdf​/cs​/0108006​.pdf (7 September 2020.) Crossref
Grave, Edouard, Piotr Bojanowski, Prakhar Gupta, Armand Joulin & Tomas Mikolov
2018Learning word vectors for 157 languages. In Nicoletta Calzolari et al. (eds.), Proceedings of the International Conference on Language Resources and Evaluation (LREC 2018). Miyazaki, Japan: European Language Resources Association (ELRA). https://​www​.aclweb​.org​/anthology​/L18​-1550​.pdf (7 September 2020).
Harris, Zellig S.
1954Distributional structure. WORD 10(2–3). 146–162. CrossrefGoogle Scholar
Hastie, Trevor, Robert Tibshirani & Jerome Friedman
2013The elements of statistical learning: Data mining, inference and prediction (Springer series in statistics). New York: Springer.Google Scholar
Joulin, Armand, Edouard Grave, Piotr Bojanowski & Tomas Mikolov
2017Bag of tricks for efficient text classification. In Mirella Lapata, Phil Blunsom & Alexander Koller (eds.), Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, 427–431. Valencia, Spain: Association for Computational Linguistics. https://​www​.aclweb​.org​/anthology​/E17​-2068​.pdf (7 September 2020) Crossref
Le, Quoc & Tomas Mikolov
2014Distributed representations of sentences and documents. In Eric P. Xing & Tony Jebara (eds.), Proceedings of the 31st International Conference on Machine Learning, 1188–1196. Bejing: JMLR. http://​proceedings​.mlr​.press​/v32​/le14​.pdf (7 September 2020).
Mikolov, Tomas, Wen-tau Yih & Geoffrey Zweig
2013Linguistic regularities in continuous space word representations. In Lucy Vanderwende, Hal Daumé III & Katrin Kirchhoff, Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Atlanta: Association for Computational Linguistics, 746–751. https://​www​.aclweb​.org​/anthology​/N13​-1090​.pdf (7 September 2020).
Mikros, George K.
2013Systematic stylometric differences in men and women authors: a corpus-based study. In Reinhard Köhler & Gabriel Altmann (eds.), Issues in Quantitative Linguistics 3: Dedicated to Karl-Heinz Best on the Occasion of His 70th Birthday, 206–223, Lüdenscheid: RAM–Verlag. https://​www​.academia​.edu​/3429459​/Systematic​_stylometric​_differences​_in​_men​_and​_women​_authors​_a​_corpus​-based​_study (7 September 2020.)
Mikros, George K. & Kostas Perifanos
2013Authorship attribution in Greek tweets using author’s multilevel n-gram profiles. In Eduard Hovy, Vita Markman, Craig Martell & David Uthus (eds.), AAAI Spring Symposium: Analyzing Microtext. https://​www​.aaai​.org​/ocs​/index​.php​/SSS​/SSS13​/paper​/viewFile​/5714​/5914 (7 September 2020.)
Ozsarfati, Eran, Egemen Sahin, Can J. Saul & Alper Yilmaz
2019Book genre classification based on titles with comparative machine learning algorithms. In IEEE 4th International Conference on Computer and Communication Systems (ICCCS), 14–20. Singapore: IEEE Press. CrossrefGoogle Scholar
Rybicki, Jan
2016Vive la différence: Tracing the (authorial) gender signal by multivariate analysis of word frequencies. Digital Scholarship in the Humanities 31(4). 746–761. CrossrefGoogle Scholar
Schwartz, Roy, Oren Tsur, Ari Rappoport & Moshe Koppel
2013Authorship attribution of micro-messages. In David Yarowsky, Timothy Baldwin, Anna Korhonen, Karen Livescu & Steven Bethard (eds.), Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. Seattle, WA: Association for Computational Linguistics. https://​www​.aclweb​.org​/anthology​/D13​-1193​.pdf (7 September 2020.)
Silessi, Shannon, Cihan Varol & Murat Karabatak
2016Identifying gender from SMS text messages. In 15th IEEE International Conference on Machine Learning and Applications (ICMLA), 488–491. Anaheim, CA: IEEE. CrossrefGoogle Scholar
Walkowiak, Tomasz & Maciej Piasecki
2018Stylometry analysis of literary texts in Polish. In Leszek Rutkowski, Rafał Scherer, Marcin Korytkowski, Witold Pedrycz, Ryszard Tadeusiewicz & Jacek M. Zadura (eds.) Artificial Intelligence and Soft Computing (Lecture notes in Artificial Intelligence 10842), 777–787. Cham: Springer. CrossrefGoogle Scholar