Pablo Martínez |
300.000 Km/s
|
Institute of Advanced Studies of Catalonia
This paper addresses the quantification of literary production in Spanish between 1900 and 1945.
To obtain quantitative data, we must consider catalogue records both of main libraries and of newspaper and
periodicals libraries. The latter preserve literary magazines that feature articles and translations from peripheral
authors who were overshadowed or forgotten by historical narrative. Given the absence of proper cataloguing in
newspaper and periodicals libraries, it is necessary to develop a workflow that enables massive and accurate searches.
The aim of the proposed methodology is to build a meta catalogue that allows to search for authors and works together
with translators or illustrators. Only this way will we be able to obtain an overall image of that literary
period.
2003 “Features for Neural Net Based Region Identification of Newspaper Documents”. In Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings, 1:403–7. IEEE Comput. Soc..
Chen, Xiaotian
2012 “Google Books and WorldCat: A Comparison of Their Content”. Online Information Review 36 (4): 507–516. .
Cristianini, Nello, Thomas Lansdall-Welfare, and Gaetano Dato
2018 “Large-Scale Content Analysis of Historical Newspapers in the Town of Gorizia
1873–1914”. Historical Methods: A Journal of Quantitative and Interdisciplinary History 51 (3): 139–64.
Gulordava, Kristina, and Marco Baroni
2011 “A Distributional Similarity Approach to the Detection of Semantic Change in the Google Books Ngram
Corpus”. In Proceedings of the GEMS 2011 Workshop on Geometrical Models of Natural Language Semantics, edited by Sebastian Pado and Yves Peirsman, 67–71. Edinburgh. [URL]
Heilbron, Johan
1999 “Towards a Sociology of Translation: Book Translations as a Cultural World-System”. European Journal of Social Theory 2 (4): 429–44.
Juola, P.
2013 “Using the Google N-Gram Corpus to Measure Cultural Complexity”. Literary and Linguistic Computing 28 (4): 668–75.
Koplenig, Alexander
2015 “The Impact of Lacking Metadata for the Measurement of Cultural and Linguistic Change Using the
Google Ngram Data Sets – Reconstructing the Composition of the German Corpus in Times of WWII”. Digital Scholarship in the Humanities 32 (1): 169–88.
Lamarca Morell, Dolors, and Eugènia Serra Aranda
2005 “L’estratègia de La Biblioteca de Catalunya En Projectes Digitals”. Item: Revista de Biblioteconomia i Documentació, no. 41: 41–53. doi: [URL].
Lansdall-Welfare, Thomas, Saatviga Sudhahar, James Thompson, Justin Lewis, FindMyPast Newspaper Team, and Nello Cristianini
2017 “Content Analysis of 150 Years of British Periodicals.” Proceedings of the National Academy of Sciences of the United States of America 114 (4): E457–65.
Michel, Jean-Baptiste, Yuan Kui Shen, Aviva Presser Aiden, Adrian Veres, Matthew K. Gray, The Google Books Team, Joseph P. Pickett, et al.
2011 “Quantitative Analysis of Culture Using Millions of Digitized Books.” Science (New York, N.Y.) 331 (6014): 176–82.
Paaß, Gerhard, and Iuliu Konya
2011 “Machine Learning for Document Structure Recognition”. Studies in Computational Intelligence 370: 221–47.
Pechenick, Eitan Adam, Christopher M. Danforth, and Peter Sheridan Dodds
2015 “Characterizing the Google Books Corpus: Strong Limits to Inferences of Socio-Cultural and
Linguistic Evolution”. Edited by Alain Barrat. PLOS ONE 10 (10): e0137041. .
Tahmasebi, Nina, Lars Borin, Gabriele Capannini, Devdatt Dubhashi, Peter Exner, Markus Forsberg, Gerhard Gossen, et al.
2015 “Visions and Open Challenges for a Knowledge-Based Culturomics”. International Journal on Digital Libraries 15 (2–4): 169–87.
Zhang, Wei, and Timothy L. Andersen
2003 “Using Artificial Neural Networks to Identify Headings in Newspaper Documents”. In Proceedings of the International Joint Conference on Neural Networks, 2003, 3:2283–87. IEEE.
This list is based on CrossRef data as of 27 september 2023. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.