Chapter published in:Parallel Corpora for Contrastive and Translation Studies: New resources and applications
Edited by Irene Doval and M. Teresa Sánchez Nieto
[Studies in Corpus Linguistics 90] 2019
► pp. 197–214
Indexation and analysis of a parallel corpus using CQPweb
The COVALT PAR_ES Corpus (EN/FR/DE > ES)
This contribution presents a section of the Corpus Valencià de Literatura Traduïda (COVALT), created by the research group of the same name (Department of Translation and Communication, Universitat Jaume I, Spain). The COVALT corpus is a four-million word corpus made up of narrative works originally written in English, French, and German and their Catalan translations published in the autonomous community of Valencia between 1990 and 2000. Since the members of the Covalt group are interested in translation research, and more specifically in the investigation of translated Catalan and Spanish, this corpus has recently been extended to include translations into Spanish published in Spain (COVALT PAR_ES corpus). This chapter presents the COVALT PAR_ES corpus, as well as its process of compilation and analysis with CQPweb.
- 2.The corpora
- 3.Corpus compilation and indexation
- 3.1Preparation of texts
- 3.2Uploading the files to CQPweb
- Step 1: Creating directories
- Step 2: Encoding and indexing corpora in CWB
- Step 3: Aligning the subcorpora
- Step 4: Copying the files to CQPweb
- Step 5: Activating the corpora on the web interface
- 4.Corpus analysis
Published online: 20 March 2019
Badia, Toni, Pujol, Manel, Tuells, Antoni, Vivaldi, Jorge, Yzaguirre, Lluís de & Cabré, Mª Teresa
Christ, Oliver, Schulze, Bruno, M., Hofmann, Anja & König, Esther
1999 The Open IMS Corpus Workbench. Corpus Query Processor. User’s Manual. Stuttgart: University of Stuttgart. http://corpora.dslo.unibo.it/TCORIS/cqpman.pdf (29 March 2017).
Doval, Irene, Fernández Lanza, Santiago, Jiménez Juliá, Tomás, Liste Lamas, Elsa & Lübke, Barbara
This volume. Corpus PaGeS: A multifunctional resource for language learning, translation and cross-linguistic research. In Parallel Corpora for Contrastive and Translation Studies: New Resources and Applications [Studies in Corpus Linguistics 90], Irene Doval & M. Teresa Sánchez eds Amsterdam John Benjamins
Evert, Stefan & The CWB Development Team
2016: The IMS Open Corpus WorkBench (CWB). Corpus Encoding Tutorial. http://cwb.sourceforge.net/files/CWB_Encoding_Tutorial.pdf (20 October 2016).
Frankenberg-Garcia, Anna & Santos, Diana
Gómez Guinovart, Xavier & Sacau Fontela, Elena
Guzman, Josep R.
2012 CQPweb – combining power, flexibility and usability in a corpus analysis tool. International Journal of Corpus Linguistics 17(3): 380–409. http://www.lancaster.ac.uk/staff/hardiea/cqpweb-paper.pdf (27 April 2017).
2014 The IMS Open Corpus Workbench (CWB) CQPweb System Administrator’s Manual. http://cwb.sourceforge.net/files/CQPwebAdminManual.pdf (8 October 2016).
2016 The IMS Open Corpus Workbench (CWB). CQPweb System Administrator’s Manual. http://cwb.sourceforge.net/ (20 November 2016).
2013a Tracing marked collocation in translated and non-translated literary language: A case study based on a parallel and comparable corpus. In Tracks and Treks in Translation Studies [Benjamins Translation Library 108], Catherine Way, Sonia Vandepitte, Reine Meylaerts & Magdalena Bartłomiejczyk (eds), 167–188. Amsterdam: John Benjamins.
Martínez Vilinsky, Bárbara
Mikhailov, Mikhail & Cooper, Robert
Oster, Ulrike & van Lawick, Heike
Oster, Ulrike & Molés-Cases, Teresa
Przepiórkowski, Adam, Górski, Rafał L., Łaziński, Marek & Pezik, Piotr
Sanjurjo-González, Hugo & Izquierdo, Marlen
This volume. P-ACTRES 2.0: A parallel corpus for cross-linguistic research. In Parallel Corpora for Contrastive and Translation Studies: New Resources and Applications [Studies in Corpus Linguistics 90], Irene Doval & M. Teresa Sánchez eds Amsterdam John Benjamins
Cited by 1 other publications
Doval, Irene, Santiago Fernández Lanza, Tomás Jiménez Juliá, Elsa Liste Lamas & Barbara Lübke
This list is based on CrossRef data as of 01 april 2022. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.