Corpus-based contrastive studies
Beginnings, developments and directions
This article outlines the beginnings of corpus-based contrastive studies with special reference to the development of parallel corpora that took place in Scandinavia in the early 1990s under the direction of Stig Johansson. It then discusses multilingual corpus types and methodological issues of their exploration, including the tertium comparationis for contrastive studies based on different types of corpora. Some glimpses are offered of recent developments and current trends in the field, including the widening scope of corpus-based contrastive analysis, concerning language pairs as well as the kinds of topics studied and the methods used. The paper ends by identifying and discussing some challenges for the field and indicating prospects and directions for its future.
Article outline
- 1.Introduction
- 2.Beginnings and early developments
- 3.Using corpora in contrastive studies
- 3.1Types of multilingual corpora
- 3.2Multilingual corpora and contrastive analysis methodology
- 4.Ongoing developments
- 4.1The development of corpus-based CA through the lens of Languages in Contrast
- 4.2Widening the scope: Current trends
- 5.Challenges for corpus-based contrastive linguistics
- 6.Future directions
- Notes
-
References
References
Aijmer, K.
2008 Parallel and Comparable Corpora. In
Corpus Linguistics. An International Handbook, Vol. 11,
A. Lüdeling and
M. Kytö (eds), 275–292. Berlin: Walter De Gruyter.
Aijmer, K. and Altenberg, B.
2013 Introduction. In
Advances in Corpus-Based Contrastive Linguistics. Studies in Honour of Stig Johansson,
K. Aijmer and
B. Altenberg (eds), 1–6. Amsterdam: John Benjamins.
Aijmer, K., Altenberg, B. and Johansson, M.
(eds) 1996 Languages in Contrast. Papers from a Symposium on Text-Based Cross-Linguistic Studies. Lund: Lund University Press.
Aijmer, K. and Lewis, D.
(eds) 2017 Contrastive Analysis of Discourse-Pragmatic Aspects of Linguistic Genres. Berlin: Springer.
Aijmer, K. and Simon-Vandenbergen, A.
(eds) 2006 Pragmatic Markers in Contrast. Amsterdam: Elsevier.
Altenberg, B.
1999 Adverbial Connectors in English and Swedish: Semantic and Lexical Correspondences. In
Out of Corpora. Studies in Honour of Stig Johansson,
H. Hasselgård and
S. Oksefjell (eds), 249–268. Amsterdam: Rodopi.
Altenberg, B. and Granger, S.
Barlow, M.
1995 ParaConc: A Concordancer for Parallel Texts.
Computers and Texts 101: 14–16.
Bernardini, S.
2015 Translation. In
The Cambridge Handbook of English Corpus Linguistics,
D. Biber and
R. Reppen (eds), 515–536. Cambridge: Cambridge University Press.
Biber, D.
1988 Variation across Speech and Writing. Cambridge: Cambridge University Press.
Biber, D.
2009 Dimensions of Register Variation. A Cross-Linguistic Comparison. Cambridge: Cambridge University Press.
Biber, D., Conrad, S. and Reppen, R.
1998 Corpus Linguistics. Investigating Language Structure and Use. Cambridge: Cambridge University Press.
Čermáková, A. and Chlumská, L.
Chlumská, L. and Lukeš, D.
2018 Comparing the Incomparable? Rethinking n-grams for Free Word-Order Languages. In
Book of Abstracts. Using Corpora in Contrastive and Translation Studies Conference (5th edition),
S. Granger,
M.-A. Lefer and
L. A. d. S. Penha-Marion (eds), 40–41. CECL Papers 1. Louvain-la-Neuve: Centre for English Corpus Linguistics/Université catholique de Louvain.
Cignoni, L., Coffey, S. and Moon, R.
Cortes, V.
2008 A Comparative Analysis of Lexical Bundles in Academic History Writing in English and Spanish.
Corpora 3(1): 43–57.
Cosme, C. and Gilquin, G.
Coussé, E. and van der Auwera, J.
Crible, L. and Degand, L.
2019 Reliability vs. Granularity in Discourse Annotation: What is the Trade-Off? Corpus Linguistics and Linguistic Theory 15(1): 71–99.
Dickens, A. and Salkie, R.
1996 Comparing Bilingual Dictionaries with a Parallel Corpus.
Proceedings of the Seventh Euralex International Congress on Lexicography (Euralex ’96). Göteborg, Sweden
13–18 August 1996 Göteborg: Göteborg University. 551–559. Available at
[URL] [last accessed May 2019].
Doval, I. and Sánchez Nieto, M. T.
Dušková, L.
2015 From Syntax to Text. The Janus Face of Functional Sentence Perspective. Prague: Karolinum Press.
Dyvik, H.
2005 Translations as a Semantic Knowledge Source.
Proceedings of the Second Baltic Conference on Human Language Technologies. Tallinn, Estonia
4–5 April 2005 Tallinn: Institute of Cybernetics at Tallinn University of Technology, Institute of the Estonian Language. 27–38.
Ebeling, J.
1998 The Translation Corpus Explorer: A Browser for Parallel Texts. In
Corpora and Cross-Linguistic Research: Theory, Method, and Case Studies,
S. Johansson and
S. Oksefjell (eds), 101–112. Amsterdam: Rodopi.
Ebeling, J.
2016 Contrastive Linguistics in a New Key.
Nordic Journal of English Studies 15(3): 7–14.
Ebeling, J. and Ebeling, S. O.
Ebeling, J. and Ebeling, S. O.
2015 An English-Norwegian Contrastive Analysis of Downtoners, more or less.
Nordic Journal of English Studies 14(1): 62–89.
Ebeling, J. and Ebeling, S. O.
2017 A Cross-Linguistic Comparison of Recurrent Word-Combinations in a Comparable Corpus of English and Norwegian Fiction. In
Contrasting English and Other Languages through Corpora,
M. Janebová,
E. Lapshinova-Koltunski and
M. Martinková (eds), 2–31. Newcastle: Cambridge Scholars Publishing.
Ebeling, S. O.
2016 Does Corpus Size Matter? Revisiting ENPC Case Studies with an Extended Version of the Corpus.
Nordic Journal of English Studies 15(3): 33–54.
Ebeling, S. O.
2018 Exploring Cross-Linguistic Congruence: The Case of two Stance Frames in English and Norwegian.
Bergen Language and Linguistics Studies (BeLLS) 9(1): 69–92.
Ebeling, S. O. and Ebeling, J.
2013b From Babylon to Bergen: On the Usefulness of Aligned Texts.
Bergen Language and Linguistics Studies (BeLLS) 3(1): 23–42.
Ebeling, S. O. and Ebeling, J.
Egan, T.
2013
Between and through Revisited. In
Corpus Linguistics and Variation in English: Focus on Non-Native Englishes,
M. Huber and
J. Mukherjee (eds). Studies in Variation, Contacts and Change in English, Vol. 131. Available at
[URL] [last accessed May 2019]
Escouflaire, L.
2020 Building a Typology of the Linguistic Functions of Emoji. A Cross-Linguistic and Cross-Platform Corpus Analysis of Emoji in Conversation. Master's dissertation, University of Oslo/Université catholique de Louvain.
Evert, S., and Neumann, S.
2017 The Impact of Translation Direction on Characteristics of Translated Texts: A Multivariate Analysis for English and German. In
Empirical Translation Studies: New Methodological and Theoretical Traditions,
G. De Sutter,
M.-A. Lefer and
I. Delaere (eds), 47–80. Berlin: De Gruyter.
Filipović, R.
1984 What are the Primary Data for Contrastive Analysis? In
Contrastive Linguistics: Prospects and Problems,
J. Fisiak (ed.), 107–117. Berlin: De Gruyter.
Fisiak, J.
(ed.) 1984 Contrastive Linguistics. Prospects and Problems. Berlin: De Gruyter.
Fløttum, K., Dahl, T. and Kinn, T.
Frankenberg-Garcia, A. and Santos, D.
2003 Introducing Compara, the Portuguese-English Parallel Corpus. In
Corpora in Translator Education,
F. Zanettin,
S. Bernardini and
D. Stewart (eds), 71–88. Manchester: St Jerome.
Gale, W. A. and Church, K. W.
1991 A Program for Aligning Sentences in Bilingual Corpora.
Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics (ACL). Berkeley, USA
18–21 June 1991 177–184.
Gellerstam, M.
1986 Translationese in Swedish Novels Translated from English. In
Translation Studies in
Scandinavia, L. Wollin and
H. Lindquist (eds), 88–95. Lund: CWK Gleerup.
Gilquin, G.
2000/2001 The Integrated Contrastive Model. Spicing up your Data.
Languages in Contrast 3(1): 95–123.
Gómez González, M. A., Mackenzie, J. L. and González-Álvarez, E.
Granger, S.
1996 From CA to CIA and Back: An Integrated Approach to Computerized Bilingual and Learner Corpora. In
Languages in Contrast. Text-Based Cross-Linguistic Studies,
K. Aijmer,
B. Altenberg and
M. Johansson (eds), 37–51. Lund: Lund University Press.
Gries, S. Th., Miglio, V. G. and Jansegers, M.
To appear.
Quantitative Methods for Corpus-Based Contrastive Linguistics. Available at
[URL] [last accessed February 2020].
Hansen-Schirra, S. and Neumann, S.
2012 Corpus Enrichment, Representation, Exploitation, and Quality Control. In
Cross-Linguistic Corpora for the Study of Translations. Insights from the Language Pair English-German,
S. Hansen-Schirra,
S. Neumann and
E. Steiner (eds), 35–52. Berlin: De Gruyter.
Hansen-Schirra, S., Neumann, S., Steiner, E.
(eds) 2012 Cross-Linguistic Corpora for the Study of Translations. Insights from the Language Pair English-German. Berlin: De Gruyter.
Hasselgård, H.
2016
The Way of the World: The Colligational Framework “the N1 of the N2” and its Norwegian Correspondences.
Nordic Journal of English Studies 15(3): 55–79.
Hasselgård, H.
2017a Temporal Expressions in English and Norwegian. In
Contrasting English and Other Languages through Corpora,
M. Janebová,
E. Lapshinova-Koltunski and
M. Martinková (eds), 75–101. Newcastle: Cambridge Scholars Publishing.
James, C.
1980 Contrastive Analysis. London: Longman.
Johansson, S.
1998 On the Role of Corpora in Cross-Linguistic Research. In
Corpora and Cross-Linguistic Research. Theory, Methods, and Case Studies,
S. Johansson and
S. Oksefjell (eds), 3–24. Amsterdam: Rodopi.
Johansson, S.
2012 Cross-Linguistic Perspectives. In
English Corpus Linguistics: Crossing Paths,
M. Kytö (ed.), 45–68. Amsterdam: Rodopi.
Johansson, Stig, Ebeling, J. and Oksefjell, S.
1999/2002 English-Norwegian Parallel Corpus: Manual. Available at
[URL] [last accessed February 2020].
Johansson, S. and Hofland, K.
1994 Towards an English-Norwegian Parallel Corpus. In
Creating and Using English Language Corpora: Papers from the Fourteenth International Conference on English Language Research on Computerized Corpora, Zürich 1993,
U. Fries,
G. Tottie and
P. Schneider (eds), 25–37. Amsterdam: Rodopi.
Kefala, S. and Sidiropoulou, M.
Kirk, J. and Čermáková, A.
2017 From ICE to ICC: The New International Comparable Corpus. In
Proceedings of the Workshop on Challenges in the Management of Large Corpora and Big Data and Natural Language Processing (CMLC-5+BigNLP), Birmingham, 24 July 2017,
P. Bański,
M. Kupietz,
H. Lüngen,
P. Rayson,
H. Biber,
E. Breiteneder,
S. Clematide,
J. Mariani,
M. Stevenson and
T. Sick (eds), 7–12. Mannheim: Institut für Deutsche Sprache. Available at
[URL] [last accessed February 2020].
Koehn, P.
2005 Europarl: A Parallel Corpus for Statistical Machine Translation.
Proceedings of the Tenth Machine Translation Summit (MT Summit ’05). Phuket, Thailand
12–16 September 2005 Asia-Pacific Association for Machine Translation, Thai Computational Linguistics Laboratory. 79–86. Available at
[URL] [last accessed February 2020].
Kong, K. C.
2013 A Corpus-Based Study in Comparing the Multimodality of Chinese- and English-Language Newspapers.
Visual Communication 12(2): 173–196.
Kunnskapsforlaget
2001 Engelsk Stor Ordbok: engelsk-norsk / norsk-engelsk. Oslo: Kunnskapsforlaget.
Kunz, K., Lapshinova-Koltunski, E., Martínez Martínez, J. M., Menzel, K. and Steiner, E.
Lado, R.
1957 Linguistics across Cultures. Ann Arbor: The University of Michigan Press.
Lauridsen, K.
1996 Text Corpora and Contrastive Linguistics: Which Type of Corpus for which Type of Analysis? In
Languages in Contrast. Papers from a Symposium on Text-Based Cross-Linguistic Studies,
K. Aijmer,
B. Altenberg and
M. Johansson (eds), 63–71. Lund: Lund University Press.
Laviosa, S.
2002 Corpus-Based Translation Studies. Theory, Findings, Applications. Amsterdam: Rodopi.
Lefer, M.-A. and Vogeleer, S.
(eds) 2014 Genre- and Register-Related Discourse Features in Contrast. Special issue of
Languages in Contrast 14(1).
Lewis, D.
2017 Coherence Relations and Information Structure in English and French Political Speeches. In
Contrastive Analysis of Discourse-Pragmatic Aspects of Linguistic Genres,
K. Aijmer and
D. Lewis (eds), 141–161. Berlin: Springer.
López Arroyo, B. and Roberts, R. P.
Macken, L., De Clercq, O. and Paulussen, H.
2011 Dutch Parallel Corpus: A Balanced Copyright-Cleared Parallel Corpus.
Meta 56(2): 374–390.
Maia, B. and Santos, D.
2012 “Who’s afraid of ... what?” – In English and Portuguese. In
Aspects of Corpus Linguistics: Compilation, Annotation, Analysis,
S. O. Ebeling,
J. Ebeling and
H. Hasselgård (eds). Studies in Variation, Contacts and Change in English, Vol. 121. Available at
[URL]
Mair, C.
2018 Contrastive Analysis in Linguistics. In
Oxford Bibliographies in Linguistics,
M. Aronoff (ed.), Oxford: Oxford University Press.
McArthur, T.
1992 The Oxford Companion to the English Language. Oxford: Oxford University Press.
McEnery, T. and Hardie, A.
2012 Corpus Linguistics. Cambridge: Cambridge University Press.
Nordrum, L., Ebeling, S. O. and Hasselgård, H.
(eds) 2016 Languages in Contrast 20 Years on. Special issue of
Nordic Journal of English Studies 15(3).
Oakes, M. and McEnery, T.
2000 Bilingual Text Alignment – An Overview. In
Multilingual Corpora in Teaching and Research,
S. P. Botley,
A. M. McEnery and
A. Wilson (eds), 1–37. Amsterdam: Rodopi.
O’Keeffe, A.
2018 Corpus-Based Function-to-Form Approaches. In
Methods in Pragmatics,
A. H. Jucker,
K. P. Schneider and
W. Bublitz (eds), 587–618. Berlin: De Gruyter Mouton.
Salkie, R., Aijmer, K. and Barlow, M.
1998 Editorial.
Languages in Contrast 1(1): v–xii.
Sanjurjo-González, H. and Izquierdo, M.
Šebestová, D. and Malá, M.
2018 Testing the Contrastive Application of the n-gram Method to Typologically Different Languages: The Case of English and Czech Children’s Literature. In
Book of Abstracts. Using Corpora in Contrastive and Translation Studies Conference (5th edition),
S. Granger,
M.-A. Lefer and
L. A. d. S. Penha-Marion (eds), 157–158. CECL Papers 1. Louvain-la-Neuve: Centre for English Corpus Linguistics/Université catholique de Louvain.
Sinclair, J.
2004 Trust the Text. Language, Corpus and Discourse. London: Routledge.
Šinkūnienė, J.
2017 Corpora and Corpus Linguistics Revisited: An Interview with Karin Aijmer.
Kalbotyra 701: 184–191.
Ström Herold, J. and Levin, M.
2019 The Obama Presidency, the Macintosh Keyboard and the Norway Fiasco: English Proper Noun Modifiers and their German and Swedish Correspondences.
English Language and Linguistics 231: 827–854.
Taboada, M., Carretero, M. and Hinnell, J.
Tognini-Bonelli, E.
1996 Towards Translation Equivalence from a Corpus Linguistics Perspective.
International Journal of Lexicography 9(3): 197–217.
Wallis, S.
2007 Annotation, Retrieval and Experimentation. Or: You only Get out what you Put in. In
Annotating Variation and Change,
A. Meurman-Solin and
A. Nurmi (eds). Studies in Variation, Contacts and Change in English, Vol. 11. Available at
[URL]
Xiao, R. and T. McEnery
2010 Corpus-Based Contrastive Studies of English and Chinese. London: Routledge.
Zanettin, F.
2011 Translation and Corpus Design.
SYNAPS – A Journal of Professional Communication 261: 14–23. Available at
[URL]
Cited by
Cited by 1 other publications
Johansen, Stine Hulleberg
This list is based on CrossRef data as of 5 april 2022. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.