Correspondence analysis is an exploratory technique for complex categorical data, typical of corpus-driven research. It identifies patterns of association and disassociation in those data. For instance, it can map the correlations between different uses of a linguistic form and its various social and/or morpho-syntactic contexts. The technique presents its results in the form of a two-dimensional plot, which visualises these relationships in an intuitive manner. These plots offer rich representations of the relations between different facets of complex data. Using R, this chapter explains how the technique works and offers a step-by-step explanation of its application and the interpretation of its results. The technique is also compared to the better-known and comparable cluster analysis.
(2002) Categorical data analysis (2nd ed.). Hoboken: John Wiley.
Arppe, A
2006Frequency considerations in morphology. Finnish verbs differ, too. SKY Journal of Linguistics, 19, 175–189.
Baayen, R.H
(2008) Analyzing linguistic data: A practical introduction to statistics using R. Cambridge: Cambridge University Press.
Baayen, R.H
(2011) languageR: Data sets and functions with “Analyzing Linguistic Data: A practical introduction to statistics”. R package version 1.1. Retrieved from [URL].
Benzécri, J.P
(1992) Correspondence analysis handbook. New York:Marcel Dekker.
De Leeuw, J., & Mair, P
(2009a) Simple and canonical correspondence analysis using the R package anacor. Journal of Statistical Software, 31, 1–18. Retrieved from [URL].
De Leeuw, J., & Mair, P
(2009b) Gifi methods for optimal scaling in R: The package homals. Journal of Statistical Software, 31, 1–20. Retrieved from [URL].
Delaere, I., Plevoets, K., & De Sutter, G
(Submitted) Measuring text type variation through profile-based correspondence analysis: How far apart are translated and non-translated Dutch? Target. International Journal of Translation Studies.
Divjak, D
(2010) Structuring the lexicon: A clustered model for near-synonymy. Berlin & New York: Mouton de Gruyter.
Dray, S., & Dufour, A.-B
(2007) The ade4 package: Implementing the duality diagram for ecologists. Journal of Statistical Software, 22, 1–20.
(2011) Cognitive Linguistic methods for literature: A usage-based approach to metanarrative and metalepsis. In A. Kwiatkowska (Ed.), Texts and minds: Papers in cognitive poetics and rhetoric (pp. 85–102). Frankfurt/Main: Peter Lang.
(2010) Synonymy, lexical fields, and grammatical constructions: A study in usage-based Cognitive Semantics. In H.-J. Schmid, & S. Handl (Eds.), Cognitive foundations of linguistic usage-patterns: Empirical studies (pp. 89–118). Berlin & New York: Mouton de Gruyter.
Glynn, D
(2014a) The conceptual profile of the lexeme home: A multifactorial diachronic analysis. In J.E. Díaz-Vera (Ed.), Metaphor and metonymy across time and cultures (pp. 265–293). Berlin & New York: Mouton de Gruyter.
Glynn, D
(2014b) The social nature of anger: Multivariate corpus evidence for context effects upon conceptual structure. In I. Novakova, P. Blumenthal, & D. Siepmann (Eds.), Emotions in discourse (pp. 69–82). Frankfurt/Main: Peter Lang.
Glynn, D
(In press) Cognitive socio-semantics: The theoretical and analytical role of context in meaning. Review of Cognitive Linguistics.
Gower, J., Gardner-Lubbe, S., & le Roux, N
(2010) Understanding biplots. Chichester: Wiley.
Greenacre, M., & Blasius, J
(Eds.) (2006) Multiple correspondence analysis and related methods. London: Chapman & Hall.
Greenacre, M., & Nenadić, O
(2010) ca: Simple, multiple and joint correspondence analysis. R package version 0.33. Retrieved from [URL].
Greenacre, M
(1984) Theory and applications of correspondence analysis. London: Academic Press.
Greenacre, M
(2006) From simple to multiple correspondence analysis. InM. Greenacre, & J. Blasius(Eds.), Multiple correspondence analysis and related methods (pp. 41–76). London: Chapman & Hall.
Greenacre, M
(2007) Correspondence analysis in practice. London: Chapman & Hall.
Greenacre, M
(2010) Biplots in practice. Bilbao: Fundación BBVA.
Husson, F., Lê, S., & Pagès, J
(2011) Exploratory multivariate analysis by example using R. London: Chapman & Hall.
Kowalczyk, T., Pleszczynska, E., & Ruland, F
(Eds.) (2004) Grade models and methods for data analysis. München: Springer.
Krawczak, K
(2014a) Shame and its near-synonyms in English: A multivariate corpus-driven approach to social emotions. In I. Novakova, P. Blumenthal, & D. Siepmann (Eds.), Emotions in discourse (pp. 84–94). Frankfurt/Main: Peter Lang.
Krawczak, K
(2014b) Epistemic stance predicates in English: A quantitative corpus-driven study of subjectivity. In D. Glynn, & M. Sjölin (Eds.), Subjectivity and epistemicity: Corpus, discourse, and literary approaches to stance (pp. 355–386). Lund: Lund University Press.
Krawczak, K., & Glynn, D
(2011) Context and cognition: A corpus-driven approach to parenthetical uses of mental predicates. In K. Kosecki, & J. Badio (Eds.), Cognitive processes in language (pp. 87–99). Frankfurt/Main: Peter Lang.
Krawczak, K., & Kokorniak, I
(2012) Subjective construal of think in Polish. Poznań Studies in Contemporary Linguistics, 48, 439–472.
Le Roux, B., & Rouanet, H
(2005) Geometric data analysis: From correspondence analysis to structured data analysis. London: Kluwer.
(2008) FactoMineR: An R package for multivariate analysis. Journal of Statistical Software, 25, 1–18.
Murtagh, F
(2005) Correspondence analysis and data coding with R and Java. London: Chapman & Hall.
Nenadić, O., & Greenacre, M
(2007) Correspondence analysis in R, with two- and three-dimensional graphics: The ca package. Journal of Statistical Software, 20. Retrieved from [URL].
Oksanen, J., Blanchet, G., Kindt, R., Legendre, P., O’Hara, R.B., Simpson, G.L., Solymos, P., Henry, M., Stevens, H., & Wagner, H
(2011) vegan: Community ecology package. R package version 1.17-11. Retrieved from [URL].
Oksanen, J
(2006) Multivariate analysis of ecological communities in R: vegan tutorial. Retrieved from [URL].
Campana, Ilaria, Ivan Farace, Miriam Paraboschi & Antonella Arcangeli
2023. Analysis of environmental, social, and anthropogenic factors as potential drivers of breaching behavior in the Mediterranean fin whale. Marine Mammal Science
Chen, Qiaoyun
2022. English and Chinese existential constructions in contrast: A corpus-based semantic study. Poznan Studies in Contemporary Linguistics 58:4 ► pp. 717 ff.
Clarke, Isobelle
2018. Stylistic Variation in Twitter Trolling. In Online Harassment [Human–Computer Interaction Series, ], ► pp. 151 ff.
Dahlgren, Sonja, Alek Keersmaekers & Joanne Stolk
2022. Language contact in historical documents: the identification and co-occurrence of Egyptian transfer features in Greek documentary papyri. Journal of Historical Sociolinguistics 8:2 ► pp. 325 ff.
2017. Zooming in on Verbs in the Progressive: A Collostructional and Correspondence Analysis Approach. Journal of English Linguistics 45:3 ► pp. 260 ff.
2015. Less is more: possibility and necessity as centres of gravity in a usage-based classification of core modals in Polish. Russian Linguistics 39:3 ► pp. 327 ff.
Dou, Jinmeng & Meichun Liu
2023. Exploring color metaphor with Behavioral Profiles: A usage-based analysis on the metaphorical meanings of the Chinese color term bái “white”. Lingua 289 ► pp. 103539 ff.
2017. Les prépositions à et de et la complémentation verbale. Langages N° 206:2 ► pp. 65 ff.
Fang, Lumin
2022. Constituency map of the alternative for Germany (AfD) vote in 2017: analysing characteristic differences via multiple correspondence analysis. Journal of Contemporary European Studies 30:2 ► pp. 313 ff.
Flach, Susanne
2020. Schemas and the frequency/acceptability mismatch: Corpus distribution predicts sentence judgments. Cognitive Linguistics 31:4 ► pp. 609 ff.
Gaio, Mario, Carmen Ferrajolo, Alessia Zinzi, Consiglia Riccardi, Pasquale Di Filippo, Ludovica Carangelo, Gorizio Pieretti, Francesco Rossi, Giovanni Francesco Nicoletti & Annalisa Capuano
2021. Association of Direct Oral Anticoagulants (DOACs) and Warfarin With Haemorrhagic Risk by Applying Correspondence Analysis to Data From the Italian Pharmacovigilance Database – A Case Study. Frontiers in Pharmacology 12
Garraffoni, André R. S., Fabrício C. Alcântara & Hélio H. Checon
2017. Evaluating the anesthetization and fixation efficacy of “soft” and “hard” freshwater benthic meiofauna: what is the best method for specimen preservation?. Limnology 18:2 ► pp. 209 ff.
2022. Death, enemies, and illness: How English and Russian metaphorically conceptualise boredom. Yearbook of the German Cognitive Linguistics Association 10:1 ► pp. 33 ff.
Hartmann, Stefan
2021. Diachronic Cognitive Linguistics. Yearbook of the German Cognitive Linguistics Association 9:1 ► pp. 1 ff.
2019. From Athenian fleet to prophetic eschatology. Correlating formal features to themes of discourse in Ancient Greek. Folia Linguistica 53:s40-s2 ► pp. 355 ff.
Jannusch, Tim, Darren Shannon, Michaele Völler, Finbarr Murphy & Martin Mullins
2021. Smartphone Use While Driving: An Investigation of Young Novice Driver (YND) Behaviour. Transportation Research Part F: Traffic Psychology and Behaviour 77 ► pp. 209 ff.
Jin, Junjie & Fuyin Thomas Li
2023. A multifactorial aspectual analysis of verb concatenation with imperfective markers zhe in Mandarin. Corpus Linguistics and Linguistic Theory 0:0
2014. 2014 IEEE International Professional Communication Conference (IPCC), ► pp. 1 ff.
Lam, Chris
2016. Correspondence Analysis: A Statistical Technique Ripe for Technical and Professional Communication Researchers. IEEE Transactions on Professional Communication 59:3 ► pp. 299 ff.
2023. Towards a dynamic behavioral profile of the Mandarin Chinese temperature term re: a diachronic semasiological approach. Corpus Linguistics and Linguistic Theory 19:2 ► pp. 289 ff.
Podhorodecka, Joanna
2021. Real-life pseudo-passives: The usage and discourse functions of adjunct-based passive constructions. Poznan Studies in Contemporary Linguistics 57:1 ► pp. 33 ff.
Rogos-Hebda, Anna
2020. It’s Raining Immigrants! HELLelujah!: The Metaphors of Immigration in Early American Magazines (1828–1959). Anglica. An International Journal of English Studies :29/2
Shao, Bin, Yingying Cai & Graeme Trousdale
2019. A Multivariate Analysis of Diachronic Variation inA Bunch of noun: A Construction Grammar Account. Journal of English Linguistics 47:2 ► pp. 150 ff.
Tahi, Mathias, Caudou Trebissou, Fabienne Ribeyre, Boguinard Sahin Guiraud, Désiré N’ da Pokou & Christian Cilas
2019. Variation in yield over time in a cacao factorial mating design: changes in heritability and longitudinal data analyses over 13 consecutive years. Euphytica 215:6
Tizón-Couto, David & David Lorenz
2021. Variables are valuable: making a case for deductive modeling. Linguistics 59:5 ► pp. 1279 ff.
Wyroślak, Piotr
2022. No big deal: Situation-backgrounding uses of the Polish dative reflexive pronoun sobie/se
. Yearbook of the German Cognitive Linguistics Association 10:1 ► pp. 77 ff.
2022. A corpus-based study of congruent and metaphorical patterns of modality in English. Studia Neophilologica► pp. 1 ff.
This list is based on CrossRef data as of 12 may 2023. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.