Collocation and colligation

Tomas Lehecka
Table of contents

Collocation and colligation are two closely related concepts associated with the distributional properties of linguistic items in actual language use. Specifically, collocation and colligation refer to the likelihood of co-occurrence of (two or more) lexical items and grammatical categories, respectively. Both terms have been attributed to J. R. Firth (1957: 194–195; 1968: 181–183; see Östman and Simon-Vandenbergen 2005 and Shore 2010 for a summary of Firth’s work). Since the terms were introduced, collocation in particular has become a fundamental concept in usage-based studies in many linguistic fields, most notably lexical syntax and semantics. Typically, collocations and colligations are studied in large electronic corpora which allows for statistical analyses of the co-occurrence patterns of linguistic items.

Full-text access is restricted to subscribers. Log in to obtain additional credentials. For subscription information see Subscription & Price.


Arppe, A.
2008Univariate, Bivariate and Multivariate Methods in Corpus-based Lexicography: A Study of Synonymy. University of Helsinki. http://​urn​.fi​/URN:ISBN:978​-952​-10​-5175​-3 [10.10.2014]Google Scholar
Arppe, A. and J. Järvikivi
2007 “Every method counts: Combining corpus-based and experimental evidence in the study of synonymy.” Corpus Linguistics and Linguistic Theory 3(2): 131–159.DOI logoGoogle Scholar
Aston, G. and L. Burnard
1998The BNC Handbook. Exploring the British National Corpus with SARA. Edinburgh: Edinburgh University Press.Google Scholar
Atkins, B.T.S.
1987 “Semantic ID tags: Corpus evidence for dictionary senses.” Proceedings of the Third Annual Conference of the UW Centre for the New Oxford English Dictionary , 17–36. Waterloo: University of Waterloo.
Atkins, B.T.S. and B. Levin
1995 “Building on a corpus: A linguistic and lexicographical look at some near-synonyms.” International Journal of Lexicography 8(2): 85–114. DOI logoGoogle Scholar
Bartsch, S.
2004Structural and Functional Properties of Collocations in English: A Corpus Study of Lexical and Pragmatic Constraints on Lexical Co-occurrence. Tübingen: Narr.Google Scholar
Benson, M., E. Benson and R.F. Ilson
1986Lexicographic Description of English. Amsterdam: John Benjamins. DOI logoGoogle Scholar
Berry-Rogghe, G.
1973 “The computation of collocations and their relevance to lexical studies.” In The Computer and Literary Studies, ed. by A.J. Aitken, R.W. Bailey and N. Hamilton-Smith, 103–112. Edinburgh: Edinburgh University Press.Google Scholar
Biber, D.
2009 “A corpus-driven approach to formulaic language in English: Multi-word patterns in speech and writing.” International Journal of Corpus Linguistics 14(3): 275–311. DOI logo  BoPGoogle Scholar
Biber, D. and S. Conrad
1999 “Lexical bundles in conversation and academic prose.” In Out of corpora. Studies in honour of Stig Johansson, ed. by H. Hasselgård and S. Oksefjell, 182–190. Amsterdam: Rodopi.Google Scholar
Biber, D., S. Conrad and R. Reppen
1998Corpus Linguistics: Investigating Language Structure and Use. Cambridge: Cambridge University Press. DOI logo  BoPGoogle Scholar
Butler, C.S.
2004 “Corpus studies and functional linguistic theories.” Functions of Language 11(2): 147–186. DOI logoGoogle Scholar
Bybee, J. and P. Hopper
(eds.) 2001Frequency and the Emergence of Linguistic Structure. Amsterdam: John Benjamins. DOI logo  BoPGoogle Scholar
Choueka, Y.
1988 “Looking for needles in a haystack.” In Proceedings, RIAO Conference on User-oriented Context Based Text and Image Handling , 609–623. Cambridge, MA.
Church, K., W. Gale, P. Hanks and D. Hindle
1991 “Using statistics in lexical analysis.” In Lexical Acquisition: Exploiting On-line Resources to Build a Lexicon, ed. by U. Zernik, 115–164. Hillsdale: Lawrence Erlbaum.Google Scholar
Church, K., W. Gale, P. Hanks, D. Hindle and R. Moon
1994 “Lexical substitutability.” In Computational Approaches to the Lexicon, ed. by B.T.S. Atkins and A. Zampolli, 153–177. Oxford: Oxford University Press.Google Scholar
Clear, J.
1993 “From Firth principles: Computational tools for the study of collocation.” In Text and Technology: In Honour of John Sinclair, ed. by M. Baker, F. Gill and E. Tognini-Bonelli, 271–292. Philadelphia: John Benjamins. DOI logoGoogle Scholar
Dilts, P.
2009 “Good nouns, bad nouns: What the corpus says and what native speakers think.” Language and Computers 71(1): 103–117.Google Scholar
Dilts, P. and J. Newman
2006 “A note on quantifying ‘good’ and ‘bad’ prosodies.” Corpus Linguistics and Linguistic Theory 2(2): 233–242. DOI logoGoogle Scholar
Divjak, D. and S.T. Gries
2008 “Clusters in the mind? Converging evidence from near synonymy in Russian.” The Mental Lexicon 3(2): 188–213. DOI logoGoogle Scholar
2006 “Ways of trying in Russian: Clustering behavioral profiles.” Corpus Linguistics and Lingustic Theory 2(1): 23–60.Google Scholar
Evert, S.
2009 “Corpora and collocations.” In Corpus Linguistics: An International Handbook, Vol. 2, ed. by A. Lüdeling and M. Kytö, 1212–1248. Berlin, New York: Mouton de Gruyter. DOI logoGoogle Scholar
2005The Statistics of Word Cooccurrences: Word Pairs and Collocations. Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart.Google Scholar
2004Association Measures. www​.collocations​.de​/AM/ [10.11.2014].
Fillmore, C.J.
1988 “The mechanisms of ‘Construction Grammar’.” In Proceedings of the Fourteenth Annual Meeting of the Berkeley Linguistics Society, ed. by S. Axmaker, A. Jaisser and H. Singmaster, 35–55. Berkeley: Berkeley Linguistics Society.Google Scholar
Fillmore, C.J., C.R. Johnson and M.R.L. Petruck
2003 “Background to FrameNet.” International Journal of Lexicography 16(3): 235–250. DOI logoGoogle Scholar
Firth, J.R.
1968Selected Papers of J. R. Firth 1952–59. London: Longmans.Google Scholar
1957Papers in Linguistics 1934–1951. London: Oxford University Press.Google Scholar
Fried, M. and J. Östman
2003 “The explicit and the implicit in the Suasion Frame.” In Proceedings of CIL17, ed. by E. Hajičová, A. Kotěšovcová and J. Mírovský, 1–22. Prague: Matfyzpress.Google Scholar
Goldberg, A.E.
2006Constructions at Work: The Nature of Generalization in Language. Oxford: Oxford University Press.  BoPGoogle Scholar
Greaves, C. and M. Warren
2010 “What can a corpus tell us about multi-word units.” In The Routledge Handbook of Corpus Linguistics, ed. by M. McCarthy and A. O’Keeffe, 212–226. Abingdon: Routledge.DOI logoGoogle Scholar
Gries, S.T.
2015 “Some current quantitative problems in corpus linguistics and a sketch of some solutions.” Language and Linguistics 16 (1): 93–117. DOI logoGoogle Scholar
2010a “Behavioral profiles: A fine-grained and quantitative approach in corpus-based lexical semantics.” The Mental lexicon 5(3): 323–346. DOI logoGoogle Scholar
2003 “Testing the sub-test: An analysis of English -ic and -ical adjectives.” International Journal of Corpus Linguistics 8(1): 31–61. DOI logoGoogle Scholar
2001 “A corpus-linguistic analysis of English -ic vs. -ical adjectives.” ICAME Journal 25: 65–108.Google Scholar
Gries, S.T. and A. Stefanowitsch
2004a “Co-varying collexemes in the into-causative.” In Language, Culture, and Mind, ed. by M. Achard and S. Kemmer, 225–236. Stanford, CA: CSLI.Google Scholar
2004b “Extending collostructional analysis: A corpus-based perspective on ‘alternations’.” International Journal of Corpus Linguistics 9(1): 97–129. DOI logoGoogle Scholar
Gries, S.T. and N. Otani
2010 “Behavioral profiles: A corpus-based perspective on synonymy and antonymy.” ICAME Journal 34. 121–150.Google Scholar
Halliday, M.A.K.
1966 “Lexis as a linguistic level.” In In Memory of J. R. Firth, ed. by C.E. Bazell, J.C. Catford, M.A.K. Halliday and R.H. Robins, 148–163. London: Longman.Google Scholar
Hausmann, F.J.
2003 “Was sind eigentlich Kollokationen?” In Wortverbindungen – mehr oder weniger fest, ed. by K. Steyer, 309–334. Berlin: Walter de Gruyter.Google Scholar
Hoey, M.
2009 “Corpus-driven approaches to grammar.” In Exploring the Lexis-Grammar Interface, ed. by U. Römer and R. Schulze, 33–48. Amsterdam: John Benjamins. DOI logoGoogle Scholar
2005Lexical Priming: A New Theory of Words and Language. London: Routledge.DOI logoGoogle Scholar
2004 “The textual priming of lexis.” In Corpora and Language Learners, ed. by G. Aston, S. Bernardini and D. Stewart, 21.42. Amsterdam: John Benjamins. DOI logoGoogle Scholar
1997 “From concordance to text structure: New uses for computer corpora.” In PALC ‘97. Proceedings of Practical Applications of Linguistic Corpora conference, ed. by B. Lewandowska-Tomaszczyk and J. Melia. Lodz: Lodz University Press. 2–23.Google Scholar
1991Patterns of Lexis in Text. Oxford: Oxford University Press.  BoPGoogle Scholar
Hoey, M. and M.B. O’Donnell
2008 “The beginning of something important? Corpus evidence on the text beginnings of hard news stories.” In Corpus Linguistics, Computer Tools and Applications: State of the Art, ed. by B. Lewandowska-Tomaszczyk. PALC 2007 : 189–212.Google Scholar
Hopper, P.
1998 “Emergent grammar.” In The New Psychology of Language: Cognitive and Functional Approaches to Language Structure, ed. by M. Tomasello, 155–176. Hillsdale, NJ: Erlbaum.  MetBibGoogle Scholar
1988 “Emergent grammar and the a priori grammar postulate.” In Linguistics in Context: Connecting Observation and Understanding. Lectures from the 1985 LSA/TESOL and NEH Institutes, ed. by D. Tannen, 117–134. Norwood, NJ: Ablex.Google Scholar
Hunston, S.
2007 “Semantic prosody revisited.” International Journal of Corpus Linguistics 12(2): 249–268. DOI logoGoogle Scholar
Jantunen, J.H.
2004Synonymia ja käännössuomi. Korpusnäkökulma samamerkityksisyyden kontekstuaalisuuteen ja käännöskielen leksikaalisiin erityispiirteisiin. (University of Joensuu publications in the humanities 35.) Joensuu: Joensuun Yliopisto. http://​epublications​.uef​.fi​/pub​/urn​_isbn​_952​-458​-479​-4​/urn​_isbn​_952​-458​-479​-4​.pdf [20.8.2014]Google Scholar
Jones, S. and J. Sinclair
1974English lexical collocations: A study in computational linguistics. Cahiers de Lexicologie 24(1): 15–61.Google Scholar
Kennedy, G.
1991 “Between and through: The company they keep and the functions they serve.” In English Corpus Linguistics: Studies in Honour of Jan Svartvik, ed. by K. Aijmer and B. Altenberg, 95–110. London: Longman.Google Scholar
Kenny, D.
2000 “Lexical hide-and-seek: Looking for creativity in a parallel corpus.” In Intercultural faultlines. Research models in translation studies I. Textual and cognitive aspects, ed. by M. Olohan, 93–104. Manchester: St. Jerome Publishing.  TSBGoogle Scholar
Kjellmer, G.
2003 “Synonymy and corpus work: On almost and nearly.” ICAME Journal 27: 19–27.Google Scholar
1996 “Idiomen, kollokationerna och lexikonet.” Lexico-Nordica 3: 79–90.Google Scholar
1987 “Aspects of English collocations.” In Corpus Linguistics and Beyond: Proceedings of the Seventh International Conference on English Language Research on Computerised Corpora, ed. by W. Meijs. Amsterdam: Rodopi.Google Scholar
1984 “Some thoughts on collocational distinctiveness.” In Corpus Linguistics: Recent Developments in the Use of Computer Corpora in English Language Research, ed. by J. Aarts and W. Meijs, 163–171. Amsterdam: Rodopi.Google Scholar
(ed.) 1994A Dictionary of English Collocations, Vol. 1–3. Oxford: Clarendon Press.Google Scholar
Lea, D.
(ed.) 2002Oxford Collocations Dictionary for Students of English. Oxford: Oxford University Press.Google Scholar
Leech, G.
1974Semantics: The Study of Meaning. Harmondsworth: Penguin Books.Google Scholar
Lehecka, T.
2012aInterrelaterade lexikala egenskaper. Engelska adjektivimporter i en svensk tidningskorpus. (Nordica Helsingiensia 32.) Helsingfors: Helsingfors universitet. http://​urn​.fi​/URN:ISBN:978​-952​-10​-8530​-7. [20.8.2014]Google Scholar
2012b “Probabilistisk syntaktisk analys av engelska adjektiv i svensk tidningstext.” Maal og Minne 104(1): 72–109.Google Scholar
2013 “Kollokationer och kolligationer: Om förhållandet mellan adjektivens semantiska och syntaktiska preferenser.” Folkmålsstudier 51: 49–85.Google Scholar
Louw, B.
1993 “Irony in the text or insincerity in the writer? The diagnostic potential of semantic prosodies.” In Text and technology: In honour of John Sinclair, ed. by M. Baker, G. Francis and E. Tognini-Bonelli, 157–176. Philadelphia/Amsterdam: John Benjamins. DOI logoGoogle Scholar
Manning, C.D. and H. Schütze
1999Foundations of Statistical Natural Language Processing. Cambridge, MA: MIT Press.Google Scholar
Mauranen, A.
2000 “Strange strings in translated language: A study on corpora.” In Intercultural Faultlines Research Models in Translation Studies, ed. by M. Olohan, 119–141. Manchester: St. Jerome Publishing.  TSBGoogle Scholar
McEnery, T., R. Xiao and Y. Tono
2006Corpus-based Language Studies: An Advanced Resource Book. London: Routledge.Google Scholar
Mel’čuk, I.
1998 “Collocations and lexical functions.” In Phraseology: Theory, Analysis and Applications, ed. by A. Cowie, 23–53. Oxford: Clarendon Press.Google Scholar
Newman, J. and S. Rice
2006 “Transitivity schemas of English EAT and DRINK in the BNC.” In Corpora in Cognitive Linguistics: Corpus-Based Approaches to Syntax and Lexis, ed. by S.T. Gries and A. Stefanowitsch, 225–260. Berlin/New York: Mouton de Gruyter.Google Scholar
Östman, J.
2005“Persuasion as implicit anchoring: The case of collocations.” In Persuasion across Genres, ed. by H. Halmari and T. Virtanen, 183–212. Amsterdam: John Benjamins. DOI logoGoogle Scholar
Östman, J. and Simon-Vandenbergen, A.
2005 ”Firthian linguistics.” In Handbook of Pragmatics Online, ed. by J. Östman and J. Verschueren. Amsterdam: John Benjamins. DOI logo [10.10.2014]  BoPGoogle Scholar
Palmer, F.R.
1976Semantics: A New Outline. Cambridge: Cambridge University Press.  BoPGoogle Scholar
Partington, A.
1998Patterns and Meanings: Using Corpora for English Language Research and Teaching. Amsterdam: John Benjamins. DOI logo  MetBibGoogle Scholar
Renouf, A. and J. Sinclair
1991 “Collocational frameworks in English.” In English Corpus Linguistics, ed. by K. Aijmer and B. Altenberg, 128–143. New York: English corpus linguistics.Google Scholar
Ruppenhofer, J., C.J. Fillmore and C.F. Baker
2002 “Collocational information in the FrameNet database.” In Proceedings of the Tenth Euralex International Congress, Vol. 1, ed. by A. Braasch and C. Povlsen, 359–369. Copenhagen.Google Scholar
Sag, I.A., T. Baldwin, F. Bond, A. Copestake and D. Flickinger
2002 “Multi-word expressions: A pain in the neck for NLP.” In Proceedings of the 3th International Conference on Intelligent Text Processing and Computational Linguistics , 1–15. Stanford, CA: Stanford University.
Shore, S.
2010 “J. R. Firth.” In Handbook of Pragmatics Online, ed. by J. Östman and J. Verschueren. Amsterdam: John Benjamins. DOI logo [10.10.2014]Google Scholar
Siepmann, D.
2005 “Collocation, colligation and encoding dictionaries. Part I: Lexicological aspects.” International Journal of Lexicography 18(4): 409–443. DOI logoGoogle Scholar
Sinclair, J.
2004Trust the Text: Language, Corpus and Discourse. London: Routledge.Google Scholar
1998 “The lexical item.” In Contrastive Lexical Semantics, ed. by E. Weigand, 1–24. Amsterdam: John Benjamins. DOI logoGoogle Scholar
1996 “The Search for Units of Meaning.” Textus IX(1): 75–106.Google Scholar
1991Corpus, Concordance, Collocation. Oxford: Oxford University Press.Google Scholar
1987 “Collocation: A progress report.” In Language Topics: Essays in Honour of Michael Halliday, ed. by R. Steele and T. Threadgold, 319–331. Amsterdam: John Benjamins. DOI logoGoogle Scholar
1966 “Beginning the study of lexis.” In In Memory of J.R. Firth, ed. by C.E. Bazell, J.C. Catford, M.A.K. Halliday and R.H. Robins, 410–430. London: Longman.Google Scholar
(ed.) 1995Collins COBUILD English Dictionary. London: Harper Collins.Google Scholar
Smadja, F.
1993 “Retrieving collocations from text: Xtract.” Computational Linguistics 19(1): 143–177.Google Scholar
Stefanowitsch, A. and S.T. Gries
2009 “Corpora and grammar.” In Corpus Linguistics: An International Handbook, Vol. 2. ed. by A. Lüdeling and M. Kytö, 933–951. Berlin/New York: Mouton de Gruyter. DOI logoGoogle Scholar
2003 “Collostructions: Investigating the interaction of words and constructions.” International Journal of Corpus Linguistics 8(2): 209–243. DOI logoGoogle Scholar
Stubbs, M.
2001a “On inference theories and code theories: Corpus evidence for semantic schemas.” Text 21(3): 437–465.  BoP DOI logoGoogle Scholar
2001bWords and Phrases: Corpus Studies of Lexical Semantics. Oxford: Blackwell.  BoPGoogle Scholar
1996Text and Corpus Analysis: Computer-assisted Studies of Language and Culture. Oxford: Blackwell.  BoPGoogle Scholar
1995a “Collocations and semantic profiles: On the cause of the trouble with quantitative studies.” Functions of Language 2(1): 23–55. DOI logo  BoPGoogle Scholar
1995b “Corpus evidence for norms of lexical collocation.” In Principle and Practice in Applied Linguistics, ed. by G. Cook and B. Seidlhoper, 245–256. Oxford: Oxford University Press.Google Scholar
Taylor, J.R.
2003 “Near synonyms as co-extensive categories: ‘high’ and ‘tall’ revisited.” Language Sciences 25(3): 263–284. DOI logoGoogle Scholar
Tognini-Bonelli, E.
2001Corpus Linguistics at Work. Amsterdam: John Benjamins. DOI logoGoogle Scholar
Wiechmann, D.
2008 “On the computation of collostruction strength: Testing measures of association as expressions of lexical bias.” Corpus Linguistics and Linguistic Theory 4(2): 253–290. DOI logoGoogle Scholar
Zipf, G.K.
1935The Psycho-biology of Language. Boston: Houghton Mifflin.Google Scholar