We present an experiment where natural language processing tools are used to automatically identify potential constructions in a corpus. The experiment was conducted as part of the ongoing efforts to develop a Swedish constructicon. Using an automatic method to suggest constructions has advantages not only for efficiency but also methodologically: it forces the analyst to look more objectively at the constructions actually occurring in corpora, as opposed to focusing on “interesting” constructions only. As a heuristic for identifying potential constructions, the method has proved successful, yielding about 200 (out of 1,200) highly relevant construction candidates.
2011) Distributional semantics and compositionality 2011: Shared task description and results. In Proceedings of the workshop on distributional semantics and compositionality (pp. 21–28). Portland: ACL.
Boas, H. C. (
in press). Zur architektur einer konstruktionsbasierten grammatik der deutchen. In A. Ziem & A. Lasch (Eds.), Grammatik als Netzwerk von konstruktionen? Sprachliches wissen im fokus der konstruktionsgrammatik. Berlin/New York: de Gruyter.
Borin, L., Dannélls, D., Forsberg, M., Gronostaj, M.T., & Kokkinakis, D. (
2010) The past meets the present in Swedish FrameNet++. In 14th EURALEX international congress (pp. 269–281). Leeuwarden: EURALEX.
Bybee, J. (
2013) Usage-based theory and exemplar representations. In Hoffmann & Trousdale (Eds.), pp. 49–69.
Bäckström, L., Borin, L., Forsberg, M., Lyngfelt, B., Prentice, J., & Sköldberg, E. (
2013) Automatic identification of construction candidates for a Swedish constructicon. In Proceedings of the workshop on lexical semantic resources for NLP at NODALIDA 2013 (pp. 2–12 ). NEALT Proceedings Series 19.
Bäckström, L., Lyngfelt, B., & Sköldberg, E. (
this issue). Towards interlingual constructicography. On correspondence between constructicon resources for English and Swedish.
Church, K. W., & Hanks, P. (
1990) Word association norms, mutual information, and lexicography. Computational Linguistics, 16(1), 22–29.
Croft, W. (
2003) Lexical rules vs. constructions: A false dichotomy. In H. Cuyckens, T. Berg, R. Dirven & K.-U. Panther (Eds.), Motivation in language: Studies in honour of Günter Radden (pp. 49–68). Amsterdam: John Benjamins.
Culicover, P. W., & Jackendoff, R. (
2005) Simpler syntax. Oxford: Oxford University Press.
Evert, S. (
2005) The statistics of word cooccurrences: Word pairs and collocations. Dissertation, Institut für maschinelle Sprachverarbeitung, University of Stuttgart.
Fano, R. M. (
1961) Transmission of information: A statistical theory of communications. New York: MIT Press.
Fillmore, C. J. (
2008) Border conflicts: FrameNet meets construction grammar. In E. Bernal & J. DeCesaris (Eds.), Proceedings of the XIII EURALEX international congress (pp. 49–68). Barcelona: Universitat Pompeu Fabra.
Fillmore, C. J., Lee-Goldman, R., & Rhomieux, R. (
2012) The FrameNet constructicon. In H. Boas & I. Sag (Eds.), Sign-based construction grammar (pp. 309–372). Stanford: CSLI.
Friberg Heppin, K., & Toporowska Gronostaj, M. (
this issue). Exploiting FrameNet for Swedish: Mismatch?
Goldberg, A. E. (
2006) Constructions at work. The nature of generalization in language. Oxford & New York: Oxford University Press.
Goldberg, A. E. (
2013) Constructionist approaches. In Hoffmann & Trousdale (Eds.), pp. 15–31.
Gustafson-Čapková, S., & Hartmann, B. (
2006) Manual of the Stockholm Umeå corpus version 2.0. Stockholm University.
Halácsy, P., Kornai, A., & Oravecz, C. (
2007) HunPos – an open source trigram tagger. In Proceedings of the 45th annual meeting of the Association for Computational Linguistics companion volume: Proceedings of the demo and poster sessions (pp. 209–212). Prague: ACL.
Hilpert, M. (
2013) Constructional change in English. developments in allomorphy, word formation, and syntax. Cambridge & New York: Cambridge University Press.
Hilpert, M. (
to appear). Construction grammar and its application to English. Edinburgh: Edinburgh University Press.
Hoffmann, T., & Trousdale, G. (
Eds.) (2013) The Oxford handbook of construction grammar. Oxford & New York: Oxford University Press.
Hutchinson, L. G. (
1974) Grammar as theory. In D. Cohen (Ed.), Explaining linguistic phenomena (pp. 43–73). New York, etc.: Wiley.
Jackendoff, R. (
2002) Foundations of language. Brain, meaning, grammar, evolution. Oxford & New York: Oxford University Press.
Jiang, C., Coenen, F., & Zito, M. (
2013) A survey of frequent subgraph mining algorithms. The Knowledge Engineering Review 281, 75–105.
Kay, P. (
2013) The limits of (construction) grammar. In Hoffmann & Trousdale (Eds.), pp. 32–48.
Kilgarriff, A., & Tugwell, D. (
2002) Sketching words. In M.-H. Corréard (Ed.), Lexicography and natural language processing: A Festschrift in honour of B. T. S. Atkins (pp. 125–137). EURALEX.
Nivre, J., Hall, J., Nilsson, J., Chanev, A., Eryiğit, G., Kübler, S., Marinov, S., & Marsi, E. (
2007) MaltParser: A language-independent system for data-driven dependency parsing. Natural Language Engineering, 13(2), 95–135.
2008) Cultivating a Swedish treebank. In J. Nivre, M. Dahllöf & B. Megyesi (Eds.), Resourceful language technology: Festschrift in honor of Anna Sågvall Hein (pp. 111–120). Acta Universitatis Upsaliensis: Studia Linguistica Upsaliensia 7.
Ohara, K. (
2013) Toward constructicon building for Japanese in Japanese FrameNet. Veredas, 17(1), 11–27.
Pecina, P. (
2010) Lexical association measures and collocation extraction. Language Resources and Evaluation, 441, 137–158.
Svanlund, J. (
2002) Lexikalisering [Lexicalization]. Språk och stil, 121, 7–45.
Torrent, T. T., Lage, L. M., Sampaio, T. F., Tavares, T., & Matos, E. (
this issue). Revisiting border conflicts between FrameNet and construction grammar: Annotation policies for the Brazilian Portuguese Constructicon.
Van de Cruys, T. (
2011) Two multivariate generalizations of pointwise mutual information. In Proceedings of the workshop on distributional semantics and compositionality (pp. 16–20). Portland: ACL.
Wible, D., & Tsao, N.-L. (
2010) StringNet as a computational resource for discovering and investigating linguistic constructions. In Proceedings of the NAACL HLT workshop on extracting and using constructions in computational linguistics (pp. 25–31). Los Angeles: ACL.
Wray, A. (
2008) Formulaic language: Pushing the boundaries. Oxford: Oxford University Press.
Ziem A., Boas. H. C., & Ruppenhofer J. (
in press). Semantische frames und grammatische konstruktionen für die textanalyse. In J. Hagemann & S. Staffeld (Eds.), Syntaxtheorien. Vergleichende analysen. Tübingen: Staffenburg.
Cited by (16)
Cited by 16 other publications
Cappelle, Bert
2024. Can Construction Grammar Be Proven Wrong?,
Yan, Hengbin & Yinghui Li
2024. Constraction: a tool for the automatic extraction and interactive exploration of linguistic constructions. Linguistics Vanguard 9:1 ► pp. 215 ff.
Coussé, Evie, Steffen Höder, Benjamin Lyngfelt & Julia Prentice
Antònia Martí, Maria, Mariona Taulé, Venelin Kovatchev & Maria Salamó
2021. DISCOver: DIStributional approach based on syntactic dependencies for discovering COnstructions. Corpus Linguistics and Linguistic Theory 17:2 ► pp. 491 ff.
This list is based on CrossRef data as of 1 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.