We present an experiment where natural language processing tools are used to automatically identify potential constructions in a corpus. The experiment was conducted as part of the ongoing efforts to develop a Swedish constructicon. Using an automatic method to suggest constructions has advantages not only for efficiency but also methodologically: it forces the analyst to look more objectively at the constructions actually occurring in corpora, as opposed to focusing on “interesting” constructions only. As a heuristic for identifying potential constructions, the method has proved successful, yielding about 200 (out of 1,200) highly relevant construction candidates.
2011) Distributional semantics and compositionality 2011: Shared task description and results. In Proceedings of the workshop on distributional semantics and compositionality (pp. 21–28). Portland: ACL.
Boas, H. C. (
in press). Zur architektur einer konstruktionsbasierten grammatik der deutchen. In A. Ziem & A. Lasch (Eds.), Grammatik als Netzwerk von konstruktionen? Sprachliches wissen im fokus der konstruktionsgrammatik. Berlin/New York: de Gruyter.
Borin, L., Dannélls, D., Forsberg, M., Gronostaj, M.T., & Kokkinakis, D. (
2010) The past meets the present in Swedish FrameNet++. In 14th EURALEX international congress (pp. 269–281). Leeuwarden: EURALEX.
Bybee, J. (
2013) Usage-based theory and exemplar representations. In Hoffmann & Trousdale (Eds.), pp. 49–69.
Bäckström, L., Borin, L., Forsberg, M., Lyngfelt, B., Prentice, J., & Sköldberg, E. (
2013) Automatic identification of construction candidates for a Swedish constructicon. In Proceedings of the workshop on lexical semantic resources for NLP at NODALIDA 2013 (pp. 2–12 ). NEALT Proceedings Series 19.
Bäckström, L., Lyngfelt, B., & Sköldberg, E. (
this issue). Towards interlingual constructicography. On correspondence between constructicon resources for English and Swedish.
Church, K. W., & Hanks, P. (
1990) Word association norms, mutual information, and lexicography. Computational Linguistics, 16(1), 22–29.
Croft, W. (
2003) Lexical rules vs. constructions: A false dichotomy. In H. Cuyckens, T. Berg, R. Dirven & K.-U. Panther (Eds.), Motivation in language: Studies in honour of Günter Radden (pp. 49–68). Amsterdam: John Benjamins.
Culicover, P. W., & Jackendoff, R. (
2005) Simpler syntax. Oxford: Oxford University Press.
Evert, S. (
2005) The statistics of word cooccurrences: Word pairs and collocations. Dissertation, Institut für maschinelle Sprachverarbeitung, University of Stuttgart.
Fano, R. M. (
1961) Transmission of information: A statistical theory of communications. New York: MIT Press.
Fillmore, C. J. (
2008) Border conflicts: FrameNet meets construction grammar. In E. Bernal & J. DeCesaris (Eds.), Proceedings of the XIII EURALEX international congress (pp. 49–68). Barcelona: Universitat Pompeu Fabra.
Fillmore, C. J., Lee-Goldman, R., & Rhomieux, R. (
2012) The FrameNet constructicon. In H. Boas & I. Sag (Eds.), Sign-based construction grammar (pp. 309–372). Stanford: CSLI.
Friberg Heppin, K., & Toporowska Gronostaj, M. (
this issue). Exploiting FrameNet for Swedish: Mismatch?
Goldberg, A. E. (
2006) Constructions at work. The nature of generalization in language. Oxford & New York: Oxford University Press.
Goldberg, A. E. (
2013) Constructionist approaches. In Hoffmann & Trousdale (Eds.), pp. 15–31.
Gustafson-Čapková, S., & Hartmann, B. (
2006) Manual of the Stockholm Umeå corpus version 2.0. Stockholm University.
Halácsy, P., Kornai, A., & Oravecz, C. (
2007) HunPos – an open source trigram tagger. In Proceedings of the 45th annual meeting of the Association for Computational Linguistics companion volume: Proceedings of the demo and poster sessions (pp. 209–212). Prague: ACL.
Hilpert, M. (
2013) Constructional change in English. developments in allomorphy, word formation, and syntax. Cambridge & New York: Cambridge University Press.
Hilpert, M. (
to appear). Construction grammar and its application to English. Edinburgh: Edinburgh University Press.
Hoffmann, T., & Trousdale, G. (
Eds.) (2013) The Oxford handbook of construction grammar. Oxford & New York: Oxford University Press.
Hutchinson, L. G. (
1974) Grammar as theory. In D. Cohen (Ed.), Explaining linguistic phenomena (pp. 43–73). New York, etc.: Wiley.
Jackendoff, R. (
2002) Foundations of language. Brain, meaning, grammar, evolution. Oxford & New York: Oxford University Press.
Jiang, C., Coenen, F., & Zito, M. (
2013) A survey of frequent subgraph mining algorithms. The Knowledge Engineering Review 281, 75–105.
Kay, P. (
2013) The limits of (construction) grammar. In Hoffmann & Trousdale (Eds.), pp. 32–48.
Kilgarriff, A., & Tugwell, D. (
2002) Sketching words. In M.-H. Corréard (Ed.), Lexicography and natural language processing: A Festschrift in honour of B. T. S. Atkins (pp. 125–137). EURALEX.
Nivre, J., Hall, J., Nilsson, J., Chanev, A., Eryiğit, G., Kübler, S., Marinov, S., & Marsi, E. (
2007) MaltParser: A language-independent system for data-driven dependency parsing. Natural Language Engineering, 13(2), 95–135.
2008) Cultivating a Swedish treebank. In J. Nivre, M. Dahllöf & B. Megyesi (Eds.), Resourceful language technology: Festschrift in honor of Anna Sågvall Hein (pp. 111–120). Acta Universitatis Upsaliensis: Studia Linguistica Upsaliensia 7.
Ohara, K. (
2013) Toward constructicon building for Japanese in Japanese FrameNet. Veredas, 17(1), 11–27.
Pecina, P. (
2010) Lexical association measures and collocation extraction. Language Resources and Evaluation, 441, 137–158.
Svanlund, J. (
2002) Lexikalisering [Lexicalization]. Språk och stil, 121, 7–45.
Torrent, T. T., Lage, L. M., Sampaio, T. F., Tavares, T., & Matos, E. (
this issue). Revisiting border conflicts between FrameNet and construction grammar: Annotation policies for the Brazilian Portuguese Constructicon.
Van de Cruys, T. (
2011) Two multivariate generalizations of pointwise mutual information. In Proceedings of the workshop on distributional semantics and compositionality (pp. 16–20). Portland: ACL.
Wible, D., & Tsao, N.-L. (
2010) StringNet as a computational resource for discovering and investigating linguistic constructions. In Proceedings of the NAACL HLT workshop on extracting and using constructions in computational linguistics (pp. 25–31). Los Angeles: ACL.
Wray, A. (
2008) Formulaic language: Pushing the boundaries. Oxford: Oxford University Press.
Ziem A., Boas. H. C., & Ruppenhofer J. (
in press). Semantische frames und grammatische konstruktionen für die textanalyse. In J. Hagemann & S. Staffeld (Eds.), Syntaxtheorien. Vergleichende analysen. Tübingen: Staffenburg.
Cited by
Cited by 16 other publications
Antònia Martí, Maria, Mariona Taulé, Venelin Kovatchev & Maria Salamó
2021. DISCOver: DIStributional approach based on syntactic dependencies for discovering COnstructions. Corpus Linguistics and Linguistic Theory 17:2 ► pp. 491 ff.
2024. Constraction: a tool for the automatic extraction and interactive exploration of linguistic constructions. Linguistics Vanguard 9:1 ► pp. 215 ff.
This list is based on CrossRef data as of 27 march 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.