Automatic expansion of the Swedish FrameNet lexicon
Comparing and combining lexicon-based and corpus-based methods
We evaluate several lexicon-based and corpus-based methods to automatically induce new lexical units for the Swedish FrameNet, and we see that the best-performing setup uses a combination of both types of methods. A particular challenge for Swedish is the absence of a lexical resource such as WordNet; however, we show that the semantic network SALDO, which is organized according to lexicographical principles quite different from those of WordNet, is very useful for our purposes.
References
Blanchard, E., Harzallah, M., Briand, H., & Kuntz, P. (
2005)
A typology of ontology-based semantic measures. In
M. Missikoff &
A. De Nicola. (Eds.),
Proceedings of the open Interop workshop on enterprise modelling and ontologies for interoperability, CEUR Workshop Proceedings, vol. 1601. Aachen: CEUR-WS.org.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Boas, H. (
Ed.) (
2009)
Multilingual FrameNets in computational lexicography: Methods and applications. Berlin, Boston: Mouton De Gruyter.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Borin, L., Dannélls, D., Forsberg, M., Toporowska Gronostaj, M., & Kokkinakis, D. (
2010)
The past meets the present in Swedish FrameNet++. In
A. Dykstra &
T. Schoonheim (Eds.),
Proceedings of the 14th EURALEX international congress (pp. 269–281). Leeuwarden/Ljouwert: Fryske Akademy.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Borin, L., Forsberg, M., & Lönngren, L. (
2013)
SALDO: A touch of yin to WordNet’s yang.
Language Resources and Evaluation, 471, 1191–1211.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Brown, P., deSouza, P., Mercer, R., Della Pietra, V., & Jenifer C. Lai (
1992)
Class-based
n
-gram models of natural language.
Computational Linguistics, 18(4), 467–479.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Burchardt, A., Pennacchiotti, M., Thater, S., & Pinkal, M. (
2009)
Assessing the impact of frame semantics on textual entailment.
Natural Language Engineering, 151, 527–550.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Das, D., & Smith, N. A. (
2011)
Semi-supervised frame-semantic parsing for unknown predicates. In
Y. Matsumoto &
R. Mihalcea (Eds.),
Proceedings of the 49th annual meeting of the Association for Computational Linguistics: human language technologies – Volume 1 (pp. 1435–1444). Stroudsburg, PA: Association for Computational Linguistics.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Das, D., & Smith, N. A. (
2012)
Graph-based lexicon expansion with sparsity-inducing penalties. In
E. Fosler-Lussier,
E. Riloff &
S. Bangalore (Eds.),
Proceedings of the 2012 conference of the North American chapter of the Association for Computational Linguistics: human language technologies (pp. 677–687). Stroudsburg, PA: Association for Computational Linguistics.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., & Lin, C.-J. (
2008)
LIBLINEAR: A library for large linear classification.
Journal of Machine Learning Research, 91, 1871–1874.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Fellbaum, C. (
Ed) (
1998)
WordNet: An electronic lexical database. Cambridge, MA: MIT Press.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Fillmore, C. J., & Baker, C. (
2009)
A frames approach to semantic analysis. In
B. Heine, &
H. Narrog (Eds.),
The Oxford handbook of linguistic analysis (pp. 313–340). Oxford: OUP.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Friberg Heppin, K., & Toporowska Gronostaj, M. (
2012)
The rocky road towards a Swedish FrameNet – creating SweFN. In
N. Calzolari,
K. Choukri,
T. Declerck,
M. U. Doğan,
B. Maegaard,
J. Mariani,
J Odijk &
S. Piperidis (Eds.),
Proceedings of the eighth conference on international language resources and evaluation (pp. 256–261). European Language Resources Association.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Gärdenfors, P. (
2000)
Conceptual spaces: The geometry of thoughts. Cambridge, MA: Bradford Books, MIT Press.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Harris, Z. (
1954)
Distributional structure.
Word, 10(23), 146–162.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Johansson, R., & Nugues, P. (
2006)
A FrameNet-based semantic role labeler for Swedish. In
N. Calzolari,
C. Cardie &
P. Isabelle (Eds.),
Proceedings of the COLING/ACL 2006 main conference poster sessions (pp. 436–443).Stroudsburg, PA: Association for Computational Linguistics.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Johansson, R., & Nugues, P. (
2007a)
LTH: Semantic structure extraction using nonprojective dependency trees. In
E. Agirre,
L. Màrquez &
R. Wicentowski (Eds.),
Proceedings of the fourth international workshop on semantic evaluations (pp. 227–230). Stroudsburg, PA: Association for Computational Linguistics.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Johansson, R., & Nugues, P. (
2007b)
Using WordNet to extend FrameNet coverage. In
P. Nugues &
R. Johansson (Eds.)
Proceedings of the workshop on building frame-semantic resources for Scandinavian and Baltic languages (pp. 27–30).Lund: LU-CS-TR 2007-240.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Johansson, R. (
2012)
Non-atomic classification to improve a semantic role labeler for a low-resource language. In
E. Agirre,
J. Bos,
M. Diab,
S. Manandhar,
Y. Marton &
D. Yuret (Eds.),
Proceedings of the first joint conference on lexical and computational semantics (pp. 95–99). Stroudsburg, PA: Association for Computational Linguistics.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Johansson, R., Friberg Heppin, K., & Kokkinakis, D. (
2012)
Semantic role labeling with the Swedish FrameNet. In
N. Calzolari,
K. Choukri,
T. Declerck,
M. U. Doğan,
B. Maegaard,
J. Mariani,
J Odijk &
S. Piperidis (Eds.),
Proceedings of the eighth conference on international language resources and evaluation (pp. 3697–3700). European Language Resources Association.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Kanerva, P., Kristoffersson, J., & Holst, A. (
2000)
Random indexing of text samples for latent semantic analysis. In
L. R. Gleitman &
A. Joshi (Eds.),
Proceedings of the 22nd annual conference of the Cognitive Science Society (p. 1036). Mahwah, NJ: Erlbaum.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Koo, T., Carreras, X., & Collins, M. (
2008)
Simple semi-supervised dependency parsing. In
J. D. Moore,
S. Teufel,
J. Allan &
S. Furui (Eds.),
Proceedings of ACL-08: HLT (pp. 595–603).Stroudsburg, PA: Association for Computational Linguistics.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Manning, C. D., Raghavan, P., & Schütze, H. (
2008)
Introduction to information retrieval. Cambridge: Cambridge University Press.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (
2013a)
Efficient estimation of word representations in vector space. In
A. Courville,
R. Fergus &
C. Manning (Eds.),
International conference on learning representations, workshop track. CoRR abs/1301.3781.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Mikolov, T., Yih, W.-t., & Zweig, G. (
2013b)
Linguistic regularities in continuous space word representations. In
L. Vanderwende, H.
Daumé III &
K. Kirchhoff (Eds.)
Proceedings of the 2013 conference of the North American chapter of the Association for Computational Linguistics: human language technologies (pp. 746–751). Stroudsburg, PA: Association for Computational Linguistics.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Mohammad, S. & Hirst, G. (
2012)
Distributional measures of semantic distance: A survey.
CoRR abs/1203.1858.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Padó, S. (
2007)
Cross-lingual annotation projection models for role-semantic information. Doctoral dissertation. Saarbrücken: Saarland University.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Palmer, A., & Sporleder, C. (
2010)
Evaluating FrameNet-style semantic parsing: the role of coverage gaps in FrameNet. In
C.-R. Huang &
D. Jurafsky (Eds.),
Proceedings of the 23rd international conference on computational linguistics: posters (pp. 928–936). Stroudsburg, PA: Association for Computational Linguistics.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Pennacchiotti, M., De Cao, D., Basili, R., Croce, D., & Roth, M. (
2008)
Automatic induction of FrameNet lexical units. In
M. Lapata &
H. T. Ng (Eds.)
Proceedings of the 2008 conference on empirical methods in natural language processing (pp. 457–465). Stroudsburg, PA: Association for Computational Linguistics.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Rada, R., Mili, H., Bicknell, E., & Blettner, M. (
1989)
Development and application of a metric on semantic nets.
IEEE Transactions on Systems, Man, and Cybernetics, 19(1), 17–30.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Tonelli, S., Giuliano, C., & Tymoshenko, K. (
2013)
Wikipedia-based WSD for multilingual frame annotation.
Artificial Intelligence, 1941, 203–221.
![DOI logo](https://benjamins.com/logos/doi-logo.svg)
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Turian, J., Ratinov, L.-A., & Bengio, Y. (
2010)
Word representations: A simple and general method for semi-supervised learning. In
J. Hajič,
S. Carberry,
S. Clark &
J. Nivre (Eds.)
Proceedings of the 48th annual meeting of the Association for Computational Linguistics (pp. 384–394). Stroudsburg, PA: Association for Computational Linguistics.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Velldal, E. (
2011)
Random indexing re-hashed. In
B. Sandford Pedersen,
G. Nešpore &
I. Skadiņa (Eds.)
Proceedings of the 18th Nordic conference of computational linguistics (pp. 224–229).Tartu: Northern European Association for Language Technology.
![Google Scholar](https://benjamins.com/logos/google-scholar.svg)
Cited by
Cited by 3 other publications
Borin, Lars, Dana Dannélls & Normunds Grūzītis
Dannélls, Dana, Lars Borin, Markus Forsberg, Karin Friberg Heppin & Maria Toporowska Gronostaj
This list is based on CrossRef data as of 1 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.