6027522 03 01 01 JB code JB John Benjamins Publishing Company 01 JB code NLP 14 Eb 15 9789027258489 06 10.1075/nlp.14 13 2021042005 00 EA E107 Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 01 http://creativecommons.org/licenses/by-nc-nd/4.0/ 10 01 JB code NLP 02 1567-8202 02 14.00 01 02 Natural Language Processing Natural Language Processing 11 01 JB code jbe-openaccess 01 02 Open Access Books (ca. 70 titles) 11 01 JB code jbe-all 01 02 Full EBA collection (ca. 4,200 titles) 11 01 JB code jbe-eba-2023 01 02 Compact EBA Collection 2023 (ca. 700 titles, starting 2018) 11 01 JB code jbe-eba-2024 01 02 Compact EBA Collection 2024 (ca. 600 titles, starting 2019) 11 01 JB code jbe-2021 01 02 2021 collection (118 titles) 11 01 JB code jbe.2021.all 01 01 The Swedish FrameNet++ Harmonization, integration, method development and practical language technology applications The Swedish FrameNet++: Harmonization, integration, method development and practical language technology applications 1 B01 01 JB code 609426403 Dana Dannélls Dannélls, Dana Dana Dannélls University of Gothenburg 07 https://benjamins.com/catalog/persons/609426403 2 B01 01 JB code 723426404 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/723426404 3 B01 01 JB code 421426405 Karin Friberg Heppin Friberg Heppin, Karin Karin Friberg Heppin HeppiLing AB 07 https://benjamins.com/catalog/persons/421426405 01 eng 11 347 03 03 xiv 03 00 333 03 01 23/eng/20211018 439.73028 03 2021 PD5611 04 Swedish language--Lexicography. 04 Lexicography--Data processing. 04 Corpora (Linguistics) 04 Computational linguistics. 10 LAN009000 12 CFX 24 JB code LIN.COMPUT Computational & corpus linguistics 24 JB code LIN.CORP Corpus linguistics 24 JB code LIN.GERM Germanic linguistics 01 06 02 00 Large computational lexicons are central NLP resources. Swedish FrameNet++ aims to be a versatile full-scale lexical resource for NLP containing many kinds of linguistic information. 03 00 Large computational lexicons are central NLP resources. Swedish FrameNet++ aims to be a versatile full-scale lexical resource for NLP containing many kinds of linguistic information. Although focused on Swedish, this ongoing effort, which includes building a new Swedish framenet and recycling existing lexicons, has offered valuable insights into general aspects of lexical-resource building for NLP, which are discussed in this book: computational and linguistic problems of lexical semantics and lexical typology, the nature of lexical items (words and multiword expressions), achieving interoperability among heterogeneous lexical content, NLP methods for extending and interlinking existing lexicons, and deploying the new resource in practical NLP applications. This book is targeted at everyone with an interest in lexicography, computational lexicography, lexical typology, lexical semantics, linguistics, computational linguistics and related fields. We believe it should be of particular interest to those who are or have been involved in language resource creation, development and evaluation. 01 00 03 01 01 D503 https://benjamins.com/covers/475/nlp.14.png 01 01 D502 https://benjamins.com/covers/475_jpg/9789027209900.jpg 01 01 D504 https://benjamins.com/covers/475_tif/9789027209900.tif 01 01 D503 https://benjamins.com/covers/1200_front/nlp.14.hb.png 01 01 D503 https://benjamins.com/covers/125/nlp.14.png 02 00 03 01 01 D503 https://benjamins.com/covers/1200_back/nlp.14.hb.png 03 00 03 01 01 D503 https://benjamins.com/covers/3d_web/nlp.14.hb.png 01 01 JB code nlp.14.loa 06 10.1075/nlp.14.loa vii viii 2 Miscellaneous 1 01 04 Acronyms Acronyms 01 eng 01 01 JB code nlp.14.glossary 06 10.1075/nlp.14.glossary ix x 2 Miscellaneous 2 01 04 Abbreviations Abbreviations 01 eng 01 01 JB code nlp.14.pre 06 10.1075/nlp.14.pre xi xiv 4 Miscellaneous 3 01 04 Preface Preface 01 eng 01 01 JB code nlp.14.p1 06 10.1075/nlp.14.p1 4 65 62 Section header 4 01 04 Part I. Introduction and background Part I. Introduction and background 01 eng 01 01 JB code nlp.14.01bor 06 10.1075/nlp.14.01bor 3 36 34 Chapter 5 01 04 Chapter 1. Introduction Chapter 1. Introduction 01 04 Swedish FrameNet++ Swedish FrameNet++ 1 A01 01 JB code 889433613 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/889433613 2 A01 01 JB code 46433614 Dana Dannélls Dannélls, Dana Dana Dannélls University of Gothenburg 07 https://benjamins.com/catalog/persons/46433614 3 A01 01 JB code 275433615 Karin Friberg Heppin Friberg Heppin, Karin Karin Friberg Heppin Independent researcher 07 https://benjamins.com/catalog/persons/275433615 01 eng 30 00

The Swedish FrameNet++ was designed to be several things. As a digital artifact, it is an integrated panchronic lexical macroresource, primarily for Swedish, but including several other languages, intended as a basic infrastructural component in Swedish language technology research and for developing natural language processing applications. As an activity, it is a long-term R&D initiative, initially aimed at bringing about this macroresource, and now at maintaining and extending it, at promoting its use in language technology research and application development, as well as ensuring that the results of this research and development in their turn are incorporated in the macroresource. As a product of research, it reflects both computational and linguistic approaches to lexicology, lexical semantics, and lexical typology.

01 01 JB code nlp.14.02dan 06 10.1075/nlp.14.02dan 37 66 30 Chapter 6 01 04 Chapter 2. Swedish FrameNet Chapter 2. Swedish FrameNet 1 A01 01 JB code 979433616 Dana Dannélls Dannélls, Dana Dana Dannélls University of Gothenburg 07 https://benjamins.com/catalog/persons/979433616 2 A01 01 JB code 156433617 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/156433617 3 A01 01 JB code 630433618 Markus Forsberg Forsberg, Markus Markus Forsberg University of Gothenburg 07 https://benjamins.com/catalog/persons/630433618 4 A01 01 JB code 843433619 Karin Friberg Heppin Friberg Heppin, Karin Karin Friberg Heppin Independent researcher 07 https://benjamins.com/catalog/persons/843433619 5 A01 01 JB code 3433620 Maria Toporowska Gronostaj Gronostaj, Maria Toporowska Maria Toporowska Gronostaj University of Gothenburg 07 https://benjamins.com/catalog/persons/3433620 01 eng 30 00

This chapter describes the development of Swedish FrameNet. A new framenet project often follows one of two methodological approaches: (1) extension, through translation of a different-language – often English – framenet into the target language, and (2) merging, where the resource is built from scratch in the target language. Both approaches have their pros and cons, which have been extensively discussed in the literature. Swedish FrameNet is mainly developed through the extension approach, although balanced with the merging approach. Drawing on the two approaches simultaneously, we describe how integrated language resources and tools have been exploited to create and develop Swedish FrameNet: how it was constructed, what it contains, and the basic assumptions underlying the annotation of its contents.

01 01 JB code nlp.14.p2 06 10.1075/nlp.14.p2 70 165 96 Section header 7 01 04 Part II. Harmonization and integration Part II. Harmonization and integration 01 eng 01 01 JB code nlp.14.03bor 06 10.1075/nlp.14.03bor 69 96 28 Chapter 8 01 04 Chapter 3. Swedish FrameNet++ - lexical samsara Chapter 3. Swedish FrameNet++ – lexical samsara 1 A01 01 JB code 723433621 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/723433621 2 A01 01 JB code 932433622 Markus Forsberg Forsberg, Markus Markus Forsberg University of Gothenburg 07 https://benjamins.com/catalog/persons/932433622 3 A01 01 JB code 345433623 Lennart Lönngren Lönngren, Lennart Lennart Lönngren Arctic University of Norway 07 https://benjamins.com/catalog/persons/345433623 4 A01 01 JB code 564433624 Niklas Zechner Zechner, Niklas Niklas Zechner University of Gothenburg 07 https://benjamins.com/catalog/persons/564433624 01 eng 30 00

One of the main goals of the Swedish FrameNet++ initiative is to recycle and include as many existing modern Swedish lexical resources as possible into one unified lexical macroresource useful for automatic language processing. In this chapter we describe the structure of Saldo, the central resource of Swedish FrameNet++, the design of the formal interlinking mechanism keeping the lexical macroresource together, and our work on Swesaurus, a Swedish wordnet, and a Swedish Roget-style thesaurus as components of Swedish FrameNet++.

01 01 JB code nlp.14.04ade 06 10.1075/nlp.14.04ade 97 122 26 Chapter 9 01 04 Chapter 4. A lexical resource for computational historical linguistics Chapter 4. A lexical resource for computational historical linguistics 1 A01 01 JB code 239433625 Yvonne Adesam Adesam, Yvonne Yvonne Adesam University of Gothenburg 07 https://benjamins.com/catalog/persons/239433625 2 A01 01 JB code 435433626 Peter Andersson Lilja Lilja, Peter Andersson Peter Andersson Lilja University of Gothenburg/University of Borås 07 https://benjamins.com/catalog/persons/435433626 3 A01 01 JB code 667433627 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/667433627 4 A01 01 JB code 103433628 Gerlof Bouma Bouma, Gerlof Gerlof Bouma University of Gothenburg 07 https://benjamins.com/catalog/persons/103433628 01 eng 30 00

In this chapter we present the diachronic dimension of Swedish FrameNet++. We describe the historical lexical resources currently available for Swedish, linked to the Contemporary Swedish lexicon Saldo. We present a case study of how interlinking the dictionaries simultaneously allows us to study lexical change. We also present a method of linking text words to lexicon entries, facilitating interactive exploration of historical texts. Diachronical language resources present both a high-variation challenge from a wider language technology perspective, and an interesting object of linguistic study. While a number of improvements of the parts of the diachronic lexical macroresource are still needed, this resource is invaluable for analysing and accessing historical texts, as well as for both synchronic historical and diachronic lexical studies.

01 01 JB code nlp.14.05lin 06 10.1075/nlp.14.05lin 123 138 16 Chapter 10 01 04 Chapter 5. A multilingual net of lexical resources Chapter 5. A multilingual net of lexical resources 1 A01 01 JB code 571433629 Krister Lindén Lindén, Krister Krister Lindén University of Helsinki 07 https://benjamins.com/catalog/persons/571433629 2 A01 01 JB code 776433630 Jyrki Niemi Niemi, Jyrki Jyrki Niemi University of Helsinki 07 https://benjamins.com/catalog/persons/776433630 3 A01 01 JB code 205433631 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/205433631 4 A01 01 JB code 421433632 Markus Forsberg Forsberg, Markus Markus Forsberg University of Gothenburg 07 https://benjamins.com/catalog/persons/421433632 5 A01 01 JB code 896433633 Bolette S. Pedersen Pedersen, Bolette S. Bolette S. Pedersen University of Copenhagen 07 https://benjamins.com/catalog/persons/896433633 6 A01 01 JB code 67433634 Sanni Nimb Nimb, Sanni Sanni Nimb The Society for Danish Language and Literature 07 https://benjamins.com/catalog/persons/67433634 7 A01 01 JB code 292433635 Heili Orav Orav, Heili Heili Orav University of Tartu 07 https://benjamins.com/catalog/persons/292433635 8 A01 01 JB code 767433636 Neeme Kahusk Kahusk, Neeme Neeme Kahusk University of Tartu 07 https://benjamins.com/catalog/persons/767433636 9 A01 01 JB code 992433637 Kadri Vider Vider, Kadri Kadri Vider University of Tartu 07 https://benjamins.com/catalog/persons/992433637 01 eng 30 00

In this chapter, we explore how to develop and encode the relationship between wordnets for different languages using some Nordic and Baltic wordnets to illustrate the variety of approaches. We also briefly touch on how these wordnets have been enhanced or augmented with various types of lexical information, such as framenet frames as well as syntagmatic and sentiment information.

01 01 JB code nlp.14.06bor 06 10.1075/nlp.14.06bor 139 166 28 Chapter 11 01 04 Chapter 6. Swedish FrameNet++ and comparative linguistics Chapter 6. Swedish FrameNet++ and comparative linguistics 1 A01 01 JB code 610433638 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/610433638 2 A01 01 JB code 843433639 Anju Saxena Saxena, Anju Anju Saxena Uppsala University 07 https://benjamins.com/catalog/persons/843433639 3 A01 01 JB code 20433640 Shafqat Mumtaz Virk Virk, Shafqat Mumtaz Shafqat Mumtaz Virk University of Gothenburg 07 https://benjamins.com/catalog/persons/20433640 4 A01 01 JB code 501433641 Bernard Comrie Comrie, Bernard Bernard Comrie University of California 07 https://benjamins.com/catalog/persons/501433641 01 eng 30 00

In this chapter we describe a multilingual extension of Swedish FrameNet++, intended to address research questions of a broad comparative nature, in genealogical, areal and typological linguistics, focusing on the integration into Swedish FrameNet++ of so-called core vocabularies, used in several linguistic subfields in order to conduct massive comparative studies involving large numbers of languages. Specifically, we describe the inclusion of two such lexical databases covering several hundred South Asian languages, with the aim of investigating areal and genealogical connections among these languages.

01 01 JB code nlp.14.p3 06 10.1075/nlp.14.p3 170 259 90 Section header 12 01 04 Part III. Method development Part III. Method development 01 eng 01 01 JB code nlp.14.07joh 06 10.1075/nlp.14.07joh 169 190 22 Chapter 13 01 04 Chapter 7. NLP for resource building Chapter 7. NLP for resource building 1 A01 01 JB code 959433642 Richard Johansson Johansson, Richard Richard Johansson University of Gothenburg 07 https://benjamins.com/catalog/persons/959433642 01 eng 30 00

We evaluate several lexicon-based and corpus-based methods to automatically induce new lexical units for Swedish FrameNet, and we see that the best-performing setup uses a combination of both types of methods. A particular challenge for Swedish is the absence of a lexical resource such as WordNet; however, we show that the semantic network Saldo, which is organized according to lexicographical principles quite different from those of WordNet, is very useful for our purposes.

01 01 JB code nlp.14.08fri 06 10.1075/nlp.14.08fri 191 220 30 Chapter 14 01 04 Chapter 8. Differing design decisions - comparing Swedish FrameNet to FrameNet Chapter 8. Differing design decisions – comparing Swedish FrameNet to FrameNet 1 A01 01 JB code 613433643 Karin Friberg Heppin Friberg Heppin, Karin Karin Friberg Heppin Independent Researcher 07 https://benjamins.com/catalog/persons/613433643 01 eng 30 00

Creation of framenets for languages other than English based on Berkeley FrameNet has tested the hypothesis that semantic frames, to a certain extent, are language independent. This working hypothesis facilitated reuse of frames for new framenets, defining language specific frame evoking lemmas and annotating language specific sentences. The caveat is the bias towards creating what is possible, rather than typical, in a language. The reuse of frames allowed developing SweFN in a relatively short period of time. However, the goal to build a typical, not a possible Swedish framenet, necessitated some frame modifications.

This chapter provides a comparison between the English and Swedish framenets regarding semantic annotation and representation, and socio-cultural factors, including how differences forced modification of the original structure.

01 01 JB code nlp.14.09bor 06 10.1075/nlp.14.09bor 221 260 40 Chapter 15 01 04 Chapter 9. Multiword expressions - a tough typological nut for Swedish FrameNet++ Chapter 9. Multiword expressions – a tough typological nut for Swedish FrameNet++ 1 A01 01 JB code 26433644 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/26433644 01 eng 30 00

Multiword expressions have attracted much attention in language technology over the last two decades or so, and in general linguistics, the interest in phraseology – which includes the linguistic study of multiword expressions – goes back much further. In our work on the multilingual components of Swedish FrameNet++, we have strived to adopt a typologically informed view on multiword expressions. This raises a number of theoretical and methodological questions, some of which are discussed in this chapter.

01 01 JB code nlp.14.p4 06 10.1075/nlp.14.p4 264 329 66 Section header 16 01 04 Part IV. Natural language processing applications Part IV. Natural language processing applications 01 eng 01 01 JB code nlp.14.10joh 06 10.1075/nlp.14.10joh 263 280 18 Chapter 17 01 04 Chapter 10. Semantic role labeling Chapter 10. Semantic role labeling 1 A01 01 JB code 760433645 Richard Johansson Johansson, Richard Richard Johansson University of Gothenburg 07 https://benjamins.com/catalog/persons/760433645 2 A01 01 JB code 992433646 Karin Friberg Heppin Friberg Heppin, Karin Karin Friberg Heppin Independent researcher 07 https://benjamins.com/catalog/persons/992433646 3 A01 01 JB code 159433647 Dimitrios Kokkinakis Kokkinakis, Dimitrios Dimitrios Kokkinakis University of Gothenburg 07 https://benjamins.com/catalog/persons/159433647 01 eng 30 00

We investigate the feasibility of automatic semantic role labeling (SRL) using Swedish FrameNet (SweFN). In the first part of the chapter, we describe a baseline system using a traditional division into segmentation and labeling steps. These subsystems are implemented as separate machine learning models, and we explore a wide range of syntactic and lexical features for these models. In the second part, we turn to the question of how the frame-to-frame relations defined in FrameNet allow us to use the annotated examples more effectively. The cross-frame generalization methods reduce the number of errors made by the labeling classifier by 27%. For previously unseen frames, the reduction is even more significant: 50%.

01 01 JB code nlp.14.11dan 06 10.1075/nlp.14.11dan 281 302 22 Chapter 18 01 04 Chapter 11. Computational representation of FrameNet for multilingual natural language generation Chapter 11. Computational representation of FrameNet for multilingual natural language generation 1 A01 01 JB code 883433648 Dana Dannélls Dannélls, Dana Dana Dannélls University of Gothenburg 07 https://benjamins.com/catalog/persons/883433648 2 A01 01 JB code 60433649 Normunds Grūzītis Grūzītis, Normunds Normunds Grūzītis IMCS, University of Latvia 07 https://benjamins.com/catalog/persons/60433649 01 eng 30 00

Multilingual natural language generation, the process of producing written or spoken utterances in parallel languages from either structured or unstructured representations requires large amounts of syntactic and semantic information to generate an expression that is tailored to the target audience. This information is offered by FrameNet-like resources, which have been developed for a number of languages. In this chapter, we present a computational FrameNet grammar resource for multilingual natural language generation. We compare between English and Swedish framenets to illustrate how these can be unified under a shared computational representation using Grammatical Framework. We demonstrate how the grammar was exploited in two practical multilingual natural language generation applications to facilitate tourist communication and empower museum users with coherent artwork descriptions.

01 01 JB code nlp.14.12pre 06 10.1075/nlp.14.12pre 303 330 28 Chapter 19 01 04 Chapter 12. Language learning and teaching with Swedish FrameNet++ Chapter 12. Language learning and teaching with Swedish FrameNet++ 01 04 Two examples Two examples 1 A01 01 JB code 720433650 Julia Prentice Prentice, Julia Julia Prentice University of Gothenburg 07 https://benjamins.com/catalog/persons/720433650 2 A01 01 JB code 938433651 Camilla Håkansson Håkansson, Camilla Camilla Håkansson University of Gothenburg 07 https://benjamins.com/catalog/persons/938433651 3 A01 01 JB code 365433652 Therese Lindström Tiedemann Lindström Tiedemann, Therese Therese Lindström Tiedemann University of Helsinki 07 https://benjamins.com/catalog/persons/365433652 4 A01 01 JB code 577433653 Ildikó Pilán Pilán, Ildikó Ildikó Pilán Norwegian Computing Center 07 https://benjamins.com/catalog/persons/577433653 5 A01 01 JB code 789433654 Elena Volodina Volodina, Elena Elena Volodina University of Gothenburg 07 https://benjamins.com/catalog/persons/789433654 01 eng 30 00

This chapter describes and discusses the use of resources connected to Swedish FrameNet++ (SweFN++) in the context of the teaching and learning of language proficiency and grammatical analysis in Swedish. We illustrate the way in which different resources in the SweFN++ context can be useful for language pedagogy, by employing two examples, the Swedish Constructicon and a semantic role exercise on the intelligent computer assisted language learning (ICALL) platform Lärka. These resources make use of the infrastructure developed within SweFN++ in fundamentally different ways, which are discussed and compared. In addition, we discuss the possibilities for further development of the language pedagogical potential of SweFN++, both in relation to ICALL and to other types of resources and descriptive databases, like corpora, constructicons and framenets.

01 01 JB code nlp.14.ind 06 10.1075/nlp.14.ind 331 333 3 Miscellaneous 20 01 04 Index Index 01 eng
01 JB code JBENJAMINS John Benjamins Publishing Company 01 01 JB code JB John Benjamins Publishing Company 01 https://benjamins.com 02 https://benjamins.com/catalog/nlp.14 Amsterdam NL 00 John Benjamins Publishing Company Marketing Department / Karin Plijnaar, Pieter Lamers onix@benjamins.nl 04 01 00 20211126 C 2021 John Benjamins D 2021 John Benjamins 02 WORLD 13 15 9789027209900 WORLD 09 01 JB 3 John Benjamins e-Platform 03 https://jbe-platform.com 29 https://jbe-platform.com/content/books/9789027258489 21 01
738027521 03 01 01 JB code JB John Benjamins Publishing Company 01 JB code NLP 14 Hb 15 9789027209900 06 10.1075/nlp.14 13 2021042004 00 BB 08 765 gr 10 01 JB code NLP 02 1567-8202 02 14.00 01 02 Natural Language Processing Natural Language Processing 01 01 The Swedish FrameNet++ Harmonization, integration, method development and practical language technology applications The Swedish FrameNet++: Harmonization, integration, method development and practical language technology applications 1 B01 01 JB code 609426403 Dana Dannélls Dannélls, Dana Dana Dannélls University of Gothenburg 07 https://benjamins.com/catalog/persons/609426403 2 B01 01 JB code 723426404 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/723426404 3 B01 01 JB code 421426405 Karin Friberg Heppin Friberg Heppin, Karin Karin Friberg Heppin HeppiLing AB 07 https://benjamins.com/catalog/persons/421426405 01 eng 11 347 03 03 xiv 03 00 333 03 01 23/eng/20211018 439.73028 03 2021 PD5611 04 Swedish language--Lexicography. 04 Lexicography--Data processing. 04 Corpora (Linguistics) 04 Computational linguistics. 10 LAN009000 12 CFX 24 JB code LIN.COMPUT Computational & corpus linguistics 24 JB code LIN.CORP Corpus linguistics 24 JB code LIN.GERM Germanic linguistics 01 06 02 00 Large computational lexicons are central NLP resources. Swedish FrameNet++ aims to be a versatile full-scale lexical resource for NLP containing many kinds of linguistic information. 03 00 Large computational lexicons are central NLP resources. Swedish FrameNet++ aims to be a versatile full-scale lexical resource for NLP containing many kinds of linguistic information. Although focused on Swedish, this ongoing effort, which includes building a new Swedish framenet and recycling existing lexicons, has offered valuable insights into general aspects of lexical-resource building for NLP, which are discussed in this book: computational and linguistic problems of lexical semantics and lexical typology, the nature of lexical items (words and multiword expressions), achieving interoperability among heterogeneous lexical content, NLP methods for extending and interlinking existing lexicons, and deploying the new resource in practical NLP applications. This book is targeted at everyone with an interest in lexicography, computational lexicography, lexical typology, lexical semantics, linguistics, computational linguistics and related fields. We believe it should be of particular interest to those who are or have been involved in language resource creation, development and evaluation. 01 00 03 01 01 D503 https://benjamins.com/covers/475/nlp.14.png 01 01 D502 https://benjamins.com/covers/475_jpg/9789027209900.jpg 01 01 D504 https://benjamins.com/covers/475_tif/9789027209900.tif 01 01 D503 https://benjamins.com/covers/1200_front/nlp.14.hb.png 01 01 D503 https://benjamins.com/covers/125/nlp.14.png 02 00 03 01 01 D503 https://benjamins.com/covers/1200_back/nlp.14.hb.png 03 00 03 01 01 D503 https://benjamins.com/covers/3d_web/nlp.14.hb.png 01 01 JB code nlp.14.loa 06 10.1075/nlp.14.loa vii viii 2 Miscellaneous 1 01 04 Acronyms Acronyms 01 eng 01 01 JB code nlp.14.glossary 06 10.1075/nlp.14.glossary ix x 2 Miscellaneous 2 01 04 Abbreviations Abbreviations 01 eng 01 01 JB code nlp.14.pre 06 10.1075/nlp.14.pre xi xiv 4 Miscellaneous 3 01 04 Preface Preface 01 eng 01 01 JB code nlp.14.p1 06 10.1075/nlp.14.p1 4 65 62 Section header 4 01 04 Part I. Introduction and background Part I. Introduction and background 01 eng 01 01 JB code nlp.14.01bor 06 10.1075/nlp.14.01bor 3 36 34 Chapter 5 01 04 Chapter 1. Introduction Chapter 1. Introduction 01 04 Swedish FrameNet++ Swedish FrameNet++ 1 A01 01 JB code 889433613 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/889433613 2 A01 01 JB code 46433614 Dana Dannélls Dannélls, Dana Dana Dannélls University of Gothenburg 07 https://benjamins.com/catalog/persons/46433614 3 A01 01 JB code 275433615 Karin Friberg Heppin Friberg Heppin, Karin Karin Friberg Heppin Independent researcher 07 https://benjamins.com/catalog/persons/275433615 01 eng 30 00

The Swedish FrameNet++ was designed to be several things. As a digital artifact, it is an integrated panchronic lexical macroresource, primarily for Swedish, but including several other languages, intended as a basic infrastructural component in Swedish language technology research and for developing natural language processing applications. As an activity, it is a long-term R&D initiative, initially aimed at bringing about this macroresource, and now at maintaining and extending it, at promoting its use in language technology research and application development, as well as ensuring that the results of this research and development in their turn are incorporated in the macroresource. As a product of research, it reflects both computational and linguistic approaches to lexicology, lexical semantics, and lexical typology.

01 01 JB code nlp.14.02dan 06 10.1075/nlp.14.02dan 37 66 30 Chapter 6 01 04 Chapter 2. Swedish FrameNet Chapter 2. Swedish FrameNet 1 A01 01 JB code 979433616 Dana Dannélls Dannélls, Dana Dana Dannélls University of Gothenburg 07 https://benjamins.com/catalog/persons/979433616 2 A01 01 JB code 156433617 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/156433617 3 A01 01 JB code 630433618 Markus Forsberg Forsberg, Markus Markus Forsberg University of Gothenburg 07 https://benjamins.com/catalog/persons/630433618 4 A01 01 JB code 843433619 Karin Friberg Heppin Friberg Heppin, Karin Karin Friberg Heppin Independent researcher 07 https://benjamins.com/catalog/persons/843433619 5 A01 01 JB code 3433620 Maria Toporowska Gronostaj Gronostaj, Maria Toporowska Maria Toporowska Gronostaj University of Gothenburg 07 https://benjamins.com/catalog/persons/3433620 01 eng 30 00

This chapter describes the development of Swedish FrameNet. A new framenet project often follows one of two methodological approaches: (1) extension, through translation of a different-language – often English – framenet into the target language, and (2) merging, where the resource is built from scratch in the target language. Both approaches have their pros and cons, which have been extensively discussed in the literature. Swedish FrameNet is mainly developed through the extension approach, although balanced with the merging approach. Drawing on the two approaches simultaneously, we describe how integrated language resources and tools have been exploited to create and develop Swedish FrameNet: how it was constructed, what it contains, and the basic assumptions underlying the annotation of its contents.

01 01 JB code nlp.14.p2 06 10.1075/nlp.14.p2 70 165 96 Section header 7 01 04 Part II. Harmonization and integration Part II. Harmonization and integration 01 eng 01 01 JB code nlp.14.03bor 06 10.1075/nlp.14.03bor 69 96 28 Chapter 8 01 04 Chapter 3. Swedish FrameNet++ - lexical samsara Chapter 3. Swedish FrameNet++ – lexical samsara 1 A01 01 JB code 723433621 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/723433621 2 A01 01 JB code 932433622 Markus Forsberg Forsberg, Markus Markus Forsberg University of Gothenburg 07 https://benjamins.com/catalog/persons/932433622 3 A01 01 JB code 345433623 Lennart Lönngren Lönngren, Lennart Lennart Lönngren Arctic University of Norway 07 https://benjamins.com/catalog/persons/345433623 4 A01 01 JB code 564433624 Niklas Zechner Zechner, Niklas Niklas Zechner University of Gothenburg 07 https://benjamins.com/catalog/persons/564433624 01 eng 30 00

One of the main goals of the Swedish FrameNet++ initiative is to recycle and include as many existing modern Swedish lexical resources as possible into one unified lexical macroresource useful for automatic language processing. In this chapter we describe the structure of Saldo, the central resource of Swedish FrameNet++, the design of the formal interlinking mechanism keeping the lexical macroresource together, and our work on Swesaurus, a Swedish wordnet, and a Swedish Roget-style thesaurus as components of Swedish FrameNet++.

01 01 JB code nlp.14.04ade 06 10.1075/nlp.14.04ade 97 122 26 Chapter 9 01 04 Chapter 4. A lexical resource for computational historical linguistics Chapter 4. A lexical resource for computational historical linguistics 1 A01 01 JB code 239433625 Yvonne Adesam Adesam, Yvonne Yvonne Adesam University of Gothenburg 07 https://benjamins.com/catalog/persons/239433625 2 A01 01 JB code 435433626 Peter Andersson Lilja Lilja, Peter Andersson Peter Andersson Lilja University of Gothenburg/University of Borås 07 https://benjamins.com/catalog/persons/435433626 3 A01 01 JB code 667433627 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/667433627 4 A01 01 JB code 103433628 Gerlof Bouma Bouma, Gerlof Gerlof Bouma University of Gothenburg 07 https://benjamins.com/catalog/persons/103433628 01 eng 30 00

In this chapter we present the diachronic dimension of Swedish FrameNet++. We describe the historical lexical resources currently available for Swedish, linked to the Contemporary Swedish lexicon Saldo. We present a case study of how interlinking the dictionaries simultaneously allows us to study lexical change. We also present a method of linking text words to lexicon entries, facilitating interactive exploration of historical texts. Diachronical language resources present both a high-variation challenge from a wider language technology perspective, and an interesting object of linguistic study. While a number of improvements of the parts of the diachronic lexical macroresource are still needed, this resource is invaluable for analysing and accessing historical texts, as well as for both synchronic historical and diachronic lexical studies.

01 01 JB code nlp.14.05lin 06 10.1075/nlp.14.05lin 123 138 16 Chapter 10 01 04 Chapter 5. A multilingual net of lexical resources Chapter 5. A multilingual net of lexical resources 1 A01 01 JB code 571433629 Krister Lindén Lindén, Krister Krister Lindén University of Helsinki 07 https://benjamins.com/catalog/persons/571433629 2 A01 01 JB code 776433630 Jyrki Niemi Niemi, Jyrki Jyrki Niemi University of Helsinki 07 https://benjamins.com/catalog/persons/776433630 3 A01 01 JB code 205433631 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/205433631 4 A01 01 JB code 421433632 Markus Forsberg Forsberg, Markus Markus Forsberg University of Gothenburg 07 https://benjamins.com/catalog/persons/421433632 5 A01 01 JB code 896433633 Bolette S. Pedersen Pedersen, Bolette S. Bolette S. Pedersen University of Copenhagen 07 https://benjamins.com/catalog/persons/896433633 6 A01 01 JB code 67433634 Sanni Nimb Nimb, Sanni Sanni Nimb The Society for Danish Language and Literature 07 https://benjamins.com/catalog/persons/67433634 7 A01 01 JB code 292433635 Heili Orav Orav, Heili Heili Orav University of Tartu 07 https://benjamins.com/catalog/persons/292433635 8 A01 01 JB code 767433636 Neeme Kahusk Kahusk, Neeme Neeme Kahusk University of Tartu 07 https://benjamins.com/catalog/persons/767433636 9 A01 01 JB code 992433637 Kadri Vider Vider, Kadri Kadri Vider University of Tartu 07 https://benjamins.com/catalog/persons/992433637 01 eng 30 00

In this chapter, we explore how to develop and encode the relationship between wordnets for different languages using some Nordic and Baltic wordnets to illustrate the variety of approaches. We also briefly touch on how these wordnets have been enhanced or augmented with various types of lexical information, such as framenet frames as well as syntagmatic and sentiment information.

01 01 JB code nlp.14.06bor 06 10.1075/nlp.14.06bor 139 166 28 Chapter 11 01 04 Chapter 6. Swedish FrameNet++ and comparative linguistics Chapter 6. Swedish FrameNet++ and comparative linguistics 1 A01 01 JB code 610433638 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/610433638 2 A01 01 JB code 843433639 Anju Saxena Saxena, Anju Anju Saxena Uppsala University 07 https://benjamins.com/catalog/persons/843433639 3 A01 01 JB code 20433640 Shafqat Mumtaz Virk Virk, Shafqat Mumtaz Shafqat Mumtaz Virk University of Gothenburg 07 https://benjamins.com/catalog/persons/20433640 4 A01 01 JB code 501433641 Bernard Comrie Comrie, Bernard Bernard Comrie University of California 07 https://benjamins.com/catalog/persons/501433641 01 eng 30 00

In this chapter we describe a multilingual extension of Swedish FrameNet++, intended to address research questions of a broad comparative nature, in genealogical, areal and typological linguistics, focusing on the integration into Swedish FrameNet++ of so-called core vocabularies, used in several linguistic subfields in order to conduct massive comparative studies involving large numbers of languages. Specifically, we describe the inclusion of two such lexical databases covering several hundred South Asian languages, with the aim of investigating areal and genealogical connections among these languages.

01 01 JB code nlp.14.p3 06 10.1075/nlp.14.p3 170 259 90 Section header 12 01 04 Part III. Method development Part III. Method development 01 eng 01 01 JB code nlp.14.07joh 06 10.1075/nlp.14.07joh 169 190 22 Chapter 13 01 04 Chapter 7. NLP for resource building Chapter 7. NLP for resource building 1 A01 01 JB code 959433642 Richard Johansson Johansson, Richard Richard Johansson University of Gothenburg 07 https://benjamins.com/catalog/persons/959433642 01 eng 30 00

We evaluate several lexicon-based and corpus-based methods to automatically induce new lexical units for Swedish FrameNet, and we see that the best-performing setup uses a combination of both types of methods. A particular challenge for Swedish is the absence of a lexical resource such as WordNet; however, we show that the semantic network Saldo, which is organized according to lexicographical principles quite different from those of WordNet, is very useful for our purposes.

01 01 JB code nlp.14.08fri 06 10.1075/nlp.14.08fri 191 220 30 Chapter 14 01 04 Chapter 8. Differing design decisions - comparing Swedish FrameNet to FrameNet Chapter 8. Differing design decisions – comparing Swedish FrameNet to FrameNet 1 A01 01 JB code 613433643 Karin Friberg Heppin Friberg Heppin, Karin Karin Friberg Heppin Independent Researcher 07 https://benjamins.com/catalog/persons/613433643 01 eng 30 00

Creation of framenets for languages other than English based on Berkeley FrameNet has tested the hypothesis that semantic frames, to a certain extent, are language independent. This working hypothesis facilitated reuse of frames for new framenets, defining language specific frame evoking lemmas and annotating language specific sentences. The caveat is the bias towards creating what is possible, rather than typical, in a language. The reuse of frames allowed developing SweFN in a relatively short period of time. However, the goal to build a typical, not a possible Swedish framenet, necessitated some frame modifications.

This chapter provides a comparison between the English and Swedish framenets regarding semantic annotation and representation, and socio-cultural factors, including how differences forced modification of the original structure.

01 01 JB code nlp.14.09bor 06 10.1075/nlp.14.09bor 221 260 40 Chapter 15 01 04 Chapter 9. Multiword expressions - a tough typological nut for Swedish FrameNet++ Chapter 9. Multiword expressions – a tough typological nut for Swedish FrameNet++ 1 A01 01 JB code 26433644 Lars Borin Borin, Lars Lars Borin University of Gothenburg 07 https://benjamins.com/catalog/persons/26433644 01 eng 30 00

Multiword expressions have attracted much attention in language technology over the last two decades or so, and in general linguistics, the interest in phraseology – which includes the linguistic study of multiword expressions – goes back much further. In our work on the multilingual components of Swedish FrameNet++, we have strived to adopt a typologically informed view on multiword expressions. This raises a number of theoretical and methodological questions, some of which are discussed in this chapter.

01 01 JB code nlp.14.p4 06 10.1075/nlp.14.p4 264 329 66 Section header 16 01 04 Part IV. Natural language processing applications Part IV. Natural language processing applications 01 eng 01 01 JB code nlp.14.10joh 06 10.1075/nlp.14.10joh 263 280 18 Chapter 17 01 04 Chapter 10. Semantic role labeling Chapter 10. Semantic role labeling 1 A01 01 JB code 760433645 Richard Johansson Johansson, Richard Richard Johansson University of Gothenburg 07 https://benjamins.com/catalog/persons/760433645 2 A01 01 JB code 992433646 Karin Friberg Heppin Friberg Heppin, Karin Karin Friberg Heppin Independent researcher 07 https://benjamins.com/catalog/persons/992433646 3 A01 01 JB code 159433647 Dimitrios Kokkinakis Kokkinakis, Dimitrios Dimitrios Kokkinakis University of Gothenburg 07 https://benjamins.com/catalog/persons/159433647 01 eng 30 00

We investigate the feasibility of automatic semantic role labeling (SRL) using Swedish FrameNet (SweFN). In the first part of the chapter, we describe a baseline system using a traditional division into segmentation and labeling steps. These subsystems are implemented as separate machine learning models, and we explore a wide range of syntactic and lexical features for these models. In the second part, we turn to the question of how the frame-to-frame relations defined in FrameNet allow us to use the annotated examples more effectively. The cross-frame generalization methods reduce the number of errors made by the labeling classifier by 27%. For previously unseen frames, the reduction is even more significant: 50%.

01 01 JB code nlp.14.11dan 06 10.1075/nlp.14.11dan 281 302 22 Chapter 18 01 04 Chapter 11. Computational representation of FrameNet for multilingual natural language generation Chapter 11. Computational representation of FrameNet for multilingual natural language generation 1 A01 01 JB code 883433648 Dana Dannélls Dannélls, Dana Dana Dannélls University of Gothenburg 07 https://benjamins.com/catalog/persons/883433648 2 A01 01 JB code 60433649 Normunds Grūzītis Grūzītis, Normunds Normunds Grūzītis IMCS, University of Latvia 07 https://benjamins.com/catalog/persons/60433649 01 eng 30 00

Multilingual natural language generation, the process of producing written or spoken utterances in parallel languages from either structured or unstructured representations requires large amounts of syntactic and semantic information to generate an expression that is tailored to the target audience. This information is offered by FrameNet-like resources, which have been developed for a number of languages. In this chapter, we present a computational FrameNet grammar resource for multilingual natural language generation. We compare between English and Swedish framenets to illustrate how these can be unified under a shared computational representation using Grammatical Framework. We demonstrate how the grammar was exploited in two practical multilingual natural language generation applications to facilitate tourist communication and empower museum users with coherent artwork descriptions.

01 01 JB code nlp.14.12pre 06 10.1075/nlp.14.12pre 303 330 28 Chapter 19 01 04 Chapter 12. Language learning and teaching with Swedish FrameNet++ Chapter 12. Language learning and teaching with Swedish FrameNet++ 01 04 Two examples Two examples 1 A01 01 JB code 720433650 Julia Prentice Prentice, Julia Julia Prentice University of Gothenburg 07 https://benjamins.com/catalog/persons/720433650 2 A01 01 JB code 938433651 Camilla Håkansson Håkansson, Camilla Camilla Håkansson University of Gothenburg 07 https://benjamins.com/catalog/persons/938433651 3 A01 01 JB code 365433652 Therese Lindström Tiedemann Lindström Tiedemann, Therese Therese Lindström Tiedemann University of Helsinki 07 https://benjamins.com/catalog/persons/365433652 4 A01 01 JB code 577433653 Ildikó Pilán Pilán, Ildikó Ildikó Pilán Norwegian Computing Center 07 https://benjamins.com/catalog/persons/577433653 5 A01 01 JB code 789433654 Elena Volodina Volodina, Elena Elena Volodina University of Gothenburg 07 https://benjamins.com/catalog/persons/789433654 01 eng 30 00

This chapter describes and discusses the use of resources connected to Swedish FrameNet++ (SweFN++) in the context of the teaching and learning of language proficiency and grammatical analysis in Swedish. We illustrate the way in which different resources in the SweFN++ context can be useful for language pedagogy, by employing two examples, the Swedish Constructicon and a semantic role exercise on the intelligent computer assisted language learning (ICALL) platform Lärka. These resources make use of the infrastructure developed within SweFN++ in fundamentally different ways, which are discussed and compared. In addition, we discuss the possibilities for further development of the language pedagogical potential of SweFN++, both in relation to ICALL and to other types of resources and descriptive databases, like corpora, constructicons and framenets.

01 01 JB code nlp.14.ind 06 10.1075/nlp.14.ind 331 333 3 Miscellaneous 20 01 04 Index Index 01 eng
01 JB code JBENJAMINS John Benjamins Publishing Company 01 01 JB code JB John Benjamins Publishing Company 01 https://benjamins.com 02 https://benjamins.com/catalog/nlp.14 Amsterdam NL 00 John Benjamins Publishing Company Marketing Department / Karin Plijnaar, Pieter Lamers onix@benjamins.nl 04 01 00 20211126 C 2021 John Benjamins D 2021 John Benjamins 02 WORLD WORLD US CA MX 09 01 JB 1 John Benjamins Publishing Company +31 20 6304747 +31 20 6739773 bookorder@benjamins.nl 01 https://benjamins.com 21 77 18 01 00 Unqualified price 02 JB 1 02 99.00 EUR 02 00 Unqualified price 02 83.00 01 Z 0 GBP GB US CA MX 01 01 JB 2 John Benjamins Publishing Company +1 800 562-5666 +1 703 661-1501 benjamins@presswarehouse.com 01 https://benjamins.com 21 77 18 01 00 Unqualified price 02 JB 1 02 149.00 USD