Exploring Newspaper Language
Using the web to create and investigate a large corpus of modern Norwegian
Editor
| Norwegian School of Economics, Bergen
This book describes new methodological and technological approaches to corpus building and presents recent research based on the Norwegian Newspaper Corpus. This is a large monitor corpus of contemporary Norwegian language, compiled through daily harvesting of web newspapers. The book gives an overview of the corpus and its system architecture, and presents tools used for tasks such as text harvesting, annotation, topic classification and extraction and frequency profiling of new words and phrases. Among the innovative technologies is Corpuscle, a corpus query engine and management system which is flexible enough to handle very large corpora in an efficient way. The individual research contributions based on the corpus explore different aspects of Norwegian, including the occurrence of anglicisms, neologisms and terminology, and the use of metonymy and metaphor in newspaper language. The book also describes an innovative method of applying correspondence analysis and implicational analysis to investigate interdependencies between morphosyntactic variants.
[Studies in Corpus Linguistics, 49] 2012. vi, 356 pp.
Publishing status: Available
© John Benjamins Publishing Company
Table of Contents
1–28
|
|
Part I. Exploiting the web as a corpus – Methods and tools
|
|
29–50
|
|
51–66
|
|
67–78
|
|
79–110
|
|
111–130
|
|
131–154
|
|
Part II. Corpus-based case studies
|
|
155–192
|
|
193–220
|
|
221–240
|
|
241–256
|
|
257–284
|
|
285–306
|
|
307–330
|
|
331–350
|
|
351–352
|
|
Subject index
|
353–356
|
Cited by
Cited by 4 other publications
Abdumanapovna, Sharipova Aziza
Andersen, Gisle & Anne-Line Graedler
Andersen, Gisle & Daniel Hardt
Gisle, Andersen
This list is based on CrossRef data as of 11 april 2021. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
Subjects
BIC Subject: CFX – Computational linguistics
BISAC Subject: LAN009000 – LANGUAGE ARTS & DISCIPLINES / Linguistics / General