Exploring Newspaper Language

Using the web to create and investigate a large corpus of modern Norwegian

Editor

Gisle Andersen | Norwegian School of Economics, Bergen

Hardbound – Available

ISBN 9789027203540 | EUR 99.00 | USD 149.00

e-Book –

ISBN 9789027274991 | EUR 99.00 | USD 149.00

This book describes new methodological and technological approaches to corpus building and presents recent research based on the Norwegian Newspaper Corpus. This is a large monitor corpus of contemporary Norwegian language, compiled through daily harvesting of web newspapers. The book gives an overview of the corpus and its system architecture, and presents tools used for tasks such as text harvesting, annotation, topic classification and extraction and frequency profiling of new words and phrases. Among the innovative technologies is Corpuscle, a corpus query engine and management system which is flexible enough to handle very large corpora in an efficient way. The individual research contributions based on the corpus explore different aspects of Norwegian, including the occurrence of anglicisms, neologisms and terminology, and the use of metonymy and metaphor in newspaper language. The book also describes an innovative method of applying correspondence analysis and implicational analysis to investigate interdependencies between morphosyntactic variants.

[Studies in Corpus Linguistics, 49] 2012. vi, 356 pp.

Publishing status: Available

https://doi.org/10.1075/scl.49

Table of Contents

Building a large corpus based on newspapers from the web

Gisle Andersen and Knut Hofland | pp. 1–28

Part I. Exploiting the web as a corpus – Methods and tools

Corpuscle – a new corpus management platform for annotated corpora

Paul Meurer | pp. 29–50

OBT+stat: A combined rule-based and statistical tagger

Janne Bondi Johannessen, Kristin Hagen, André Lynum and Anders Nøklestad | pp. 51–66

Exploring corpora through syntactic annotation

Victoria Rosén | pp. 67–78

Collocations and statistical analysis of n-grams: Multiword expressions in newspaper text

Gunn Inger Lyse and Gisle Andersen | pp. 79–110

Automatic topic classification of a large newspaper corpus

Thomas M. Hagen | pp. 111–130

A data-driven approach to anglicism identification in Norwegian

Gyri Smørdal Losnegaard and Gunn Inger Lyse | pp. 131–154

Part II. Corpus-based case studies

A corpus-based study of the adaptation of English import words in Norwegian

Gisle Andersen | pp. 155–192

Norm clusters in written Norwegian

Helge Dyvik | pp. 193–220

Lexical neography in modern Norwegian

Ruth Vatvedt Fjeld and Lars Nygaard | pp. 221–240

Ash compound frenzy: A case study in the Norwegian Newspaper Corpus

Koenraad De Smedt | pp. 241–256

Financial jargon in a general newspaper corpus

Marita Kristiansen | pp. 257–284

Metonymic extension and vagueness: Schengen and Kyoto in Norwegian newspaper language

Sandra L. Halverson | pp. 285–306

Spatial metaphors in present-day Norwegian newspaper language

Leiv Egil Breivik and Toril Swan | pp. 307–330

Doing historical linguistics using contemporary data

Øivin Andersen | pp. 331–350

| pp. 351–352

Subject index | pp. 353–356

Cited by

Cited by 8 other publications

Order by:

Abdumanapovna, Sharipova Aziza

2018. Proceedings of the 2nd International Conference on Digital Technology in Education, ► pp. 82 ff.

Andersen, Gisle

2014. Relevance. In Corpus Pragmatics, ► pp. 143 ff.

Andersen, Gisle

2022. Utilising heterogeneous language resources for term extraction in maritime domains. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 28:1 ► pp. 1 ff.

Andersen, Gisle & Anne-Line Graedler

2020. Morphological borrowing from English to Norwegian: The enigmatic non-possessive -s. Nordic Journal of Linguistics 43:1 ► pp. 3 ff.

Andersen, Gisle & Daniel Hardt

2014. Introduction: Corpus linguistics and the Nordic languages. Nordic Journal of Linguistics 37:2 ► pp. 135 ff.

Claire Emma Birnie, Jennifer Sampson, Eivind Sjaastad, Bjarte Johansen, Lars Egil Obrestad, Ronny Larsen & Ahmed Khamassi

2019. Day 2 Wed, September 04, 2019,

Gisle, Andersen

2022. Phraseology in a cross-linguistic perspective: A diachronic and corpus-based account. Corpus Linguistics and Linguistic Theory 18:2 ► pp. 365 ff.

Matharaarachchi, Surani, Mike Domaratzki, Alan Katz & Saman Muthukumarana

2022. Discovering Long COVID Symptom Patterns: Association Rule Mining and Sentiment Analysis in Social Media Tweets. JMIR Formative Research 6:9 ► pp. e37984 ff.

This list is based on CrossRef data as of 16 april 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

Subjects