Chapter published in:
Applications of Pattern-driven Methods in Corpus LinguisticsEdited by Joanna Kopaczyk and Jukka Tyrkkö
[Studies in Corpus Linguistics 82] 2018
► pp. 277–310
Chapter 11Blogging around the world
Universal and localised patterns in Online Englishes
Joanna Kopaczyk | University of Glasgow
Jukka Tyrkkö | Linnaeus University Växjö
The borderless nature of blogging raises the question whether the traditional regionally defined varieties of English continue to hold true (see Crystal 2011). In order to investigate the extent to which the language published online without external intervention is similar around the world, this chapter investigates repetitive patterns, or 3-grams, found in blogs in the 583-million-word GloWbE corpus (Davies 2013). The data shows two types of repetitive word sequences: universal, or those that are frequent in all or most of the nineteen geographic locations represented in the corpus, and localised, or those unique to specific regions. We explore multiple ways of approaching the regional distribution of universal and localised 3-grams, such as statistical similarity measures (Jaccard coefficient and hierarchical clustering) and network visualisations. Three correlated research issues are addressed by this study: (1) the ratio of 3-grams in blogs from various World Englishes, which will shed light onto the degree of formulaicity in Web Englishes around the world; (2) the overlaps between various locations in terms of preferred sequences, which may point to local or global standardization hubs on the level of sentence and text construction; (3) finally, the status of model-providing varieties for internet communication, especially American English, in view of the most frequent 3-grams from other locations (cf. Mair 2013).
Keywords: World Englishes, blogs, GloWbE, hierarchical clustering, Gephi plot
Article outline
- 1.Introduction
- 2.Background
- 2.1World Englishes vs Online English(es)
- 2.1.1 Kachru’s (1982) concentric circles
- 2.1.2 Schneider’s (2007) Dynamic-Evolutionary model
- 2.1.3 Mair’s (2013) World System of Englishes
- 2.2Blogs as an Internet genre
- 2.1World Englishes vs Online English(es)
-
3.Material
- 3.1The GloWbE corpus
- 3.2Retrieving patterns: N-grams and lexical bundles
- 4.Methods and findings
- 4.1Finding similarities
- 4.1.1Regional binary similarities: Jaccard coefficient
- 4.1.2Hierarchical clustering
- 4.1.3Digging deeper, exploring further: Network visualisation
- 4.2N-grams in World Englishes
- 4.2.1Corpus inquiries into linguistic areas
- 4.2.2Universal and localised types
- 4.2.3Zooming in on the Inner Circle
- 4.1Finding similarities
- 5.Back to World Englishes and Online Englishes
-
Notes -
References
Published online: 13 March 2018
https://doi.org/10.1075/scl.82.11kop
https://doi.org/10.1075/scl.82.11kop
References
Ädel, Annelie & Erman, Britt
Baroni, Marco
Biber, Douglas & Barbieri, Federica
Biber, Douglas, Johansson, Stig, Leech, Geoffrey, Conrad, Susan & Finegan, Edward
British National Corpus (BNC XML Edition)
2007 Distributed by Oxford University Computing Services on behalf of the BNC Consortium. http://www.natcorp.ox.ac.uk/
Burridge, Kate
Davies, Mark
2014 Corpus of Global Web-Based English: 1.9 billion words from speakers in 20 countries. http://corpus.byu.edu/glowbe/
Davies, Mark & Fuchs, Robert
de Swaan, Abram
Eckert, Penelope & McConnell-Ginet, Sally
Fernback, Jan
Fuster-Márquez, Miguel
Gries, Stefan T. & Mukherjee, Joybrato
Grieve, Jack, Douglas Biber, Eric Friginal & Tatiana Nekrasova
Görlach, Manfred
Goźdź-Roszkowski, Stanisław
Gupta, Anthea Fraser
Hickey, Raymond
Hundt, Marianne & Gut, Ulrike
Hyland, Ken
Internet World Stats. Usage and Population Statistics
Internet Live Stats
Jacomy, Mathieu, Venturini, Tommaso, Heymann, Sebastien & Bastian, Mathieu
Jacquemet, Marco
Jucker, Andreas H. & Kopaczyk, Joanna
Kopaczyk, Joanna
Mair, Christian
Mauranen, Anna
Mesthrie, Rajend
Mukherjee, Joybrato & Gries, Stefan T.
Oliveros, J. C.
2007–2015 Venny. An interactive tool for comparing lists with Venn’s diagrams. http://bioinfogp.cnb.csic.es/tools/venny/index.html
Omoniyi, Tope
Pennycook, Alastair
Richardson, Kay, Parry, Katy & Corner, John
Salazar, Danica
Säily, Tanja
Schneider, Gerold & Hundt, Marianne
Traugott, Elisabeth
2008 Grammaticalization, constructions and the incremental development of language: Suggestions from the development of degree modifiers in English. In Variation, Selection, Development. Probing the Evolutionary Model of Language Change, Regine Eckardt, Gerhard Jäger, and Tonjes Veenstra (eds), 219–250. Berlin: Mouton de Gruyter.
Tyrkkö, Jukka, Hickey, Raymond & Marttila, Ville
Warschauer, Mark, Black, Rebecca & Chou, Yen-Lin
Cited by
Cited by 1 other publications
Weetman, Katharine, Jeremy Dale, Rachel Spencer, Emma Scott & Stephanie Schnurr
This list is based on CrossRef data as of 31 march 2022. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.