Building and Using the Siarad Corpus

Bilingual conversations in Welsh and English

Authors
Margaret Deuchar | University of Cambridge & Bangor University
Peredur Webb-Davies | Bangor University
Kevin Donnelly | Independent Researcher, Llanfairpwll
HardboundAvailable
ISBN 9789027200112 | EUR 95.00 | USD 143.00
 
e-Book
ISBN 9789027264589 | EUR 95.00 | USD 143.00
 
Google Play logo
This book is a research monograph divided into two parts. The first part describes the methods used to build the first sizeable corpus of informal conversational data collected from bilingual speakers of Welsh and English: Siarad. The second part describes the linguistic analysis of data from this corpus (available at bangortalk.org.uk). The information in Part One will be useful as a ‘how to’ manual on building a bilingual spoken corpus, including methods of data collection, transcription, glossing and analysis. The findings reported in Part Two throw new light on the debate regarding code-switching vs. borrowing, the application of the Matrix Language Framework (MLF) to the grammar of Welsh-English code-switching, the extralinguistic factors influencing variation in quantity of code-switching, and the extent to which the grammar of Welsh is changing in contact with English. Additional findings by other researchers using the corpus are also reported, and possible future directions are discussed.
[Studies in Corpus Linguistics, 81] 2018.  vii, 199 pp.
Publishing status: Available
Table of Contents
Cited by

Cited by 11 other publications

Bellamy, Kate & Jesse Wichers Schreur
2022. When semantics and phonology collide: Gender assignment in mixed Tsova-Tush–Georgian nominal constructions. International Journal of Bilingualism 26:3  pp. 257 ff. DOI logo
Knight, Dawn, Steve Morris, Laura Arman, Jennifer Needs & Mair Rees
2021. (Meta)Data Collection. In Building a National Corpus,  pp. 75 ff. DOI logo
Knight, Dawn, Steve Morris, Laura Arman, Jennifer Needs & Mair Rees
2021. Understanding the Language Context. In Building a National Corpus,  pp. 1 ff. DOI logo
Knight, Dawn, Steve Morris, Laura Arman, Jennifer Needs & Mair Rees
2021. Processing and (Re)presenting Corpora. In Building a National Corpus,  pp. 105 ff. DOI logo
Knight, Dawn, Steve Morris & Tess Fitzpatrick
2021. 2.2 Corpws Cenedlaethol Cymraeg Cyfoes: Cyd-destun a Gweledigaeth. In Corpus Design and Construction in Minoritised Language Contexts - Cynllunio a Chreu Corpws mewn Cyd-destunau Ieithoedd Lleiafrifoledig,  pp. 101 ff. DOI logo
Knight, Dawn, Steve Morris & Tess Fitzpatrick
2021. 1.2 A National Corpus of Contemporary Welsh: Context and Vision. In Corpus Design and Construction in Minoritised Language Contexts - Cynllunio a Chreu Corpws mewn Cyd-destunau Ieithoedd Lleiafrifoledig,  pp. 19 ff. DOI logo
Knight, Dawn, Steve Morris & Tess Fitzpatrick
2021. 2.3 Cynllunio Corpws Cenedlaethol mewn Iaith Leiafrifoledig. In Corpus Design and Construction in Minoritised Language Contexts - Cynllunio a Chreu Corpws mewn Cyd-destunau Ieithoedd Lleiafrifoledig,  pp. 117 ff. DOI logo
Knight, Dawn, Steve Morris & Tess Fitzpatrick
2021. 1.3 Designing a National Corpus in a Minoritised Language. In Corpus Design and Construction in Minoritised Language Contexts - Cynllunio a Chreu Corpws mewn Cyd-destunau Ieithoedd Lleiafrifoledig,  pp. 35 ff. DOI logo
Nguyen, Li, Oliver Mayeux & Zheng Yuan
2023. Code-switching input for machine translation: a case study of Vietnamese–English data. International Journal of Multilingualism  pp. 1 ff. DOI logo
Vaughan-Evans, Awel, Maria Carmen Parafita Couto, Bastien Boutonnet, Noriko Hoshino, Peredur Webb-Davies, Margaret Deuchar & Guillaume Thierry
2020. Switchmate! An Electrophysiological Attempt to Adjudicate Between Competing Accounts of Adjective-Noun Code-Switching. Frontiers in Psychology 11 DOI logo
Wigdorowitz, Mandy, Ana I. Pérez & Ianthi M. Tsimpli
2023. A holistic measure of contextual and individual linguistic diversity. International Journal of Multilingualism 20:2  pp. 469 ff. DOI logo

This list is based on CrossRef data as of 20 february 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

Subjects

Main BIC Subject

CFDM: Bilingualism & multilingualism

Main BISAC Subject

LAN009000: LANGUAGE ARTS & DISCIPLINES / Linguistics / General
ONIX Metadata
ONIX 2.1
ONIX 3.0
U.S. Library of Congress Control Number:  2017045523 | Marc record