Table of contents
List of tables
ix
List of figures
xi
Abbreviations
xiii
Chapter 1.Introduction
1
1.1Previous approaches to pragmatics and discourse
2
1.2Speech acts
5
1.3Approaches to corpus-/computer-based pragmatics
8
1.4Outline of the book
14
1.5Conventions used in this book
15
Chapter 2.Computer-based data in pragmatics
17
2.1Linguistic corpora and pragmatics
17
2.2Issues and standards in text representation and annotation
20
2.2.1General computer-based representation
27
2.2.2Text vs. meta-information
34
2.2.3General linguistic annotation
35
2.3Problems and specifics in dealing with spoken language
transcription
39
2.3.1Issues concerning orthographic representation
39
2.3.2Issues concerning prosody
44
2.3.3Issues concerning segmental and other features
47
2.3.4Issues concerning sequential integrity
52
2.3.5Issues concerning multi-modality
54
Chapter 3.Data, tools and resources
57
3.1Corpus data used in the research
57
3.1.1The SPAADIA trainline corpus
57
3.1.2The selection from trains 93
58
3.1.3The selection from the switchboard annotated dialogue corpus
59
3.1.4Discarded data
61
3.1.5Supplementary data
62
3.2The DART implementation and its use in handling dialogue data
62
3.2.1The DART functionality
63
3.2.2The DART XML format
67
3.3Morpho-syntactic resources required for pragmatic analysis
69
3.3.1The generic lexicon concept
72
3.3.2The DART tagset
78
3.3.3Morphology and morpho-syntax
83
3.3.4‘Synthesising’ domain-specific lexica
84
Chapter 4.The syntax of spoken language units
89
4.1Sentence vs. syntactic types (C-Units)
90
4.2Units of analysis and frequency norming for pragmatic purposes
96
4.3Unit types and basic pragmatic functions
97
4.3.1Yes-Units
100
4.3.2No-Units
108
4.3.3Discourse markers
117
4.3.4Forms of address
134
4.3.5Wh-Questions
134
4.3.6Yes/No- and alternative questions
138
4.3.7Declaratives
141
4.3.8Imperatives
146
4.3.9Fragments and exclamatives
149
Chapter 5.Semantics and semantico-pragmatics
155
5.1The DAMSL annotation scheme
156
5.2Modes
162
5.2.1Grammatical modes
165
5.2.2Interactional modes
166
5.2.3Point-of-view modes
171
5.2.4Volition and personal stance modes
174
5.2.5Social modes
176
5.2.6Syntax-indicating modes
176
5.3Topics
177
5.3.1Generic topics
179
5.3.2Domain-specific topics
184
Chapter 6.The annotation process
187
6.1Issues concerning the general processing of spoken dialogues
187
6.1.1Pre-processing – manual and automated unit determination
187
6.1.2Fillers, pauses, backchannels, overlap, etc
190
6.1.3Handling initial connectors, prepositions & adverbs
192
6.1.4Dealing with disfluent starts
193
6.1.5Parsing & chunking for syntactic purposes
193
6.2Identifying and annotating the individual unit types
automatically
194
6.2.1Splitting off and annotating shorter units
194
6.2.2Tagging Wh-Questions
196
6.2.3Tagging Yes/No-questions
199
6.2.4Tagging fragments, imperatives and declaratives
201
6.3Levels above the C-Unit
205
6.3.1Answers & other responses
205
6.3.2Echoes
207
6.4Identifying topics and modes
208
6.5Inferencing and determining or correcting speech acts
209
Chapter 7.Speech acts: Types, functions, and distributions across the corpora
213
7.1Information-seeking speech acts
214
7.2(Non-)Cohesive speech acts
220
7.3Information-providing and referring speech acts
231
7.4Negotiative speech acts
250
7.5Suggesting or commitment-indicating speech acts
256
7.6Evaluating or attitudinal speech acts
261
7.7Reinforcing speech acts
266
7.8Social, conventionalised speech acts
269
7.9Residual speech acts
274
Chapter 8.Conclusion
277
Appendix A.The DART speech-act taxonomy (version 2.0)
281
References
285
Index
293
This article is available free of charge.