The discourse markers well and so and their equivalents in the Portuguese and Turkish subparts of the TED-MDB corpus
Amália Mendes | CLUL – Centre of Linguistics, School of Arts and Humanities, University of Lisbon
Deniz Zeyrek | Graduate School of Informatics, Cognitive Science Department, Middle East Technical University
The present research describes how the PDTB style of discourse annotation has been applied to transcribed TED Talks in the TED-MDB corpus, which include multifunctional elements such as well and so. The occurrence of these two markers is analyzed in English in comparison to the way they are conveyed in Portuguese and Turkish translations of TED-MDB. Their presence in discourse relations, as well as in question-response pairs found in TED Talks, are investigated. The implicitation of well and so in translation is shown to be related to their function in the context. Results also point to differences in the target languages regarding implicitation, especially when the discourse marker doesn’t fulfil a connective function.
2011 “Modelling Discourse Relations for Arabic.” In Proceedings of the Conference on Empirical Methods in Natural Language Proceedings of EMNLP ’11, Stroudsburg, PA, USA, 736–747. ACL.
Asher, Nicholas
2012Reference to Abstract Objects in Discourse. Volume 50. Springer Science & Business Media.
Asher, Nicholas, Julie Hunter, and Kate Thompson
2019 “Comparing Discourse Structures between Purely Linguistic and Situated Messages in an Annotated Corpus.” Dialogue & Discourse 11(1): 89–121.
Asher, Nicolas, and Alex Lascarides
1988 “The Semantics and Pragmatics of Presupposition.” Journal of Semantics 15(2): 239–299.
Asher, Nicolas, Philippe Muller, Myriam Bras, Lydia Mai Ho-Dac, Farah Benamara, Stergos Afantenos, and Laure Vieu
2017 “ANNODIS and Related Projects: Case Studies on the Annotation of Discourse Structure.” In Handbook of Linguistic Annotation, ed. by Nancy Ide, and James Pustejovsky, 1241–1264. Springer.
2016 “Discourse Structure and Dialogue Acts in Multiparty Dialogue: the STAC Corpus.” In The Tenth International Conference on Language Resources and Evaluation (LREC 2016), 2721–2727. ELDA.
Brinton, Laurel
1996Pragmatic markers in English. Grammaticalization and discourse function. Mouton de Gruyter.
Buysse, Lieven
2015‘Well it’s not very ideal …’ The pragmatic marker well in learner English. Intercultural Pragmatics 12(1): 59–89.
Cettolo, Mauro, Christian Girardi, and Marcello Federico
2012 “WIT3: Web Inventory of Transcribed and Translated Talks”. In Proceedings of EAMT, Trento, Italy, 261–268.
Crible, Ludivine
2017Discourse Markers and (Dis)fluency across Registers: A Contrastive Usage-based Study in English and French. Ph.D. Dissertation. [URL]
Crible, Ludivine, and Liesbeth Degand
2017 “Reliability vs. Granularity in Discourse Annotation: What is the Trade-off?” Corpus Linguistics and Linguistic Theory 15(1): 71–99.
Crible, Ludivine, Ágnes Abuczki, Nijolė Burkšaitienė, Péter Furkó, Anna Nedoluzhko, Giedre Valunaite Oleskeviciene, Sigita Rackevičienė, and Šárka Zikánová
2019 “Functions and Translations of Underspecified Discourse Markers in TED Talks: A Parallel Corpus Study on five Languages.” Journal of Pragmatics 142: 139–155.
Cuenca, Maria Josep
2008 “Pragmatic markers in contrast: The case of well.” Journal of Pragmatics 40(8): 1373–1391.
Cuenca, Maria Josep, and Maria Josep Marín
2009 “Co-occurrence of Discourse Markers in Catalan and Spanish Oral Narrative.” Journal of Pragmatics 41: 899–914.
Grésillon, A., J.-L. Lebrave
1984 “Qui interroge qui et pourquoi?”. In La langue au ras du texte. Lille: Presses Universitaires de Lille, 57–132.
Oleskeviciene, Giedre V., Deniz Zeyrek, Viktorija Mazeikiene, and Murathan Kurfalı
2018 “Observations on the Annotation of Discourse Relational Devices in TED Talk Transcripts in Lithuanian.” In Proceedings of the workshop on annotation in digital humanities co-located with ESSLLI, vol. 2155, 53–58.
Oza, Umangi, Rashmi Prasad, Sudheer Kolachina, Dipti M. Sharma, and Aravind Joshi
2009 “The Hindi Discourse Relation Bank.” In Proceedings of the third linguistic annotation workshop, 158–161. ACL.
Prasad, Rashmi, Eleni Miltsakaki, Nikhil Dinesh, Alan Lee, Aravind Joshi, Livio Robaldo, and Bonnie Webber
2007The Penn Discourse Treebank 2.0 Annotation Manual. [URL]
Prasad, Rashmi, Nikhil Dinesh, Alan Lee, Eleni Miltsakaki, Livio Robaldo, Aravind Joshi, and Bonnie Webber
2008 “The Penn Discourse Treebank 2.0.” In Proceedings of LREC, 2961–2968. ELRA.
Schiffrin, Deborah
2001 “Discourse markers: Language, meaning, and context.” In Deborah Schiffrin, Deborah Tannen, and Heidi E. Hamilton (Eds.) The handbook of discourse analysis 1, 54–75. Blackwell Publishers.
Spooren, W., and Degand, L.
2010Coding coherence relations: Reliability and validity. Corpus Linguistics and Linguistic Theory, 6(2):241–266.
Tonelli, Sara, Giuseppe Riccardi, Rashmi Prasad, and Aravind Joshi
2010 “Annotation of Discourse Relations for Conversational Spoken Dialogs.” In Proceedings of LREC, 2084–2090. ELRA.
Webber, Bonnie, Rashmi Prasad, Alan Lee, and Aravind Joshi
2019The Penn Discourse Treebank 3.0 Annotation Manual. Technical Report, Institute for Research in Cognitive Science. University of Pennsylvania.
Zeyrek, Deniz, Işın Demirsahin, Ayışığı Sevdik Callı, and Ruket Çakıcı
2013 “Turkish Discourse Bank: Porting a Discourse Annotation Style to a Morphologically Rich Language.” Dialogue and Discourse 4(2): 174–184.
Zeyrek, Deniz, Amália Mendes, and Murathan Kurfalı
2018 “Multilingual Extension of PDTB-Style Annotation: The Case of TED Multilingual Discourse Bank.” In Proceedings of the 11th Language Resources and Evaluation Conference – LREC’20187–12May 2018, Miyazaki, Japan, 1913–1919.
Zeyrek, Deniz, Amália Mendes, Yulia Grishina, Murathan Kurfalı, Samuel Gibbon, and Maciej Ogrodniczuk
2019 “TED Multilingual Discourse Bank (TED-MDB): A Parallel Corpus Annotated in the PDTB Style.” Language Resources and Evaluation.
Zhou, Yuping, and Nianwen Xue
2015 “The Chinese Discourse Treebank: a Chinese Corpus Annotated with Discourse Relations.” Language Resources and Evaluation, 49(2):397–431.
Cited by
Cited by 1 other publications
Özer, Sibel, Murathan Kurfalı, Deniz Zeyrek, Amália Mendes, Giedrė Valūnaitė Oleškevičienė, Julia Bosque-Gil, Milan Dojchinovski, Philipp Cimiano, Julia Bosque-Gil, Philipp Cimiano & Milan Dojchinovski
2022. Linking discourse-level information and the induction of bilingual discourse connective lexicons. Semantic Web 13:6 ► pp. 1081 ff.
This list is based on CrossRef data as of 2 january 2023. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.