Article published in:Dependency Linguistics: Recent advances in linguistic theory using dependency structures
Edited by Kim Gerdes, Eva Hajičová and Leo Wanner
[Linguistik Aktuell/Linguistics Today 215] 2014
► pp. 53–74
Sentence structure and discourse structure
The present contribution represents the first step in comparing the nature of syntactico-semantic relations present in the sentence structure to their equivalents in the discourse structure. The study is carried out on the basis of Czech manually annotated material collected in the Prague Dependency Treebank (PDT). According to the analysis of the underlying syntactic structure of a sentence (tectogrammatics) in the PDT, we distinguish various types of relations that can be expressed both within a single sentence (i.e. in a tree) and in a larger text, beyond the sentence boundary (between trees). We suggest that, on the one hand, semantic nature of each type of these relations corresponds both within a sentence and in a larger text (i.e. a causal relation remains a causal relation) but, on the other hand, according to the semantic properties of the relations, their distribution in a sentence or between sentences is very diverse. In this study, this observation is analyzed in detail for three cases (relations of condition, specification and opposition) and further supported by similar behaviour of the English data from the Penn Discourse Treebank.
Published online: 01 October 2014
Hajič, J., Hajičová, E., Panevová, J., Sgall, P., Štěpánek, J., Havelka, J. & Mikulová, M.
2006 Prague Dependency Treebank 2.0. Philadelphia PA: Linguistic Data Consortium, LDC 2006T01, http://ufal.mff.cuni.cz/pdt2.0
Hajičová, E., Partee, B.H. & Sgall, P.
Mikulová, M., Bémová, A., Hajič, J., Hajičová, E., Havelka, J., Kolářová, V., Kučová, L., Lopatková, M., Pajas, P., Panevová, J., Razímová, M., Sgall, P., Štěpánek, J., Urešová, Z., Veselá, K. & Žabokrtský, Z.
Miltsakaki, E., Robaldo, L., Lee, A. & Joshi, A.
Mladová, L., Zikánová, Š. & Hajičová, E.
2008 From sentence to discourse: Building an annotation scheme for discourse based on Prague Dependency Treebank. In Proceedings of the 6th International Conference on Language Resources and Evaluation , CD-ROM.
Mladová, L., Zikánová, Š., Bedřichová, Z. & Hajičová, E.
2009 Towards a discourse corpus of Czech. In Proceedings of the Fifth Corpus Linguistics Conference . Liverpool UK. <http://ucrel.lancs.ac.uk/publications/cl2009/#papers>.
2009 Annotation of discourse connectives for the PDT. Proceedings of WDS’09 . Praha, Czechia.
Nedoluzhko, A., Mírovský, J. & Pajas, P.
2009 The coding scheme for annotating extended nominal coreference and bridging anaphora in the Prague Dependency Treebank. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing . Suntec, Singapore.
Prasad, R., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A. & Webber, B.
2008 The Penn Discourse Treebank 2.0. In Proceedings of the 6th International Conference on Language Resources and Evaluation , CD-ROM.
Prasad, R., Miltsakaki, E., Dinesh, N., Lee, A. & Joshi, A.
2007 The Penn Discourse TreeBank 2.0 Annotation Manual. <www.seas.upenn.edu/~pdtb/PDTBAPI/pdtb-annotation-manual.pdf>