Article published in:
Dependency Linguistics: Recent advances in linguistic theory using dependency structuresEdited by Kim Gerdes, Eva Hajičová and Leo Wanner
[Linguistik Aktuell/Linguistics Today 215] 2014
► pp. 75–98
The Copenhagen Dependency Treebank (CDT)
Extending syntactic annotation to other linguistic levels
Henrik Høeg Müller | Copenhagen Business School
Iørn Korzen | Copenhagen Business School
The objective of this paper is to provide an overview of the CDT annotation design with special emphasis on the modelling of the interface between the syntactic level and two other linguistic levels, viz. morphology and discourse. In connection with the description of NP annotation we present the fundamentals of how CDT is marked up with semantic relations in accordance with the dependency principles governing the annotation on the other levels of CDT. Specifically, focus will be on how Generative Lexicon (GL) theory has been incorporated into the unitary theoretical dependency framework of CDT. An annotation scheme for lexical semantics has been designed so as to account for the lexico-semantic structure of complex NPs, and the four GL qualia also appear in some of the CDT discourse relation labels as a description of parallel semantic relations at this level.
Published online: 01 October 2014
https://doi.org/10.1075/la.215.04mul
https://doi.org/10.1075/la.215.04mul
References
References
Böhmová, A., Hajič, J., Hajičová, E. & Hladká, B.
Buch-Kromann, M.
2006 Discontinuous Grammar. A Dependency-based Model of Human Parsing and Language Learning. Doctoral. dissertation, Copenhagen Business School.
Buch-Kromann, M., Gylling, M., Knudsen, L.J., Korzen, I. & Müller, H.H.
2010 The inventory of linguistic relations used in the Copenhagen Dependency Treebanks. Technical report. Copenhagen: Copenhagen Business School. <http://code.google.com/p/copenhagen-dependency-treebank/>.
Buch-Kromann, M., Hardt, D. & Korzen, I.
2011 Syntax-centered and semantics-centered views of discourse. Can they be reconciled? In Beyond Semantics. Corpus-based Investigations of Pragmatic and Discourse Phenomena, S. Dipper & H. Zinsmeister (eds), 17–30. Bochum: Ruhr-Universität Bochum, Sprachwissenschaftliches Institut. [Bochumer Linguistische Arbeitsberichte, vol. 3].
Buch-Kromann, M., Korzen, I. & Müller, H.H.
Carlson, L., Marcu, D. & Okurowski, M.E.
2001 Building a discourse-tagged corpus in the framework of rhetorical structure theory. In
Proceedings of the 2nd SIGdial Workshop on Discourse and Dialogue
.
Dinesh, N., Lee, A., Miltsakaki, E., Prasad, R., Joshi, A. & Webber, B.
2005 Attribution and the (non-)alignment of syntactic and discourse arguments of connectives. In
Proceedings of the Workshop on Frontiers in Corpus Annotation, II: Pie in the Sky
, 29–36.
Hardt, D.
Hinrichs, E., Kubler, S., Naumann, K., Telljohann, H. & Trushkina, J.
2004 Recent developments in linguistic annotations of the TuBa-D/Z Treebank. In
Proceedings of the Third Workshop on Treebanks and Linguistic Theories
, 51–62. Tübingen, Germany.
Johnston, M. & Busa, F.
Korzen, I.
Korzen, I. & Buch-Kromann, M.
2011 Anaphoric relations in the Copenhagen Dependency Treebanks. In Beyond Semantics. Corpus-based Investigations of Pragmatic and Discourse Phenomena, S. Dipper & H. Zinsmeister (eds), 83–98. Bochum: Ruhr-Universität Bochum, Sprachwissenschaftliches Institut. [Bochumer Linguistische Arbeitsberichte, vol. 3].
Kromann, M.T.
2003 The Danish Dependency Treebank and the DTAG treebank tool. In
Proceedings of the Second Workshop on Treebanks and Linguistic Theories (TLT 2003)
, 14–15 November, Växjö, 217–220.
Lundquist, L.
Mann, W.C. & Thompson, S.A.
Marcu, D.
2003 Discourse Structures: Trees or Graphs? <www.isi.edu/~marcu/discourse/Discourse%20structures.htm>.
Marcus, M.P., Marcinkiewicz, M.A. & Santorini, B.
Meyers, A., Reeves, R., Macleod, C., Szekely, R., Zielinska, V., Young, B. & Grishman, R.
2004a The NomBank Project: An interim report. In
Proceedings of the HLTNAACL Workshop on Frontiers in Corpus Annotation, 24–31. Boston MA.
Meyers, A.. et al.
2004b Annotating noun argument structure for NomBank. In
Proceedings of the 4th International Conference on Language Resources and Evaluation (LCREC 2004)
. Lisbon, Portugal.
Mladová, L., Zikánová, Š. & Hajičová, E.
2008 From sentence to discourse: Building an annotation scheme for discourse based on Prague Dependency Treebank. In
Proceedings of the 6th International Conference on Language Resources and Evaluation (LCREC 2008)
, 2564–2570. Marrakesh, Morocco.
Müller, H.H.
Palmer, M., Gildea, D. & Kingsbury, P.
Poesio, M.
2004 Discourse annotation and semantic annotation in the GNOME corpus. In
Proceedings of the ACL Workshop on Discourse Annotation
. Barcelona, Spain.
Prasad, R., Dinesh, N., Lee, A., Joshi, A. & Webber, B.
Prasad, R., Miltsakaki, E., Dinesh, A., Lee, A., Joshi, A., Robaldo, L. & Webber, B.
Prasad, R., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A. & Webber, B.
2008b The Penn Discourse TreeBank 2.0. In
Proceedings of the Sixth International Language Resources and Evaluation (LREC’08)
. Marrakesh, Morocco.
Rainer, F.
Ramm, W. & Fabricius-Hansen, C.
Ruppenhofer, J., Ellsworth, M., Petruck, M., Johnson, C. & Scheffczyk, J.
2006 FrameNet, II: Extended Theory and Practice. <http://framenet2.icsi.berkeley.edu/docs/r1.5/book.pdf>
Stede, M.
Taboada, M. & Mann, W.C.
Varela, S. & Martín García, J.
Cited by
Cited by other publications
Høeg Müller, Henrik
This list is based on CrossRef data as of 26 december 2020. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.