Punctuation has usually been ignored by researchers in computational linguistics over the years. Recently, it has been realized that a true understanding of written language will be impossible if punctuation marks are not taken into account. This paper contains the details of a computer-aided exercise to investigate English punctuation practice for the special case of comma (the most significant punctuation mark) in a parsed corpus. The study classifies the various "structural" uses of the comma according to the syntax-patterns in which a comma occurs. The corpus (Penn Treebank) consists of syntactically annotated sentences with no part-of-speech tag information about the individual words.
Lin, Jason, Xing Wang, Zelun Wang, Donald Beyette & Jyh-Charn Liu
2019. Proceedings of the ACM Symposium on Document Engineering 2019, ► pp. 1 ff.
Cook, Vivian
2014. Standard Punctuation and the Punctuation of the Street. In Essential Topics in Applied Linguistics and Multilingualism [Second Language Learning and Teaching, ], ► pp. 267 ff.
2011. Comparing methods for the syntactic simplification of sentences in information extraction. Literary and Linguistic Computing 26:4 ► pp. 371 ff.
정연우 & Yoo Isaiah Wonho
2011. 대학교 신입생들의 영어쉼표 오류분석. English Teaching 66:4 ► pp. 261 ff.
Favre, Benoit, Dilek Hakkani-Tur & Elizabeth Shriberg
2009. 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, ► pp. 4697 ff.
Garat, Diego
2006. Shallow Parsing Based on Comma Values. In Advances in Artificial Intelligence - IBERAMIA-SBIA 2006 [Lecture Notes in Computer Science, 4140], ► pp. 492 ff.
Zhou, L. & D. Zhang
2005. A Heuristic Approach to Establishing Punctuation Convention in Instant Messaging. IEEE Transactions on Professional Communication 48:4 ► pp. 391 ff.
This list is based on CrossRef data as of 4 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.