Article published In: International Journal of Corpus Linguistics
Vol. 3:1 (1998) ► pp.33–57
An Analysis of English Punctuation
The Special Case of Comma
Published online: 1 January 1998
https://doi.org/10.1075/ijcl.3.1.03bay
https://doi.org/10.1075/ijcl.3.1.03bay
Punctuation has usually been ignored by researchers in computational linguistics over the years. Recently, it has been realized that a true understanding of written language will be impossible if punctuation marks are not taken into account. This paper contains the details of a computer-aided exercise to investigate English punctuation practice for the special case of comma (the most significant punctuation mark) in a parsed corpus. The study classifies the various "structural" uses of the comma according to the syntax-patterns in which a comma occurs. The corpus (Penn Treebank) consists of syntactically annotated sentences with no part-of-speech tag information about the individual words.
Keywords: Comma, Structural Punctuation Marks, The Penn Treebank, Punctuation
Cited by (9)
Cited by nine other publications
Sharipov, Maksud S., Hushnudbek S. Adinaev & Elmurod R. Kuriyozov
Lin, Jason, Xing Wang, Zelun Wang, Donald Beyette & Jyh-Charn Liu
Cook, Vivian
Kirchhoff, Frank & Beatrice Primus
Evans, R. J.
Favre, Benoit, Dilek Hakkani-Tur & Elizabeth Shriberg
Garat, Diego
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
