Article published In: Compilation, transcription, markup and annotation of spoken corpora
Edited by John M. Kirk and Gisle Andersen
[International Journal of Corpus Linguistics 21:3] 2016
► pp. 299–322
The Pragmatic Annotation Scheme of the SPICE-Ireland Corpus
Published online: 29 September 2016
https://doi.org/10.1075/ijcl.21.3.01kir
https://doi.org/10.1075/ijcl.21.3.01kir
This paper builds on the paradox whereby transcriptions are the object of study for investigations into the spoken language but yet omit so much of “what is heard” when an utterance is made. Transcriptions are agnostic with regard to Searlean notions of illocutionary force and perlocutionary effect. The paper proposes that the enhancement of transcriptions with pragmatic and prosodic annotation overcomes that paradox and captures the original utterance more objectively. It argues that annotation is part of transcription. It presents with examples a brief summary of each part of the Pragmatic Annotation Scheme developed for the SPICE-Ireland Corpus: speech acts, tone movements, discourse markers, utterance tags and quotatives.
Keywords: markup, SPICE-Ireland Corpus, prosody, pragmatics, annotation
References (71)
. (2002). English Discourse Particles. Amsterdam: John Benjamins.
. (2013). Understanding Pragmatic Markers: A Variational Pragmatic Approach. Edinburgh: Edinburgh University Press.
. (2014). Pragmatic markers. In K. Aijmer & C. Rühlemann (Eds.), Corpus Pragmatics (pp. 195–218). Cambridge: Cambridge University Press.
Aijmer K., & Simon-Vandenbergen, A.-M. (Eds.) (2011). Pragmatic markers. In J. Zienkowski, J.-O. Östman & J. Verschueren (Eds.), Discursive Pragmatics (pp. 205–240). Amsterdam: John Benjamins.
Andersen, G. (2001). Pragmatic Markers and Sociolinguistic Variation. A Relevance-Theoretic Approach to the Language of Adolescents. Amsterdam: John Benjamins.
. (2011). Corpus-based pragmatics I: Qualitative studies. In W. Bublitz & N. Norrick. Foundations of Pragmatics (pp. 587–627). Berlin: Mouton de Gruyter.
Archer, D. (2002). “Can innocent people be guilty?” A sociopragmatic analysis of examination transcripts from the Salem Witchcraft Trials. Journal of Historical Pragmatics, 3(1), 1–30.
Archer, D., Culpeper, J., & Davies, M. (2008). Pragmatic annotation. In A. Lüdeling & M. Kytö (Eds.), Corpus Linguistics. An International Handbook (pp. 642–663). Berlin: Walter de Gruyter.
Barron, A. (2015). “And your wedding is the twenty-second <.> of June is it?” Tag questions in Irish English. In C.P. Amador-Moreno, K. McCafferty & E. Vaughan (Eds.), Pragmatic Markers in Irish English (pp. 203–228). Amsterdam: John Benjamins.
Beckman, M.E., & Elam, G.A. (1997). Guidelines for ToBI labelling. Version 3.0. Ohio State University Research Foundation. Retrieved from [URL] (last accessed November 2015)
Cook, G. (1995). Theoretical issues: Transcribing the untranscribable. In G. Leech, G. Myers, & J. Thomas (Eds.), Spoken English on Computer: Transcription, Mark-up and Application (pp. 35–53). London: Longman.
De Felice, R. (2013). A corpus-based classification of commitments in Business English. In J. Romero-Trillo (Ed.), Yearbook of Corpus Linguistics and Pragmatics 2013 (pp. 153–171). Dordrecht: Springer.
De Felice, R., Darby, J., Fisher, A., & Peplow, D. (2013). A classification scheme for annotating speech acts in a business email corpus. ICAME Journal, 371, 71–105.
Edwards J.A., & Lampert, M.D. (Eds.) (1993). Talking Data: Transcription and Coding in Discourse Research. Hillsdale, NJ: Lawrence Erlbaum Associates.
Garside R., Leech G., & Sampson, G. (Eds.). (1987). The Computational Analysis of English. London: Longman.
Garside, R., Leech, G., & McEnery, A. (Eds.). 1997. Corpus Annotation: Linguistic Information from Computer Text Corpora. London: Longman.
Goffman, E. (1974). Frame Analysis: An Essay on the Organization of Experience. Boston, MA: Northeastern University Press.
Halliday, M.A.K. (1975). Learning How to Mean: Explorations in the Development of Language. London: Edward Arnold.
. (2015). “Actually, it’s unfair to say that I was throwing stones”: Comparative perspectives on uses of actually in ICE-Ireland. In C.P. Amador-Moreno, K. McCafferty, & E. Vaughan (Eds.), Pragmatic Markers in Irish English (pp. 135–155). Amsterdam: John Benjamins.
Kallen, J.L., & Kirk, J.M. (2008). ICE-Ireland: A User’s Guide. Belfast: Cló Ollscoil na Banríona. Retrieved from [URL] (last accessed November 2015).
. (2012). SPICE-Ireland: A User’s Guide. Belfast: Cló Ollscoil na Banríona. Retrieved from [URL] (last accessed November 2015).
Kirk, J.M. (2015a). Kind of and sort of: Pragmatic discourse markers in the SPICE-Ireland Corpus. In C.P. Amador-Moreno, K. McCafferty, & E. Vaughan (Eds.), Pragmatic Markers in Irish English (pp. 89–113). Amsterdam: John Benjamins.
. (2015b July). Corpus Annotation: Speech acts and the description of spoken registers. Paper presented at
Corpus Linguistics 2015 Conference
, Lancaster University, UK.
. (in press) The evolution of spoken corpora within ICE. World Englishes.
Kirk, J.M., & J.L. Kallen. (2009).
Just as a pragmatic discourse marker in the ICE-Ireland corpus. In R. Bowen, M. Mobärg & S. Ohlander (Eds.), Corpora and Discourse – and Stuff: Papers in Honour of Karin Aijmer (pp. 149–158.). Gothenburg: Acta Universitatis Gothoburgensis.
Kirk, J.M., Kallen, J.L., Lowry, O., Rooney, A., & Mannion, M. (2011). The SPICE-Ireland Corpus: Systems of Pragmatic Annotation for the Spoken Component of ICE-Ireland. Version 1.2.2. Belfast: Queen’s University Belfast & Dublin: Trinity College Dublin.
Leech, G. (2005). Adding linguistic annotation. In M. Wynne (Ed.), Developing Linguistic Corpora: A Guide to Good Practice (pp. 17–29). Oxford: Oxbow Books. Retrieved from [URL] (last accessed November 2015).
Leech, G., & Weisser, M. (2003). Generic speech act annotation for task-oriented dialogue. In D. Archer, P. Rayson, A. Wilson & T. McEnery (Eds.), Proceedings of the Corpus Linguistics 2003 Conference (pp. 441–446). Lancaster University: UCREL Technical Papers.
. (2013). The SPAADIA Annotation Scheme. Retrieved from [URL] (last accessed November 2015).
Leech, G., McEnery, T., & Wynne, M. (1997). Further levels of annotation. In R. Garside, G. Leech & T. McEnery (Eds.), Corpus Annotation: Linguistic Information from Computer Text Corpora (pp. 85–101). London: Longman.
Leech, G., Myers, G., & Thomas, J. (Eds.). (1995). Spoken English on Computer: Transcription, Mark-up and Application. London: Longman.
Lehmberg, T., & Wörner, K. (2008). Annotation standards. In A. Lüdeling & M. Kytö (Eds.), Corpus Linguistics. An International Handbook (pp. 484–501). Berlin: Walter de Gruyter.
. (1995). The International Corpus of English: Mark-up for spoken Language. In G. Leech, G. Myers & J. Thomas (Eds.), Spoken English on Computer: Transcription, Mark-up and Application (pp. 220–223). London: Longman.
. (1996). Markup systems. In S. Greenbaum (Ed.), Exploring English World-wide (pp. 36–53). Oxford: Clarendon Press.
Portes, C. (2003 November). How to assert while questioning: The pragmatic use of rising-falling contours in French debates. Paper presented at the
6th NWCL International Conference: Prosody and Pragmatics
, Preston, UK.
Rehbein, I., Schalowski, S., & Wiese, H. (2014). Annotating spoken language. In Ş. Ruhi, M. Haugh, T. Schmidt & K. Wörner, K. (Eds.), Best Practices for Spoken Corpora in Linguistic Research (pp. 75–94). Newcastle-upon-Tyne: Cambridge Scholars Press.
Ronan, P. (2015). Categorizing expressive speech acts in the pragmatically annotated SPICE Ireland corpus. ICAME Journal, 391, 25–45.
Rühlemann, C. (2010). What can a corpus tell us about pragmatics? In A. O’Keeffe & M. McCarthy (Eds.), The Routledge Handbook of Corpus Linguistics (pp. 288–301). London: Routledge.
. (2011). Corpus-based pragmatics II: Quantitative studies. In W. Bublitz & N. Norrick. Foundations of Pragmatics (pp. 629–656). Berlin: Mouton de Gruyter.
Sauer S., & Kirk, J.M. (2015 May). How to get more out of a corpus: A generic multilayer corpus query interface. Paper presented at the
ICAME 36 Conference
, Trier, Germany.
Schweinberger, M. (2011). The discourse marker LIKE: A corpus-based analysis of selected varieties of English. (Unpublished doctoral dissertation). Hamburg University, Germany.
. (2012). The discourse marker LIKE in Irish English. In B. Migge & M. Ní Chiosáin (Eds.), New Perspectives on Irish English (pp. 179–201). Amsterdam: John Benjamins.
. (2015). A comparative study of the pragmatic marker like in Irish English and in south-eastern varieties of British English. In C.P. Amador-Moreno, K. McCafferty & E. Vaughan (Eds.), Pragmatic Markers in Irish English (pp. 114–134). Amsterdam: John Benjamins.
Silverman, K., Beckman, M. Pitrelli, J., Ostendorf, M., Wightman, C., Price, P. Pierrehumbert, J., & Hirschberg, J. (1992). TOBI: A standard for labeling English prosody.
Proceedings of the Second International Conference on Spoken Language Processing
(ICSLP 1992), 867–870.
Smith, N., Hoffmann, S., & Rayson, P. (2008). Corpus tools and methods, today and tomorrow: Incorporating linguists manual annotations. Literary and Linguistic Computing, 23(2), 163–180.
Stenström, A.-B. (1990). Lexical items peculiar to spoken discourse. In J. Svartvik (Ed.), The London Corpus of Spoken English: Description and Research (pp. 137–175). Lund: Lund University Press.
Stiles, W.B. (1992). Describing Talk: A Taxonomy of Verbal Response Modes. Newbury Park, CA: Sage Publications.
Thompson, P. (2005). Spoken language corpora. In M. Wynne (Ed.), Developing Linguistic Corpora: A Guide to Good Practice (pp. 59–70). Oxford: Oxbow Books. Retrieved from [URL] (last accessed November 2015).
. (2010). Building a specialised audio-visual corpus. In A. O’Keeffe & M. McCarthy (Eds.), The Routledge Handbook of Corpus Linguistics (pp. 93–103). London: Routledge.
Trudgill, P. (1999). Standard English: What it isn’t. In T. Bex, & R.J. Watts (Eds.), Standard English: The Widening Debate (pp. 117–128). London: Routledge.
. (2013). Small corpora and pragmatics. In Romero-Trillo (Ed.), Yearbook of Corpus Linguistics and Pragmatics 2013 (pp. 53–73) Dordrecht: Springer.
Weisser, M. (2003). SPAACy: A semi-automated tool for annotating dialogue acts. International Journal of Corpus Linguistics, 8(1), 63–74.
. (2010). Annotating dialogue corpora semi-automatically: A corpus-linguistic approach to pragmatics (Unpublished Post-doctoral Dissertation). University of Bayreuth, Germany.
Cited by (18)
Cited by 18 other publications
Schweinberger, Martin & Michael Haugh
2025. Reproducibility and transparency in interpretive corpus pragmatics. International Journal of Corpus Linguistics 30:2 ► pp. 234 ff.
Su, Hang & Jun Ye
Qi, Da & Hua Wang
Yu, Danni, Luyang Li, Hang Su & Matteo Fuoli
2024. Assessing the potential of LLM-assisted annotation for corpus-based pragmatics and discourse analysis. International Journal of Corpus Linguistics 29:4 ► pp. 534 ff.
Lei, Hong
Love, Robbie & David Wright
Põldvere, Nele, Johan Frid, Victoria Johansson & Carita Paradis
Mukherjee, Joybrato & Tobias Bernaisch
Santos, Giovani
Visser, Jacky, Barbara Konat, Rory Duthie, Marcin Koszowy, Katarzyna Budzynska & Chris Reed
Hundt, Marianne
Flöck, Ilka & Ronald Geluykens
Kirk, John & Gerald Nelson
Milà-Garcia, Alba
KIRK, JOHN M.
[no author supplied]
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
