Article published In: International Journal of Corpus Linguistics
Vol. 20:2 (2015) ► pp.232–259
A quantitative approach to the grammaticalization of discourse markers
Evidence from their sequencing behavior
Published online: 17 August 2015
https://doi.org/10.1075/ijcl.20.2.04koo
https://doi.org/10.1075/ijcl.20.2.04koo
This article takes a quantitative approach to the grammar of English two-part discourse marker sequences like oh well, you know I mean, etc. We investigate the internal ordering preferences of such sequences in spoken American English corpus data from the perspective of grammaticalization. From this perspective, the development of many discourse markers can be understood as involving a process of increasing syntactic de-categorialization (Hopper 1991) as the grammaticalizing element loses its original grammatical constraints and comes to function as a marker at the level of discourse. We test the hypothesis that discourse marker grammaticalization results in largely unconstrained ordering possibilities. Our analysis shows that, on the contrary, discourse marker sequencing is highly constrained. We interpret these constraints in terms of Auer’s (1996) model of discourse marker grammaticalization. Discourse marker sequencing is characterized by strong persistence of a marker’s original syntactic category and reflects its specific grammaticalization trajectory.
References (40)
Aijmer, K. (2002). English Discourse Particles: Evidence from a Corpus. Amsterdam, Netherlands: Benjamins.
. (2013). Understanding Pragmatic Markers: A Variational Pragmatic Approach. Edinburgh, UK: Edinburgh University Press.
Andersen, G. (2001). Pragmatic Markers and Sociolinguistic Variation. Amsterdam, Netherlands: Benjamins.
Auer, P. (1996). The pre-front field in spoken German and its relevance as a grammaticalization position. Pragmatics, 6(3), 295–322.
Biber, D., Conrad, S., & Leech, G. (2002). Longman Student Grammar of Spoken and Written English. London, UK: Longman.
Beckman, M., Hirschberg, J., & Shattuck-Hufnagel, S. (2005). The original ToBI system and the evolution of the ToBI framework. In S.-A. Jun (Ed.), Prosodic Typology: The Phonology of Intonation and Phrasing (pp. 9–54). Oxford, UK: Oxford University Press.
Boersma, P., & Weenink, D. (2014). Praat: Doing phonetics by computer [Computer software]. Retrieved from [URL] (last accessed July 2014).
Brinton, L. (1996). Pragmatic Markers in English: Grammaticalization and Discourse Functions. Berlin, Germany: Mouton de Gruyter.
. (2008). The Comment Clause in English. Syntactic Origins and Pragmatic Development. Cambridge, UK: Cambridge University Press.
Cieri, C., Graff, D., Kimball, O., Miller, D., & Walker, K. (2004a). Fisher English Training Speech Part 1, Transcripts. Philadelphia, PA: Linguistic Data Consortium.
. (2004b). Fisher English Training Speech Part 1, Speech. Philadelphia, PA: Linguistic Data Consortium.
. (2005a). Fisher English Training Speech Part 2, Transcripts. Philadelphia, PA: Linguistic Data Consortium.
. (2005b). Fisher English Training Speech Part 2, Speech. Philadelphia, PA: Linguistic Data Consortium.
Conrad, S., & Biber, D. (2004). The Frequency and use of lexical bundles in conversation and academic prose. Lexicographica, 201, 56–71.
Dehé, N., & Wichmann, A. (2010). Sentence-initial I think (that) and I believe (that): Prosodic evidence for use as main clause, comment clause and discourse marker. Studies in Language, 34(1), 36–74.
Du Bois, J., Schuetze-Coburn, S., Cumming, S., & Paolino, D. (1993). Outline of discourse transcription. In J. Edwards & M. Lampert (Eds.), Talking Data: Transcription and Coding in Discourse Research (pp. 45–89). Hillsdale, NJ: Erlbaum.
Du Bois, J., Chafe, W., Meyer, C., & Thompson, S. (2000). Santa Barbara Corpus of Spoken American English. Philadelphia, PA: Linguistic Data Consortium.
Fitzmaurice, S. (2004). Subjectivity, intersubjectivity and the historical construction of interlocutor stance: From stance markers to discourse markers. Discourse Studies, 6(4), 427–448.
Fraser, B. (2011). The sequencing of contrastive discourse markers in English. Baltic Journal of English Language, Literature, and Culture, 11, 29–35.
Hirschberg, J. & Litman, D. (1993). Empirical studies on the disambiguation of cue phrases. Computational Linguistics, 19(3), 501–530.
Hopper, P. (1991). On some principles of grammaticalization. In E. Traugott & B. Heine (Eds.), Approaches to Grammaticalization (Vol. 11, pp. 17–35). Amsterdam, Netherlands: Benjamins.
Jucker, A. (1997). The discourse marker well in the history of English. English Language and Linguistics, 1(1), 91–110.
Knott, A. (1996). A data-driven methodology for motivating a set of coherence relations. (Unpublished doctoral dissertation). University of Edinburgh, Edinburgh, UK.
Koops, C., & Lohmann, A. (in press). Discourse marker sequencing and grammaticalization. In Baier, N., Donnelly, E., Faytak, M., Giroux, J., Goss, M., Heath, J., Merrill, J., Neely, K., & Redeye, M. (Eds.), Proceedings of the Thirty-Ninth Annual Meeting of the Berkeley Linguistics Society. Berkeley, CA: Berkeley Linguistics Society.
Levelt, W., & Cutler, A. (1983). Prosodic marking in speech repair. Journal of Semantics, 2(2), 205–217.
Lutzky, U. (2012). Discourse Markers in Early Modern English. Amsterdam, Netherlands: Benjamins.
Müller, S. (2005). Discourse Markers in Native and Non-native English Discourse. Amsterdam, Netherlands: Benjamins.
Oates, S. (2000). Multiple discourse marker occurrence: Creating hierarchies for natural language generation. In Kilgarriff, A., Pearce, D., & Tiberius, C. (Eds.), Proceedings of the Third Computational Linguistics UK (CLUK) Colloquium (pp. 41–45). University of Brighton and University of Sussex, UK.
Pierrehumbert, J. (1980). The Phonology and Phonetics of English Intonation. (Unpublished doctoral dissertation). Massachusetts Institute of Technology, Cambridge, MA.
R Core Team. (2014). R: A Language and Environment for Statistical Computing [Computer software]. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from [URL] (last accessed July 2014).
Romaine, S., & Lange, D. (1991). The use of like as a marker of reported speech and thought: A case of grammaticalization in progress. American Speech, 66(3), 227–279.
. (2001). Discourse markers: Language, meaning, and context. In D. Schiffrin, D. Tannen & H. Hamilton (Eds.), The Handbook of Discourse Analysis (pp. 54–75). Malden, MA: Blackwell.
Cited by (31)
Cited by 31 other publications
Choi, Inji
Klumm, Matthias
Klumm, Matthias & Augustin Speyer
Tajeddin, Zia & Maryam Bolouri
Tajeddin, Zia & Maryam Bolouri
Yang, Guoping & Mian Jia
Zuo, Shan & Fuyin Li
2025. Review of Traugott (2022): Ten Lectures on a Diachronic Constructionalist Approach to Discourse Structuring Markers. Language and Linguistics. 語言暨語言學 26:1 ► pp. 190 ff.
Salih, Sana’Khalifa
Bourgeois, Samuel
2022. “Oh yeah, one more thing: It’s gonna be huge.”. In Broadening the Spectrum of Corpus Linguistics [Studies in Corpus Linguistics, 105], ► pp. 197 ff.
Koops, Christian & Arne Lohmann
Blanchard, Meaghan & Lieven Buysse
Crible, Ludivine & Liesbeth Degand
Izutsu, Katsunobu & Mitsuko Narita Izutsu
2021. Presentation followed by negotiation. In Pragmatic Markers and Peripheries [Pragmatics & Beyond New Series, 325], ► pp. 77 ff.
Izutsu, Mitsuko Narita & Katsunobu Izutsu
Mycock, Louise & Chi Lun Pang
Shirtz, Shahar
Van Olmen, Daniël & Jolanta Šinkūnienė
2021. Pragmatic markers and peripheries. In Pragmatic Markers and Peripheries [Pragmatics & Beyond New Series, 325], ► pp. 1 ff.
Faller, Martina
Pinto, Derrin & Donny Vigil
Cuenca, Maria Josep & Ludivine Crible
Haselow, Alexander
Haselow, Alexander
2020. Local and global structures in discourse and interaction. In Grammar and Cognition [Human Cognitive Processing, 70], ► pp. 267 ff.
Mohammadi, Ariana N.
Pons Bordería, Salvador
2018. The combination of discourse markers in spontaneous conversations. Revue Romane. Langue et littérature. International Journal of Romance Languages and Literatures 53:1 ► pp. 121 ff.
Dobrovoljc, Kaja
2017. Multi-word discourse markers and their corpus-driven identification. International Journal of Corpus Linguistics 22:4 ► pp. 551 ff.
Macário Lopes, Ana Cristina & Conceição Carapinha
Lohmann, Arne & Christian Koops
2016. Aspects of discourse marker sequencing. In Outside the Clause [Studies in Language Companion Series, 178], ► pp. 417 ff.
Lohmann, Arne & Christian Koops
[no author supplied]
2020. Dualistic approaches to the analysis of forms and structures in languages. In Grammar and Cognition [Human Cognitive Processing, 70], ► pp. 157 ff.
[no author supplied]
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
