Article published In: Trends in Second Language Pragmatics Research Part I:
[Applied Pragmatics 4:2] 2022
► pp. 178–198
Corpus linguistics in L2 pragmatics research
Published online: 13 April 2022
https://doi.org/10.1075/ap.00008.fer
https://doi.org/10.1075/ap.00008.fer
Abstract
In second language acquisition, corpus linguistics (CL) enjoys prominence as a methodology valuable for its
descriptive power in the textual analysis of patterns. Within second language (L2) pragmatics, CL has been increasingly
influential in shaping our understanding of learners’ pragmatic capacities and articulating new insights and possibilities. In
this narrative review, I take a broad view of corpora in L2 pragmatics research with a focus on automatization, size,
representativeness, temporal design, and annotation. This is followed by a review of representative research highlighting language
proficiency, prosodic pragmatics, pragmatic particles and oral fluency, and pragmatics pedagogy. I conclude with future directions
where the discussion centers on bringing CL and other methodologies together for pragmatic analysis and extending the use of these
methodologies to more L2 pragmatic learning contexts.
Article outline
- 1.Introduction
- 2.The use of corpora in L2 pragmatics research
- 3.Review of representative research in learner corpus pragmatics
- 3.1Comparison of L1 and L2 pragmatic performance
- 3.2Language proficiency
- 3.3Prosodic pragmatics
- 3.4Pragmatic particles and spoken fluency
- 3.5Pragmatics instruction
- 4.Future directions
- 4.1Combination of corpus linguistics with other methodologies for pragmatic analysis
- 4.2Extension of contexts for L2 pragmatic learning
- 5.Conclusion
References
References (86)
Adolphs, S. (2008). Corpus and context: Investigating pragmatic functions in spoken discourse. John Benjamins.
Aijmer, K. (1996). Conversational routines in English: Convention and creativity. Studies in language and linguistics. Longman.
(2002). Modality in advanced Swedish learners’ written interlanguage. In S. Granger, J. Hung, & S. Petch-Tyson (Eds.), Computer learner corpora, second language acquisition and foreign language teaching (pp. 55–76). John Benjamins.
(2011). Well I’m not sure I think… The use of well by non-native speakers. International Journal of Corpus Linguistics, 16(2), 231–254.
(2015). General extenders in learner language. In N. Groom, M. Charles, & S. John (Eds.), Corpora, grammar and discourse. In honour of Susan Hunston (pp. 211–233). John Benjamins.
Andersen, G. (2018). Corpus construction. In A. H. Jucker, K. P. Schneider, & W. Bublitz (Eds.), Methods in pragmatics (pp. 467–494). De Gruyter.
Anthony, L., & Baker, P. (2015). ProtAnt: A tool for analyzing the prototypicality of texts. International Journal of Corpus Linguistics, 20(3), 273–293.
Archer, D., & Culpeper, J. (2018). Pragmatic annotation. In A. H. Jucker, K. P. Schneider & W. Bublitz (Eds.), Methods in Pragmatics (pp. 493–525). De Gruyter.
Bardovi-Harlig, K. (2019). Formulaic language in second language pragmatics research. In A. Siyanova-Chanturia & A. Pellicer-Sanchez (Eds.), Understanding formulaic language. A second language acquisition perspective (pp. 97–114). Routledge.
Bardovi-Harlig, K., & Bastos, M. T. (2011). Proficiency, length of stay, and intensity of interaction and the acquisition of conventional expressions in L2 pragmatics. Intercultural Pragmatics, 8(3), 347–384.
Bardovi-Harlig, K., Mossman, S., & Su, Y. (2017). The effect of corpus-based instruction on pragmatic routines. Language Learning & Technology, 21(3), 76–103.
Barron, A. (2019). Using corpus-linguistics methods to track longitudinal development: Routine apologies in the study abroad context. Journal of Pragmatics, 1461, 87–105.
Belz, J. A., & Vyatkina, N. (2005). Learner corpus analysis and the development of L2 pragmatic competence in networked intercultural language study: The case of German modal particles. Canadian Modern Language Review, 62(1), 17–48.
(2008). The pedagogical mediation of a developmental learner corpus for classroom-based language instruction. Language Learning and Technology, 12(3), 33–52.
Biber, D., Conrad, S., & Reppen, R. (1998). Corpus Linguistics: Investigating language structure and use. Cambridge University Press.
Buysse, L. (2014). ‘We went to the restroom or something’ General extenders and stuff in the speech of Dutch learners of English. In J. Romero-Trillo (Ed.), The yearbook of corpus linguistics and pragmatics 2014: New empirical and theoretical paradigms (pp. 213–237). Springer.
Chen, H.-I. (2010). Contrastive learner corpus analysis of epistemic modality and interlanguage pragmatic competence in L2 writing. Arizona Working Papers in Second Language Acquisition and Teaching, 171, 27–51.
Clancy, B., & O’Keeffe, A. (2015). Chapter 13. Pragmatics. In D. Biber, & R. Reppen (Eds.). The Cambridge handbook of English corpus linguistics (pp. 235–251). Cambridge University Press.
Conrad, S. (2005). Corpus linguistics and L2 teaching. In E. Hinkel (Ed.), Handbook of research in second language teaching and learning (pp. 393–410). Lawrence Erlbaum.
Culpeper, J., Mackey, A., & Taguchi, N. (2018). Second language pragmatics. From theory to research. Routledge.
Czerwionka, L., & Olson, D. J. (2020). Pragmatic development during study abroad. L2 intensifiers in spoken Spanish. International Journal of Learner Corpus Research, 6(2), 125–162.
De Cock, S. (2004). Preferred sequences of words in NS and NNS speech. Belgian Journal of English Language and Literatures (BELL) New Series 2, 225–246. [URL]
De Felice, R., & Deane, P. (2012). Identifying speech acts in e-mails: Toward automated scoring of the TOEIC® E-Mail Task. ETS research report no. RR-12-16. Educational Testing Service.
Du Bois, J. W., Chafe, W., Meyer, C., Thompson, S. A., Englebretson, R., & Martey, N. (2000–2005). Santa Barbara corpus of spoken American English (1–4). Linguistic Data Consortium.
Fernández, J. (2013). A corpus-based study of vague language use by learners of Spanish in a study abroad context. In C. Kinginger (Ed.), Social and cultural aspects of language learning in study abroad (pp. 299–332). John Benjamins.
Fernández, J., & Davis, T. (2021). Overview of available learner corpora. In N. Tracy-Ventura & M. Paquot (Eds.), Routledge handbook of second language acquisition and corpora (pp. 147–159). Routledge.
Fernández, J., & Staples, S. (2021). Pragmatic approaches. In N. Tracy-Ventura & M. Paquot (Eds.), Routledge handbook of second language acquisition and corpora (pp. 242–253). Routledge.
Fernández, J., & Yuldashev, A. (2011). Variation in the use of general extenders and stuff in instant messaging interactions. Journal of Pragmatics, 23(10), 2610–2626.
(2015). Using a corpus-informed pedagogical intervention to develop language awareness toward appropriate lexicogrammatical choices. L2 Journal, 7(4), 91–107.
Fernández, J., Gates Tapia, A., & Lu, X. (2014). Oral proficiency and pragmatic marker use in L2 spoken Spanish: The case of pues and bueno. Journal of Pragmatics, 741, 150–164.
Fung, L., & Carter, R. (2007). Discourse markers and spoken English: Native and learner use in pedagogic settings. Applied Linguistics, 28(3), 410–439.
Gablasova, D. & Brezina, V. (2015). Does speaker role affect the choice of epistemic adverbials in L2 speech? Evidence from the Trinity Lancaster Corpus. In J. Romero-Trillo (Ed.), Yearbook of corpus linguistics and pragmatics 2015 (pp. 117–136). Springer.
Gablasova, D., & Brezina, V. (2018). Disagreement in L2 spoken English: From learner corpus research to corpus-based teaching materials. In V. Brezina & L. Flowerdew (Eds.), Learner corpus research: New perspectives and applications (pp. 69–89). Bloomsbury Academic.
Gablasova, D., Brezina, V., Mcenery, T., Boyd, E. (2017). Epistemic stance in spoken L2 English: The effect of task and speaker style. Applied Linguistics, 38(5), 613–637.
Gilquin, G. (2008). Hesitation markers among EFL learners: Pragmatic deficiency or difference? In J. Romero-Trillo (Ed.), Corpus and pragmatics. A Mutualistic entente (pp. 119–149). De Gruyter.
(2015). From design to collection of learner corpora. In S. Granger, G. Gilquin, & F. Meunier (Eds.), The Cambridge handbook of learner corpus research (pp. 9–34). Cambridge University Press.
Gilquin, G., & Gries, S. Th. (2009). Corpora and experimental methods: A state-of-the-art review. Corpus Linguistics and Linguistic Theory, 5(1), 1–26.
Gilquin, G., De Cock, S. & Granger, S. (2010). Louvain international database of spoken English interlanguage. Handbook and CD-ROM. Presses universitaires de Louvain.
Gilquin, G., Granger, S. & Paquot, M. (2007). Learner corpora: The missing link in EAP pedagogy. Journal of English for Academic Purposes, 6(4), 319–35.
Götz, S. (2013). Fluency in native and nonnative English speech. Studies in corpus linguistics 53. John Benjamins.
Götz, S., & Mukherjee, J. (2017). Investigating the effect of the study abroad variable on learner output: A pseudo-longitudinal study on spoken German learner English. In A. McEnery, D. Gablasova, V. Brezina, & L. Flowerdew (Eds.), Learner corpus research: New perspectives and applications (pp. 47–65). Bloomsbury.
Granger, S. (2008). Learner corpora. In A. Lüdeling & M. Kytö (Eds.), Corpus linguistics: An international handbook (pp. 259–275). Mouton de Gruyter.
(2017). Learner corpora in foreign language education. In S. Thorne, & S. May (Eds.), Language, education and technology. Encyclopedia of language and education (3rd ed., pp. 427–440). Springer.
Grice, M., Baumann, S., & Benzmüller, R. (2005). German intonation in autosegmental-metrical phonology. In S.-A. Jun (Ed.), Prosodic typology. The phonology of intonation and phrasing (pp. 55–83). Oxford University Press.
Gut, U. (2009). Non-native speech: A corpus-based analysis of phonological and phonetic properties of L2 English and German. Peter Lang.
Hasselgren, A. (1994). Lexical teddy bears and advanced learners: A study into the ways Norwegian students cope with English vocabulary. International Journal of Applied Linguistics, 4(2), 237–260.
(2002). Learner corpora and language testing: Smallwords as markers of learner fluency. In S. Granger, J. Hung, & S. Petch-Tyson (Eds.), Computer learner corpora, second language acquisition, and foreign language teaching (pp. 143–173). John Benjamins.
Hellermann, J., & Vergun, A. (2007). Language which is not taught: The discourse marker use of beginning adult learners of English. Journal of Pragmatics, 39(1), 157–179.
Hyland, K. & Tse, P. (2005). Hooking the reader: A corpus study of evaluative that in abstracts. English for Specific Purposes, 24(2), 123–139.
Hyland, K. (2005). Stance and engagement: A model of interaction in academic discourse. Discourse Studies, 7(2), 173–192.
Ishikawa, S. (2013). The ICNALE and sophisticated contrastive interlanguage analysis of Asian Learners of English. In S. Ishikawa (Ed.), Learner corpus studies in Asia and the world, 11 (pp. 91–118). Kobe University.
Jucker, A. H., Schneider, G., Taavitsainen, I., & Breustedt, B. (2008). Fishing for compliments: Precision and recall in corpus linguistic compliment research. In A. H. Jucker & I. Taavitsainen (Eds.), Speech acts in the history of English (pp. 273–294). John Benjamins.
Kang, O. & Kermad, A. (2019). Prosody in L2 pragmatics. In N. Taguchi (Ed.), The Routledge handbook of second language acquisition and pragmatics (78–92). Routledge.
Lee, J. J., & Deakin, L. (2016). Interactions in L1 and L2 undergraduate student writing: Interactional metadiscourse in successful and less-successful argumentative essays. Journal of Second Language Writing, 331, 21–34.
Levon, E. (2016). Qualitative analysis of stance. In P. Baker, & J. Egbert (Eds.), Triangulating methodological approaches in corpus linguistics research (pp. 152–166). Routledge.
Lin, Y. L. (2017). Co-occurrence of speech and gestures: A multimodal corpus linguistic approach to intercultural interaction. Journal of Pragmatics, 1171, 155–167.
Maden-Weinberger, U. (2008). Modality as indicator of L2 proficiency? A corpus-based investigation into advanced German interlanguage. In M. Walter & P. Grommes (Eds.), Fortgeschrittene Lernervarietäten: Korpuslinguistik und Zweitspracherwerbsforschung (pp. 141–64). Max Niemeyer Verlag.
Milà-Garcia, A. (2018). Pragmatic annotation for a multi-layered analysis of speech acts: A methodological proposal. Corpus Pragmatics, 21, 265–287.
Müller, S. (2005). Discourse markers in native and non-native English discourse. John Benjamins.
Paquot, M., & Plonsky, L. (2017). Quantitative research methods and study quality in learner corpus research. International Journal of Learner Corpus Research, 3(1), 61–94.
Pierrehumbert, J., & Hirschberg, J. (1990). The meaning of intonational contours in the interpretation of discourse. In P. R. Cohen, J. Morgan, & M. E. Pollack (Eds.), Intentions in communication (271–311). MIT Press.
Polat, B. (2011). Investigating acquisition of discourse markers through a developmental learner corpus. Journal of Pragmatics, 43(15), 3745–3756.
Ramírez-Verdugo, M. D. (2008). A cross-linguistic study on the pragmatics of intonation in directives. In J. Romero-Trillo (Ed.), Pragmatics and corpus linguistics: A mutualistic entente (205–233). De Gruyter.
Reinhardt, J. (2010). Directives in office hour consultations: A corpus-informed investigation of learner and expert usage. English for Specific Purposes, 29(2), 94–107.
Romero-Trillo, J. (2019). Prosodic pragmatics and feedback in intercultural communication. Journal of Pragmatics, 1511, 91–102.
Rosen, A. (2019). The use of smallwords in the speech of German learners of English. A corpus-based study of the factors of instruction and natural exposure. In S. Götz, & J. Mukherjee (Eds), Learner corpora and language teaching (pp. 219–241). John Benjamins.
Rühlemann, C. (2017). Integrating corpus-linguistic and conversation-analytic transcription in XML: The case of backchannels and overlap in storytelling interaction. Corpus Pragmatics, 11, 201–232.
Rühlemann, C., & O’Donnell, M. B. (2012). Introducing a corpus of conversational stories. Construction and annotation of the Narrative Corpus. Corpus Linguistics and Linguistics Theory, 8(2), 313–350.
Sato, S. (2020). A corpus-based analysis of so in written discourse: A comparison between L1 English speakers and Japanese EFL learners. Applied Pragmatics, 1(1), 26–45.
Shin, J.-Y., Velázquez, A. J., Swatek, A., Staples, S., & Partridge, R. S. (2018). Examining the effectiveness of corpus-informed instruction of reporting verbs in L2 first-year college writing. L2 Journal, 10(3), 31–46.
Staples, S. (2015). The discourse of nurse-patient interactions: Contrasting the communicative styles of U.S. and international nurses. John Benjamins.
Staples, S., & Fernández, J. (2019). Corpus linguistics approaches to L2 pragmatics research. In N. Taguchi (Ed.), The Routledge handbook of second language acquisition and pragmatics (pp. 241–254). Routledge.
Taguchi, N. (2013). Production of routines in L2 English: Effect of proficiency and study abroad experience. System, 41(1), 109–121.
Taguchi, N., & Collentine, J. (2018). Language learning in a study-abroad context: Research agenda. Language Teaching, 51(4), 553–566.
Tracy-Ventura, N., Mitchell, R., & McManus, K. (2016). The LANGSNAP longitudinal learner corpus. In M. Alonso-Ramos (Ed.), Spanish learner corpus research: Current trends and future perspectives (pp. 117–142). John Benjamins.
Tsuchiya, K. (2013). Listenership behaviors in intercultural encounters: A time aligned multimodal corpus analysis. John Benjamins.
Vaughan, E., & Clancy, B. (2013). Small corpora and pragmatics. In J. Romero-Trillo (Ed.), Yearbook of corpus linguistics and pragmatics 2013: New domains and methodologies (pp. 53–73). Springer.
Vyatkina, N., & Cunningham, D. J. (2015). Learner corpora and pragmatics. In S. Granger, G. Gilquin, & F. Meunier (Eds.), The Cambridge handbook of learner corpus research (pp. 281–304). Cambridge University Press.
Weisser, M. (2014). Speech act annotation. In K. Aijmer & C. Rühlemann (Eds.), Corpus pragmatics: A handbook (pp. 84–113). Cambridge, UK: Cambridge University Press.
(2020). Speech acts in corpus pragmatics: Making the case for an extended taxonomy. International Journal of Corpus Linguistics, 25(4), 400–426.
Cited by (2)
Cited by two other publications
Czerwionka, Lori
This list is based on CrossRef data as of 1 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
