Article published In: International Journal of Corpus Linguistics
Vol. 23:3 (2018) ► pp.335–369
A critical review of research and practice in data-driven learning (DDL) in the academic writing classroom
Published online: 29 October 2018
https://doi.org/10.1075/ijcl.16130.che
https://doi.org/10.1075/ijcl.16130.che
Abstract
Since the late 1980s, there has been a growing interest in the direct application of corpora, or data-driven learning (DDL), in
language education. This relatively novel teaching approach has been particularly applied in the teaching and learning of English
for Academic Purposes (EAP)/academic writing, especially since the turn of the century. This paper synthesizes and evaluates the
research progress in the field of EAP/academic writing since the year 2000 by critically reviewing 37 empirical studies focussing
on applications of DDL in this context. Based on the critical review and a discussion of some contentious issues, a set of five
recommendations for the way forward in DDL research and practice for EAP/academic writing is presented.
Article outline
- 1.Introduction
- 2.Search criteria, procedures, methods of analysis, and structure of the review
- 3.Overview of the 37 studies
- 4.Studies focussing on particular foci of teaching of DDL in the academic writing classroom
- 4.1Studies focussing on learner self-correction of errors
- 4.2More globally-focussed studies
- 4.3Studies focusing on particular sub-sections and rhetorical functions of research papers and theses
- 4.4Studies focussing on the teaching of academic vocabulary
- 5.Contentious issues
- 6.Conclusion
References
References (93)
Ädel, A. (2010). Using corpora to teach academic writing: Challenges for the direct approach. In M. C. Campoy-Cubillo, B. Bellés-Fortuño & M. L. Gea-Valor (Eds.), Corpus-based Approaches to ELT (pp. 39–55). London: Continuum.
Anthony, L. (2018). AntConc (Version 3.5.7) [Computer Software]. Tokyo: Waseda University. Available from [URL] (last accessed August 2018).
Bianchi, F., & Pazzaglia, R. (2007). Student writing of research articles in a foreign language: metacognition and corpora. In R. Facchinetti (Ed.), Corpus Linguistics 25 Years on (pp. 259–287). Amsterdam/Philadelphia: Rodopi.
Boulton, A. (2010). Data-driven learning: Taking the computer out of the equation. Language Learning, 60(3), 534–572.
(2011). Bringing corpora to the masses: Free and easy tools for language learning. In N. Kubler (Ed.), Corpora, Language, Teaching, and Resources: From Theory to Practice (pp. 69–95). Brussels: Peter Lang.
(2012). Corpus Consultation for ESP: A Review of Empirical Research. In A. Boulton, S. Carter-Thomas & E. Rowley-Jolivet (Eds.), Corpus-Informed Research and Learning in ESP: Issues and Applications (pp. 261–292). Amsterdam/Philadelphia: John Benjamins.
Boulton, A., & Cobb, T. (2017). Corpus use in language learning: A meta-analysis. Language Learning, 67(2), 348–393.
Cargill, M., & O’Connor, P. (2006). Developing Chinese scientists’ skills for publishing in English: Evaluating collaborating-colleague workshops based on genre analysis. Journal of English for Academic Purposes, 5(3), 207–221.
Caviglia, F. (2004). Students’ diverse appreciation of text corpora as writing aids. In T. Caudery (Ed.), Proceedings of the Ninth Nordic Conference for English Studies (NAES 2004) (pp. 27–29). Aarhus: University of Aarhus.
Chambers, A. (2007). Popularising corpus consultation by language learners and teachers. In E. Hidalgo, L. Quereda & J. Santana (Eds.), Corpora in the Foreign Language Classroom (pp. 3–16). Amsterdam/New York: Rodopi.
(2010). Combining text-based and corpus-based approaches in the learning and teaching of academic writing in French. In M. M. Jaén & M. C. Pérez (Eds.) Exploring New Paths in Language Pedagogy (pp. 145–156). London: Equinox Publishing.
(2011). Language learning as discourse analysis: Playing games in a corpus of French journalistic discourse. In N. Kübler (Ed.). Corpora, Language, Teaching, and Resources: From Theory to Practice (pp. 97–110). Frankfurt: Peter Lang.
Chambers, A. & O’Sullivan, Í. (2004). Corpus consultation and advanced learners’ writing skills in French. ReCALL, 16(1), 158–172.
Chang, J. -Y. (2014). The use of general and specialized corpora as reference sources for academic English writing: A case study. ReCALL, 26(2), 243–259.
Chang, P. (2012). Using a stance corpus to learn about effective authorial stance-taking: A textlinguistic approach. ReCALL, 24(2), 209–236.
Charles, M. (2007). Reconciling top-down and bottom-up approaches to graduate writing: Using a corpus to teach rhetorical functions. Journal of English for Academic Purposes, 6(4), 289–302.
(2011). Using hands-on concordancing to teach rhetorical functions: Evaluation and implications for EAP writing classes. In A. Frankenberg-Garcia, L. Flowerdew & G. Aston (Eds.), New Trends in Corpora and Language Learning (pp. 81–104). London/New York: Continuum.
(2012). ‘Proper vocabulary and juicy collocations’: EAP students evaluate do-it-yourself corpus building. English for Specific Purposes, 31(2), 93–102.
(2014). Getting the corpus habit: EAP students’ long-term use of personal corpora. English for Specific Purposes, 351, 30–40.
(2015). Same task, different corpus: The role of personal corpora in EAP classes. In A. Leńko-Szymańska & A. Boulton (Eds.), Multiple Affordances of Language Corpora for Data-driven Learning (pp. 129–154). Amsterdam/Philadelphia: John Benjamins.
Cobb, T. (1997). Is there any measurable learning from hands-on concordancing? System, 25(3), 301–315.
(n. d.). Concordancers [Computer software]. Available at [URL] (last accessed January 2018).
Cobb, T., & Boulton, A. (2015). Classroom applications of corpus analysis. In D. Biber & R. Reppen (Eds.), Cambridge Handbook of English Corpus Linguistics (pp. 478–497). Cambridge: Cambridge University Press.
Cortes, V. (2007). Exploring genre and corpora in the English for academic writing class. The ORTESOL Journal, 251, 8–14.
(2014). Genre analysis in the academic writing class: With or without corpora? Quaderns de Filologia-Estudis Lingüístics, 161, 65–80.
Cresswell, A. (2007). Getting to “know” connectors? Evaluating data-driven learning in a writing skills course. In E. Hidalgo, L. Quereda & J. Santana (Eds.), Corpora in the Foreign Language Classroom (pp. 267–288). Amsterdam/New York: Rodopi.
Crosthwaite, P. (2017). Retesting the limits of data-driven learning: Feedback and error correction. Computer Assisted Language Learning, 30(6) 1–27.
Davies, M. (2008). The Corpus of Contemporary American English (COCA): 560 million words, 1990-present. Retrieved from [URL] (last accessed August 2018).
Diani, G. (2012). Text and corpus work, EAP writing and language learners. In R. Tang (Ed.), Academic Writing in a Second or Foreign Language (pp. 45–66). London: Continuum.
Ebrahimi, A., & Faghih, E. (2017). Integrating corpus linguistics into online language teacher education programs. ReCALL, 29(1), 120–135–16.
Eriksson, A. (2012). Pedagogical perspectives on bundles: Teaching bundles to doctoral students of biochemistry. In J. Thomas & A. Boulton (Eds.), Input, Process and Products. Developments in Teaching and Language Corpora (pp. 195–211). Brno: Masaryk University Press.
Fan, M., & Xu, X. (2002). An evaluation of an online bilingual corpus for the self-learning of legal English. System, 30(1), 47–63.
Feng, H. -H. (2014). A pilot study: The use of corpus concordancing for ESL learner self error-correction. Journal of Interactive Learning Research, 251, 5–25.
Fenton-Smith, B., Humphries, P., & Walkinshaw, I. (Eds.) (2017). EMI: Issues and Challenges in Asia-Pacific Higher Education. Cham: Springer.
Flowerdew, J., & Wang, S. O. (2017). Teaching English for research writing purposes with a focus on genre, register, textual mentors, and language-reuse. In J. Flowerdew & T. Costley (Eds.), Discipline-specific Writing; Theory into Practice (pp. 144–161). London: Routledge.
Flowerdew, L. (2001). The exploitation of small learner corpora in EAP materials design. In M. Ghadessy, A. Henry & R. Robert (Eds.), Small Corpus Studies and ELT: Theory and Practice (pp. 363–379). Amsterdam/Philadelphia: John Benjamins.
(2004). The problem-solution pattern in apprentice vs. professional technical writing: An application of appraisal theory. In G. Aston, S. Bernardini & D. Stewart (Eds.), Corpora and Language Learners (pp. 125–35). Amsterdam/Philadelphia: John Benjamins.
(2005). Integrating traditional and critical approaches to syllabus design: The ‘what’, the ‘how’ and the ‘why?’. Journal of English for Academic Purposes, 4(2), 135–147.
(2008). Corpus linguistics for academic literacies mediated through discussion activities. In D. Belcher & A. Hirvela (Eds.), The Oral-Literate Connection: Perspectives on L2 Speaking, Writing and Other Media Interactions (pp. 268–287). Ann Arbor, MI: University of Michigan Press.
(2009). Applying corpus linguistics to pedagogy: A critical evaluation. International Journal of Corpus Linguistics. 14(3) 393–417.
(2015). Using corpus-based research and online academic corpora to inform writing of the discussion section of a thesis. Journal of English for Academic Purposes, 201, 58–68.
Francis, W. N., & Kucera, H. (1964). Computational Analysis of Present-day American English. Providence, R. I.: Brown University Press.
Frankenberg-Garcia, A. (2012). Raising teachers’ awareness of corpora. Language Teaching, 45(4), 475–489.
Gabel, S. (2001). Over-indulgence and under-representation in interlanguage: Reflections on the utilization of concordancers in self-directed foreign language learning. Computer Assisted Language Learning, 14(3–4), 269–288.
Gaskell, D., & Cobb, T. (2004). Can learners use concordance feedback for writing errors? System, 32(3), 301–319.
Gilmore, A. (2009). Using online corpora to develop students’ writing skills. ELT Journal, 63(4), 363–372.
Gosden, H. (1995). Success in research article writing and revision: A social-constructionist perspective. English for Specific Purposes, 14(1), 37–57.
Gilquin, G., Granger, S., & Paquot, M. (2007). Learner corpora: The missing link in EAP pedagogy. Journal of English for Academic Purposes, 6(4), 319–335.
Greaves, C. (2009). ConcGram 1.0: A Phraseological Search Engine. Amsterdam/Philadelphia: John Benjamins.
Granger, S. (2009). The contribution of learner corpora to second language acquisition and foreign language teaching: A critical evaluation. In K. Aijmer (Ed.), Corpora and Language Teaching (pp. 3–32). Amsterdam/Philadelphia: John Benjamins.
Hafner, C. A., & Candlin, C. N. (2007). Corpus tools as an affordance to learning in professional legal education. Journal of English for Academic Purposes, 6(4), 303–318.
Hegelheimer, V. (2006). Helping ESL writers through a multimodal, corpus-based, online grammar resource. Calico Journal, 24(1), 5–32.
Hsieh, W. M., & Liou, H. C. (2008). A case study of corpus-informed online academic writing for EFL graduate students. CALICO Journal, 26(1), 28–47.
Huang, Z. (2014). The effects of paper-based DDL on the acquisition of lexico-grammatical patterns in L2 writing. ReCALL, 26(2), 163–183.
Johns, T. (1990). From printout to handout: Grammar and vocabulary teaching in the context of data-driven learning. English Language Research Journal, 41, 27–45.
(1991). Should you be persuaded: Two samples of data-driven learning materials. English Language Research Journal, 41, 1–16.
Johns, A. M., & Snow, M. A. (2006). Academic English in secondary schools. Journal of English for Academic Purposes, 5(4), 251–253.
Kaszubuski, P. (2010). IFAConc: A pedagogic tool for online concordancing with EFL/EAP learners. In A. Frankenberg-Garcia, L. L. Flowerdew & G. Aston (Eds.), New Trends in Corpora and Language Learning (pp. 81–104). London/New York: Continuum.
Kaur, J., & Hegelheimer, V. (2005). ESL students use of concordance in the transfer of academic word knowledge: An exploratory study. Computer Assisted Language Learning, 18(4), 287–310.
Kennedy, C., & Miceli, T. (2000). The CWIC project: Developing and using a corpus for intermediate Italian students. In B. Kettemann & G. Marko (Eds.), Teaching and Learning by Doing Corpus Analysis. Proceedings of the Fourth International Conference on Teaching and Language Corpora (pp. 81–104). Amsterdam/New York: Rodopi.
(2001). An evaluation of intermediate students’ approaches to corpus investigation. Language Learning and Technology, 5(3), 77–90.
(2010). Corpus-assisted creative writing: Introducing intermediate Italian learners to a corpus as a reference resource. Language Learning & Technology, 14(1), 28–44.
Lee, D., & Swales, J. (2006). A corpus-based EAP course for NNS doctoral students: Moving from available specialized corpora to self-compiled corpora. English for Specific Purposes, 251, 56–75.
Leech, G. (1997). Teaching and language corpora: A convergence. In A. Wichmann, S. Fligelstone, T. McEnery & G. Knowles (Eds.), Teaching and Language Corpora (pp. 11–23). Harlow: Addison Wesley Longman.
Leńko-Szymańska, A. (2014). Is this enough? A qualitative evaluation of the effectiveness of a teacher-training course on the use of corpora in language education. ReCALL, 26(2), 260–278.
Liou, H. C., Yang, P. C., & Chang, J. S. (2012). Language supports for journal abstract writing across disciplines. Journal of Computer Assisted Learning, 28(4), 322–335.
McGarrell, H. M. (2015). Developing autonomy through awareness of textual features in academic texts. OLBI Working Papers, 71, 41–52.
Miceli, T., & Kennedy, C. (2002). An apprenticeship with the CWIC Corpus: A tool for learner writers in Italian. In C. Kennedy (Ed.), Proceedings of Workshop Innovations in Italian Teaching (pp. 83–94). Brisbane: Griffith University.
Nunan, D., & Richards, J. C. (Eds.). (2014). Language Learning Beyond the Classroom. New York, NY: Routledge.
O’Sullivan, Í., & Chambers, A. (2006). Learners’ writing skills in French: corpus consultation and learner evaluation. Journal of Second Language Writing, 15(1), 49–68.
Park, K., & Kinginger, C. (2010). Writing/thinking in real time: Digital video and corpus query analysis. Language Learning & Technology, 14(3), 31–50.
Poole, R. (2016). A corpus-aided approach for the teaching and learning of rhetoric in an undergraduate composition course for L2 writers. Journal of English for Academic Purposes, 211, 99–109.
Quinn, C. (2015). Training L2 writers to reference corpora as a self-correction tool. ELT Journal, 69(2), 165–177.
Scott, M. (2017). WordSmith Tools (Version 7) [Computer Software]. Stroud: Lexical Analysis Software.
Seidlhofer, B. (2002). Pedagogy and local learner corpora: Working with learning-driven data. In S. Granger, J. Hung & S. Petch-Tyson (Eds.), Computer Learner Corpora, Second Language Acquisition and Foreign Language Teaching (pp. 213–234). Amsterdam/Philadelphia: John Benjamins.
Stevens, V. (1991). Classroom concordancing: Vocabulary materials derived from relevant, authentic text. English for Specific Purposes, 10(1), 35–46.
Sun, Y. C. (2007). Learner perceptions of a concordancing tool for academic writing. Computer Assisted Language Learning, 20(4), 323–343.
Swales, J. M. (1990). Genre Analysis: English in Academic and Research Settings. Cambridge: Cambridge University Press.
Swales, J. (2002). Integrated and fragmented worlds: EAP materials and corpus linguistics. In J. Flowerdew (Ed.), Academic Discourse (pp. 150–164). Harlow: Longman.
Tardy, C. (2004). The role of English in scientific communication: Lingua franca or Tyrannosaurus rex? Journal of English for Academic Purposes, 3(3), 247–269.
Todd, R. W. (2001). Induction from self-selected concordances and self-correction. System, 29(1), 91–102.
Tono, Y., Satake, Y., & Miura, A. (2014). The effects of using corpora on revision tasks in L2 writing with coded error feedback. ReCALL, 26(2), 147–162.
Tribble, C., & Jones, G. (1990). Concordances in the Classroom: A Resource Book for Teachers. Harlow: Longman.
Tribble, C. (2015). Teaching and language corpora: Perspectives from a personal journey. In A. Lenko-Szymanska & A. Boulton (Eds.), Multiple Affordances of Language Corpora for Data-Driven Learning (pp. 37–62). Amsterdam/Philadelphia: John Benjamins.
Turnbull, J., & Burston, J. (1998). Towards independent concordance work for students: Lessons from a case study. On-Call, 12(2), 10–21.
Widdowson, H. G. (2000). On the limitations of linguistics applied. Applied Linguistics, 21(1), 3–25.
Yoon, C. (2011). Concordancing in L2 writing class: An overview of research and issues. Journal of English for Academic Purposes, 10(3), 130–139.
Yoon, H. (2008). More than a linguistic reference: The influence of corpus technology on EAP writing. Language Learning & Technology, 12(2), 31–49.
Cited by (68)
Cited by 68 other publications
Du, Yu & Junju Wang
Alamri, Basim & Assem Alqarni
Boulton, Alex & Luciana Forti
Curry, Niall & Tony McEnery
Dong, Jihua & Louisa Buckingham
Dong, Jihua & Hao Wang
Green, Clarence
Huang, Shiya & Qing Ma
Jablonkai, Reka R.
Liao, Baiqiu
Liu, Tanjun & Dana Gablasova
Ma, Qing, Jiahao Yan, Peter Crosthwaite & Chi-Kin John Lee
Róg, Tomasz
Sfeir, Maya
Sun, Amelie Xiaohan & Atsushi Mizumoto
Tono, Yukio
Wang, Qingran, Yaochen Deng & Dilin Liu
Wang, Wenyi & Ali Derakhshan
Wang, Yingmin, Wenjuan Qin, Tan Jin & Jing Chen
Wang, Yongliang & Xiaofang Yan
Wunderlich, Claudia Theresia
Zare, Javad, Azadeh Emadi, Mostafa Azari Noughabi & Fatemeh Ranjbaran Madiseh
Özbay, Ali Şükrü, Hatice Celep, Fatma Betül Küçük, Beceste Tamtürk & Dilek Cankur
Crosthwaite, Peter & Brett Steeples
Farr, Fiona & Elaine Riordan
Flowerdew, Lynne & Bojana Petrić
Gilquin, Gaëtanelle
He, Mengyu & Qin Xie
HU, X.I.A.O.J.I.N.G., L.U. SONG & X.I. CHEN
Ngo, Thuy Thi-Nhu & Howard Hao-Jan Chen
Staples, Shelley, Anh Dang & Hui Wang
Yan, Jiahao & Qing Ma
Yan, Jiahao & Qing Ma
Zhu, Jinyu, Yongle Yang & Zi Yan
Dong, Jihua, Yanan Zhao & Louisa Buckingham
Lusta, Amel, Özcan Demirel & Behbood Mohammadzadeh
Sun, Xiaoya & Guangwei Hu
Yu, Xiaoli & Veysel Altunel
Zare, Javad & Khadijeh Aqajani Delavar
Zhao, Ke & Xiaojing Bai
Zhao, Ke & Xiaojing Bai
Şahin Kızıl, Aysel
Bal-Gezegin, Betül, Erdem Akbaş & Ahmet Başal
Charles, Maggie
Charles, Maggie
Charles, Maggie
Charles, Maggie & Gregory Hadley
Du, Jianying, Weiping Li & Qiong Li
Frankenberg-Garcia, Ana, Paula Tavares Pinto, Ana Eliza Pereira Bocorny & Simone Sarmento
Poole, Robert
Raitskaya, Lilia & Elena Tikhonova
Yan, Hengbin
Zare, Javad & Sedigheh Karimpour
Zare, Javad, Sedigheh Karimpour & Khadijeh Aqajani Delavar
Zare, Javad, Sedigheh Karimpour & Khadijeh Aqajani Delavar
Boulton, Alex
2021. Research in data-driven learning. In Beyond Concordance Lines [Studies in Corpus Linguistics, 102], ► pp. 9 ff.
Li, Yongyan
Liao, Han-Teng & Tao Guo
Lin, Ming Huei
Liu, Tanjun
2021. Data-driven learning. In Beyond Concordance Lines [Studies in Corpus Linguistics, 102], ► pp. 177 ff.
Schaeffer‐Lacroix, Eva
Li, Yongyan & John Flowerdew
Liou, Hsien-Chin & Tzu-Wei Yang
Noguera-Díaz, Yolanda & Pascual Pérez-Paredes
Tatsenko, Nataliia, Vitalii Stepanov & Hanna Shcherbak
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
