In:Learner Corpora in Language Testing and Assessment
Edited by Marcus Callies and Sandra Götz
[Studies in Corpus Linguistics 70] 2015
► pp. 13–34
The Marburg Corpus of Intermediate Learner English (MILE)
Published online: 9 April 2015
https://doi.org/10.1075/scl.70.01kre
https://doi.org/10.1075/scl.70.01kre
Over the last few decades, corpus linguistic research and methodology have
had a huge impact on the foreign language classroom, mainly as reference tools
but also by introducing new ways of (data-driven) learning into the classroom.
A fairly recent trend is the use of corpus-linguistic methods for the study of the
development of learner language, which has led to the compilation of a fairly
large number of learner corpora. Among other things, this has opened up new
avenues for language testing and assessment, e.g. in the context of the Common
European Framework of Reference for Languages (CEFR). However, although
learner corpora seem to lend themselves easily to such purposes, the role they
have actually played so far is rather minor in comparison to the huge influence
on other aspects of foreign language teaching (at least with regard to German
learners of English). The present chapter explores possible reasons and introduces
the Marburg Corpus of Intermediate Learner English (MILE) as a potential
resource for learner language analysis and for corpus-informed, -based and
-driven contributions to language testing and assessment.
References (41)
Atwell, E., Howarth, P. & Souter, C. 2003. The ISLE corpus: Italian and German spoken learners’ English. ICAME Journal 27: 5–18.
Barlow, M. 2005. Computer-based analysis of learner language. In Analysing Learner Language, R. Ellis & G.P. Barkhuizen (eds), 335–357. Oxford: OUP.
Braun, S. 2005. From pedagogically relevant corpora to authentic language learning contents. ReCALL 17(1): 47–64.
Callies, M. and Zaytseva, E. 2013. The Corpus of Academic Learner English (CALE) – A new resource for the assessment of writing proficiency in the academic register. Dutch Journal of Applied Linguistics 2(1): 126–132.
Council of Europe. 2001. Common European Framework of Reference for Languages: Learning, teaching, assessment. Cambridge: CUP.
Dagneaux, E., Denness, S., Granger, S., Meunier, F., Neff, J. & Thewissen, J. 2008. Error Tagging Manual. Version 1.3. Louvain-La-Neuve: Centre for English Corpus Linguistics.
Díaz-Negrillo, A. & Domingo-Fernández, J. 2006. Error tagging systems for learner corpora. RESLA 19: 83–102.
Gilquin, G., De Cock, S. & Granger, S. 2010. The Louvain International Database of Spoken English Interlanguage. Handbook and CD-ROM. Louvain-la-Neuve: Presses Universitaires de Louvain.
Granger, S. 1998. The computerized learner corpus: A versatile new source of data for SLA research. In Learner English on Computer, S. Granger (ed.), 3–18. London: Longman.
. 2002. A bird’s-eye view of computer learner corpus research. In Computer Learner Corpora, Second Language Acquisition and Foreign Language Teaching [Language Learning & Language Teaching 6], S. Granger, J. Hung & S. Petch-Tyson (eds), 3–33. Amsterdam: John Benjamins.
. 2008. Learner corpora. In Corpus Linguistics. An International Handbook, A. Lüdeling (ed.), 259–275. Berlin: Walter de Gruyter.
Granger, S., Dagneaux, E., Meunier, F. & Paquot, M. 2009. The International Corpus of Learner English, Version 2. Handbook and CD-ROM. Louvain-la-Neuve: Presses Universitaires de Louvain.
Gut, U. 2012. The LeaP corpus. A multilingual corpus of spoken learner German and learner English. In Multilingual Corpora and Multilingual Corpus Analysis [Hamburg Studies on Multilingualism 14], T. Schmidt & K. Wörner (eds), 3–23. Amsterdam: John Benjamins.
Hardie, A. 2014. Modest XML for Corpora: Not a standard, but a suggestion. ICAME Journal 38: 73–103.
Hasko, V. 2013. Capturing the dynamics of second language development via learner corpus research: A very long engagement. The Modern Language Journal 97(1): 1–10.
Izumi, E., Uchimoto, K. & Isahara, H. 2004. The NICT JLE Corpus: Exploiting the language learners’ speech database for research and education. International Journal of the Computer, the Internet and Management 12: 119–125.
Jäkel, O. 2010. The Flensburg English Classroom Corpus (FLECC): Sammlung authentischer Unterrichtsgespräche aus dem aktuellen Englischunterricht auf verschiedenen Stufen an Grund-, Haupt-, Real- und Gesamtschulen Norddeutschlands. Flensburg: Flensburg University Press.
Kämmerer, S. 2012. Interference in advanced English interlanguage: Scope, detectability and dependency. In Input, Process and Product: Developments in Teaching and Language Corpora, J. Thomas & A. Boulton (eds), 284–297. Brno: Masaryk University Press.
Kreyer, R. 2011. Review of Jäkel, O. (2010): The Flensburg English Classroom Corpus (FLECC). Flensburg: Flensburg University Press. Fremdsprachen Lehren und Lernen 40: 136–138.
Leech, G. 1998. Learner corpora: What they are and what can be done with them. In Learner English on Computer, S. Granger (ed.), xiv–xx. London: Longman.
Lüdeling, A. 2008. Mehrdeutigkeiten und Kategorisierung: Probleme bei der Annotation von Lernerkorpora. In Fortgeschrittene Lernervarietäten, M. Walter & P. Grommes (eds), 119–140. Tübingen: Niemeyer.
Lüdeling, A., Walter, M., Kroymann, E. & Adolphs, P. 2005. Multi-level error annotation in learner corpora. Proceedings of the Corpus Linguistics Conference Series 1(1): 105–115.
MacWhinney, B. 2000. The CHILDES project: Tools for Analyzing Talk, 3rd edn. Mahwah, NJ: Lawrence Erlbaum Associates.
Meunier, F. 2002. The pedagogical value of native and learner corpora in EFL grammar teaching. In Computer Learner Corpora, Second Language Acquisition and Foreign Language Teaching [Language Learning & Language Teaching 6], S. Granger, J. Hung & S. Petch-Tyson (eds), 119–141. Amsterdam: John Benjamins.
Meunier, F. & Littre, D. 2013. Tracking learners’ progress: Adopting a dual ‘corpus cum experimental data’ approach. The Modern Language Journal 97: 61–76.
Mukherjee, J. 2002. Korpuslinguistik und Englischunterricht. Eine Einführung. Frankfurt: Peter Lang.
Nesselhauf, N. 2004. Learner corpora and their potential for language teaching. In How to Use Corpora in Language Teaching [Studies in Corpus Linguistics 12], J. Sinclair (ed.), 125–152. Amsterdam: John Benjamins.
Nicholls, D. 2003. The Cambridge Learner Corpus: Error coding and analysis for lexicography and ELT. In Proceedings of the Corpus Linguistics 2003 Conference, D. Archer, P. Rayson, A. Wilson & T. McEnery (eds), 572–581. Lancaster: University of Lancaster.
Ortega, L. & Iberri-Shea, G. 2005. Longitudinal research in second language acquisition: Recent trends and future directions. Annual Review of Applied Linguistics 25: 26–45.
Ortega, L. & Byrnes, H. (eds). 2008. The Longitudinal Study of L2 Capacities. New York, NY: Routledge.
Pérez-Paredes, P. 2010. Corpus Linguistics and Language Education in perspective: Appropriation and the possibilities scenario. In Corpus Linguistics in Language Teaching, T. Harris & M. Moreno Jaén (eds), 53–73. Bern: Peter Lang.
Reznicek, M., Lüdeling, A., Krummes, C., Schwantuschke, F., Walter, M., Schmidt, K., Hirschmann, H. & Andreas, T. 2012. Das Falko-Handbuch. Korpusaufbau und Annotationen. Version 2.01. Berlin: Institut für deutsche Sprache und Linguistik, Humboldt-Universität zu Berlin.
Römer, U. 2006. Pedagogical applications of corpora: Some reflections on the current scope and a wish list for future developments. Zeitschrift für Anglistik und Amerikanistik 54: 121–134.
. 2007. Learner language and the norms in native corpora and EFL teaching materials: A case study of English conditionals. In Anglistentag 2006 Halle. Proceedings, S. Volk-Birke & J. Lippert (eds), 355–363. Trier: Wissenschaftlicher Verlag Trier.
Sinclair, J. 1996. EAGLES. Preliminary Recommendations on Corpus Typology, [URL] (25 July 2014).
Tono, Y. 2003. Learner corpora: Design, development and applications. In Proceedings of the Corpus Linguistics 2003 Conference, D. Archer, P. Rayson, A. Wilson & T. McEnery (eds), 800–809. Lancaster: University of Lancaster.
. 2009. Variability and invariability in learner language. A corpus-based approach. In Corpus Analysis and Variation in Linguistics (Vol. 1) [Tokyo University of Foreign Studies 1], Y. Kawaguchi, M. Minegishi & J. Durand (eds), 67–82. Amsterdam: John Benjamins.
. 2012.
International Corpus of Crosslinguistic Interlanguage: Project overview and a case study on the acquisition of new verb co-occurrence patterns. In Developmental and Crosslinguistic Perspectives in Learner Corpus Research [Tokyo University of Foreign Studies 4], Y. Tono, Y. Kawaguchi & M. Minegishi (eds), 27–46. Amsterdam: John Benjamins.
Cited by (6)
Cited by six other publications
Pauls, Tobias
Szudarski, Paweł
Hoffmann, Tim
2022. Measuring lexical accuracy. In Complexity, Accuracy and Fluency in Learner Corpus Research [Studies in Corpus Linguistics, 104], ► pp. 159 ff.
Kawecki, Regis
Kreyer, Rolf
2021. Collocations in learner English. In Beyond Concordance Lines [Studies in Corpus Linguistics, 102], ► pp. 97 ff.
Kreyer, Rolf & Steffen Schaub
2018. The development of phrasal complexity in German intermediate learners of English. International Journal of Learner Corpus Research 4:1 ► pp. 82 ff.
This list is based on CrossRef data as of 1 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
