Article published In: International Journal of Corpus Linguistics
Vol. 22:4 (2017) ► pp.457–489
The English Grammar Profile of learner competence
Methodology and key findings
Available under the Creative Commons Attribution-NonCommercial (CC BY-NC) 4.0 license.
For any use beyond this license, please contact the publisher at rights@benjamins.nl.
Published online: 1 December 2017
https://doi.org/10.1075/ijcl.14086.oke
https://doi.org/10.1075/ijcl.14086.oke
Abstract
English Profile (EP) is an ongoing empirical exploration of learner English initiated by Cambridge University Press and Cambridge English, among others. EP aims to create a set of empirically-based descriptions of language competencies for English. ‘Reference Level Descriptors’ already exist as part of the Common European Framework of Reference (CEFR) but are intuitively derived and not designed for one specific language. The English Grammar Profile (EGP, www.englishprofile.org/english-grammar-profile) is a sub-project of EP which aims to profile learner competence in grammar. This paper details the rationale for the study and the methodology that was developed to investigate the Cambridge Learner Corpus to arrive at over 1,200 grammatical competence statements. Key findings which link to existing corpus-based second language acquisition work are also presented.
Article outline
- 1.Introduction
- 2.Corpus studies of learner language
- 3.Methodological issues in LCR
- 3.1Contrastiveness and representativeness
- 3.2Interspeaker variation and proficiency
- 4.The data
-
5.Methodology
- 5.1Establishing a grammar search inventory
-
5.2Developing an iterative process
- 5.2.1Establishing criteria
- 5.2.2Steps in the iterative process
- 5.3Exemplifying the iterative process using a criteria-based methodology
- 6.Discussion of initial findings
- 6.1Lexico-grammatical development and the ceiling effect
- 6.2Pragmatic development
- 7.Conclusion
- Appendices
- Appendix 1.Overview of “styles” and “formats” by level, as defined by the CLC customised version of Sketch Engine
- Appendix 2.Grammar items typically taught and tested in ELT syllabi
-
Appendix 3.ELT pedagogical grammars and courses surveyed as the basis for the list of “superordinate” categories and “subordinate” categories for investigation
- ELT Pedagogical grammar books
References
References (66)
Ädel, A. (2015). Variability in learner corpora. In Granger, S., Gilquin, G., & Meunier, F. (Eds.), The Cambridge Handbook of Learner Corpus Research (pp. 401–421). Cambridge: Cambridge University Press.
Anderson, C. (2007). The CEFR and the need for more research. Modern Language Journal, 91(4), 659–663.
Bardovi-Harlig, K., & Bastos, M. T. (2011). Proficiency, length of stay, and intensity of interaction, and the acquisition of conventional expressions in L2 pragmatics. Intercultural Pragmatics, 8(3), 347–384.
Burnard, L. (1995). Users’ Reference Guide to British National Corpus. Oxford: Oxford University Computing Services.
Burton, G. (2012). Corpora and coursebooks: Destined to be strangers forever? Corpora, 7(1), 91–108.
Buttery, P., & Caines, A. (2012). Normalising frequency counts to account for ‘opportunity of use’ in learner corpora. In Y. Tono, Y. Kawaguchi & M. Minegishi (Eds.), Developmental and Crosslinguistic Perspectives in Learner Corpus Research (pp. 187–204). Amsterdam/Philadelphia: John Benjamins.
Callies, M. (2015). Learner corpus methodology. In Granger, S., Gilquin, G., & Meunier, F. (Eds.), The Cambridge Handbook of Learner Corpus Research (pp. 35–55). Cambridge: Cambridge University Press.
Callies, M., & Zaytseva, E. (2013). The Corpus of Academic Learner English (CALE): A new resource for the study and assessment of advanced language proficiency. In S. Granger, G. Gilquin & F. Meunier (Eds.), Twenty Years of Learner Corpus Research: Looking back, Moving ahead. Corpora and Language in Use – Proceedings 1 (pp. 49–59). Louvain-la-Neuve: Presses Universitaires de Louvain.
Cambridge University Press (2015). English Grammar Profile. Cambridge: Cambridge University Press. Retrieved from [URL] (last accessed March 2016).
Capel, A. (2010). A1 – B2 Vocabulary: Insights and issues arising from the English Profile Wordlists project. English Grammar Profile Journal, 1(1), 1–11. Retrieved from [URL] (last accessed November 2014).
(2015). The English Vocabulary Profile. In J. Harrison & F. Barker (Eds.), English Profile in Practice, English Profile Studies, Vol. 51 (pp. 9–27). Cambridge: Cambridge University Press.
Carlsen, C. (2012). Proficiency level: A fuzzy variable in computer learner corpora. Applied Linguistics, 3(2), 161–183.
Carter, R., McCarthy, M., Mark, G., & O’Keeffe, A. (2011). English Grammar Today. Cambridge: Cambridge University Press.
Council of Europe (2001a). Common European Framework of Reference for Languages: Learning, Teaching, Assessment. Cambridge: Cambridge University Press.
(2001b). Common European Framework of Reference for Languages: Learning, Teaching, Assessment Structured Overview of all CEFR Scales. Strasbourg: Council of Europe. Retrieved from [URL] (last accessed November 2014).
Díez-Bedmar, M. B. (2012). The use of the Common European Framework of Reference for Languages to evaluate compositions in the English exam section of the University admission examination. Revista de Educación, 3571, 55–79.
Ellis, N. C., & Ferreira-Junior, F. (2009). Constructions and their acquisition: Islands and the distinctiveness of their occupancy. Annual Review of Cognitive Linguistics, 7(1), 188–221.
Ellis, N. C., O’Donnell, M., & Römer, U. (2015). Usage-based language learning. In B. MacWhinney & W. O’Grady (Eds.), The Handbook of Language Emergence (pp. 163–180). Hoboken, NJ: John Wiley & Sons, Inc.
Figueras, N., North, B., Takala, S., Verhelst, N., & Van Avermaet, P. (2005). Relating examinations to the Common European Framework: A manual. Language Testing, 22(3), 262–279.
Fulcher, G. (2004). Deluded by artifices? The Common European Framework and harmonization. Language Assessment Quarterly, 1(4), 253–266.
Gablasova, D., Brezina, V., McEnery, T., & Boyd, E. (2015). Epistemic stance in spoken L2 English: The effect of task and speaker style. Applied Linguistics, Advance online publication.
Gablasova, D., Brezina, V., & McEnery, T. (2017). Exploring learner language through corpora: Comparing and interpreting corpus frequency information. Language Learning, 67(s1), 130–154.
Granger, S. (1994). The learner corpus: A revolution in applied linguistics. English Today, 10(3), 25–33.
(2012). How to use foreign and second language learner corpora. In A. Mackey & S. Gass (Eds.), Research Methods in Second Language Acquisition: A Practical Guide (pp. 7–29). Malden: Blackwell.
Granger, S., Gilquin, G., & Meunier, F. (Eds.) (2015). The Cambridge Handbook of Learner Corpus Research. Cambridge: Cambridge University Press.
Green, A. (2012). English Profile Studies. Language Functions Revisited: Theoretical and Empirical Bases for Language Construct Definition across the Ability Range. Cambridge: UCLES/Cambridge University Press.
Harrison, J., & Barker, F. (Eds.) (2015). English Profile in Practice. English Profile Studies, Vol. 51. Cambridge: Cambridge University Press.
Hawkins, J. A., & Buttery, P. (2009). Using learner language from corpora to profile levels of proficiency: Insights from the English Profile Programme. In L. Taylor & C. J. Weir (Eds.), Language Testing Matters: Investigating the Wider Social and Educational Impact of Assessment (pp. 158–175). Cambridge: Cambridge University Press.
(2010). Criterial features in learner corpora: Theory and illustrations. English Profile Journal, 1(1).
Hawkins, J., & Filipović, L. (2012). Criterial Features in L2 English: Specifying the Reference Levels of the Common European Framework. Cambridge: Cambridge University Press.
Jones, N., & Saville, N. (2009). European language policy: Assessment, learning, and the CEFR. Annual Review of Applied Linguistics, 291, 51–63.
Kilgarriff, A., Baisa, V., Bušta, J., Jakubíček, M., Kovář, V., Michelfeit, J. Rychlý, P., & Suchomel, V. (2014). The Sketch Engine: Ten years on. Lexicography, 1(1), 7–36.
Larsen-Freeman, D. (2006). The emergence of complexity, fluency, and accuracy in the oral and written production of five Chinese learners of English. Applied Linguistics, 27(4), 590–619.
(2014). Another step to be taken: Rethinking the end point of the interlanguage continuum. In Z. -H. Han & E. Tarone (Eds.), Interlanguage: Forty Years Later (pp. 203–220). Amsterdam/Philadelphia: John Benjamins.
(2015). Saying what we mean: Making a case for ‘language acquisition’ to become ‘language development’. Language Teaching, 48(4), 491–505.
Lave, J., & Wenger, E. (1991). Situated Learning: Legitimate Peripheral Participation. Cambridge: Cambridge University Press.
Littlejohn, A. (1992). Why are ELT coursebooks the way they are? (Unpublished doctoral dissertation). Lancaster University, Lancaster, UK.
McCarthy, M. J. (1998). Spoken Language and Applied Linguistics. Cambridge: Cambridge University Press.
(2013). Corpora and the advanced level: Problems and prospects. English Australia Journal, 29(1), 39–49.
McEnery, T., Xiao, R., & Tono, Y. (2006). Corpus-Based Language Studies: An Advanced Resource Book. London: Routledge
Meunier, F. (2015). Developmental patterns in learner corpora. In Granger, S., Gilquin, G., & Meunier, F. (Eds.), The Cambridge Handbook of Learner Corpus Research (pp. 379–400). Cambridge: Cambridge University Press.
Milanovic, M. (2009). Cambridge ESOL and the CEFR. University of Cambridge ESOL Examinations Research Notes, 371, 2–5. Retrieved from [URL] (last accessed November 2014).
Milton, J., & Meara, P. (1995). How periods abroad affect vocabulary growth in a foreign language. ITL Review of Applied Linguistics, (107–08), 17–34.
Murakami, A. (2013). Cross-linguistic influence on the accuracy order of L2 English grammatical morphemes. In S. Granger, S. Gaëtanelle & F. Meunier (Eds.), Twenty Years of Learner Corpus Research. Looking back, Moving ahead: Proceedings of the First Learner Corpus Research Conference (LCR 2011) (Vol. 11) (pp. 325–334). Louvain: Presses universitaires de Louvain.
Myles, F. (2004). From data to theory: The over‐representation of linguistic knowledge in SLA. Transactions of the Philological Society, 102(2), 139–168
(2015). Second language acquisition theory and learner corpus research. In S. Granger, G. Gilquin & F. Meunier (Eds.), The Cambridge Handbook of Learner Corpus Research (pp. 309–331). Cambridge: Cambridge University Press.
Negishi, M., Tono, Y., & Fujita, Y. (2013). A validation study of the CEFR levels of phrasal verbs in the English Vocabulary Profile. English Profile Journal, 31, 1–16. Retrieved from [URL] (last accessed November 2014).
Osbourne, J. (2014). Multiple assessments of oral proficiency: Evidence from a collaborative platform. In P. Leclercq, A. Edmonds & H. Hilton (Eds.), Measuring L2 Proficiency: Perspectives from SLA (pp. 54–70). Bristol: Multilingual Matters.
O’Sullivan, B. (2011). Language testing. In J. Simpson (Ed.), The Routledge Handbook of Applied Linguistics (pp. 259–273). London: Routledge.
Pienemann, M. (1998). Developmental dynamics in L1 and L2 acquisition: Processability Theory and generative entrenchment. Bilingualism, 1(1), 1–20.
Simpson, R. C., Briggs, S. L., Ovens, J., & Swales, J. M. (2002). The Michigan Corpus of Academic Spoken English. Ann Arbor, MI: The Regents of the University of Michigan.
Tavakoli, P. (2009). Assessing L2 task performance: Understanding effects of task design. System, 37(3), 482–495.
Thewissen, J. (2013). Capturing L2 accuracy developmental patterns: Insights from an error tagged learner corpus. The Modern Language Journal, 97(S1), 77–101.
Tono, Y., & Díez-Bedmar, M. B. (2014). Focus on learner writing at the beginning and intermediate stages: The ICCI corpus. International Journal of Corpus Linguistics, 19(2), 163–177.
Tracy-Ventura, N., & Myles, F. (2015). The importance of task variability in the design of learner corpora for SLA research. International Journal of Learner Corpus Research, 1(1), 58–95.
Van Ek, J. A., & Trim, J. L. M. (1991a). Waystage 1990. Cambridge: Council of Europe/Cambridge University Press.
Wang, X. (2013). Grammatical Development of Chinese among Non-native Speakers: From a Processability Account. Cambridge: Cambridge Scholars. Retrieved from [URL] (last accessed April 2015).
Cited by (14)
Cited by 14 other publications
Gao, Zhao-Ming & Maya Rubin
Romero Muñoz, Eloy & Remy Decorte
2025. Exploring methodological issues in Applied Cognitive Linguistics teaching materials. Cognitive Linguistic Studies 12:1 ► pp. 23 ff.
Rohani, Siti, Yan Watequlis Syaifudin, Moch Yusuf Hermawan, Pramana Yoga Saputra & Achmad Suyono
Gaillat, Thomas
2022. Investigating the scopes of textual metrics for learner level discrimination and learner analytics. In Complexity, Accuracy and Fluency in Learner Corpus Research [Studies in Corpus Linguistics, 104], ► pp. 21 ff.
Gaillat, Thomas, Andrew Simpkin, Nicolas Ballier, Bernardo Stearns, Annanda Sousa, Manon Bouyé & Manel Zarrouk
Romero Muñoz, Eloy
Tam, Hugo Wing-Yu
2021. 「粵語語法學習概覽」的建構. Chinese as a Second Language (漢語教學研究—美國中文教師學會學報). The journal of the Chinese Language Teachers Association, USA 56:3 ► pp. 209 ff.
Ballier, Nicolas, Stéphane Canu, Caroline Petitjean, Gilles Gasso, Carlos Balhana, Theodora Alexopoulou & Thomas Gaillat
2020. Machine learning for learner English. International Journal of Learner Corpus Research 6:1 ► pp. 72 ff.
Berger, Armin
Burton, Graham
Pérez-Paredes, Pascual & María Belén Díez-Bedmar
2019. Certainty adverbs in spoken learner language. International Journal of Learner Corpus Research 5:2 ► pp. 253 ff.
[no author supplied]
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
