Article published In: ITL - International Journal of Applied Linguistics
Vol. 166:1 (2015) ► pp.94–126
How much collocation knowledge do L2 learners have?
The effects of frequency and amount of exposure
Published online: 8 June 2015
https://doi.org/10.1075/itl.166.1.03fer
https://doi.org/10.1075/itl.166.1.03fer
Many scholars believe that collocations are difficult to learn and use by L2 learners. However, some research suggests that learners often know more collocations than commonly thought. This study tested 108 Spanish learners of English to measure their productive knowledge of 50 collocations, which varied according to corpus frequency, t-score, and MI score. The participants produced a mean score of 56.6% correct, suggesting that our learners knew a substantial number of collocations. Knowledge of the collocations correlated moderately with corpus frequency (.45), but also with everyday engagement with English outside the classroom, in activities like reading, watching movies/TV, and social networking (composite correlation = .56). Everyday engagement also had a stronger relationship with collocation knowledge than years of English study (.45).
Keywords: collocations, productive knowledge, frequency, exposure, acquisition
References (65)
Adolphs, S., & Durow, V. (2004). Social-cultural integration and the development of formulaic sequences. In N. Schmitt (Ed.), Formulaic sequences: Acquisition, processing and use (pp. 106–126). Amsterdam: John Benjamins.
Bardovi-Harlig, K. (2012). Formulas, routines, and conventional expressions in pragmatics research. Annual Review of Applied Linguistics, 321, 206–227.
Barfield, A., & Gyllstad, H. (Eds.). (2009). Researching collocations in another language: Multiple interpretations. Houndmills: Palgrave Macmillan.
Boers, F., Eyckmans, J., Kappel, J., Stengers, H., & Demecheleer, M. (2006). Formulaic sequences and perceived oral proficiency: Putting a Lexical Approach to the test. Language Teaching Research, 101, 245–261.
Brown, D. (2011). What aspects of vocabulary knowledge do textbooks give attention to? Language Teaching Research, 151, 83–97.
Burdelski, M., & Cook, H.M. (2012). Formulaic language in language socialization. Annual Review of Applied Linguistics, 321, 173–188.
Bybee, J.L., & Hopper, P. (2001). Frequency and the emergence of language structure. Amsterdam: John Benjamins.
Cowie, A.P. (Ed.). (1998). Phraseology: Theory, analysis, and applications. Oxford: Oxford University Press.
Cummins, J. (1998). Immersion education for the millennium: What we have learned from 30 years of research on second language immersion. In M.R. Childs & R.M. Bostwick (Eds.), Learning through two languages: Research and practice. Second Katoh Gakuen International Symposium on Immersion and bilingual education (pp. 34–47). Japan: Katoh Gakuen.
Davies, M. (2008) The corpus of contemporary American English: 450 million words, 1990-present. Available online at <[URL]> (Accessed from June to August, 2013).
Day, R.R., & Bamford, J. (1998). Extensive reading in the second language classroom. Cambridge: Cambridge University Press.
De Cock, S. (2000). Repetitive phrasal chunkiness and advanced EFL speech and writing. In C. Mair & M. Hundt (Eds.), Corpus linguistics and linguistic theory (pp. 51–68). Amsterdam: Rodopi.
Dörnyei, Z., Durow, V., & Zahran, K. (2004). Individual differences and their effects on formulaic sequence acquisition. In N. Schmitt (Ed.), Formulaic sequences (pp. 87–106). Amsterdam: John Benjamins.
Durrant, P., & Schmitt, N. (2009). To what extent do native and non-native writers make use of collocations? International Review of Applied Linguistics, 471, 157–177.
. (2010). Adult learners’ retention of collocations from exposure. Second Language Research, 261, 163–188.
Ellis, N.C. (2001). Memory for language. In P. Robinson (Ed.), Cognition and second language instruction. Cambridge: Cambridge University Press.
. (2002). Frequency effects in language processing: A review with implications for theories of implicit and explicit language acquisition. Studies in Second Language Acquisition, 241, 143–188.
. (2005). At the interface: Dynamic interactions of explicit and implicit language knowledge. Studies in Second Language Acquisition, 271, 305–352.
Ellis, N.C., Simpson-Vlach, R., & Maynard, C. (2008). Formulaic language in native and second language speakers: Psycholinguistics, corpus linguistics and TESOL. TESOL Quarterly, 421, 375–396.
Freed, B.F., Segalowitz, N., & Dewey, D.P. (2004). Context of learning and second language fluency in French: Comparing regular classroom, study abroad, and intensive domestic immersion programs. Studies in Second Language Acquisition, 261, 275–301.
Granger, S. (1998). Prefabricated patterns in advanced EFL writing: Collocations and formulae. In A.P. Cowie (Ed.), Phraseology: Theory, analysis, and applications (pp. 79–100). Oxford: Oxford University Press.
Granger, S., Paquot, M., & Rayson, P. (2006). Extraction of multi-word units from EFL and native English corpora: The phraseology of the verb ‘make’. In A. Häcki Buhofer & H. Burger (Eds.), Phraseology in Motion I: Methoden und Kritik. Akten der Internationalen Tagung zur Phraseologie, Basel, 2004 (pp. 57–68). Baltmannsweiler: Schneider Verlag Hohengehren.
Hasselgren, A. (1994). Lexical teddy bears and advanced learners: A study into the ways Norwegian students cope with English vocabulary. International Journal of Applied Linguistics, 41, 237–260.
Horst, M., Cobb, T., & Meara, P. (1998). Beyond a clockwork orange: Acquiring second language vocabulary through reading. Reading in a Foreign Language, 111, 207–223.
Howarth, P. (1998). The phraseology of learners’ academic writing. In A.P. Cowie (Ed.), Phraseology: Theory, analysis, and applications (pp. 161–186). Oxford: Oxford University Press.
Irujo, S. (1993). Steering clear: Avoidance in the production of idioms. International Review of Applied Linguistics in Language Teaching, 311, 205–219.
Laufer, B., & Girsai, N. (2008). Form-focused instruction in second language vocabulary learning: A case for contrastive analysis and translation. Applied Linguistics, 291, 1–23.
Laufer, B., & Waldman, T. (2011). Verb-noun collocations in second-language writing: A corpus analysis of learners´ English. Language Learning, 611, 647–672.
Leech, G., Rayson, P., & Wilson, A. (2001). Word frequencies in written and spoken English based on the British National Corpus. Harlow: Longman.
Lorenz, G. (1999). Adjective intensification – Learners versus native speakers: A corpus study of argumentative writing. Amsterdam: Rodopi.
Martinez, R., & Murphy, V. (2011). Effect of frequency and idiomaticity in second language reading comprehension. TESOL Quarterly, 451, 267–290.
Meunier, F. (2012). Formulaic language and language teaching. Annual Review of Applied Linguistics, 321, 111–129.
Millar, N. (2011). The processing of malformed formulaic language. Applied Linguistics, 321, 129–48.
Nation, I.S.P. (2001). Learning Vocabulary in another language. Cambridge: Cambridge University Press.
Nation, I.S.P., & Waring, R. (1997). Vocabulary size, text coverage and word lists. In N. Schmitt & M. McCarthy (Eds.), Vocabulary: Description, acquisition and pedagogy (pp. 6–19). Cambridge: Cambridge University Press.
Nattinger, J.R., & DeCarrico, J.S. (1992). Lexical phrases and language teaching. Oxford: Oxford University Press.
Nelson, K. (1973). Structure and strategy in learning to talk. Monographs of the Society for Research in Child Development, 1491(1–2).
Nesselhauf, N. (2003). The use of collocations by advanced learners of English and some implications for teaching. Applied Linguistics, 241, 223–242.
. (2005). Collocations in a learner corpus. Amsterdam: John Benjamins.
Newton, J. (1995). Task-based interaction and incidental vocabulary learning: A case study. Second Language Research, 111, 159–177.
Pawley, A., & Syder, F. (1983). Two puzzles for linguistic theory: Nativelike selection and nativelike fluency. In J. Richards & R. Schmidt (Eds.), Language and communication (pp.191–226). London: Longman.
Peters, E. (2012). Learning German formulaic sequences: The effect of two attention-drawing techniques. Language Learning Journal, 401, 65–79.
. (2014). The effects of repetition and time of post-test administration on EFL learners’ form recall of single words and collocations. Language Teaching Research, 181, 75–94.
. (Ed.). (2004). Formulaic sequences: Acquisition, processing and use. Amsterdam: John Benjamins.
Schmitt, N., Dörnyei, Z., Adolphs, S., & Durow, V. (2004). Knowledge and acquisition of formulaic sequences: A longitudinal study. In N. Schmitt (Ed.), Formulaic sequences: Acquisition, processing, and use (pp. 55–86). Amsterdam: John Benjamins.
Schmitt, N., & Redwood, S. (2011). Learner knowledge of phrasal verbs: A corpus-informed study. In F. Meunier, S. De Cock, G. Gilquin, & M. Paquot (Eds.), A taste for corpora: In honour of Sylviane Granger (pp.173–209). Amsterdam: John Benjamins.
Schmitt, N., Schmitt, D., & Clapham, C. (2001). Developing and exploring the behaviour of two new versions of the Vocabulary Levels Test. Language Testing, 18(1), 55–88.
Siyanova-Chanturia, A. (2013). Eye-tracking and ERPs in multi-word expression research. Mental Lexicon, 81, 245–268.
Siyanova, A., & Schmitt, N. (2007). Native and nonnative use of multi-word vs. one-word verbs. International Review of Applied Linguistics, 451, 119–139.
. (2008). L2 learner production and processing of collocation: A multi-study perspective. Canadian Modern Language Review, 641, 429–458.
Slobin, D.I. (1997). The origins of grammaticizable notions: Beyond the individual mind. In D.I. Slobin (Ed.), The crosslinguistic study of language acquisition, Vol. 51 (pp. 265–323). Mahwah, NJ: Lawrence Erlbaum Associates.
Stewart, J., & White, D.A. (2011). Estimating guessing effects on the Vocabulary Levels Test for differing degrees of word knowledge. TESOL Quarterly, 451, 370–380.
. (2008). Receptive and productive vocabulary sizes of L2 learners. Studies in Second Language Acquisition, 301, 79–95.
Webb, S., Newton, J., & Chang, A. (2013). Incidental learning of collocation. Language Learning, 631, 91–120.
Wong‑Fillmore, L. (1976). The second time around. Unpublished PhD dissertation, Stanford University.
Cited by (96)
Cited by 96 other publications
Breslaw, Ronit & Batia Laufer
Kim, Taehyeong, Tove Larsson, Henrik Kaatari, Ying Wang & Pia Sundqvist
Adebayo, Qudus Ayinde
Alonso, Rosa Alonso & Ana Fernández-Dobao
2025. Adolescents’ informal exposure to English as a second language in the context of Galicia. ITL - International Journal of Applied Linguistics 176:1 ► pp. 76 ff.
Amrate, Moustafa
Brown, Dale
Davydova, Julia
de la Viña, Inés, Christina S. Kim & Gloria Chamorro
Ding, Chen, Barry Lee Reynolds, Csaba Z. Szabo & Griet Boone
Lai, Chun & Qiu Wang
Lu, Yuan
Naismith, Ben & Alan Juffs
Nirattisai, Supika
Saito, Kazuya & Takumi Uchihara
Tizón-Couto, David & David Lorenz
Vandeweerd, Nathan & Klara Arvidsson
2025. Not just quantity but quality. Study Abroad Research in Second Language Acquisition and International Education 10:1 ► pp. 102 ff.
Zhou, Siyang & Nathan Thomas
Arvidsson, Klara, Fanny Forsberg Lundell, Marta Zakrzewska & Andreas Jemstedt
Bogunović, Irena
Ding, Chen, Barry Lee Reynolds & Xuan Van Ha
Duan, Shiping & Zhiliang Shi
Karlak, Manuela
Mathieson, Paul, Francesco Bolstad & Yosuke Sasao
Takizawa, Kotaro
Terai, Masato, Junya Fukuta & Yu Tamura
Tran, Linh & Imma Miralpeix
Uztosun, Mehmet Sercan & Muhammed Kök
Wang, Lu, Wenbo Yu, Yiran Peng & Dandan Liang
2024. Examining the role of distributional information and structural types
in multiword sequence processing by Chinese preschool children. International Journal of Chinese Linguistics 11:1 ► pp. 56 ff.
ÇETİNKAYA, Gökhan, Salih KESİCİ & Betül POLAT
Abu Sneida, Taghreed I.G., Muhammad Yasir Yahya & Salina Husain
Allal-Sumoto, Takara Kenza, Kiyofumi Miyoshi & Hiroaki Mizuhara
Altamimi, Abdulaziz & Kathy Conklin
Boone, Griet, Vanessa De Wilde & June Eyckmans
Boone, Griet & June Eyckmans
Fakir, Abdelali El & Hind Brigui
Ghamarian-Krenn, Katharina & Marlene Schwarz
Lamine, Imane, Abir Chahouri, Bilal Mghili, Abdellatif Moukrim & Aicha Ait Alla
Lu, Cailing & Thi Ngoc Yen Dang
Lundell, Fanny Forsberg, Klara Arvidsson & Andreas Jemstedt
Shin, Dongkwang, Jang Ho Lee & Wonkyung Choi
Sonbul, Suhad, Dina Abdel Salam El-Dakhs & Ahmed Masrai
Sun, Danning, Zihan Chen & Shanhua Zhu
Tam, Ho I (Anna) & Barry Lee Reynolds
2023. The relationship between extramural English engagement and the vocabulary size of L1 Cantonese speakers in Macau. ITL - International Journal of Applied Linguistics 174:1 ► pp. 49 ff.
Yamagata, Satoshi, Tatsuya Nakata & James Rogers
Zhou, Siyang & Jessica Briggs Baffoe-Djan
2023. “You just picked it up”. Study Abroad Research in Second Language Acquisition and International Education 8:1 ► pp. 142 ff.
AKTÜRK, Ahmet, Ali Şükrü ÖZBAY & Hakan CANGIR
Alhatmi, Sultan
Cao, Dung Thi Phuong, Phuong Dzung Pho & Nguyen Anh Chi Dang
Deng, Yaochen & Dilin Liu
2022. A multi-dimensional comparison of the effectiveness and efficiency of association measures in collocation extraction. International Journal of Corpus Linguistics 27:2 ► pp. 191 ff.
El-Dakhs, Dina Abdel Salam, Ahmed Masrai & Noorchaya Yahya
Montero Perez, Maribel
Pulido, Manuel F.
Sonbul, Suhad, Dina Abdel Salam El-Dakhs & Hind Al-Otaibi
Uchihara, Takumi, Masaki Eguchi, Jon Clenton, Kristopher Kyle & Kazuya Saito
Warnby, Marcus
2022. Receptive academic vocabulary knowledge and extramural English involvement – is there a correlation?. ITL - International Journal of Applied Linguistics 173:1 ► pp. 120 ff.
ABDUL WAHAB, URAIDAH, MAT TAIB PA & LILY HANEFAREZAN ASBULAH
Busby, Nicole Louise
Busby, Nicole Louise
Danilina, Svetlana
Dushku, Silvana & Youngshil Paek
Kang, Dae-Min
2021. L2 English learners’ knowledge of figurative meaning senses of phrasal verbs. Review of Cognitive Linguistics 19:1 ► pp. 172 ff.
Lee, Senyung & Sun-Young Shin
Sonbul, Suhad & Anna Siyanova-Chanturia
Wongkhan, Prueksa & Atikhom Thienthong
Öksüz, Doğuş, Vaclav Brezina & Patrick Rebuschat
Bulon, Amélie
2020. Comparing the ‘phrasicon’ of teenagers in immersive and non-immersive settings. Journal of Immersion and Content-Based Language Education 8:1 ► pp. 107 ff.
Chou, Mu-Hsuan
De Wilde, Vanessa, Marc Brysbaert & June Eyckmans
De Wilde, Vanessa, Marc Brysbaert & June Eyckmans
González-fernández, Beatriz & Norbert Schmitt
JIN, ZHOUHAN & STUART WEBB
Niitemaa, Marja-Leena
Sonbul, Suhad & Dina El-Dakhs
Arvidsson, Klara
2019. Quantity of target language contact in study abroad and knowledge of multiword expressions. Study Abroad Research in Second Language Acquisition and International Education 4:2 ► pp. 145 ff.
McEnery, Tony, Vaclav Brezina, Dana Gablasova & Jayanti Banerjee
Omidian, Taha, Maryam Akbary & Hesamoddin Shahriari
Pellicer-Sánchez, Ana
Peters, Elke, Ann‐Sophie Noreillie, Kris Heylen, Bram Bulté & Piet Desmet
Puimège, Eva & Elke Peters
Puimège, Eva & Elke Peters
Puimège, Eva & Elke Peters
Schmitt, Norbert
Supasiraprapa, Sarut
Alharthi, Thamer
Forsberg Lundell, Fanny, Christina Lindqvist & Amanda Edmonds
García Salido, Marcos & Marcos Garcia
Peters, Elke
2018. The effect of out-of-class exposure to English language media on learners’ vocabulary knowledge. ITL - International Journal of Applied Linguistics 169:1 ► pp. 142 ff.
Peters, Elke
2020. The effect of out-of-class exposure to English language media on learners’ vocabulary knowledge. In Approaches to Learning, Testing and Researching L2 Vocabulary [Benjamins Current Topics, 109], ► pp. 143 ff.
Teng, Feng
Teng, Feng
Wolter, Brent & Junko Yamashita
Gablasova, Dana, Vaclav Brezina & Tony McEnery
Macis, Marijana & Norbert Schmitt
Garnier, Mélodie & Norbert Schmitt
[no author supplied]
This list is based on CrossRef data as of 12 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
