Article published In: Bridging the Methodological Divide: Linguistic and psycholinguistic approaches to formulaic language
Edited by Stefanie Wulff and Debra Titone
[The Mental Lexicon 9:3] 2014
► pp. 401–418
Words that go together
Measuring individual differences in native speakers’ knowledge of collocations
Published online: 23 January 2015
https://doi.org/10.1075/ml.9.3.02dab
https://doi.org/10.1075/ml.9.3.02dab
Although formulaic language has been studied extensively from both a linguistic and psycholinguistic perspective, little is known about the relationship between individual speakers’ knowledge of collocations and their linguistic experience, or between collocational knowledge and other aspects of linguistic knowledge. This is partly because work in these areas has been hampered by lack of an adequate instrument measuring speakers’ knowledge of collocations. This paper describes the development of such an instrument, the “Words that go together” (WGT) test, and some preliminary research using it. The instrument is a multiple choice test consisting of 40 items of varying frequency and collocation strength. The test was validated with a sample of 80 adult native speakers of English. Test-retest reliability was 0.80 and split-half reliability was 0.88. Convergent validity was established by comparing participants’ scores with measures expected to correlate with language experience (print exposure, education, and age) and other linguistic abilities (vocabulary size, grammatical comprehension); divergent validity was established by comparing test scores with nonverbal IQ. The results of the validation study are then used to compare speakers’ performance on the WGT with corpus-based measures of collocation strength (mutual information, z-score, t-score and simple frequency); however, no statistically reliable relationships were found.
References (45)
Acheson, D.J., Wells, J.B., & MacDonald, M.C. (2008). New and updated tests of print exposure and reading abilities in college students. Behavior Research Methods, 401, 278–289.
Arnon, I., & Snider, N. (2010). More than words: Frequency effects for multi-word phrases. Journal of Memory and Language, 621, 67–82.
Barlow, M., & Kemmer, S. (2000). Usage-based models of language. Cambridge: Cambridge University Press.
Bley-Vroman, R. (2002). Frequency in production, comprehension and acquisition. Studies in Second Language Acquisition, 241, 209–213.
Boers, F., & Lindstromberg, S. (2008). Structural elaboration by the sound (and feel) of it. In F. Boers & S. Lindstromberg (Eds.), Cognitive linguistic approaches to teaching vocabulary and phraseology (pp. 329–353). Berlin/New York: Mouton de Gruyter.
Bybee, J.L. (2013). Usage-based theory and exemplar representation. In T. Hoffman &
G. Trousdale (Eds.), The Oxford handbook of construction grammar (pp. 49–69). Oxford: Oxford University Press.
Chomsky, N. (1981/1993). Lectures on government and binding: The pisa lectures. Berlin: Walter de Gruyter.
Church, K.W., Gale, W., Hanks, P., & Hindle, D. (1991). Using statistics in lexical analysis. In
U. Zernik (Ed.), Lexical acquisition: Using on-line resources to build a lexicon (pp. 115–164.). Hillsdale: Lawrence Erlbaum Associates.
Church, K.W., & Hanks, P. (1990). Word association norms, mutual information, and lexicography. Computational Linguistics, 161, 22–29.
Conklin, K., & Schmitt, N. (2008). Formulaic sequences: Are they processed more quickly than nonformulaic language by native and nonnative speakers? Applied Linguistics, 291, 72–89.
Dąbrowska, E. (2009). Words as constructions. In V. Evans & S. Pourcel (Eds.), New directions in cognitive linguistics (pp. 201–223). Amsterdam: John Benjamins.
Dąbrowska, E. (in press). Individual differences in grammatical knowledge. In E. Dąbrowska & D. Divjak (Eds.), Handbook of cognitive linguistics. Berlin: De Gruyter Mouton.
. (in preparation). Grammar, vocabulary and collocations.
Dennis, S.F. (1965). The construction of a thesaurus automatically from a sample of text. In M. E. Stevens, V.E. Giuliano & L.B. Heilprin (Eds.), Proceedings of the symposium on statistical association methods for mechanized documentation (Vol. 2691, pp. 61–148).
Washington, DC: National Bureau of Standards Miscellaneous Pubcation.
Dunn, L.M., & Dunn, L.M. (1997). Peabody picture vocabulary test (3rd ed.). Circle Pines, MN: American Guidance Service.
Durrant, P., & Doherty, A. (2010). Are high-frequency collocations psychologically real? Investigating the thesis of collocational priming. Corpus Linguistics and Linguistic Theory, 61, 125–155.
Durrant, P.L. (2008). High frequency collocations and second language learning. (Ph. D.), University of Nottingham. Retrieved from [URL].
Ellis, N.C., & Simpson-Vlach, R. (2009). Formulaic language in native speakers: Triangulating psycholinguistics, corpus linguistics, and education. Corpus Linguistics and Linguistic Theory, 51, 61–78.
Erman, B., & Warren, B. (2000). The idiom principle and the open choice principle. Text, 201, 29–62.
Evert, S. (2004). Computational approaches to collocations. Retrieved 8 February 2014, from [URL].
. (2005). The statistics of word cooccurrences: Word pairs and collocations. (Ph.D.), University of Stuttgart. Retrieved from [URL].
Gries, S.T. (2011). Phonological similarity in multi-word symbolic units. Cognitive Linguistics, 221, 491–510.
Herbst, T. (1996). What are collocations: Sandy beaches or false teeth? English Studies, 771, 379–393.
Hodgson, J.M. (1991). Informational constraints on pre-lexical priming. Language and Cognitive Processes, 61, 169–205.
Langacker, R.W. (1988). A usage-based model. In B. Rudzka-Ostyn (Ed.), Topics in cognitive linguistics (pp. 127–161). Amsterdam: John Benjamins.
. (2000). A dynamic usage-based model. In M. Barlow & S. Kemmer (Eds.), Usage-based models of language (pp. 1–63). Stanford, CA: CSLI Publications.
Schmitt, N., Grandage, S., & Adolphs, S. (2004). Are corpus-derived recurrent clusters psycholinguistically valid? In N. Schmitt (Ed.), Formulaic sequences: acquisition, processing and use (pp. 127–151). Amsterdam: John Benjamins.
Shipley, W.C., Gruber, C.P., Martin, T.A., & Klein, A.M. (2009). Shipley-2 manual. Los Angeles: Western Psychological Services.
Siyanova-Chanturia, A., Conklin, K., & van Heuven, W.J.B. (2011). Seeing a phrase “time and again” matters: The role of phrasal frequency in the processing of multi-word sequences. Experimental Psychology: Learning, Memory, and Cognition, 371, 776–784.
Stubbs, M. (1995). Collocations and semantic profiles: On the cause of the trouble with quantitative methods. Functions of Language, 21, 1–33.
Tremblay, A., Derwing, B., Libben, G., & Westbury, C. (2011). Processing advantages of lexical bundles: Evidence from self-paced reading and sentence recall tasks. Language Learning, 611, 569–613.
Tremblay, A., & Tucker, B.V. (2011). The effects of N-gram probabilistic measures on the recognition and production of four-word sequences. The Mental Lexicon, 61, 302–324.
Ullman, M.T. (2006). The declarative/procedural model and the shallow structure hypothesis. Applied Psycholinguistics, 271, 97–105.
Verneau, M., van der Kamp, J., Savelsbergh, G.J., & de Loozem, M.P. (2014). Age and time effects on implicit and explicit learning. Experimental Aging Research, 401, 477–511.
Cited by (29)
Cited by 29 other publications
Johnson, Tamara Marie & Simone Eveline Pfenninger
McCarron, Sean P., Victoria A. Murphy & Kate Nation
Shadrova, Anna
Gedik, Tan Arda
Gedik, Tan Arda
Jiménez-Navarro, Eva Lucía & Isabel Durán-Muñoz
2024. Collocations of fictive motion verbs in adventure tourism. Revista Española de Lingüística Aplicada/Spanish Journal of Applied Linguistics 37:2 ► pp. 371 ff.
Schäfer, Roland & Ulrike Sayatz
Dąbrowska, Ewa & Tan Arda Gedik
Garibyan, Armine, Achim Schilling, Claudia Boehm, Alexandra Zankl & Patrick Krauss
Gibbs, Raymond W.
2022. Important challenges in the study of metaphors. In Figurative Thought and Language in Action [Figurative Thought and Language, 16], ► pp. 19 ff.
RICHES, Nick, Carolyn LETTS, Hadeel AWAD, Rachel RAMSEY & Ewa DĄBROWSKA
Vulchanova, Mila, Valentin Vulchanov, Antonella Sorace, Cristina Suarez-Gomez & Pedro Guijarro-Fuentes
McConnell, Kyla & Alice Blumenthal-Dramé
McConnell, Kyla & Alice Blumenthal-Dramé
Llompart, Miquel & Ewa Dąbrowska
Tachihara, Karina & Adele E. Goldberg
Liu, Yu
Rastelli, Stefano
Dunn, Jonathan
Dąbrowska, Ewa
Dąbrowska, Ewa
Herbst, Thomas
Schäfer, Roland
González-Márquez, Mónica, Michele I. Feist & Liane Ströbel
[no author supplied]
[no author supplied]
This list is based on CrossRef data as of 27 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
