Article published In: The Mental Lexicon
Vol. 16:2/3 (2021) ► pp.362–396
Word order effect in collocation processing
Published online: 8 March 2022
https://doi.org/10.1075/ml.20022.vil
https://doi.org/10.1075/ml.20022.vil
Abstract
Collocations are words associated because of their frequent co-occurrence, which makes them predictable and leads
to facilitated processing. While there have been suggestions that collocations are stored as unanalysed chunks, other researchers
disagree. One of the arguments against holistic storage is the fact that collocations are not fixed phrases, for example, their
word order can vary. To explore whether reversed collocations retain the processing advantage that they have in their canonical
form, we conducted two primed lexical decision experiments: Experiment 1 in English, and Experiment 2 in Lithuanian, an
understudied language. We presented both forward and backward collocations and compared them to matched control phrases. We also
explored which collocational measure (phrasal frequency, MI, t-score, or ΔP) worked as the best predictor of
processing speed. We found a clear priming effect for both languages when collocations were presented in their forward form, which
is in line with previous research. There was no priming for the backward condition in English, but a priming effect for it in
Lithuanian, where the reversed word order is acceptable albeit marked. These results are not easily explained by holistic storage.
As far as collocational measures are concerned, they all seem to perform reasonably well, with none of them being clearly better
than the others.
Keywords: collocations, collocation processing, mental lexicon, word order
Article outline
- Processing advantage for collocation
- Question of holistic storage
- Directionality
- The present study
- Experiment 1
- Participants
- Stimuli
- Procedure
- Analysis
- Results
- Discussion
- Experiment 2
- Participants
- Stimuli
- Analysis
- Results
- Comparing different collocational measures
- Discussion
- General discussion
- Holistic storage
- Collocational measures
- Limitations
- Conclusions
- Acknowledgements
- Note
References
References (59)
Altenberg, B. (1998). On
the phraseology of spoken English: The evidence of recurrent
word-combinations. In A. P. Cowie (Ed.), Phraseology:
Theory, Analysis, and
Applications (pp. 101–122). Clarendon Press.
Baayen, R. H. (2008). Analyzing
Linguistic Data: A Practical Introduction to Statistics using R. Cambridge University Press.
Barfield, A., & Gyllstad, H. (2009). Introduction:
Researching L2 Collocation Knowledge and Development. In A. Barfield & H. Gyllstad (Eds.), Researching
collocations in another language: Multiple
interpretations (pp. 1–18). Palgrave Macmillan.
Barton, K. (2015). MuMIn:
Multi-Model Inference. R package (1.13.4) [Computer
software]. [URL]
Bates, D., Maechler, M., Bolker, B., & Walker, S. (2014). Lme4:
Linear mixed-effects models using Eigen and S4_. R
package (1.1–7) [Computer software]. [URL]>
Biber, D., Johansson, S., Leech, G., Conrad, S., & Finegan, E. (1999). Longman
grammar of spoken and written English. Longman.
Bielinskienė, A., Boizou, L., Bumbulienė, I., Kovalevskaitė, J., Krilavičius, T., Mandravickaitė, J., Rimkutė, E., & Vilkaitė-Lozdienė, L. (2019). Lietuvių
kalbos kolokacijų žodynas [elektroninis išteklius].
Bonk, W., & Healy, A. F. (2005). Priming
effects without semantic or associative links through collocation. 46th Annual Meeting of the
Psychonomic Society, Toronto, Canada.
Brezina, V., McEnery, T., & Wattam, S. (2015). Collocations
in context: A new perspective on collocation networks. International Journal of Corpus
Linguistics,
20
(2), 139–173.
Caldwell-Harris, C. L., & Morris, A. L. (2008). Fast
Pairs: A visual word recognition paradigm for measuring entrenchment, top-down effects, and subjective
phenomenology. Consciousness and
Cognition,
17
(4), 1063–1081.
Carrol, G., & Conklin, K. (2020). Is
All Formulaic Language Created Equal? Unpacking the Processing Advantage for Different Types of Formulaic
Sequences. Language and
Speech,
63
(1), 95–122.
Dahlmann, I., & Adolphs, S. (2007). Pauses
as an indicator of psycholinguistically valid multi-word expressions (MWEs)? MWE ’07
Proceedings of the Workshop on a Broader Perspective on Multiword
Expressions, Pages 49–56.
Durrant, P., & Doherty, A. (2010). Are
high-frequency collocations psychologically real? Investigating the thesis of collocational
priming. Corpus Linguistics and Linguistic
Theory,
6
(2), 125–155.
Ellis, N. C. (2006). Language
acquisition as rational contingency learning. Applied
Linguistics,
27
(1), 1–24.
Ellis, N. C., & Simpson-Vlach, R. (2009). Formulaic
language in native speakers: Triangulating psycholinguistics, corpus linguistics, and
education. Corpus Linguistics and Linguistic
Theory,
5
(1), 61–78.
Ervin, S. M. (1961). Changes
with Age in the Verbal Determinants of Word-Association. The American Journal of
Psychology,
74
(3), 361–372.
Ferraro, G., Nazar, R., Ramos, M. A., & Wanner, L. (2014). Towards
advanced collocation error correction in Spanish learner corpora. Language Resources and
Evaluation,
48
(1), 45–64.
Gablasova, D., Brezina, V., & McEnery, T. (2017). Collocations
in Corpus-Based Language Learning Research: Identifying, Comparing, and Interpreting the
Evidence. Language
Learning,
67
(S1), 155–179.
González, A. O., & Ramos, M. A. (2013). A
comparative study of collocations in a native corpus and a learner corpus of
Spanish. Procedia-Social and Behavioral
Sciences,
95
1, 563–570.
Gries, S. Th. (2013). 50-something years of work on
collocations: What is or should be next …. International Journal of Corpus
Linguistics,
18
(1), 137–166.
Gries, S. Th., & Ellis, N. C. (2015). Statistical
Measures for Usage-Based Linguistics. Language
Learning,
65
(S1), 228–255.
Handl, S. (2008). Essential
collocations for learners of English. Phraseology in Foreign Language Learning and
Teaching, 43–66.
Hunston, S. (2002). Pattern
grammar, language teaching, and linguistic variation. In R. Reppen, S. M. Fitzmaurice, & D. Biber (Eds.), Using
Corpora to Explore Linguistic
Variation (pp. 167–186). John Benjamins Publishing.
Hutchison, K. A. (2003). Is
semantic priming due to association strength or feature overlap? A microanalytic
review. Psychonomic Bulletin &
Review,
10
(4), 785–813.
Jiang, N., & Nekrasova, T. M. (2007). The
processing of formulaic sequences by second language speakers. The Modern Language
Journal,
91
(3), 433–445.
Jiang, S., Jiang, X., & Siyanova-Chanturia, A. (2020). The
processing of multiword expressions in children and adults: An eye-tracking study of
Chinese. Applied
Psycholinguistics,
41
(4), 901–931.
Kahan, T. A., Neely, J. H., & Forsythe, W. J. (1999). Dissociated
backward priming effects in lexical decision and pronunciation tasks. Psychonomic Bulletin
&
Review,
6
(1), 105–110.
Kiss, G. R., Milroy, C., & Piper, J. (1973). An
associative thesaurus of English and its computer analysis. In A. J. Aitken, R. W. Bailey, & N. Hamilton-Smith (Eds.), The
Computer and Literary Studies. University Press. [URL]
Kuznetsova, A., Brockhoff, P. B., & Bojesen Christensen, B. (2015). lmerTest:
Tests in Linear mixed effects models (2.0–29) [Computer
software]. [URL]>
Lemhöfer, K., & Broersma, M. (2012). Introducing
LexTALE: A quick and valid Lexical Test for Advanced Learners of English. Behavior Research
Methods,
44
(2), 325–343.
McConnell, K., & Blumenthal-Dramé, A. (2019). Effects
of task and corpus-derived association scores on the online processing of collocations. Corpus
Linguistics and Linguistic
Theory,
1
1(ahead-of-print).
McDonald, S. A., & Shillcock, R. C. (2003a). Low-level
predictive inference in reading: The influence of transitional probabilities on eye
movements. Vision
Research,
43
(16), 1735–1751.
(2003b). Eye
movements reveal the on-line computation of lexical probabilities during reading. Psychological
Science,
14
(6), 648–652.
Molinaro, N., Canal, P., Vespignani, F., Pesciarelli, F., & Cacciari, C. (2013). Are
complex function words processed as semantically empty strings? A reading time and ERP study of collocational complex
prepositions. Language and Cognitive
Processes,
28
(6), 1–27.
Peirce, J. W. (2007). PsychoPy –
Psychophysics software in Python. Journal of Neuroscience
Methods,
162
(1), 8–13.
R Core Team. (2013). R: A language and
environment for statistical computing. R Foundation for Statistical Computing. [URL]
Rastle, K., Harrington, J., & Coltheart, M. (2002). 358,534
nonwords: The ARC Nonword Database. The Quarterly Journal of Experimental Psychology. A, Human
Experimental
Psychology,
55
(4), 1339–1362.
Romero, F. (2019). Philosophy
of science and the replicability crisis. Philosophy
Compass,
14
(11), e12633.
Schmid, H.-J. (2017). Linguistic
entrenchment and its psychological foundations. In Entrenchment and
the psychology of language learning: How we reorganize and adapt linguistic
knowledge (pp. 435–452). De Gruyter Mouton.
Siyanova-Chanturia, A. (2015a). On
the ‘holistic’ nature of formulaic language: Corpus linguistics and Linguistic theory. Corpus
Linguistics and Linguistic
Theory,
11
(2), 285–301.
Siyanova-Chanturia, A., & Martinez, R. (2015). The
idiom principle revisited. Applied
Linguistics, 549–569.
Sonbul, S. (2015). Fatal
mistake, awful mistake, or extreme mistake? Frequency effects on off-line/on-line collocational
processing. Bilingualism: Language and
Cognition,
18
(03), 419–437.
Spina, S. (2010). The
Dici Project: Towards a Dictionary of Italian Collocations integrated with an online language learning
platform. ELexicography in the 21st Century: New Challenges, New
Applications, 2731.
Thompson-Schill, S. L., Kurtz, K. J., & Gabrieli, J. D. (1998). Effects
of semantic and associative relatedness on automatic priming. Journal of Memory and
Language,
38
(4), 440–458.
Underwood, G., Schmitt, N., & Galpin, A. (2004). The
eyes have it. An eye-movement study into the processing of formulaic
sequences. In N. Schmitt (Ed.), Formulaic
Sequences: Acquisition, Processing, and
Use (pp. 153–172). John Benjamins Publishing.
Utka, A.; Rimkutė, E.; Kovalevskaitė, J.; Bielinskienė, A.; Petkevičius, M.; Petrauskaitė, R., & Mikelionienė, J. (2017). Corpus of the Contemporary Lithuanian Language, CLARIN-LT digital library in the Republic of Lithuania, [URL].
Vilkaitė, L. (2016). Are
non-adjacent collocations processed faster? Journal of Experimental Psychology: Learning,
Memory, and
Cognition,
42
(10), 1632–1642.
Vilkaitė-Lozdienė, L. (2019). First steps towards the Lithuanian word association database. Taikomoji kalbotyra, 12, 226–258.
Wolter, B., & Gyllstad, H. (2011). Collocational
links in the L2 mental lexicon and the influence of L1 intralexical knowledge. Applied
Linguistics,
32
(4), 430–449.
(2013). Frequency
of input and L2 collocational processing. Studies in Second Language
Acquisition,
35
(03), 451–482.
Cited by (8)
Cited by eight other publications
Hong, Wei & Le Ma
Johnson, Tamara Marie & Simone Eveline Pfenninger
Li, Wanyin, Bene Bassetti & Steven Frisson
Pérez Serrano, Mercedes & Matías Hidalgo Gallardo
Yamagata, Satoshi, Gareth Carrol & Crayton Walker
This list is based on CrossRef data as of 27 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
