Article published In: Graded Resources for Second and Foreign Language Learning
Edited by David Alfter and Thomas François
[ITL - International Journal of Applied Linguistics 175:1] 2024
► pp. 46–76
Towards a graded lexical inventory of multi-word combinations
The distribution of collocations across Spanish CEFR levels
Published online: 26 February 2024
https://doi.org/10.1075/itl.22021.cub
https://doi.org/10.1075/itl.22021.cub
Abstract
Collocations have become increasingly important in our understanding of foreign language learning. When it comes to setting vocabulary learning goals, concerns about how to address collocations still arise today. This article explores the distribution of collocations in L1 and L2 Spanish production with the ultimate goal of informing the design of graded lexical inventories of multi-word combinations. To do so, we explore three defining properties of collocations in L1 and L2 production data, and across different levels of L2 proficiency: syntactic structure, semantic transparency, and the strength of association. Results indicate that there is an increase of collocational density and diversity, but that isolated features of collocations fail to predict L2 proficiency. Findings suggest the need to evaluate collocation use at a high level of granularity.
Article outline
- On the notion of collocation
- The Study
- Method
- Data
- Identification of collocations
- TIPUS: Test for the Identification of Phraseological Units
- Defining collocational features: Syntactic structure, semantic transparency, and strength of association
- Data analysis
- Results
- On the specific nature of collocations in L1 and L2 Spanish
- Syntactic structures of collocations in L1 and L2
- Semantic transparency of collocations in L1 and L2
- Strength of collocations in L1 and L2
- Additional insights on L2 Spanish preferred collocations across proficiency levels
- On the specific nature of collocations in L1 and L2 Spanish
- Discussion
- Acknowledgements
- Notes
References
References (64)
Alonso-Ramos, M. (2004). Diccionario de colocaciones del español. [URL]
(2013). Colocaciones, diccionario y corpus de aprendices. In M. Casas Gómez, R. Vela Sánchez (Eds.), Eugenio Coseriu, in memoriam: XIV Jornadas de Lingüística (pp. 57 – 71). Universidad de Cádiz.
Alonso-Ramos, M., Roberto Carlini, J. C. F., Orol González, A., Vincze, O., & Wanner, L. (2015). Towards a learner need-oriented second language collocation writing assistant. In F. Helm, L. Bradley, M. Guarda, & S. Thouësny (Eds.), Critical CALL: Proceedings of the 2015 EUROCALL Conference (pp. 16 – 23).
Bardovi-Harlig, K., & Stringer, D. (2017). Unconventional Expressions: Productive syntax in the L2 acquisition of formulaic language. Second Language Research, 33(1), 61–90.
Benson, M., Benson, E., & Ilson, R. (1986). The BBI Combinatory Dictionary of English. A Guide to Word Combinations. John Benjamins.
Berman, R., & Slobin, D. I. (1994). Relating events in narrative: A crosslinguistic developmental study. Lawrence Erlbaum.
Berman, R., & Verhoeven, L. (2002). Cross-linguistic perspectives on the development of text-production abilities: Speech and writing. Written Language and Literacy, 5(2), 1–172.
Bestgen, Y. (2017). Beyond single-word measures: L2 writing assessment, lexical richness and formulaic competence. System, 691, 65–78.
Bosque, I. (2001). Sobre el concepto de ‘colocación’ y sus límites. Lingüística Española Actual, 23(1), 9–40.
(2011). Deducing collocations. In I. Boguslavsky & L. Wanner (Eds.), Proceedings of the 5th International Conference on Meaning-Text Theory (pp. 6–13).
(2017). On the conceptual bases of collocations: Restricted adverbs and lexical selection. In S. Torner, & E. Bernal (Eds.), Collocations and other lexical combinations in Spanish: theoretical, lexicographical and applied perspectives (pp. 9–20). Routledge.
Brezina, V., McEnery, T., & Wattam, S. (2015). Collocations in context: A new perspective on collocation networks. International Journal of Corpus Linguistics, 201, 139–173.
Corpas, G. (2001). Apuntes para el estudio de la colocación. Lingüística Española Actual, 23(1), 41–56.
Crossley, S., Cai, Z., & McNamara. (2012). Syntagmatic, Paradigmatic, and Automatic N-gram Approaches to Assessing Essay Quality. In Proceedings of the 25th International Florida Artificial Intelligence Research Society Conference, FLAIRS-25, 214–219.
Council of Europe (2001). The Common European Framework of Reference for Languages: Learning, teaching, assessment. Cambridge University Press.
Cuberos, R. (2019). Indicadores léxicos de calidad textual en español nativo y no nativo (Doctoral dissertation). [URL]
Durrant, P., & Schmitt, N. (2009). To what extent do native and non-native writers make use of collocations? International Review of Applied Linguistics in Language Teaching, 471, 157–177.
Dürlich, L., & François, T. (2018). EFLLex: A Graded Lexical Resource for Learners of English as a Foreign Language. In Proceedings of the 11th International Conference on Language Resources and Evaluation, 873–879.
Ellis, N. C. (2012). Formulaic Language and Second Language Acquisition: Zipf and the Phrasal Teddy Bear. Annual Review of Applied Linguistics, 321, 17–44.
Ellis, N. C., Simpson-Vlach, R., Romer, U., Brook O’Donnell, M., & Wulff, S. (2015). Learner corpora and formulaic language in second language acquisition. In S. Granger, G. Gilquin, & F. Meunier (Eds.). The Cambridge handbook of learner corpus research (pp. 357–378). Cambridge University Press.
Fan, M. (2009). An exploratory study of collocational use by ESL students. A task based approach. System, 371, 110–123.
Gablasova, D. & Brezina, V. & McEnery, T. (2017). Exploring Learner Language Through Corpora: Comparing and Interpreting Corpus Frequency Information. Language Learning, 67(21), 130–154.
García-Salido, M. (2017). La frecuencia de corpus como criterio para nivelar colocaciones léxicas. Études romanes de Brno, 38(2), 29–49.
García-Salido, M., & García, M. (2018). Comparing learners’ and native speakers’ use of collocations in written Spanish. International Review of Applied Linguistics in Language Teaching, 56(4), 401–426.
García-Salido, M., Garcia, M., & Alonso-Ramos, M. (2019). Towards a graded dictionary of Spanish collocations. In I. Kosem, T. Zingano, M. Correira, J. P. Ferreira, M. Jansen, I. Pereira, J. Kallas, M. Jakubíček, S. Krek, & C. Tiberius (Eds.), Proceedings of eLex 2019 conference (pp. 849 – 864).
Gitsaki, C. (1996). The development of ESL Collocational Knowledge. [Doctoral dissertation, The University of Queensland]. University of Queensland eSpace.
Granger, S., & Paquot, M. (2008). Disentangling the phraseological web. In S. Granger, & F. Meunier (Ed.), Phraseology: An interdisciplinary perspective (pp. 28–49). John Benjamins.
Gries, S. (2022). What do (some of) our association measures measure (most)? Association? Journal of Second Language Studies, 5(1), 1–33.
Gyllstad, H., & Wolter, B. (2015). Collocational processing in light of the phraseological continuum model: Does semantic transparency matter? Language Learning, 66(2), 296–323.
Hausmann, F. (1979). Un dictionnaire des collocations est-il possible? Travaux de linguistique et de littérature, 17(1), 187–195.
(1989). Le dictionnaire de collocations. In F. J. Hausmann, O. Reichmann, H. Wiegand, & L. Zgusta (Eds.), Wörterbücher: ein internationales Handbuch zur Lexikographie (pp. 1010–1019). Mouton de Gruyter.
(1998). O diccionario de colocacións: criterios de organización. In X. Ferro Ruibal (Ed.), Actas do I Coloquio Galego de Fraseoloxía (pp. 63–81). Centro Ramón Piñeiro, Xunta de Galicia.
(2017). Pedagogical principles for the teaching of collocations in the foreign language classroom. In S. Torner & E. Bernal (Eds.), Collocations and other lexical combinations in Spanish: theoretical, lexicographical and applied perspectives (pp. 250–266). Routledge.
Howarth, P. (1996). Phraseology in English Academic Writing: Some Implications for Language Learning and Dictionary Making. Max Niemeyer Verlag.
Hyland, K. (2008). As can be seen: Lexical bundles and disciplinary variation. English for Specific Purposes, 27(1), 4–21.
Instituto Cervantes. (2006). Plan curricular del Instituto Cervantes. Niveles de referencia para el español. Instituto Cervantes.
Kecskes, I. (2007). Formulaic language in English Lingua Franca. In I. Kecskes, & L. R. Horn (Eds.), Explorations in Pragmatics (pp. 191–218). Mouton de Gruyter.
Kilgarriff, A. & Renau, I. (2013). esTenTen, a Vast Web Corpus of Peninsular and American Spanish. Procedia-Social and Behavioral Sciences, 951, 12–19.
Kilgarriff, A., Baisa, V., Busta, J., Jakubicek, M., Kovar, V., Michelfet, J., Rychlý, P., & Suchomel. (2014). The Sketch Engine: ten years on. Lexicography ASIALEX, 11, 7–36.
Koike, K. (2001). Colocaciones léxicas en el español actual: Estudio formal y léxicosemántico. University of Alicante & Takushoku University.
Laso, N. (2009). A corpus-based study of phraseological behaviour of abstract nouns in medical English: a needs analysis of a Spanish medical community. [Doctoral dissertation, University of Barcelona]. Online repository of doctoral dissertations. [URL]
Lozano, C. & Mendikoetxea, A. (2013). Learner corpora and second language acquisition. The design and collection of CEDEL2. In A. Díaz-Negrillo, P. Thompson & N. Ballier (Eds.), Automatic Treatment and Analysis of Learner Corpus Data (pp. 65–100). John Benjamins.
MacWhinney, B. (2000). The CHILDES Project: tools for analyzing talk (3th Ed.). Lawrence Erlbaum Associates.
Mel’čuk, I. (1998). Collocations and lexical functions. In A. P. Cowie (Ed.), Phraseology. Theory, Analysis, and Applications (pp. 23–53). Clarendon Press.
Mittmann, B. (2013). New Tendencies in the Treatment of Collocations. In R. H. Gouws, U. Heid, W. Schweickard & H. E. Wiegand (Eds.), Supplementary Volume Dictionaries. An International Encyclopedia of Lexicography (pp. 500–509). De Gruyter Mouton.
Nesselhauf, N. (2004). What are collocations? In D. Allerton, N. Nesselhauf, & P. Skandera (Eds.), Phraseological Units: Basic Concepts and Their Application (pp. 1–21). Schwabe.
Paquot, M. (2018). Phraseological Competence: A Missing Component in University Entrance Language Tests? Insights From a Study of EFL Learners’ Use of Statistical Collocations. Language Assessment Quarterly, 15(1), 29–43.
Revier, R. L. (2014). Testing knowledge of whole collocations available for use in written production: Developing tests for use with intermediate and advanced Danish learners. [Doctoral dissertation, University of Aarhus].
Rychlý, P. (2008). A lexicographer-Friendly Association Score. In P. Sojka, A. Horák (Eds.) Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2008 (pp. 6 – 9). Universidad de Masaryk.
Sebastián-Gallés, N., Martí, M. A., Cuetos, F., & Carreiras, M. F. (2000). LEXESP: Léxico informatizado del español. Edicions de la Universitat de Barcelona.
Sinclair, J. (1987). Looking up – An account of the COBUILD Project in lexical computing. Harper-Collins.
Tolchinsky, L., Rosado, E., Aparici, M., & Cuberos, R. (2023). Spanish GRERLI-ES2 Corpus [Data set]. CHILDES Talkbank.
Cited by (1)
Cited by one other publication
This list is based on CrossRef data as of 30 march 2026. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
