Article published In: International Journal of Corpus Linguistics
Vol. 21:2 (2016) ► pp.139–164
The shapes of collocation
Published online: 8 September 2016
https://doi.org/10.1075/ijcl.21.2.01bak
https://doi.org/10.1075/ijcl.21.2.01bak
The tool GraphColl (Brezina et al. 2015) allows collocational networks to be identified within corpora, enabling corpus analysis to go beyond two-way collocation. This paper aims to illustrate the types of linguistic relationships that can appear when more than two words are considered, using graph theory to account for the different types of collocational “shapes” that can be formed within GraphColl networks. Using the reference corpus, the BE06, examples of different types of graphs were obtained and analysed in order to form an understanding of the sorts of relationships between words that occur in particular shapes. The analysis indicates that concepts from graph theory can be usefully integrated into corpus analysis of collocation as well as showing the potential for a more sophisticated understanding of the company that words keep.
Keywords: methods, GraphColl, graph, network, collocation
References (25)
Alonso, A., Millon, C., & Williams, G. (2011). Collocational networks and their application to an E-Advanced Learner’s Dictionary of Verbs in Science (DicSci). In I. Kosem & K. Kosem (Eds.), Electronic Lexicography in the 21st Century: New Applications for New Users: Proceedings of eLex 2011, Bled, 10-12 November 2011 (pp. 12–22).
Anthony, L. (2014). AntConc (Version 3.4.3) [Computer Software]. Tokyo: Waseda University. Available from [URL] (last accessed June 2016).
. (2009). The BE06 Corpus of British English and recent language change. International Journal of Corpus Linguistics, 14(3), 312–337.
Biber, D. Conrad, S., & Cortes, V. (2004). If you look at…: Lexical bundles in University teaching and textbooks. Applied Linguistics, 25(3), 371–405.
Brezina, V., McEnery, T., & Wattam, S. (2015). Collocations in context: A new perspective on collocation networks. International Journal of Corpus Linguistics, 20(2), 139–173.
Church, K.W., & Hanks, P. (1990). Word association norms, mutual information, and lexicography. Computational Linguistics, 16(1), 22–29.
Durrant, P., & Doherty, A. (2010). Are high frequency collocations psychologically real? Investigating the thesis of collocational priming. Corpus Linguistics and Linguistic Theory, 6 (2), 125–155.
Eeg-Olofsson, M., & Altenberg, B. (1994). Discontinuous recurrent word combinations in the London-Lund Corpus. In U. Fries, G. Tottie & P. Schneider (Eds.), Creating and Using English Language Corpora. Papers from the 14th ICAME Conference (pp. 63–77). Amsterdam: Rodopi.
Gries, S. Th. (2013). 50-something years of work on collocations: What is or should be next… International Journal of Corpus Linguistics, 18(1), 137–166.
McEnery, T. (2006). Swearing in English: Bad Language, Purity and Power from 1586 to the Present. Abington: Routledge.
Phillips, M.K. (1983). Lexical macrostructure in science text (Unpublished doctoral dissertation). University of Birmingham, Birmingham, UK.
Phillips, M. (1985). Aspects of Text Structure: An Investigation of the Lexical Organisation of Text. Amsterdam: North-Holland.
Stubbs, M. (1995). Collocations and semantic profiles. Functions of Language, 2(1), 23–55.
Williams, G. (1998). Collocational networks: Interlocking patterns of lexis in a corpus of plant biology research articles. International Journal of Corpus Linguistics, 3(1), 151–171.
Cited by (40)
Cited by 40 other publications
Akinseye, Tolulope
Babicova, Ivana & Frazer Heritage
Ben Ghozlen, Boutheina & Mounir Triki
Bonsu, Emmanuel Mensah
Busso, Lucia & Ottavia Tordini
2025. How do media talk about the COVID-19 pandemic?. In COVID-19 [Metaphor in Language, Cognition, and Communication, 11], ► pp. 10 ff.
Palayon, Raymund T., Regie P. Amamio, Yenying Chongchit & Naruethai Chanthap
Yeh, Aiden
Zaikovskii, Mikhail
Jiménez-Navarro, Eva Lucía & Isabel Durán-Muñoz
2024. Collocations of fictive motion verbs in adventure tourism. Revista Española de Lingüística Aplicada/Spanish Journal of Applied Linguistics 37:2 ► pp. 371 ff.
Wang, Xiaomei, Andrew South, Brett Hashimoto & Clifton Farnsworth
Ben Ghozlen, Boutheina
Fitzsimmons-Doolan, Shannon & Jennifer Beseres Pollack
Khachan, Victor
Xodabande, Ismail, Mahmood Reza Atai, Mohammad R. Hashemi & Paul Thompson
Giacomini, Laura
Heritage, Frazer & Paul Baker
Mehl, Seth
Pérez, María José Marín & Ángela Almela
Uchihara, Takumi, Masaki Eguchi, Jon Clenton, Kristopher Kyle & Kazuya Saito
Bouvier, Gwen & Zhonghua Wu
Cantos, Pascual & Moisés Almela-Sánchez
Gauthier, Michael
Liu, Tanjun
2021. Data-driven learning. In Beyond Concordance Lines [Studies in Corpus Linguistics, 102], ► pp. 177 ff.
McGlashan, Mark
2021. Networked discourses of bereavement in online COVID-19 memorials. International Journal of Corpus Linguistics 26:4 ► pp. 557 ff.
McGlashan, Mark
Regan, John
Al Fajri, Muchamad Sholakhuddin
Tatsenko, Nataliia, Vitalii Stepanov & Hanna Shcherbak
Lehto, Anu
2019. The representation of citizens and monarchy in Acts of Parliament in 1800 to 2000. In Corpus-based Research on Variation in English Legal Discourse [Studies in Corpus Linguistics, 91], ► pp. 235 ff.
Baker, Paul
2018. Language, sexuality and corpus linguistics. Journal of Language and Sexuality 7:2 ► pp. 263 ff.
Brezina, Vaclav
Sánchez-Berriel, Isabel, Octavio Santana Suárez, Virginia Gutiérrez Rodríguez & José Pérez Aguiar
Wang, Feng (Robin) & Philippe Humblé
Kopytowska, Monika & Łukasz Grabowski
Lugea, Jane
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
