Article published In: International Journal of Corpus Linguistics
Vol. 9:2 (2004) ► pp.321–348
Gravity Counts for the boundaries of collocations
Published online: 30 November 2004
https://doi.org/10.1075/ijcl.9.2.08dau
https://doi.org/10.1075/ijcl.9.2.08dau
This paper compares several methods (MI,T-score, Dice) for the extraction of collocations and presents a new method called Gravity Counts. The respective methods are evaluated and compared, measuring the combinability and collocability for each pair of words within the moving span of three words in the corpus of “The Times” newspaper for the year 1995. The collocability of words is the basis for detection of the collocational chains, i.e. frequent recurrent uninterrupted strings of word-forms, with clear-cut boundaries, found in the corpus. Collocational chains obtained with the help of different methods are compared and their lexical, grammatical and semantic features discussed.
Keywords: collocation, collocation boundaries, collocational chains, MI, T-score, Dice, Gravity Counts
Cited by (30)
Cited by 30 other publications
Deng, Yaochen & Dilin Liu
2022. A multi-dimensional comparison of the effectiveness and efficiency of association measures in collocation extraction. International Journal of Corpus Linguistics 27:2 ► pp. 191 ff.
Spina, Stefania
2022. The effect of time and dimensions of collocational relationship on phraseological accuracy. In Complexity, Accuracy and Fluency in Learner Corpus Research [Studies in Corpus Linguistics, 104], ► pp. 181 ff.
Gries, Stefan Th. & Philip Durrant
Siyanova‐Chanturia, Anna & Stefania Spina
Kochetkova, Nataliya, Ekaterina Pronoza & Elena Yagunova
Schneider, Ulrike
Wahl, Alexander & Stefan Th. Gries
Wahl, Alexander & Stefan Th. Gries
2020. Computational extraction of formulaic sequences from corpora. In Computational Phraseology [IVITRA Research in Linguistics and Literature, 24], ► pp. 83 ff.
Buerki, Andreas
DUNN, JONATHAN
Dunn, Jonathan
2018. Multi-unit association measures. International Journal of Corpus Linguistics 23:2 ► pp. 183 ff.
Markievicz, Irena, Minija Tamosiunaite, Daiva Vitkute-Adzgauskiene, Jurgita Kapociute-Dzikiene, Rita Valteryte & Tomas Krilavicius
Matsuno, Kazuko
Brezina, Vaclav, Tony McEnery & Stephen Wattam
Gries, Stefan Th.
2015. 50-something years of work on collocations. In Current Issues in Phraseology [Benjamins Current Topics, 74], ► pp. 135 ff.
Gries, Stefan Th.
Gries, Stefan Th. & Nick C. Ellis
O'Donnell, Matthew Brook, Ute Römer & Nick C. Ellis
2015. The development of formulaic sequences in first and second language writing. In Current Issues in Phraseology [Benjamins Current Topics, 74], ► pp. 83 ff.
Wahl, Alexander
2015. Intonation unit boundaries and the storage of bigrams. Review of Cognitive Linguistics 13:1 ► pp. 191 ff.
Markievicz, Irena, Daiva Vitkute-Adzgauskiene & Minija Tamosiunaite
Markievicz, Irena, Daiva Vitkutė-Adžgauskienė & Minija Tamošiūnaitė
Mukherjee, Joybrato & Marco Schilk
Spina, Stefania & Elena Tanganelli
Theijssen, Daphne, Lou Boves, Hans van Halteren & Nelleke Oostdijk
Daudaravicius, Vidas
Daudaravicius, Vidas
EunJooLee
[no author supplied]
[no author supplied]
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
