Article published In: International Journal of Corpus Linguistics
Vol. 23:2 (2018) ► pp.216–243
The academic English collocation list
A corpus-driven study
Published online: 5 October 2018
https://doi.org/10.1075/ijcl.16135.lei
https://doi.org/10.1075/ijcl.16135.lei
Abstract
The use of collocations plays an important role for the proficiency of ESL/EFL learners. Hence, educators and researchers have long
tried to identify collocations typical of either academic or general English and the challenges involved in learning them. This
paper proposes a comprehensive and type-balanced academic English collocation list (AECL). AECL is based on a large corpus of
academic English and was created to cover the types of collocations that will be most useful to ESL/EFL learners. AECL is the
result of an innovative research-based procedure that involves a five-step selection method. A comparison of the collocations on
AECL with those found in well-known collocation dictionaries of general English and on three existing academic English collocation
lists indicates that AECL indeed contains mainly academic rather than general English collocations. In addition, AECL is more
comprehensive with regard to the types of collocations that are relevant to learners.
Keywords: Academic English, collocations, general English, word list
Article outline
- 1.Introduction
- 2.Key issues in research on collocation definition and identification for language learning
- 2.1Collocation: Definition and challenges
- 2.2Collocation identification: Existing collocation dictionaries/lists
- 3.Methods
- 3.1Corpora
- 3.2The procedure of collocation extraction
- 3.2.1Node words for collocation extraction
- 3.2.2Types of collocation identified
- 3.2.3Syntactic dependency relation analysis
- 3.2.4Extraction and selection of collocations
- 3.2.5Manual checking
- 4.Results and discussion
- 4.1Descriptive statistics of the AECL
- 4.2Validation of the AECL
- 5.Conclusion: Pedagogical and research implications
- Acknowledgements
- Notes
References
References (49)
Ackermann, K., & Chen, Y. H. (2013). Developing the Academic Collocation List (ACL) – A corpus-driven and expert-judged approach. Journal of English for Academic Purposes, 12(4), 235–247.
Bartsch, S. (2004). Structural and Functional Properties of Collocations in English: A Corpus Study of Lexical and Pragmatic Constraints on Lexical Co-occurrence. Tübingen: Narr.
Benson, M., Benson, E., & Ilson, R. (1986). The BBI Combinatory Dictionary of English. Amsterdam/Philadelphia: John Benjamins.
(2009). The BBI Dictionary of English Word Combinations (3rd ed.). Amsterdam/Philadelphia: John Benjamins.
Biber, D., & Conrad, S. (1999). Lexical bundles in conversation and academic prose. In H. Hasselgard & S. Oksfjell (Eds.), Out of Corpora: Studies in Honor of Stig Johansson (pp. 181–190). Amsterdam: Rodopi.
Biber, D., Johansson, S., Leech, G., Conrad, S., & Finegan, E. (1999). Longman Grammar of Spoken and Written English. London: Longman.
Biskup, D. (1992). L1 influence on learners’ rendering of English collocations: A Polish/German empirical study. In P. J. L. Arnaud & H. Bejoint (Eds.), Vocabulary and Applied Linguistics (pp. 85–93). London: Macmillan.
Celce-Murcia, M. & Larsen-Freeman, D. (1999). The Grammar Book: An ESL/EFL Teacher’s Course. Boston, MA: Heinle & Heinle.
Chan, T. P., & Liou, H. C. (2005). Effects of web-based concordancing instruction on EFL students’ learning of verb-noun collocations. Computer Assisted Language Learning, 18(3), 231–251.
Chon, Y., & Shin, D. (2013). A corpus-driven analysis of spoken and written academic collocations. Multimedia-Assisted Language Learning, 16(3), 11–38.
Church, K. W., & Hanks, P. (1990). Word association, norms, mutual information, and lexicography. Computational Linguistics, 16(1), 22–29.
Church, K. W., Gale, W., Hanks, P., Hindle, R. & Moon, R. (1994). Lexical substitutability. In B. T. S. Atkins & A. Zampolli (Eds.), Computational Approaches to the Lexicon (pp. 153–177). Oxford: Oxford University Press.
Coffey, S. (2011). A new pedagogical dictionary of English collocations. International Journal of Lexicography, 24(3), 328–341.
Crossley, S., Salsbury, T., & McNamara, D. (2015). Assessing lexical proficiency using analytic ratings: A case for collocation accuracy. Applied Linguistics, 36(5), 570–590.
Davies, M. (2008-) The Corpus of Contemporary American English (1990–2012). Available online at [URL] (last accessed May 2018).
Diani, G. (2009). Reporting and evaluation in English book review articles: A cross-disciplinary study. In K. Hyland & G. Diani (Eds.), Academic Evaluation: Review Genres in University Settings (pp. 87–104). New York, NY: Palgrave.
Durrant, P. (2009). Investigating the viability of a collocation list for students of English for academic purposes. English for Specific Purposes, 28(3), 157–169.
Gardner, D., & Davies, M. (2014). A new academic vocabulary list. Applied Linguistics, 35(3), 305–327.
Granger, S. (1998). Prefabricated patterns in advanced EFL writing: Collocations and formulae. In A. Cowie (Ed.), Phraseology (pp. 145–160). Oxford: Oxford University Press.
Hausmann, F. J. (1984). Wortschatzlernen ist kollokationslernen. Praxis des Neusprachlichen Unterrichts, 31(1), 395–406.
Hill, J., & Lewis, M. (1997). LTP Dictionary of Selected Collocations. Hove: Language Teaching Publications.
Hyland, K. (2008). As can be seen: Lexical bundles and disciplinary variation. English for Specific Purposes, 27(1), 4–21.
Hyland, K., & Milton, J. (1997). Qualification and certainty in L1 and L2 students’ writing. Journal of Second Language Writing, 6(2), 183–205.
Laufer, B., & Waldman, T. (2011). Verb-noun collocations in second language writing: A corpus analysis of learners’ English. Language Learning, 61(2), 647–672.
Lea, D., Crowther, J., & Dignen, S. (2002). Oxford Collocations Dictionary for Students of English. Oxford: Oxford University Press.
Lewis, M. (Ed.) (2000). Teaching Collocation: Further Development in the Lexical Approach. Boston, MA: Heinle & Heinle.
Liu, D. (2010). Going beyond patterns: Involving cognitive analysis in the learning of collocations. TESOL Quarterly, 44(1), 4–30.
(2011). The most-frequently used English phrasal verbs in American and British English: A multi-corpus examination. TESOL Quarterly. 45(4), 661–688.
(2012). The most frequently-used multi-word constructions in academic written English: A multi-corpus study. English for Specific Purposes. 31(1), 25–35.
Liu, D., & Lei, L. (2018, preview published online). The appeal to political sentiment: An analysis of Donald Trump’s and Hilary Clinton’s speech themes and discourse strategies in the 2016 presidential election. Discourse, Context & Media.
Liu, D., & Zhong, S. (2016). L2 vs. L1 use of synonymy: An empirical study of synonym use/acquisition, Applied Linguistics, 37(2), 239–261.
Manning, C. D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S. J., & McClosky, D. (2014). The Stanford CoreNLP Natural Language Processing Toolkit. Available at [URL] (last accessed July 2018).
Nesselhauf, N. (2003). The use of collocations by advanced learners of English and some implications for teaching. Applied Linguistics, 24(2), 223–242.
(2005). Collocations in a Learner Corpus. Amsterdam/Philadelphia: John Benjamins.
Pawley, A., & Syder, F. H. (1983). Two puzzles for linguistic theory: Nativelike selection and nativelike fluency. In J. C. Richards & R. W. Schmidt (Eds.), Language and Communication (pp. 191–226). London: Longman.
Schmitt, N. (2016). The PHRASal Expressions (PHRASE) List User’s Guide. Retrieved from [URL] (last accessed May 2016).
Simpson-Vlach, R., & Ellis, N. C. (2010). An academic formulas list: New methods in phraseology research. Applied Linguistics, 31(4), 487–512.
Siyanova, A., & Schmitt, N. (2008). L2 learner production and processing of collocation: A multi-study perspective. Canadian Modern Language Review, 64(3), 429–458.
Toutanova, K., Klein, D., Manning, C. D., & Singer, Y. (2003). Feature-rich part-of-speech tagging with a cyclic dependency network (pp. 173–180). The 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology (Volume 11), Association for Computational Linguistics, Edmonton, Canada.
Cited by (43)
Cited by 43 other publications
Takizawa, Kotaro & Shungo Suzuki
Chen, Fu
Dang, Thi Ngoc Yen & Stuart Webb
Ding, Chen, Barry Lee Reynolds, Csaba Z. Szabo & Griet Boone
González-Fernández, Beatriz
Guzzi, Eleonora
Huang, Ping-Yu
Li, Lingling, Hai Xu & Xian Zhang
Lu, Cailing & Averil Coxhead
Michta, Tomasz & Ana Frankenberg-Garcia
Yunjung (Yunie), Ku
Zhang, Yanyan & Ruoyan Cui
Arndt, Rebeca
He, Mengyu , Leng Hong ANG & Hajar Abdul Rahim
Su, Hang, Jun Ye & Naixing Wei
Su, Qi, Chen Gu & Pengyuan Liu
2024. Association measures for collocation extraction. International Journal of Corpus Linguistics 29:1 ► pp. 59 ff.
Söğüt, Sibel
Esfandiari, Rajab & Mohammad Ahmadi
He, Mengyu & Leng Hong Ang
Hu, Yiyang & Qingshun He
Nguyen, Thi My Hang & Averil Coxhead
2023. Evaluating multiword unit word lists for academic purposes. ITL - International Journal of Applied Linguistics 174:1 ► pp. 83 ff.
Xia, Detong, Yudi Chen & Hye K. Pae
Ballance, Oliver James
Deng, Yaochen & Dilin Liu
2022. A multi-dimensional comparison of the effectiveness and efficiency of association measures in collocation extraction. International Journal of Corpus Linguistics 27:2 ► pp. 191 ff.
Deng, Yaochen, Dilin Liu & Shuqiong Wu
Lin, Phoebe
Liu, Bin, Jing Wang & Le Sun
Meng, Fanqi, Yujie Zheng, Songbin Bao, Jingdong Wang & Shuaisong Yang
Wingrove, Peter
Albertson, Brendon
Rogers, James, Amanda Müller, Frank E. Daulton, Paul Dickinson, Cosmin Florescu, Gordon Reid & Tim Stoeckel
Siengsanoh, Boonyakorn
Skoufaki, Sophia & Bojana Petrić
Wongkhan, Prueksa & Atikhom Thienthong
Zhou, Qihong
Csomay, Eniko
2020. A corpus-based study of academic word use in EFL student
writing. In Advances in Corpus-based Research on Academic Writing [Studies in Corpus Linguistics, 95], ► pp. 9 ff.
Oakey, David
Zuppardi, Maria Carolina & Tony Berber Sardinha
2020. A multi-dimensional view of collocations in academic
writing. In Advances in Corpus-based Research on Academic Writing [Studies in Corpus Linguistics, 95], ► pp. 333 ff.
Green, Clarence
Green, Clarence
[no author supplied]
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
