In:Corpus-based Research in Applied Linguistics: Studies in Honor of Doug Biber
Edited by Viviana Cortes and Eniko Csomay
[Studies in Corpus Linguistics 66] 2015
► pp. 123–146
The challenge of constructing a reliable word list: An exploratory corpus-based analysis of lexical variability in introductory Psychology textbooks
Published online: 14 January 2015
https://doi.org/10.1075/scl.66.06mil
https://doi.org/10.1075/scl.66.06mil
This study highlights the methodological challenges inherent in reliably capturing meaningful sets of vocabulary for instructional focus. An analysis of a 3.1 million-word corpus of introductory psychology textbooks suggests that, while comparatively large, and, thus, presumably representative of the lexical variability in the target domain, this corpus was unable to capture a stable list of “important” words. Findings highlight an important issue requiring further investigation in corpus-based vocabulary research: the extent to which corpora – and the word lists based on them – reliably represent the lexical variability of their target domains. Keywords: Corpus representativeness; lexical diversity; word list reliability
References (38)
Atkins, Sue, Clear, Jeremy & Ostler, Nicholas. 1992. Corpus design criteria. Literary and Linguistic Computing 7: 1-16.
Biber, Douglas, Conrad, Susan & Reppen, Randi. 1998. Corpus Linguistics: Investigating Structure and Use. Cambridge: CUP.
Biber, Douglas, Conrad, Susan, Reppen, Randi, Byrd, Pat, Helt, Marie, Clark, Victoria, Cortes, Viviana, Csomay, Eniko & Urzúa, Alfredo. 2004. Representing Language Use in the University: Analysis of the TOEFL 2000 Spoken and Written Academic Language Corpus [ETS TOEFL Monograph Series, MS-25]. Princeton NJ: Educational Testing Service.
Bowker, Lynne & Pearson, Jennifer. 2002. Working with Specialized Language: A Practical Guide to Using Corpora. London: Routlege.
Brysbaert, Marc & New, Boris. 2009. Moving beyond Kucera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behavior Research Methods 41(4): 977-990.
Burgmeier, Arline & Zimmerman, Cheryl Boyd. 2007. Inside Reading 1 Student Book Pack: The Academic Word List in Context. Oxford: OUP.
Chen, Qi & Ge, Guang-Chun. 2007. A corpus-based lexical study on frequency and distribution of Coxhead’s AWL word families in medical research articles (RAs). English for Specific Purposes 26: 502–514.
The College Board. 2010. CLEP® Introductory psychology: At a glance. [URL]
Coxhead, Averil & Hirsh, David. 2007. A pilot science word list for EAP. Revue Française de Linguistique Appliquée XII(2): 65-78.
Davies, Mark. 2009. The 385+ million word Corpus of Contemporary American English (1990-2008+): Design, architecture, and linguistic insights. International Journal of Corpus Linguistics 14: 159-90.
. 2010. The Corpus of Contemporary American English as the first reliable monitor corpus of English. Literary and Linguistic Computing 25(4): 447–65.
Davies, Mark & Gardner, Dee. 2010. A Frequency Dictionary of Contemporary American English. New York NY: Routledge.
Durrant, Philip. 2009. Investigating the viability of a collocation list for students of English for academic purposes. English for Specific Purposes 28: 157–169.
Francis, W. Nelson & Kucera, Henry. 1982. Frequency Analysis of English Usage: Lexicon and Grammar. Boston MA: Houghton Mifflin.
Heatley, A. & Nation, Paul. 1994. Range. Victoria University of Wellington, NZ. Sofware. [URL]
Huntley, Helen. 2005. Essential Academic Vocabulary: Mastering the Complete Academic Word List. New York NY: Houghton Mifflin.
Leech, Geoffrey, Rayson, Paul & Wilson, Andrew. 2001. Word Frequencies in Written and Spoken English: Based on the British National Corpus. London: Pearson.
Li, Yongyan, & Qian, David. 2010. Profiling Academic Word List (AWL) in a financial corpus. System 38: 402-411.
Martínez, Illiana, Beck, Silvia & Panza, Carolina. 2009. Academic vocabulary in agricultural research articles: A corpus-based study. English for Specific Purposes 28(3): 183-198.
McEnery, Tony, Xiao, Richard & Tono, Yukio. 2006. Corpus-based Language Studies: An Advanced Resource Book. New York NY: Routledge.
Millar, Neil & Budgell, Brian. 2008. The language of public health: A corpus-based analysis. Journal of Public Health 16: 369-374.
Mudraya, Olga. 2006. Engineering English: A lexical frequency instructional model. English for Specific Purposes 25(2): 235–256.
. 2004. A study of the most frequent word families in the British National Corpus. In Vocabulary in a Second Language [Language Learning & Language Teaching 10], Paul Bogaards & Bahtia Laufer (eds), 3-14. Amsterdam: John Benjamins.
Schmitt, Norbert. 2010. Researching Vocabulary: A Vocabulary Research Manual. Houndmills: Palgrave Macmillan.
Schmitt, Diane & Schmitt, Norbert. 2005. Focus on Vocabulary: Mastering the Academic Word List. New York NY: Pearson.
Upton, Thomas. 2004. Reading Skills for Success: A Guide to Academic Texts. Ann Arbor MI: The University of Michigan Press.
Vongpumivitch, Viphavee, Huang, Ju-yu, & Chang, Yu-Chia. 2009. Frequency analysis of the words in the Academic Word List (AWL) and non-AL content words in applied linguistics research papers. English for Specific Purposes 28(1): 33-41.
Cited by (1)
Cited by one other publication
This list is based on CrossRef data as of 1 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
