Article published In: International Journal of Corpus Linguistics
Vol. 5:2 (2000) ► pp.147–178
Lexical Frequencies in a 300 Million Word Corpus of Australian Newspapers. Analysis and Interpretation
Published online: 30 May 2001
https://doi.org/10.1075/ijcl.5.2.04lei
https://doi.org/10.1075/ijcl.5.2.04lei
Corpus linguistics, descriptive, sociolinguistics, and psycholinguistics use corpora and generalise their findings beyond the samples contained in them. That raises the problem of the representativity of the data base and of the application of methods for the presentation of findings. Although this paper originated in the context of the pluricentricity of English in the lexis of mainstream Australian English (mAusE), it was inspired by the current debates about corpus methodology (Kretzschmar et al. 1987). It is based on a large newspaper corpus that extends over a period of six years. It studies the distribution patterns of a small set of lexical items that are derived from Aboriginal languages or relate to Aboriginal concerns. While there appears to be a fairly consistent stable core, these items manifest significant differences in occurrence over the six-year period and in the media outlets and that raises the questions of what a replicate study of these items (or of others) would find and whether a corpus can claim to be representative in the first place.
Cited by (5)
Cited by five other publications
Qin, Melissa Xiaohui
LEITNER, GERHARD
HASHIM, AZIRAH & GERHARD LEITNER
EunJooLee
Pauwels, Anne & Joanne Winter
2004. Generic pronouns and gender-inclusive language reform in the English of Singapore and the Philippines. Australian Review of Applied Linguistics 27:2 ► pp. 50 ff.
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
