Article published In: International Journal of Corpus Linguistics
Vol. 21:1 (2016) ► pp.105–115
WordSkew
Linking corpus data and discourse structure
Published online: 31 March 2016
https://doi.org/10.1075/ijcl.21.1.05bar
https://doi.org/10.1075/ijcl.21.1.05bar
In this article, I provide a brief introduction to the operation and motivation behind the text analysis tool WordSkew. This program, currently available for Windows, is a variant of a typical concordance program. The distinguishing feature of the software is that it allows the user to specify the units of discourse and apposite ways of segmenting the discourse. The results of a search query are then given with respect to each segment. For example, sentences might be divided into ten segments (based on word counts) and the frequency of the search term is then provided for each segment. This process is repeated as required for other textual units.
Keywords: tools, discourse structure, concordance, text analysis, distribution
References (13)
. (2004) Software for corpus access and analysis. In J. Sinclair (Ed), How to Use Corpora in Language Teaching (pp. 204–221). Amsterdam, Netherlands: John Benjamins.
Hoey, M., & O’Donnell, M.B. (2015). Examining associations between lexis and textual position in hard news stories, or according to a study by.... In Groom, N., Charles, M., & John, S. (Eds.), Corpora, Grammar and Discourse. In Honour of Susan Hunston (pp. 117–144). Amsterdam: John Benjamins.
Mahlberg, M. (2009). Local textual functions of move in newspaper story patterns. In U. Römer & R. Schulze (Eds.), Exploring the Lexis-Grammar Interface (pp. 265–287). Amserdam, Netherlands: John Benjamins.
. (2015). Corpus stylistics. In V. Sotirova (Ed.), The Bloomsbury Companion to Stylistics (pp. 139–156). London: Bloomsbury.
Mahlberg, M., & O’Donnell, M.B. (2008). A fresh view of the structure of hard news stories. In S. Neumann & E. Steiner (Eds.), Online Proceedings of the 19th European Systemic Functional Linguistics Conference and Workshop. Retrieved from [URL] (last accessed December 2015).
Mahlberg, M., & Smith, C. (2012). Dickens, the suspended quotation and the corpus. Language and Literature, 21(1), 51–65.
Rayson, P. (2009). Wmatrix: A web-based corpus processing environment. Computing Department, Lancaster University. Available at [URL] (last accessed December 2015).
Römer, U., & O’Donnell, M.B. (2010, May).
Positional variation of n-grams and phrase-frames in a new corpus of proficient student writing. Paper presented at
the ICAME 31 Conference
, Giessen, Germany.
Cited by (8)
Cited by eight other publications
王, 晨缘
Egbert, Jesse & Michaela Mahlberg
Dong, Jihua & Louisa Buckingham
2018. The textual colligation of stance phraseology in cross-disciplinary academic discourse. International Journal of Corpus Linguistics 23:4 ► pp. 408 ff.
Jeaco, Stephen
2017. Concordancing lexical primings. In Lexical Priming [Studies in Corpus Linguistics, 79], ► pp. 274 ff.
Mahlberg, Michaela, Peter Stockwell, Johan de Joode, Catherine Smith & Matthew Brook O'Donnell
[no author supplied]
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
