Article published In: International Journal of Corpus Linguistics
Vol. 15:4 (2010) ► pp.429–473
Choosing the best tools for comparative analyses of texts
Published online: 29 October 2010
https://doi.org/10.1075/ijcl.15.4.01mol
https://doi.org/10.1075/ijcl.15.4.01mol
What measurements should linguists use when comparing texts written by different writers? We report aspects of a systematic evaluation of 381 different language measures derived from 200 analytic tools, carried out during the pilot for a study exploring genetic contributions to language variation. The measures covered lexis, structure, meaning, and discourse features, and were evaluated with a focus on capturing numerically the qualitative features that linguists consider central to differentiating one text from another. We review principles for selecting analytic tools, and the choices faced by the researcher in processing and analysing data. We then identify and demonstrate five of the measures, which between them provide a useful profile of different linguistic features, and note correlations with psychometric measures taken for each writer. We conclude with some caveats regarding general issues of validity and some indications about potential links between our work and research into authorship attribution for forensic purposes
Keywords: profiling, text analysis, quantitative analysis, lexis, variation
Cited by (5)
Cited by five other publications
Dagogo-Jack, Sokiente W. & Tim R. Samples
Bruyèl-Olmedo, Antonio & Maria Juan-Garau
Celaya, M. Luz
Fitzpatrick, Tess, David Playfoot, Alison Wray & Margaret J. Wright
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
