Computational linguistics can be used to uncover mysteries in text which are not always obvious to visual inspection. For example, the computer analysis of writing style can show who might be the true author of a text in cases of disputed authorship or suspected plagiarism. The theoretical… read more
This is a comprehensive guidebook to the quantitative methods needed for Corpus-Based Translation Studies (CBTS). It provides a systematic description of the various statistical tests used in Corpus Linguistics which can be used in translation research. In Part 1, Theoretical Explorations, the… read more
Of the commonly-used measures of lexical association or collocation strength, only some directly relate to statistical significance: the t-score, chi-squared, log-likelihood, the z-score and Fisher’s exact test. We describe each of these tests, and also describe a computer simulation by which we… read more
This paper investigates the stylistic profiles of the early English translations of Cao Xueqin’s masterwork Hongloumeng (The Story of the Stone or Dream of the Red Chamber 1791), by Bowra (1868–70), Joly (1892–3), and Giles (1885). Through a detailed comparison of early English translations of the… read more
There are a number of different ways to describe a single corpus. We consider how the frequencies of linguistic features may be quantified, such as in terms of their “average” occurrence, dispersion among text segments, and whether they follow the familiar “bell curve” characteristic of a normal… read more
In this paper we describe term extraction from full length journal articles in the domain of crop husbandry for the purpose of producing abstracts automatically. Initially, candidate terms are extracted which occur in one of a number of fixed lexical environments, as found by a system of contextual… read more