Corpus analysis
Table of contents
Nowadays, when linguists speak of a corpus, they usually mean a collection of computer-readable texts. The design of the collection as well as the nature of the texts may vary considerably from one corpus to another, but the texts, whether spoken or written, must have been produced in an actual context of language use. The utterances constituting the texts are never artificial linguistic objects produced under laboratory conditions for the sole purpose of linguistic research. The fact that corpora are computationally accessible and that they are repositories of language use, largely determines the nature of the linguistic research they are used for. First, corpus analysis nowadays cannot be carried out without the availability of advanced computational tools; secondly, it is naturally oriented towards the study of language use and therefore biased towards the study of specific languages, genres and language varieties.