Article published In: International Journal of Corpus Linguistics
Vol. 7:2 (2002) ► pp.265–282
Today's corpus linguistics
Some open questions
Published online: 4 April 2003
https://doi.org/10.1075/ijcl.7.2.06cer
https://doi.org/10.1075/ijcl.7.2.06cer
The paper is concerned with problems of methodology. Against this background, the situation of today's corpora is discussed and some fields are identified as being in a far from satisfactory shape. The place of corpora in linguistics is briefly looked at, suggesting that structuralist tradition is the only one to use them extensively. Problems of annotation and ways, less (statistical) or more successful (rule-based), are raised and discussed. Here, some of the most serious shortcomings, such as multi-word units or status of language units in general that computational linguists should deal with, are listed. In a more general direction, implications and status of paradigmatics and syntagmatics are discussed, too, with considerable and critical attention paid to ontologies.
Keywords: computational linguistics, taxonomy, thesaurus, corpora, corpus linguistics, linguistics
Cited by (5)
Cited by five other publications
V. A. Plungian
Usoniene, Aurelija, Linas Butenas, Birute Ryvityte, Jolanta Sinkuniene, Erika Jasionyte & Algimantas Juozapavicius
Arppe, Antti, Gaëtanelle Gilquin, Dylan Glynn, Martin Hilpert & Arne Zeschel
Colson, Jean-Pierre
[no author supplied]
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
