Article published In: International Journal of Corpus Linguistics
Vol. 9:1 (2004) ► pp.69–81
The notion of a “lemma”
Headwords, roots and lexical sets
Published online: 29 April 2004
https://doi.org/10.1075/ijcl.9.1.04kno
https://doi.org/10.1075/ijcl.9.1.04kno
The notion of alemmais so familiar in corpus linguistics that it scarcely needs a formal definition. When a wordlist or a text is lemmatised, the process is apparently transparent, so that any observer can understand how the lemma relates to the original set or string of words. We shall argue in this paper that, on the contrary, the concept of lemma is not well defined, and is in need of a clear formal definition. The lemma is a fundamental concept in the processing of texts in at least some languages, a point we shall illustrate with respect to Arabic and Malay. It so happens that English lemmas are not typical of the general category, so that linguists who base their understanding of the lemma on English obtain a distorted view. It is essential to reverse the direction of argument, and to start with a general understanding of the lemma, and to consider English lemmas in the wider context.
Keywords: grammatical tags, headword, root, base form, Arabic, Malay, Asian languages, lemma
Cited by (15)
Cited by 15 other publications
Al-Otaibi, Ghuzayyil Mohammed
Weber, Natalie, Tyler Brown, Joshua Celli, McKenzie Denham, Hailey Dykstra, Rodrigo Hernandez-Merlin, Evan Hochstein, Pinyu Hwang, Nico Kidd, Diana Kulmizev, Hannah Morrison, Matty Norris & Lena Venkatraman
Perez-Cortes, Silvia & David Giancaspro
Ranfagni, Silvia, Monica Faraoni, Lamberto Zollo & Virginia Vannucci
Gagnon, Chantal, Pier-Pascale Boulanger & Esmaeil Kalantari
Gagnon, Chantal & Esmaeil Kalantari
Su, Hang
Kestemont, Mike, Guy de Pauw, Renske van Nie & Walter Daelemans
Wolff, Charlotte E., Halszka Jarodzka, Niek van den Bogert & Henny P. A. Boshuizen
Newman, John
2015. Low-level patterning of pronominal subjects and verb tenses in English. In Causation, Permission, and Transfer [Studies in Language Companion Series, 167], ► pp. 295 ff.
Brysbaert, Marc, Boris New & Emmanuel Keuleers
Kestemont, M., W. Daelemans & G. De Pauw
Dong-Young Lee
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
