Article published In: Japanese Term Extraction
Kyo Kageura and Teruo Koyama
[Terminology 6:2] 2000
► pp. 211–232
Extracting terms by a combination of term frequency and a measure of term representativeness
Published online: 1 October 2001
https://doi.org/10.1075/term.6.2.06his
https://doi.org/10.1075/term.6.2.06his
This article describes a method for extracting terms that combines term frequency with a novel measure of term representativeness (i.e., informativeness or domain specificity). The measure is defined as the normalized distance between the word distribution in the documents which contain the term and the word distribution in the whole corpus. The measure is particularly effective in discarding uninformative terms that frequently appear and has a well-defined threshold value for judging the representativeness of a term. We combined the new measure with term frequency and applied it to the extraction of terms from abstracts of artificial intelligence papers. This article introduces the measure and reports on its effectiveness in term extraction.
Cited by (8)
Cited by eight other publications
Suzuki, T., S. Kawamura, F. Yoshikane, K. Kageura & A. Aizawa
KOYAMA, Teruo
Suzuki, T.
Horyu, Daisuke & Seishi Ninomiya
Hattori, S., T. Tezuka & K. Tanaka
Wakaki, Hiromi, Tomonari Masada, Atsuhiro Takasu & Jun Adachi
Iwayama, Makoto & Yoshiki Niwa
This list is based on CrossRef data as of 6 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
