Term recognition using corpora from different fields

Uchimoto, Kiyotaka; Sekine, Satoshi; Murata, Masaki; Ozaku, Hiromi; Isahara, Hitoshi

doi:10.1075/term.6.2.07uch

Article published In: Japanese Term Extraction
Kyo Kageura and Teruo Koyama
[Terminology 6:2] 2000
► pp. 233–256

Get fulltext from our e-platform

Download PDF

Term recognition using corpora from different fields

Published online: 1 October 2001

https://doi.org/10.1075/term.6.2.07uch

We present a system used in the term recognition competition, one of the subtasks covered by the NTCIR tmrec group, and we evaluate its term recognition results. We regard that terms are lexical items, characteristic of a field, which have the following three features: (1) they appear frequently in documents of the target field; (2) they are not common words in the target field; and (3) they appear less frequently in the corpora of other fields. Our system uses corpora from different fields and uses these features to recognize terms.

We then analyze the differences between our term list and the manual candidates list produced by the NTCIR tmrec group. In this article we identify features that are important for automatic term recognition. Furthermore, through comparative experiments based on manual candidates, we establish the importance of indices in extracting a term list.

Keywords: field, term recognition, corpora, document frequency, term frequency

Cited by (5)

Cited by five other publications

Order by:

Ishida, Youichi, Toshiyuki Shimizu & Masatoshi Yoshikawa

2015. A Keyword Recommendation Method Using CorKeD Words and Its Application to Earth Science Data. In Information Retrieval Technology [Lecture Notes in Computer Science, 9460], ► pp. 96 ff.

Ittoo, Ashwin & Gosse Bouma

2013. Term extraction from sparse, ungrammatical domain-specific documents. Expert Systems with Applications 40:7 ► pp. 2530 ff.

KUBO, Junko, Keita TSUJI & Shigeo SUGIMOTO

2010. Automatic Term Recognition Using the Corpora of the Different Academic Areas. Joho Chishiki Gakkaishi 20:1 ► pp. 15 ff.

Liu, Xiao-Yue & Chunyu Kit

2009. 2009 International Conference on Machine Learning and Cybernetics, ► pp. 3499 ff.

Horyu, Daisuke & Seishi Ninomiya

2007. Additional Selection of Extracted Terms for a Specific Area. Agricultural Information Research 16:2 ► pp. 52 ff.

This list is based on CrossRef data as of 6 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.