Article published In: Terminology
Vol. 13:2 (2007) ► pp.225–248
Evaluation of terms and term extraction systems
A practical approach
Published online: 19 November 2007
https://doi.org/10.1075/term.13.2.06viv
https://doi.org/10.1075/term.13.2.06viv
Term extraction may be defined as a text mining activity whose main purpose is to obtain all the terms included in a text of a given domain. Since the eighties, and mainly due to the rapid scientific advances as well as the evolution of the communication systems, there has been a growing interest in obtaining the terms found in written documents. A number of techniques and strategies have been proposed for satisfying this requirement. At present it seems that term extraction has reached a maturity stage. Nevertheless, many of the systems proposed fail to qualitatively present their results, almost every system evaluates its abilities in an ad hoc manner (if any, many times). Often, the authors do not explain their evaluation methodology; therefore comparisons between different implementations are difficult to draw. In this paper, we review the state-of-the-art of term extraction systems evaluation in the framework of natural language systems evaluation. The main approaches are presented, with a focus on their limitations. As an instantiation of some ideas for overcoming these limitations, the evaluation framework is applied to YATE, a hybrid term extractor.
Keywords: term extraction, term extractor evaluation, evaluation
Cited by (20)
Cited by 20 other publications
Xu, Kang, Yifan Feng, Qiandi Li, Zhenjiang Dong & Jianxiang Wei
Malyuga, Elena N.
Gallego-Hernández, Daniel
2022. Extracción de fraseología especializada basada en corpus. Revista Española de Lingüística Aplicada/Spanish Journal of Applied Linguistics 35:1 ► pp. 294 ff.
Nugumanova, Aliya, Darkhan Akhmed-Zaki, Madina Mansurova, Yerzhan Baiburin & Almasbek Maulit
Kwong, Oi Yee
2021. User-driven assessment of commercial term extractors. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 27:2 ► pp. 179 ff.
Paletta, Francisco Carlos & José-Antonio Moreiro-González
Zhao, Ziyan, Li Zhang & Xiaoli Lian
Rigouts Terryn, Ayla, Véronique Hoste & Els Lefever
Pajić, Vesna, Staša Vujičić Stanković, Ranka Stanković & Miloš Pajić
PERIÑAN-PASCUAL, CARLOS
Oliver, Antoni
Heylen, Kris & Dirk De Hertog
2015. Automatic Term Extraction. In Handbook of Terminology [Handbook of Terminology, 1], ► pp. 203 ff.
Bernier-Colborne, Gabriel & Patrick Drouin
2014. Creating a test corpus for term extractors through term annotation. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 20:1 ► pp. 50 ff.
da Silva Conrado, Merley, Ariani Di Felippo, Thiago Alexandre Salgueiro Pardo & Solange Oliveira Rezende
Marín, María José
Vivaldi, Jorge & Horacio Rodriguez
Conrado, Merley S., Thiago A. S. Pardo & Solange O. Rezende
van der Plas, Lonneke, Jörg Tiedemann & Ismail Fahmi
[no author supplied]
[no author supplied]
This list is based on CrossRef data as of 6 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
