Article published In: Terminology
Vol. 9:2 (2003) ► pp.221–246
A corpus comparison approach for terminology extraction
Published online: 4 February 2004
https://doi.org/10.1075/term.9.2.05chu
https://doi.org/10.1075/term.9.2.05chu
This article examines one of the possible approaches to identifying technical terms: a corpus comparison approach using the range and frequency of word forms. In order to identify terms using the corpus comparison approach, a ratio is used as a tool based on the comparative range and frequency of word forms between a technical corpus and a comparison corpus. A rating scale approach is used as the basis for evaluating the corpus comparison approach.
The analysis shows that the corpus comparison approach works reasonably well with around 86% overlap with the results from the rating scale approach. It also shows that the corpus comparison approach using word types is a reasonably simple and practical way of identifying terms.
Cited by (47)
Cited by 47 other publications
Xie, Tong, Yuwei Wan, Haoran Wang, Ina Østrøm, Shaozhou Wang, Mingrui He, Rong Deng, Xinyuan Wu, Clara Grazian, Chunyu Kit & Bram Hoex
Biel, Łucja & Hendrik J. Kockaert
Abulaish, Muhammad, Mohd Fazil & Mohammed J. Zaki
Gleason, Kelly & Maria R. Dahm
Prayogo, Nicholas, Ehsan Amjadian, Serena McDonnell & Muhammad Rizwan Abid
Vo, Chau, Tru Cao, Ngoc Truong, Trung Ngo & Dai Bui
2022. Automatic medical term extraction from Vietnamese clinical texts. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 28:2 ► pp. 299 ff.
Amjadian, Ehsan, Nicholas Prayogo, Serena McDonnell, Cathal Smyth & Muhammad Rizwan Abid
Kováříková, Dominika
Kováříková, Dominika
Kwong, Oi Yee
2021. User-driven assessment of commercial term extractors. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 27:2 ► pp. 179 ff.
Lim, Hyo Jung, Ji-Hyun Park & Min Ju Young
Rieder-Bünemann, Angelika, Julia Hüttner & Ute Smit
2019. Capturing technical terms in spoken CLIL. Journal of Immersion and Content-Based Language Education 7:1 ► pp. 4 ff.
Dahm, Maria
Ghazzawi, Nizar, Benoît Robichaud, Patrick Drouin & Fatiha Sadat
2017. Automatic extraction of specialized verbal units. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 23:2 ► pp. 207 ff.
Kim, Young Kwang
Drouin, Patrick
Fage‐Butler, Antoinette M. & Matilde Nisbeth Jensen
Lopes, Lucelene, Paulo Fernandes & Renata Vieira
Lopes, Lucelene, Paulo Fernandes & Renata Vieira
Nation, I.S.P.
Pérez, María José Marín
2016. Measuring the degree of specialisation of sub-technical legal terms through corpus comparison. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 22:1 ► pp. 80 ff.
Drouin, Patrick, Natalia Grabar, Thierry Hamon & Kyo Kageura
2015. Introduction to the Special Issue. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 21:2 ► pp. 139 ff.
Gaizauskas, Robert, Monica Lestari Paramita, Emma Barker, Marcis Pinnis, Ahmet Aker & Marta Pahisa Solé
2015. Extracting bilingual terms from the Web. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 21:2 ► pp. 205 ff.
Heylen, Kris & Dirk De Hertog
2015. Automatic Term Extraction. In Handbook of Terminology [Handbook of Terminology, 1], ► pp. 203 ff.
Lopes, Lucelene, Paulo Fernandes, Roger Granada & Renata Vieira
Rizzo, Camino Rea & María José Marín Pérez
Marín, María José
Marín, María José
2023. Automatic term recognition and legal language. In Handbook of Terminology [Handbook of Terminology, 3], ► pp. 511 ff.
Ittoo, Ashwin & Gosse Bouma
Santamaría-Pérez, María Isabel & José Joaquín Martínez-Egido
Lopes, Lucelene & Renata Vieira
Lopes, Lucelene & Renata Vieira
Nation, Paul & Averil Coxhead
Dahm, Maria R.
신동광
Liu, Xiao-Yue & Chunyu Kit
Rea Rizzo, Camino
Brunzel, Marko & Myra Spiliopoulou
Kida, Mitsuhiro, Masatsugu Tonoike, Takehito Utsuro & Satoshi Sato
Utsuro, Takehito, Masatsugu Tonoike, Satoshi Sato & Sadao Kurohashi
Nation, I.
Utsuro, Takehito, Mitsuhiro Kida, Masatsugu Tonoike & Satoshi Sato
Utsuro, Takehito, Mitsuhiro Kida, Masatsugu Tonoike & Satoshi Sato
Panunzi, A., M. Fabbri & M. Moneglia
L'Homme, Marie-Claude
[no author supplied]
This list is based on CrossRef data as of 6 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
