LlamATE: Automated terminology extraction using large‑scale generative language models

Tran, Hanh Thi-Hong; González-Gallardo, Carlos-Emiliano; Doucet, Antoine; Pollak, Senja

doi:10.1075/term.00082.tra

Article published In: Computational Terminology
Edited by Ayla Rigouts Terryn and Patrick Drouin
[Terminology 31:1] 2025
► pp. 5–36

Get fulltext from our e-platform

Download PDF

Download EPUB

LlamATE

Automated terminology extraction using large‑scale generative language models

Hanh Thi-Hong Tran | ARKHN

Carlos-Emiliano González-Gallardo | University of Tours

Antoine Doucet | University of La Rochelle

Senja Pollak | Jožef Stefan International Postgraduate School | Jožef Stefan Institute

Published online: 23 May 2025

https://doi.org/10.1075/term.00082.tra

Abstract

Over the past decades, automatic term or terminology extraction (ATE), a natural language processing (NLP) task that aims to identify terms from specific domains by providing a list of candidate terms, has been challenging due to the strong influence of domain-specific differences on term definitions. Leveraging the advances of large-scale language models (LLMs), we propose LlamATE, a framework to verify the impact of domain specificity on ATE when using in-context learning prompts in open-sourced LLM-based chat models, namely Llama-2-Chat. We evaluate how well the LLM-based chat (e.g., using reinforcement learning with human feedback (RLHF)) models perform with different levels of domain-related information in the dominant language in NLP research (e.g., English) and other European languages (e.g., French, Slovene) from ACTER datasets, i.e., in-domain and cross-domain demonstrations with and without domain enunciation. Furthermore, we examine the potential of cross-lingual and cross-domain prompting to reduce the need for extensive data annotation of the target domain and language. The results demonstrate the potential of implicit in-domain learning where examples of the target domain are used as demonstrations for the prompts without specifying the domain of each example, and cross-lingual learning when knowledge is transferred from the dominant to lesser-represented European languages as for the data used to pre-train the LLMs. LlamATE also offers a valuable compromise by reducing the need for extensive data annotation, making it suitable for real-world applications where labeled corpora are scarce. The source code is publicly available at the following link: https://github.com/honghanhh/terminology2024.

Keywords: term extraction, LLMs, prompt engineering, in-context learning, Llama-2-chat, cross-domain, transfer learning, self-verification

Article outline

1.Introduction
2.Related work
- 2.1Machine learning approaches
- 2.2Neural approaches
3.Datasets
4.Methodology
- 4.1Large language models
- 4.2Architecture design
- 4.3Domain transfer
- 4.4Language transfer
- 4.5Postprocessing steps
- 4.6Self-verification
- 4.7Experiment settings
- 4.8Evaluation metrics
5.Results
- 5.1General observation
- 5.2Verification strategies comparison
- 5.3Monolingual vs. cross-lingual transfer comparison
- 5.4Environmental impact
6.Discussion
- 6.1The impact of term length
- 6.2Practical use of LLMs for lesser-represented languages
- 6.3Limitations
7.Ablations
- 7.1Model sizes and prompt’s output designs
- 7.2Optimal number of demonstrations
8.Conclusion
Notes
References

References (57)

References

Astrakhantsev, Nikita A., Denis G. Fedorenko, and D. Yu. Turdakov. 2015. “Methods for Automatic Term Recognition in Domain-Specific Text Collections: A Survey.” Programming and Computer Software 41 (6): 336–49.

Azé, Jérôme, Mathieu Roche, Yves Kodratoff, and Michèle Sebag. 2005. “Preference Learning in Terminology Extraction: A ROC-Based Approach.” arXiv preprint cs/0512050.

Bay, Matthias, Daniel Bruneß, Miriam Herold, Christian Schulze, Michael Guckert, and Mirjam Minor. 2021. “Term Extraction from Medical Documents Using Word Embeddings.” In 2020 6th IEEE CiSt, 328–33. IEEE.

Biemann, Chris, and Alexander Mehler. 2014. Text Mining: From Ontology Learning to Automated Text Processing Applications. Springer.

Bolshakova, Elena, Natalia Loukachevitch, and Michael Nokel. 2013. “Topic Models Can Improve Domain Term Extraction.” In European Conference on Information Retrieval, 684–87. Springer.

Cabré Castellví, M. Teresa, Rosa Estopa Bagot, and Jordi Vivaldi Palatresi. 2001. “Automatic Term Detection: A Review of Current Systems.” Recent Advances in Computational Terminology 21: 53–88.

Conrado, Merley, Thiago Pardo, and Solange Rezende. 2013. “A Machine Learning Approach to Automatic Term Extraction Using a Rich Feature Set.” In Proceedings of the 2013 NAACL HLT Student Research Workshop, 16–22. Atlanta, Georgia, June 2013. Association for Computational Linguistics. [URL]

Conrado, Merley da Silva, Ariani Di Felippo, Thiago Alexandre Salgueiro Pardo, and Solange Oliveira Rezende. 2014. “A Survey of Automatic Term Extraction for Brazilian Portuguese.” Journal of the Brazilian Computer Society 20 (1): 1–28.

Daille, Béatrice, Éric Gaussier, and Jean-Marc Langé. 1994. “Towards Automatic Extraction of Monolingual and Bilingual Terminology.” In COLING 1994 Volume 1: The 15th International Conference on Computational Linguistics.

Delaunay, Julien, Hanh Thi Hong Tran, Carlos-Emiliano González-Gallardo, Georgeta Bordea, Mathilde Ducos, Nicolas Sidere, Antoine Doucet, Senja Pollak, and Olivier De Viron. 2024. “CoastTerm: A Corpus for Multidisciplinary Term Extraction in Coastal Scientific Literature.” In International Conference on Text, Speech, and Dialogue, 97–109. Springer.

Dettmers, Tim, Artidoro Pagnoni, Ari Holtzman, and Luke Zettlemoyer. 2024. “QLoRA: Efficient Finetuning of Quantized LLMs.” Advances in Neural Information Processing Systems 361.

Ding, Ning, Guangwei Xu, Yulin Chen, Xiaobin Wang, Xu Han, Pengjun Xie, Hai-Tao Zheng, and Zhiyuan Liu. 2021. “Few-NERD: A Few-Shot Named Entity Recognition Dataset.” arXiv preprint arXiv:2105.07464.

Drouin, Patrick. 2003. “Term Extraction Using Non-Technical Corpora as a Point of Leverage.” Terminology 9 (1): 99–115.

El-Kishky, Ahmed, Yanglei Song, Chi Wang, Clare R. Voss, and Jiawei Han. 2014. “Scalable Topical Phrase Mining from Text Corpora.” Proceedings of the VLDB Endowment 8 (3): 305–16.

Fedorenko, Denis, N. Astrakhantsev, and D. Turdakov. 2014. “Automatic Recognition of Domain-Specific Terms: An Experimental Evaluation.” Proceedings of the Institute for System Programming 26 (4): 55–72.

Foo, Jody, and Magnus Merkel. 2010. “Using Machine Learning to Perform Automatic Term Recognition.” In LREC 2010 Workshop on Methods for Automatic Acquisition of Language Resources and Their Evaluation Methods, 23 May 2010, Valletta, Malta, 49–54. European Language Resources Association.

Frantzi, Katerina T., Sophia Ananiadou, and Junichi Tsujii. 1998. “The C-Value/NC-Value Method of Automatic Recognition for Multi-Word Terms.” In International Conference on Theory and Practice of Digital Libraries, 585–604. Springer.

Guo, Biyang, Xin Zhang, Ziyuan Wang, Minqi Jiang, Jinran Nie, Yuxuan Ding, Jianwei Yue, and Yupeng Wu. 2023. “How Close Is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection.”

Han, Xiaowei, Lizhen Xu, and Feng Qiao. 2018. “CNN-BiLSTM-CRF Model for Term Extraction in Chinese Corpus.” In International Conference on Web Information Systems and Applications, 267–74. Springer.

Hazem, Amir, Mérieme Bouhandi, Florian Boudin, and Béatrice Daille. 2020. “TermEval 2020: TALN-LS2N System for Automatic Term Extraction.” In Proceedings of the 6th International Workshop on Computational Terminology, 95–100.

. 2022. “Cross-Lingual and Cross-Domain Transfer Learning for Automatic Term Extraction from Low-Resource Data.” In Proceedings of the Thirteenth Language Resources and Evaluation Conference, 648–662.

ISO. 2019. Terminology Work and Terminology Science–Vocabulary. ISO 1087.

Judea, Alex, Hinrich Schütze, and Sören Brügmann. 2014. “Unsupervised Training Set Generation for Automatic Acquisition of Technical Terminology in Patents.” In Proceedings of COLING 2014, The 25th International Conference on Computational Linguistics: Technical Papers, 290–300.

Kageura, Kyo, and Bin Umino. 1996. “Methods of Automatic Term Recognition: A Review.” Terminology: International Journal of Theoretical and Applied Issues in Specialized Communication 3 (2): 259–89.

Karan, Mladen, Jan Šnajder, and Bojana Dalbelo Bašić. 2012. “Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian.” In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC’12), 657–62.

Kocón, Jan, Igor Cichecki, Oliwier Kaszyca, Mateusz Kochanek, Dominika Szydło, Joanna Baran, Julita Bielaniewicz, Marcin Gruza, Arkadiusz Janz, Kamil Kanclerz, Anna Kocón, Bartłomiej Koptyra, Wiktoria Mieleszczenko-Kowszewicz, Piotr Miłkowski, Marcin Oleksy, Maciej Piasecki, Łukasz Radlínski, Konrad Wojtasik, Stanisław Wóźniak, and Przemysław Kazienko. 2023. “ChatGPT: Jack of All Trades, Master of None.”

Kucza, Maren, Jan Niehues, Thomas Zenkel, Alex Waibel, and Sebastian Stüker. 2018. “Term Extraction via Neural Sequence Labeling: A Comparative Evaluation of Strategies Using Recurrent Neural Networks.” In INTERSPEECH, 2072–76.

Lang, Christian, Lennart Wachowiak, Barbara Heinisch, and Dagmar Gromann. 2021. “Transforming Term Extraction: Transformer-Based Approaches to Multilingual Term Extraction Across Domains.” In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 3607–20.

Le, Ngoc Tan, and Fatiha Sadat. 2021. “Multilingual Automatic Term Extraction in Low-Resource Domains.” In The International FLAIRS Conference Proceedings 341.

Litvak, Marina, and Mark Last. 2008. “Graph-Based Keyword Extraction for Single-Document Summarization.” In Coling 2008: Proceedings of the Workshop Multi-Source Multilingual Information Extraction and Summarization, 17–24.

Ljubešić, Nikola, Tomaž Erjavec, and Darja Fišer. 2018. “KAS-Term and KAS-Biterm: Datasets and Baselines for Monolingual and Bilingual Terminology Extraction from Academic Writing.” Digital Humanities 71.

Maldonado, Alfredo, and David Lewis. 2016. “Self-Tuning Ongoing Terminology Extraction Retrained on Terminology Validation Decisions.” In Proceedings of The 12th International Conference on Terminology and Knowledge Engineering, 91–100.

Nugumanova, Aliya, Darkhan Akhmed-Zaki, Madina Mansurova, Yerzhan Baiburin, and Almasbek Maulit. 2022. “NMF-Based Approach to Automatic Term Extraction.” Expert Systems with Applications 1991: 117179.

Pavlopoulos, John, and Ion Androutsopoulos. 2014. “Aspect Term Extraction for Sentiment Analysis: New Datasets, New Evaluation Measures and an Improved Unsupervised Method.” In Proceedings of the 5th Workshop on Language Analysis for Social Media (LASM), 44–52.

Qasemizadeh, Behrang, and Siegfried Handschuh. 2014. “Evaluation of Technology Term Recognition with Random Indexing.” In Proceedings of the Ninth International Conference on Language Resources and Evaluation.

Radford, Alec, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, Ilya Sutskever, et al. 2019. “Language Models Are Unsupervised Multitask Learners.” OpenAI Blog 1 (8): 9.

Repar, Andraz, Vid Podpečan, Anže Vavpetič, Nada Lavrač, and Senja Pollak. 2019. “TermEnsembler: An Ensemble Learning Approach to Bilingual Term Extraction and Alignment.” Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 25 (1): 93–120.

Rigouts Terryn, Ayla, Veronique Hoste, Patrick Drouin, and Els Lefever. 2020. “TermEval 2020: Shared Task on Automatic Term Extraction Using the Annotated Corpora for Term Extraction Research (ACTER) Dataset.” In 6th International Workshop on Computational Terminology (COMPUTERM 2020), 85–94. European Language Resources Association (ELRA).

Rigouts Terryn, Ayla, Véronique Hoste, and Els Lefever. 2020a. “In No Uncertain Terms: A Dataset for Monolingual and Multilingual Automatic Term Extraction from Comparable Corpora.” Language Resources and Evaluation 54 (2): 385–418.

. 2020b. “HAMLET: Hybrid Adaptable Machine Learning Approach to Extract Terminology.” Terminology, 2021.

. 2022a. “D-terminer: Online Demo for Monolingual and Bilingual Automatic Term Extraction.” In Proceedings of the TERM21 Workshop, 33–40. Language Resources and Evaluation Conference (LREC 2022).

. 2022b. “Tagging Terms in Text: A Supervised Sequential Labelling Approach to Automatic Term Extraction.” Terminology: International Journal of Theoretical and Applied Issues in Specialized Communication 28 (1): 157–89.

Tran, Hanh Thi Hong, Matej Martinc, Antoine Doucet, and Senja Pollak. 2022a. “Can Cross-Domain Term Extraction Benefit from Cross-Lingual Transfer?” In Discovery Science: 25th International Conference, DS 2022, Montpellier, France, October 10–12, 2022, Proceedings, 363–78. Springer.

Tran, Hanh Thi Hong, Matej Martinc, Andraz Pelicon, Antoine Doucet, and Senja Pollak. 2022b. “Ensembling Transformers for Cross-Domain Automatic Term Extraction.” In From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries: 24th International Conference on Asian Digital Libraries, ICADL 2022, Hanoi, Vietnam, November 30–December 2, 2022, Proceedings, 90–100. Springer.

Tran, Hanh Thi Hong, Matej Martinc, Jaya Caporusso, Antoine Doucet, and Senja Pollak. 2023. “The Recent Advances in Automatic Term Extraction: A Survey.” arXiv preprint arXiv:2301.06767.

Tran, Hanh Thi Hong, Carlos-Emiliano Gonzalez-Gallardo, Julien Delaunay, Antoine Doucet, and Senja Pollak. 2024a. “Is Prompting What Term Extraction Needs?” In Text, Speech, and Dialogue, edited by Elmar Nöth, Aleš Horák, and Petr Sojka, 17–29. Cham: Springer Nature Switzerland. ISBN 978-3-031-70563-2.

Tran, Hanh Thi Hong, Matej Martinc, Andraz Repar, Nikola Ljubešić, Antoine Doucet, and Senja Pollak. 2024b. “Can Cross-Domain Term Extraction Benefit from Cross-Lingual Transfer and Nested Term Labeling?” Machine Learning, 1–30.

Tran, Hanh Thi Hong, Matej Martinc, Antoine Doucet, and Senja Pollak. 2022c. “A Transformer-Based Sequence-Labeling Approach to the Slovenian Cross-Domain Automatic Term Extraction.” In Slovenian Conference on Language Technologies and Digital Humanities.

Utka, Andrius. 2020. “Automatic Extraction of Lithuanian Cybersecurity Terms Using Deep Learning Approaches.” In Human Language Technologies–The Baltic Perspective: Proceedings of the Ninth International Conference Baltic HLT 2020, vol. 328, 39. IOS Press.

Vintar, Špela. 2010. “Bilingual Term Recognition Revisited: The Bag-of-Equivalents Term Alignment Approach and Its Evaluation.” Terminology: International Journal of Theoretical and Applied Issues in Specialized Communication 16 (2): 141–58.

Wang, Jiangyu, Chong Feng, Fang Liu, Xinyan Li, and Xiaomei Wang. 2023a. “Extract Then Adjust: A Two-Stage Approach for Automatic Term Extraction.” In CCF International Conference on Natural Language Processing and Chinese Computing, 236–47. Springer.

Wang, Rui, Wei Liu, and Chris McDonald. 2016. “Featureless Domain-Specific Term Extraction with Minimal Labelled Data.” In Proceedings of the Australasian Language Technology Association Workshop 2016, 103–12.

Wang, Xiao, Weikang Zhou, Can Zu, Han Xia, Tianze Chen, Yuansen Zhang, Rui Zheng, Junjie Ye, Qi Zhang, Tao Gui, et al. 2023. “InstructUIE: Multi-Task Instruction Tuning for Unified Information Extraction.” arXiv preprint arXiv:2304.08085.

Wolf, Petra, Ulrike Bernardi, Christian Federmann, and Sabine Hunsicker. 2011. “From Statistical Term Extraction to Hybrid Machine Translation.” In Proceedings of the 15th Annual Conference of the European Association for Machine Translation.

Yang, Lingpeng, Ji Donghong, Guodong Zhou, and Yu Nie. 2005. “Improving Retrieval Effectiveness by Using Key Terms in Top Retrieved Documents.” In European Conference on Information Retrieval, 169–84. Springer.

Yuan, Yu, Jie Gao, and Yue Zhang. 2017. “Supervised Learning for Robust Term Extraction.” In 2017 International Conference on Asian Language Processing (IALP), 302–5. IEEE.

Zhang, Ziqi, Jie Gao, and Fabio Ciravegna. 2018. “SemRE-Rank: Improving Automatic Term Extraction by Incorporating Semantic Relatedness with Personalised PageRank.” ACM Transactions on Knowledge Discovery from Data (TKDD) 12 (5): 1–41.

Cited by (1)

Cited by one other publication

Rakotomalala, Christiane, Jean-Marie Paillat, Frédéric Feder, Angel Avadí, Laurent Thuriès, Marie-Liesse Vermeire, Jean-Michel Médoc, Tom Wassenaar, Caroline Hottelart, Lilou Kieffer, Elisa Ndjie, Mathieu Picart, Jorel Tchamgoue, Alvin Tulle, Laurine Valade, Annie Boyer, Marie-Christine Duchamp & Mathieu Roche

2025. A lexicon obtained and validated by a data-driven approach for organic residues valorization in emerging and developing countries. Frontiers in Artificial Intelligence 8

This list is based on CrossRef data as of 20 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.