Article published In: Computational Terminology
Edited by Ayla Rigouts Terryn and Patrick Drouin
[Terminology 31:1] 2025
► pp. 37–71
AI as a resource for the clarification of medical terminology
An analysis of its advantages and limitations
Available under the Creative Commons Attribution (CC BY) 4.0 license.
For any use beyond this license, please contact the publisher at rights@benjamins.nl.
Open Access publication of this article was funded through a Transformative Agreement with Pompeu Fabra University.
Published online: 23 May 2025
https://doi.org/10.1075/term.00083.vid
https://doi.org/10.1075/term.00083.vid
Abstract
Medical terminology is perceived as an obstacle for patients and family members to understand the medical message.
In this context, plain language advocates for making specialised knowledge accessible to citizens. This article puts the synergy
between medical terminology, plain language, and computational linguistics (a branch of Artificial Intelligence) on the table. Our
purpose is to determine if Generative AI applications, like ChatGPT, can assist in creating glossaries of terms with their
corresponding simple variants. For this, a relevant sub-field of medicine, cardiology, was taken as a case study, even though it
could be extrapolated to other sub-fields. Next, a glossary of key cardiology terms was created following a classic methodological
approach. Then, the most relevant phases of the process (terminology extraction, search for synonyms, and selection of the
clearest synonym) were reproduced using ChatGPT. The results of the comparative study between both approaches give a glimpse of
the extent to which this technology can be a useful resource for creating glossaries that can be employed for writing using plain
language.
Article outline
- 1.Introduction
- 2.Theoretical framework
- 3.Terminology work, plain language, and artificial intelligence
- 4.Methodology
- 4.1Creation of a classic glossary (GS)
- 4.2Reproduction of TW phases using ChatGPT
- 4.3Comparison between both methodological approaches
- 5.Results
- 5.1Results of the GS development
- 5.2Outcomes of the phases reproduced by ChatGPT and comparative analysis
- 6.Conclusions and future work
- Acknowledgments
- Notes
References
References (69)
Abt-Sacks, Analía, Susana Pablo-Hernando, Pedro Serrano-Aguilar, Enrique Fernández-Vega, and Roberto Martín Fernández. 2013. “Necesidades de información y uso de Internet en pacientes con cáncer de mama en
España” [Information needs and Internet use in breast cancer patients in
Spain]. Gaceta
Sanitaria, 27(3): 241–247.
Ananiadou, Sophia. 1994. “A
Methodology for Automatic Term Recognition”. In COLING 1994 Volume 2:
The 15th International Conference on Computational Linguistics.
Anthropic, Claude. 2024. [URL]
Araújo, Sílvia, and Micaela Aguiar. 2023. “Simplifying
Specialized Texts with AI: A ChatGPT-Based Learning
Scenario”. In Smart Innovation, Systems and Technologies.
Perspectives and Trends in Education and Technology, ed. by Mesquita, Annabela, António Abreu, Joao Vidal Carvalho, Cleuciliz Santana, and Cristina Helena Pinto de Mello, 3661: 599–609. Springer.
Arratibel, Patricia, Rafael Bengoa, and Gabriel Rafael. 2022. La situación de las enfermedades cardiovasculares en España. Principales factores de riesgo e impacto en la
gestión de las enfermedades crónicas [The situation of cardiovascular diseases in
Spain. Main risk factors and impact on chronic disease management]. Fundación
Alternativas, 211. ISBN 978-84-18677-45-8
Basagoiti, Ignasio (coord.). 2012. Alfabetitzación en salut: de la información a la acción [Health
literacy: from information to
action]. Valencia, Spain: Itaca. ISBN 978-84-695-5267-4.
Brown, Tom B., Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, et al. 2020. “Language
Models are Few-Shot Learners.” arXiv [Cs.CL]. [URL]
Cabré, Maria Teresa. 2009. “La Teoría
Comunicativa de la Terminología, una aproximación lingüística a los términos” [The Communicative Theory of Terminology, a linguistic approach to terms]. Revue
française de linguistique
appliquée, XIV1:9–15.
Cabré, Maria Teresa, Ona Domènech-Bagaria, and Rosa Estopà. 2018. La terminologia avui: termes, textos i aplicacions [Terminology
today: terms, texts and applications]. Barcelona, Spain: Universitat Oberta de Catalunya.
Cabré, Maria Teresa. 2004. “La
terminología en la traducción especializada” [Terminology in specialized
translation]. In Manual de documentación y terminología
para la traducción especializada [Documentation and Terminology Manual for
Specialized Translation], ed. by Gonzalo García, Consuelo, and Valentín García Yebra, 89–126. Madrid: Arco.
. 1999. La terminología:
representación y comunicación. Elementos para una teoría de base comunicativa y otros artículos [Terminology: representation and communication. Elements for a theory of communicative basis and other
articles]. Barcelona, Spain: Institut Universitari de Lingüística Aplicada.
Cabré, M. Teresa and Iria da Cunha. 2022. “El papel de los corpus en la terminología: una mirada específica a la terminología del
español” [The role of corpora in terminology: a specific look at Spanish
terminology]. In Lingüística de corpus en
español / The Routledge Handbook of Spanish Corpus
Linguistics, ed. by Parodi, Giovanni, Pascual Cantos, and Lewis C. Howe, 190–204. London: Routledge. ISBN 9780367350123.
Callava, Marta and Roberto Silva. “Automating
term management activities with AI: a CASE STUDY of generative AI for metadata generation and intelligent terminology
extraction”. Paper presented at EAFT Terminology
Summit, Barcelona, November 17,
2023.
Campillos-Llanos, Leonardo, Rocío Bartolomé, and Ana R. Reinares, Sofía Puig, Ana Valverde-Mateos, and Adrian Capllonch. 2024. “Enhancing
the understanding of clinical trials with a sentence-level simplification
dataset”. Procesamiento de Lenguaje
Natural, 741: 31–43.
Campillos-Llanos, Leonardo, Ana R. Reinares, Sofía Puig, Ana Valverde-Mateos, and Adrian Capllonch. 2022. “Building
a comparable corpus and a benchmark for Spanish medical text simplification”. Procesamiento de
Lenguaje
Natural, 691: 189–196.
Da Cunha, Iria. 2020. El discurso del ámbito de la Administración. Una perspectiva lingüística [The discourse of the Administration field. A linguistic
perspective]. Granada, Spain: Comares. ISBN 978-84-9045-955-3
. 2024 (in
press). “Inteligencia artificial y lenguaje claro en
español” [Artificial intelligence and plain language in
Spanish]. In Lenguaje claro en Iberoamérica. Principios y
prácticas [Plain Language in Ibero-America. Principles and
practices], ed. by Retegui, Alejandro R., and Fernando. Thomson Reuters — La Ley Argentina.
Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. 2019. “BERT:
Pre-training of Deep Bidirectional Transformers for Language
Understanding”. In Proceedings of the Conference of the North
American Chapter of the Association for Computational Linguistics: Human Language
Technologies.
De Schryver, Gilles-Maurice. 2023. “Generative
AI and Lexicography: The Current State of the Art Using ChatGPT”. International Journal of
Lexicography, 36(4): 355–387.
Di Nunzio, Giorgio Maria, Stefano Marchesin, and Gianmaria Silvello. 2023. “A
systematic review of Automatic Term Extraction: What happened in 2022?”. Digital Scholarship in
the
Humanities, 38(1): i41–i47.
Domènech-Bagaria, Ona, Rosa Estopà, and Laia Vidal-Sabanés. 2020. “La comprensión de los informes médicos” [Understanding medical
reports]. In Los informes médicos: estrategias
lingüísticas para favorecer su comprensión [Medical reports: linguistic
strategies to promote their comprehension], coord. by Estopà, Rosa. Buenos Aires, Argentina: Delhospital. ISBN 978-987-8349-12-1.
Eloundou, Tyna, Sam Manning, Pamela Mishkin, and Daniel Rock. 2023. “GPTs
are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models”. Papers
2303.10130, arXiv.org.
Estopà, Rosa, and Maria Amor Montané. 2020. “Terminology
in medical reports: Textual parameters and their lexical indicators that hinder patient
understanding”. Terminology: International Journal of Theoretical and Applied Issues in
Specialized
Communication, 261:213–236.
Estopà, Rosa. 1999. Extracció de terminologia: elements per a la construcció d’un SEACUSE (Sistema d’Extracció Automàtica de
Candidats a Unitats de Significació Especialitzada) [Terminology extraction:
elements for the construction of a SEACUSE (System for the Automatic Extraction of Candidates to Units of Specialized
Significance)]. Barcelona: Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada. [Doctoral
Dissertation].
Faber, Pamela. 2009. “The Cognitive Shift in Terminology and Specialized Translation”. MonTI. Monografías de Traducción e Interpretación, 11: 107–134.
Fernández Silva, Sabela and Maribel Tercedor. 2019. “El desarrollo de la competencia sobre variación terminológica en estudiantes de traducción e interpretación. Una
experiencia de colaboración interuniversitaria” [The development of
terminological variation competence in translation and interpreting students. An experience of inter-university
collaboration]. RLA: Revista de lingüística teórica y
aplicada, 57(2): 137–163.
Gao, Tianyu, Adam Fisch, and Danqi Chen. 2021. “Making
pre-trained language models better few-shot learners”. In Proceedings
of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on
Natural Language Processing (Volume 1: Long Papers), edited by Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli, 3816–3830. Stroudsburg, PA: Association for Computational Linguistics.
Garbers, Samantha, and Mary Ann Chiasson. 2004. “Inadequate
functional health literacy in Spanish as a barrier to cervical cancer screening among immigrant Latinas in New York
City”. Preventing chronic
disease, 1(4), A07.
Generalitat de Catalunya. Departament de Salut, and TERMCAT. Centre de
Terminologia. 2022. Terminologia de la salut
cardiovascular [Cardiovascular health
terminology]. Barcelona, Spain: TERMCAT. Centre de Terminologia. [URL]
Google, Gemini. 2024. [URL]
Heinisch, Barbara. “ChatGPT
prompting in a university course on Terminology”. Paper presented
at EAFT Terminology
Summit, Barcelona, November 17,
2023.
International Plain Language
Federation. 2024. (online). Plain Language
definitions. [URL]
Jänich, Anna. 2023. Exploring
Automatic Synonym Generation for Lexical Simplification of Swedish Electronic Health
Record. Sweden: Linköping University, Department of Health, Medicine and Caring Sciences [Master’s thesis]. [URL]
Jefford, Michael, and Martin H. N. Tattersall. 2002. “Informing
and involving cancer patients in their own care”. The Lancet.
Oncology, 3(10): 629–637.
Kadam, Suvarna, and Vinay Vaidya. 2020. “Review
and analysis of zero, one and few shot learning
approaches”. In Advances in Intelligent Systems and
Computing, 100–112. Cham: Springer International Publishing.
Kickbusch, Ilona, Jürgen M. Pelikan, Franklin Apfel, and Agis D. Tsouros (eds.). 2013. Health
literacy. The solid facts. [Informe]. Copenhaguen (Dinamarca): World Health Organization. [URL]
Lee, J., Yoon, W., Kim, S., Kim, D., Kim, S., So, C. H., and Kang, J. 2020. “BioBERT:
a pre-trained biomedical language representation model for biomedical text
mining”. Bioinformatics, 36(4), 1234–1240.
Lew, Robert. “ChatGPT
as a COBUILD lexicographer”. 2023. Humanities and Social Sciences
Communications, 101: 704.
Linares, Oscar, David T. Daly, and Gertrude A. Daly. 2017. Plain
English for doctors and other medical scientists. New York: Oxford University Press.
Lommel, Arle. 2023. “Generative
AI makes terminology management more critical”. Paper presented
at EAFT Terminology
Summit, Barcelona, November 17,
2023.
Łukasik, Marek. 2023. “Corpus
linguistics and generative AI tools in term extraction: a case of Kashubian — a low-resource
language”. Applied Linguistics
Papers, 231: 34–45.
Lyu, Qing, Josh Tan, Michael E. Zapadka, Janardhana Ponnatapura, Chuang Niu, Kyle J. Myers, Ge Wang, and Christopher T. Whitlow. 2023. “Translating
radiology reports into plain language using ChatGPT and GPT-4 with prompt learning: results, limitations, and
potential”. Vis Comput Ind Biomed
Art, 6(1): 9.
Marrero, Yusney. 2016. SE-Term: sistema de extracción de términos híbrido independiente del
dominio [SE-Term: domain-independent hybrid term extraction
system]. Alicante, Spain: Universidad de Alicante. [Doctoral dissertation]. [URL]
McKean, Erin, and Will Fitzgerald. 2023. “The
ROI of AI in Lexicography”. In Proceedings of the 16th Conference of
the Asian Association for Lexicography: Lexicography, Artificial Intelligence, and Dictionary
Users, 10–20. Seoul: Yonsei University.
Montolío, Estrella, and Mario Tascón. 2020. El derecho a entender. La comunicación clara, la mejor defensa de la ciudadanía [The right to understand. Clear communication, the best defense of
citizenship]. Prodigioso Volcán, Catarata.
Muegge, Uwe. 2023. Terminology
extraction for translation and interpreting made
easy. ISBN: 979-8-9893043-1-8.
Nazar, Rogelio. 2016. “Distributional
analysis applied to terminology extraction: example in the domain of psychiatry in
Spanish”. Terminology: International Journal of Theoretical and Applied Issues in Specialized
Communication, 22(2): 142–170.
Nazar, Rogelio, and Maria Teresa Cabré. 2012. “Supervised
Learning Algorithms Applied to Terminology
Extraction”. In Proceedings of the 10th Terminology and Knowledge
Engineering Conference: New Frontiers in the Constructive Symbiosis of Terminology and Knowledge Engineering, TKE
2012, 209–217.
Nazar, Rogelio, Jorge Vivaldi, and Leo Wanner. 2012. “Automatic
Taxonomy Extraction for Specialized Domains Using Distributional Semantics”. Terminology:
International Journal of Theoretical and Applied Issues in Specialized
Communication, 18(2): 188–225.
OpenAI. ChatGPT. San Francisco: OpenAI, 2024. [URL]
OpenAI and Jessica Shieh. (2023). Best
practices for prompt engineering with OpenAI. [URL]
Paetzold, Gustavo, and Lucia Specia. 2017. “Lexical
Simplification with Neural Ranking”. In: Proceedings of the 15th
Conference of the European Chapter of the Association for Computa- tional
Linguistics, 21: 34–40. Valencia, Spain: Association for Computational Linguistics. [URL].
Peters, M. E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., and Zettlemoyer, L. 2018. “Deep
contextualized word representations”. In Proceedings of the
Conference of the North American Chapter of the Association for Computational Linguistics: Human Language
Technologies.
Qiang, Jipeng, Yun Li, Yi Zhu, Yun-Hao Yuan, and Xindong Wu. 2021. “LSBert:
A Simple Framework for Lexical Simplification”. IEEE/ACM Transactions on Audio, Speech and
Language
Processing, 291: 3064–3076.
Real Academia Nacional de Medicina
Española (2021). Diccionario de términos médicos. [URL]
Ribera, Mireia, and Oliver Díaz (coords.). 2024. ChatGPT y educación universitaria [ChatGPT and university
education]. Barcelona: Ediciones Octaedro and Universitat de Barcelona. [URL]
Rigouts Terryn, Ayla, Véronique Hoste, and Els Lefever. 2021. “HAMLET:
Hybrid Adaptable Machine Learning Approach to Extract Terminology.” Terminology: International
Journal of Theoretical and Applied Issues in Specialized
Communication, 27(2): 254–293.
San Martín, Antonio. 2024. “What
Generative Artificial Intelligence Means for Terminological Definitions”.
Siu, Sai Cheong. 2023. “ChatGPT and GPT-4 for
Professional Translators: Exploring the Potential of Large Language Models in
Translation”. Computational Linguistics & Natural Language Processing
eJournal, 5(24).
Suñer, Rosa, and Manel Santiñà. 2014. “Alfabetització per a la salut: definicions, conceptes i la seva relació amb la
salut” [Health literacy: definitions, concepts and their relation to
health]. In Alfabetització per a la salut: estàndards i
recomanacions per als professionals de la salut [Health literacy: standards and
recommendations for health care professionals], ed. by Suñer, Rosa, and Manel Santiñà, 13–26. Girona, Spain: Documenta Universitaria.
Tercedor, Maribel (dir.). (nd). VARIMED
database. Universidad de Granada. [URL]
TERMCAT. Centre de Terminologia (2010). El Diccionari terminològic [Terminological
Dictionary]. Eumo Editorial / TERMCAT, Centre de Terminologia. ISBN 978-84-9766-394-6.
Tran, Hanh Thi Hong, Matej Martinc, Jaya Caporusso, Antoine Doucet, and Senja Pollak. 2023. “The
Recent Advances in Automatic Term Extraction: A Survey.” arXiv. [URL]
Vidal-Sabanés, Laia. 2021. “La Terminologia en els textos mèdics per a pacients: el cas d’una comunitat virtual de dones amb càncer de
mama” [Terminology in medical texts for patients: the case of a virtual
community of women with breast cancer]. Barcelona, Spaon: Universitat Pompeu Fabra. [Doctoral
dissertation] [URL]
Viñuales-Ferreiro, Susana, and Pistola, Sara. 2022. “Lenguaje claro y variación terminológica en textos de la Administración” [Clear language and terminology variation in government
texts]. In Lenguaje claro y tecnología en la
Administración [Clear language and technology in the
Administration], ed. by Da Cunha, Iria, 117–135. Granada, Spain: Comares. ISBN 978-84-1369-444-3.
Vivaldi, Jorge. 2002. “Extracción
de candidatos a término mediante la combinación de estrategias heterogéneas”. Procesamiento del
lenguaje natural, 281: 112.
Wang, Yaqing, Quanming Yao, James T. Kwok, and Lionel M. Ni. 2020. “Generalizing
from a Few Examples: A Survey on Few-Shot Learning”. ACM Computing
Surveys 53 (3): 1–34.
World Health
Organizacion. 2021. (online) Cardiovascular diseases
(CVDs). [URL]
Cited by (1)
Cited by one other publication
Rodríguez Salcedo, Eliana del Rocío , Doménica Jasiel Molina Hurtado, Yennifer Tatiana Morocho Sáez, Kerly Abigail Lema Vaca, Mathew Abraham Morales Alvarado, Martha Carolina Espinosa Rodríguez, Alex David Guangasi Lagua, Marlon Alejandro Centeno Chiguano & Bryan Henry Zamora Paredes
This list is based on CrossRef data as of 20 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
