AI as a resource for the clarification of medical terminology: An analysis of its advantages and limitations

Vidal Sabanés, Laia; da Cunha, Iria

doi:10.1075/term.00083.vid

Article published In: Computational Terminology
Edited by Ayla Rigouts Terryn and Patrick Drouin
[Terminology 31:1] 2025
► pp. 37–71

Get fulltext from our e-platform

Download PDF

Download EPUB

AI as a resource for the clarification of medical terminology

An analysis of its advantages and limitations

Laia Vidal Sabanés | Pompeu Fabra University (UPF) | National Distance Education University (UNED)

Iria da Cunha | National Distance Education University (UNED)

Available under the Creative Commons Attribution (CC BY) 4.0 license.

For any use beyond this license, please contact the publisher at rights@benjamins.nl.

Open Access publication of this article was funded through a Transformative Agreement with Pompeu Fabra University.

Published online: 23 May 2025

https://doi.org/10.1075/term.00083.vid

Abstract

Medical terminology is perceived as an obstacle for patients and family members to understand the medical message. In this context, plain language advocates for making specialised knowledge accessible to citizens. This article puts the synergy between medical terminology, plain language, and computational linguistics (a branch of Artificial Intelligence) on the table. Our purpose is to determine if Generative AI applications, like ChatGPT, can assist in creating glossaries of terms with their corresponding simple variants. For this, a relevant sub-field of medicine, cardiology, was taken as a case study, even though it could be extrapolated to other sub-fields. Next, a glossary of key cardiology terms was created following a classic methodological approach. Then, the most relevant phases of the process (terminology extraction, search for synonyms, and selection of the clearest synonym) were reproduced using ChatGPT. The results of the comparative study between both approaches give a glimpse of the extent to which this technology can be a useful resource for creating glossaries that can be employed for writing using plain language.

Keywords: medical terminology, terminography, plain language, doctor-patient communication, computational linguistics, generative AI

Article outline

1.Introduction
2.Theoretical framework
3.Terminology work, plain language, and artificial intelligence
4.Methodology
- 4.1Creation of a classic glossary (GS)
- 4.2Reproduction of TW phases using ChatGPT
- 4.3Comparison between both methodological approaches
5.Results
- 5.1Results of the GS development
- 5.2Outcomes of the phases reproduced by ChatGPT and comparative analysis
6.Conclusions and future work
Acknowledgments
Notes
References

References (69)

References

Abt-Sacks, Analía, Susana Pablo-Hernando, Pedro Serrano-Aguilar, Enrique Fernández-Vega, and Roberto Martín Fernández. 2013. “Necesidades de información y uso de Internet en pacientes con cáncer de mama en España” [Information needs and Internet use in breast cancer patients in Spain]. Gaceta Sanitaria, 27(3): 241–247.

Ananiadou, Sophia. 1994. “A Methodology for Automatic Term Recognition”. In COLING 1994 Volume 2: The 15th International Conference on Computational Linguistics.

Anguita, Manuel P. 2009. Diccionario Lid Cardiología. Madrid, Spain: LID Editorial Empresarial.

Anthropic, Claude. 2024. [URL]

Araújo, Sílvia, and Micaela Aguiar. 2023. “Simplifying Specialized Texts with AI: A ChatGPT-Based Learning Scenario”. In Smart Innovation, Systems and Technologies. Perspectives and Trends in Education and Technology, ed. by Mesquita, Annabela, António Abreu, Joao Vidal Carvalho, Cleuciliz Santana, and Cristina Helena Pinto de Mello, 3661: 599–609. Springer.

Arratibel, Patricia, Rafael Bengoa, and Gabriel Rafael. 2022. La situación de las enfermedades cardiovasculares en España. Principales factores de riesgo e impacto en la gestión de las enfermedades crónicas [The situation of cardiovascular diseases in Spain. Main risk factors and impact on chronic disease management]. Fundación Alternativas, 211. ISBN 978-84-18677-45-8

Basagoiti, Ignasio (coord.). 2012. Alfabetitzación en salut: de la información a la acción [Health literacy: from information to action]. Valencia, Spain: Itaca. ISBN 978-84-695-5267-4.

Brown, Tom B., Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, et al. 2020. “Language Models are Few-Shot Learners.” arXiv [Cs.CL]. [URL]

Cabré, Maria Teresa. 2009. “La Teoría Comunicativa de la Terminología, una aproximación lingüística a los términos” [The Communicative Theory of Terminology, a linguistic approach to terms]. Revue française de linguistique appliquée, XIV1:9–15.

Cabré, Maria Teresa, Ona Domènech-Bagaria, and Rosa Estopà. 2018. La terminologia avui: termes, textos i aplicacions [Terminology today: terms, texts and applications]. Barcelona, Spain: Universitat Oberta de Catalunya.

Cabré, Maria Teresa. 2004. “La terminología en la traducción especializada” [Terminology in specialized translation]. In Manual de documentación y terminología para la traducción especializada [Documentation and Terminology Manual for Specialized Translation], ed. by Gonzalo García, Consuelo, and Valentín García Yebra, 89–126. Madrid: Arco.

. 1999. La terminología: representación y comunicación. Elementos para una teoría de base comunicativa y otros artículos [Terminology: representation and communication. Elements for a theory of communicative basis and other articles]. Barcelona, Spain: Institut Universitari de Lingüística Aplicada.

Cabré, M. Teresa and Iria da Cunha. 2022. “El papel de los corpus en la terminología: una mirada específica a la terminología del español” [The role of corpora in terminology: a specific look at Spanish terminology]. In Lingüística de corpus en español / The Routledge Handbook of Spanish Corpus Linguistics, ed. by Parodi, Giovanni, Pascual Cantos, and Lewis C. Howe, 190–204. London: Routledge. ISBN 9780367350123.

Callava, Marta and Roberto Silva. “Automating term management activities with AI: a CASE STUDY of generative AI for metadata generation and intelligent terminology extraction”. Paper presented at EAFT Terminology Summit, Barcelona, November 17, 2023.

Campillos-Llanos, Leonardo, Rocío Bartolomé, and Ana R. Reinares, Sofía Puig, Ana Valverde-Mateos, and Adrian Capllonch. 2024. “Enhancing the understanding of clinical trials with a sentence-level simplification dataset”. Procesamiento de Lenguaje Natural, 741: 31–43.

Campillos-Llanos, Leonardo, Ana R. Reinares, Sofía Puig, Ana Valverde-Mateos, and Adrian Capllonch. 2022. “Building a comparable corpus and a benchmark for Spanish medical text simplification”. Procesamiento de Lenguaje Natural, 691: 189–196.

Da Cunha, Iria. 2020. El discurso del ámbito de la Administración. Una perspectiva lingüística [The discourse of the Administration field. A linguistic perspective]. Granada, Spain: Comares. ISBN 978-84-9045-955-3

. 2024 (in press). “Inteligencia artificial y lenguaje claro en español” [Artificial intelligence and plain language in Spanish]. In Lenguaje claro en Iberoamérica. Principios y prácticas [Plain Language in Ibero-America. Principles and practices], ed. by Retegui, Alejandro R., and Fernando. Thomson Reuters — La Ley Argentina.

Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. 2019. “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding”. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

De Schryver, Gilles-Maurice. 2023. “Generative AI and Lexicography: The Current State of the Art Using ChatGPT”. International Journal of Lexicography, 36(4): 355–387.

Di Nunzio, Giorgio Maria, Stefano Marchesin, and Gianmaria Silvello. 2023. “A systematic review of Automatic Term Extraction: What happened in 2022?”. Digital Scholarship in the Humanities, 38(1): i41–i47.

Domènech-Bagaria, Ona, Rosa Estopà, and Laia Vidal-Sabanés. 2020. “La comprensión de los informes médicos” [Understanding medical reports]. In Los informes médicos: estrategias lingüísticas para favorecer su comprensión [Medical reports: linguistic strategies to promote their comprehension], coord. by Estopà, Rosa. Buenos Aires, Argentina: Delhospital. ISBN 978-987-8349-12-1.

Eloundou, Tyna, Sam Manning, Pamela Mishkin, and Daniel Rock. 2023. “GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models”. Papers 2303.10130, arXiv.org.

Estopà, Rosa, and Maria Amor Montané. 2020. “Terminology in medical reports: Textual parameters and their lexical indicators that hinder patient understanding”. Terminology: International Journal of Theoretical and Applied Issues in Specialized Communication, 261:213–236.

Estopà, Rosa. 1999. Extracció de terminologia: elements per a la construcció d’un SEACUSE (Sistema d’Extracció Automàtica de Candidats a Unitats de Significació Especialitzada) [Terminology extraction: elements for the construction of a SEACUSE (System for the Automatic Extraction of Candidates to Units of Specialized Significance)]. Barcelona: Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada. [Doctoral Dissertation].

Faber, Pamela. 2009. “The Cognitive Shift in Terminology and Specialized Translation”. MonTI. Monografías de Traducción e Interpretación, 11: 107–134.

Fernández Silva, Sabela and Maribel Tercedor. 2019. “El desarrollo de la competencia sobre variación terminológica en estudiantes de traducción e interpretación. Una experiencia de colaboración interuniversitaria” [The development of terminological variation competence in translation and interpreting students. An experience of inter-university collaboration]. RLA: Revista de lingüística teórica y aplicada, 57(2): 137–163.

Gao, Tianyu, Adam Fisch, and Danqi Chen. 2021. “Making pre-trained language models better few-shot learners”. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), edited by Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli, 3816–3830. Stroudsburg, PA: Association for Computational Linguistics.

Garbers, Samantha, and Mary Ann Chiasson. 2004. “Inadequate functional health literacy in Spanish as a barrier to cervical cancer screening among immigrant Latinas in New York City”. Preventing chronic disease, 1(4), A07.

Generalitat de Catalunya. Departament de Salut, and TERMCAT. Centre de Terminologia. 2022. Terminologia de la salut cardiovascular [Cardiovascular health terminology]. Barcelona, Spain: TERMCAT. Centre de Terminologia. [URL]

Google, Gemini. 2024. [URL]

Heinisch, Barbara. “ChatGPT prompting in a university course on Terminology”. Paper presented at EAFT Terminology Summit, Barcelona, November 17, 2023.

International Plain Language Federation. 2024. (online). Plain Language definitions. [URL]

Jänich, Anna. 2023. Exploring Automatic Synonym Generation for Lexical Simplification of Swedish Electronic Health Record. Sweden: Linköping University, Department of Health, Medicine and Caring Sciences [Master’s thesis]. [URL]

Jefford, Michael, and Martin H. N. Tattersall. 2002. “Informing and involving cancer patients in their own care”. The Lancet. Oncology, 3(10): 629–637.

Kadam, Suvarna, and Vinay Vaidya. 2020. “Review and analysis of zero, one and few shot learning approaches”. In Advances in Intelligent Systems and Computing, 100–112. Cham: Springer International Publishing.

Kickbusch, Ilona, Jürgen M. Pelikan, Franklin Apfel, and Agis D. Tsouros (eds.). 2013. Health literacy. The solid facts. [Informe]. Copenhaguen (Dinamarca): World Health Organization. [URL]

Lee, J., Yoon, W., Kim, S., Kim, D., Kim, S., So, C. H., and Kang, J. 2020. “BioBERT: a pre-trained biomedical language representation model for biomedical text mining”. Bioinformatics, 36(4), 1234–1240.

Lew, Robert. “ChatGPT as a COBUILD lexicographer”. 2023. Humanities and Social Sciences Communications, 101: 704.

Linares, Oscar, David T. Daly, and Gertrude A. Daly. 2017. Plain English for doctors and other medical scientists. New York: Oxford University Press.

Lommel, Arle. 2023. “Generative AI makes terminology management more critical”. Paper presented at EAFT Terminology Summit, Barcelona, November 17, 2023.

Łukasik, Marek. 2023. “Corpus linguistics and generative AI tools in term extraction: a case of Kashubian — a low-resource language”. Applied Linguistics Papers, 231: 34–45.

Lyu, Qing, Josh Tan, Michael E. Zapadka, Janardhana Ponnatapura, Chuang Niu, Kyle J. Myers, Ge Wang, and Christopher T. Whitlow. 2023. “Translating radiology reports into plain language using ChatGPT and GPT-4 with prompt learning: results, limitations, and potential”. Vis Comput Ind Biomed Art, 6(1): 9.

Marrero, Yusney. 2016. SE-Term: sistema de extracción de términos híbrido independiente del dominio [SE-Term: domain-independent hybrid term extraction system]. Alicante, Spain: Universidad de Alicante. [Doctoral dissertation]. [URL]

McKean, Erin, and Will Fitzgerald. 2023. “The ROI of AI in Lexicography”. In Proceedings of the 16th Conference of the Asian Association for Lexicography: Lexicography, Artificial Intelligence, and Dictionary Users, 10–20. Seoul: Yonsei University.

Montolío, Estrella, and Mario Tascón. 2020. El derecho a entender. La comunicación clara, la mejor defensa de la ciudadanía [The right to understand. Clear communication, the best defense of citizenship]. Prodigioso Volcán, Catarata.

Muegge, Uwe. 2023. Terminology extraction for translation and interpreting made easy. ISBN: 979-8-9893043-1-8.

Nazar, Rogelio. 2016. “Distributional analysis applied to terminology extraction: example in the domain of psychiatry in Spanish”. Terminology: International Journal of Theoretical and Applied Issues in Specialized Communication, 22(2): 142–170.

Nazar, Rogelio, and Maria Teresa Cabré. 2012. “Supervised Learning Algorithms Applied to Terminology Extraction”. In Proceedings of the 10th Terminology and Knowledge Engineering Conference: New Frontiers in the Constructive Symbiosis of Terminology and Knowledge Engineering, TKE 2012, 209–217.

Nazar, Rogelio, Jorge Vivaldi, and Leo Wanner. 2012. “Automatic Taxonomy Extraction for Specialized Domains Using Distributional Semantics”. Terminology: International Journal of Theoretical and Applied Issues in Specialized Communication, 18(2): 188–225.

OpenAI. ChatGPT. San Francisco: OpenAI, 2024. [URL]

OpenAI and Jessica Shieh. (2023). Best practices for prompt engineering with OpenAI. [URL]

Paetzold, Gustavo, and Lucia Specia. 2017. “Lexical Simplification with Neural Ranking”. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computa- tional Linguistics, 21: 34–40. Valencia, Spain: Association for Computational Linguistics. [URL].

Peters, M. E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., and Zettlemoyer, L. 2018. “Deep contextualized word representations”. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

Qiang, Jipeng, Yun Li, Yi Zhu, Yun-Hao Yuan, and Xindong Wu. 2021. “LSBert: A Simple Framework for Lexical Simplification”. IEEE/ACM Transactions on Audio, Speech and Language Processing, 291: 3064–3076.

Real Academia Nacional de Medicina Española (2021). Diccionario de términos médicos. [URL]

Ribera, Mireia, and Oliver Díaz (coords.). 2024. ChatGPT y educación universitaria [ChatGPT and university education]. Barcelona: Ediciones Octaedro and Universitat de Barcelona. [URL]

Rigouts Terryn, Ayla, Véronique Hoste, and Els Lefever. 2021. “HAMLET: Hybrid Adaptable Machine Learning Approach to Extract Terminology.” Terminology: International Journal of Theoretical and Applied Issues in Specialized Communication, 27(2): 254–293.

San Martín, Antonio. 2024. “What Generative Artificial Intelligence Means for Terminological Definitions”.

Siu, Sai Cheong. 2023. “ChatGPT and GPT-4 for Professional Translators: Exploring the Potential of Large Language Models in Translation”. Computational Linguistics & Natural Language Processing eJournal, 5(24).

Suñer, Rosa, and Manel Santiñà. 2014. “Alfabetització per a la salut: definicions, conceptes i la seva relació amb la salut” [Health literacy: definitions, concepts and their relation to health]. In Alfabetització per a la salut: estàndards i recomanacions per als professionals de la salut [Health literacy: standards and recommendations for health care professionals], ed. by Suñer, Rosa, and Manel Santiñà, 13–26. Girona, Spain: Documenta Universitaria.

Tercedor, Maribel (dir.). (nd). VARIMED database. Universidad de Granada. [URL]

TERMCAT. Centre de Terminologia (2010). El Diccionari terminològic [Terminological Dictionary]. Eumo Editorial / TERMCAT, Centre de Terminologia. ISBN 978-84-9766-394-6.

Tran, Hanh Thi Hong, Matej Martinc, Jaya Caporusso, Antoine Doucet, and Senja Pollak. 2023. “The Recent Advances in Automatic Term Extraction: A Survey.” arXiv. [URL]

Vidal-Sabanés, Laia. 2021. “La Terminologia en els textos mèdics per a pacients: el cas d’una comunitat virtual de dones amb càncer de mama” [Terminology in medical texts for patients: the case of a virtual community of women with breast cancer]. Barcelona, Spaon: Universitat Pompeu Fabra. [Doctoral dissertation] [URL]

Viñuales-Ferreiro, Susana, and Pistola, Sara. 2022. “Lenguaje claro y variación terminológica en textos de la Administración” [Clear language and terminology variation in government texts]. In Lenguaje claro y tecnología en la Administración [Clear language and technology in the Administration], ed. by Da Cunha, Iria, 117–135. Granada, Spain: Comares. ISBN 978-84-1369-444-3.

Vivaldi, Jorge. 2002. “Extracción de candidatos a término mediante la combinación de estrategias heterogéneas”. Procesamiento del lenguaje natural, 281: 112.

Wang, Yaqing, Quanming Yao, James T. Kwok, and Lionel M. Ni. 2020. “Generalizing from a Few Examples: A Survey on Few-Shot Learning”. ACM Computing Surveys 53 (3): 1–34.

World Health Organizacion. 2021. (online) Cardiovascular diseases (CVDs). [URL]

Cited by (1)

Cited by one other publication

Rodríguez Salcedo, Eliana del Rocío , Doménica Jasiel Molina Hurtado, Yennifer Tatiana Morocho Sáez, Kerly Abigail Lema Vaca, Mathew Abraham Morales Alvarado, Martha Carolina Espinosa Rodríguez, Alex David Guangasi Lagua, Marlon Alejandro Centeno Chiguano & Bryan Henry Zamora Paredes

2025. Ética de la IA generativa en la formación legal universitaria. Prohominum 7:3 ► pp. 360 ff.

This list is based on CrossRef data as of 20 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.