Article published In: Computational Terminology
Edited by Ayla Rigouts Terryn and Patrick Drouin
[Terminology 31:1] 2025
► pp. 72–109
When LMF and TMF meet
Towards a Unified Markup Framework (UMF)
Available under the Creative Commons Attribution-NonCommercial (CC BY-NC) 4.0 license.
For any use beyond this license, please contact the publisher at rights@benjamins.nl.
This article was made Open Access under a CC BY-NC 4.0 license through payment of an APC by or on behalf of the authors.
Published online: 23 May 2025
https://doi.org/10.1075/term.00084.vez
https://doi.org/10.1075/term.00084.vez
Abstract
The interoperability of language resources is crucial for effective communication and data exchange across various
computational systems. In this context, the ISO/TC 37 standards, specifically the Lexical Markup Framework (LMF) and the
Terminological Markup Framework (TMF), play a vital role by providing a common framework for the modelling, representation, and
exchange of lexical and terminological data. The LMF has been deliberately aligned with TMF to facilitate close coordination
between the two standards. This paper explores the convergence between LMF and TMF, underscoring the need for a Unified Markup
Framework (UMF) that enhances interoperability and effective resource management. We propose a unified meta-model that integrates
these frameworks through comparative analysis and real-world examples, facilitating the development of advanced language
processing applications and multilingual lexicographic and terminology management. This study not only underscores the
opportunities and challenges of such coordination but also sets the groundwork for future research directions in the harmonisation
of lexicographic and terminology resources.
Article outline
- 1.Introduction
- 2.Background
- 2.1Overview of LMF
- 2.2Overview of TMF
- 3.Comparison between LMF and TMF
- 3.1Macrostructures
- 3.1.1LMF: Macrostructure of a Lexical Resource
- 3.1.2TMF: Macrostructure of a Termbase
- 3.2Microstructures
- 3.2.1LMF Microstructure of a Lexical Entry
- 3.2.2LMF: Microstructure of a Lexicographic Entry
- 3.2.3TMF: Microstructure of a Concept Entry
- 3.3Comparison: Lexicographic Entry vs Concept Entry
- 3.1Macrostructures
- 4.Proposal for a Unified Markup Framework (UMF)
- 5.Conclusions and future perspectives
- Notes
- Author statement
References
References (30)
Bellandi, Andrea, Giorgio Maria Di Nunzio, Silvia Piccini, and Federica Vezzani. 2023a. “From
TBX to Ontolex Lemon: Issues and Desiderata.” In Proceedings of the
2nd International Conference on Multilingual Digital Terminology Today (MDTT 2023), ed.
by Giorgio Maria Di Nunzio, Rute Costa, and Federica Vezzani, Lisboa, Portugal: CEUR (CEUR Workshop Proceedings). Available
at: [URL]
. 2023b. “The
Importance of Being Interoperable: Theoretical and Practical Implications in Converting TBX to
OntoLex-Lemon.” In Proceedings of the 4th Conference on Language,
Data and Knowledge. LDK 2023, ed. by Sara Carvalho et al., 646–651, Vienna, Austria: NOVA CLUNL, Portugal. Available
at: [URL]
Bosque-Gil, Julia, Jorge Gracia, Elena Montiel-Ponsoda, and Asunción Gómez-Pérez. 2018. “Models
to represent linguistic linked data.” Natural Language
Engineering 24(6): 811–859.
Branco, António, Maria Eskevich, Francesca Frontini, Jan Hajič, Erhard Hinrichs, Franciska de Jong, Pawel Kamocki, Alexander König, Krister Lindén, Alexander Constanza Navarretta, Maciej Piasecki, Stelios Piperidis, Olli Pitkänen, Kiril Simov, Inguna Skadiņa, Thorsten Trippel, andreas Witt, and Claus Zinn. 2023. “The
CLARIN infrastructure as an interoperable language technology platform for SSH and
beyond”. Language Resources and Evaluation. Available
at:
Caselli, Tommaso, and Johan Bos. 2023. “Investigating
interoperable event corpora: Limitations of reusability of resources and portability of
models”. Language Resources and
Evaluation 57(3): 1107–1137. Available
at:
Chen, Peter Pin-Shan. 1976. “The entity-relationship
model — toward a unified view of data.” ACM Transaction on Database
Systems 1(1): 9–36.
Cimiano, Philipp, Christian Chiarcos, John P. McCrae, and Jorge Gracia. 2020. Linguistic
Linked Data: Representation, Generation and Applications. Cham, Switzerland: Springer International Publishing. Available at:
Costa, Rute. 2013. “Terminology
and Specialised Lexicography: two complementary
domains”. Lexicographica 291: 29–42.
Eckle-Kohler, Judith, Iryna Gurevych, Silvana Hartmann, Michael Matuschek, and Christian M. Meyer. 2012. “UBY-LMF
— A Uniform Model for Standardizing Heterogeneous Lexical-Semantic Resources in
ISO-LMF”. In Proceedings of the Eighth International Conference on
Language Resources and Evaluation (LREC’12), ed. by N. Calzolari, K. Choukri, T. Declerck, M. U. Doğan, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, & S. Piperidis, 275–282. European Language Resources Association (ELRA). Available at: [URL]
Francopoulo, Gil, Bel Nuria, George Monte, Calzolari Nicoletta, Monachini Monica, Pet Mandy, and Claudia Soria. 2006. “Lexical
markup framework (LMF) for NLP multilingual
resources.” In Proceedings of the Workshop on Multilingual Language
Resources and Interoperability, ed. by Witt, A., Sérasset, G., Armstrong, S., Breen, J., Heid, U., Sasaki, F., 1–8. Stroudsburg, PA: Association for Computational Linguistics.
Gromann, Dagmar, Elena-Simona Apostol, Christian Chiarcos, Marco Cremaschi, Jorge Gracia, Katerina Gkirtzou, Chaya Liebeskind, Liudmila Mockiene, Michael Rosner, Ineke Schuurman, Gilles Sérasset, Purificação Silvano, Blerina Spahiu, Ciprian-Octavian Truică, Andriusm Utka, and Giedreh Valunaite Oleskeviciene. 2024. “Multilinguality
and LLOD: A survey across linguistic description levels”, Semantic Web, Preprint
(Preprint), 1–44. Available at:
ISO 1087. 2019. Terminology Work and
Terminology Science — Vocabulary — Part 1: Theory and
application. Geneva: International Organization for Standardization.
ISO 16642. 2017. Computer Applications in
Terminology — Terminological markup
framework. Geneva: International Organization for Standardization.
ISO 24613–1. 2024. Language resource
management — Lexical Markup Framework (LMF) — Part 1: Core
model. Geneva: International Organization for Standardization.
ISO 24613–2. 2020. Language resource
management — Lexical Markup Framework (LMF) — Part 2: Machine Readable Dictionary (MRD)
model. Geneva: International Organization for Standardization.
ISO 24613-3. 2021. Language resource
management — Lexical Markup Framework (LMF) — Part 3: Etymological
Extension. Geneva: International Organization for Standardization.
ISO 24613-4. 2021. Language resource
management — Lexical Markup Framework (LMF) — Part 4: TEI
serialisation. Geneva: International Organization for Standardization.
ISO 24613-5. 2022. Language resource
management — Lexical Markup Framework (LMF) — Part 5: Lexical base exchange (LBX)
serialization. Geneva: International Organization for Standardization.
ISO 30042. 2019. Management of Terminology
Resources — Term-Base eXchange
(TBX). Geneva: International Organization for Standardization.
ISO 639. 2023. Code for individual
languages and language groups. Geneva: International Organization for Standardization.
Pernes, Stefan, Laurent Romary, and Kara Warburton. 2017. “TBX
in ODD: Schema-agnostic specification and documentation for TermBase
eXchange”, in LOTKS 2017-Workshop on Language, Ontology, Terminology
and Knowledge Structures. Available at: [URL]
Piccini, Silvia, Federica Vezzani, and Andrea Bellandi. 2023. “TBX
and “Lemon”: What perspectives in terminology?”. Digital Scholarship in the
Humanities, 38 (Supplement_1): i61–i72.
Romary, Luarent, Mohamed Khemakhem, Fahad Khan, Jack Bowers, Nicoletta Calzolari, George Monte, Mandy Pet, and Piotr Banski. 2019. “LMF
reloaded.” In Proceedings of the 13th International Conference of the
Asian Association for Lexicography, edited by Ahmet, M. G., Çiçekler, N. & Taşdemir, Y., 533–539. Istanbul: Istanbul University Department of Linguistics. [URL]
Silecchia, Sara, Federica Vezzani, and Giorgio Maria Di Nunzio. 2022. “Knowledge
Representation and Language Simplification of Human
Rights”, in Proceedings of the Workshop on Terminology in the 21st
century: many faces, many places. TERM
2022, 8–12, Marseille, France: European Language Resources Association. Available at: [URL]
Vezzani, Federica. 2022. Terminologie
numérique: conception, représentation et
gestion. Bern: Peter Lang International Academic Publishers.
Vezzani, Federica, and Giorgio Maria Di Nunzio. 2020. “Methodology
for the standardization of terminological resources: Design of TriMED database to support multi-register medical
communication”. Terminology. International Journal of Theoretical and Applied Issues in
Specialized
Communication 26(2): 265–297.
Vezzani, Federica, Giorgio Maria Di Nunzio, and Rute Costa. 2023. “ISO
standards for terminology resources management: Are they FAIR enough?”. Digital
Translation 10(2): 233–252.
Vezzani, Federica, Giorgio Maria Di Nunzio, and Sara Silecchia. 2022. “La
fraseologia dei trattati internazionali di disarmo: la risorsa terminologica DITTO”. Umanistica
Digitale (14): 91–117.
Wilkinson, Mark, Michel Dumontier, IJsbrand Jan Aalbersberg, Gabrielle Appleton, Myles Axton, Arie Baak, Niklas Blomberg, Jan-Willem Boiten, Luiz Bonino da Silva Santos, Philip E. Bourne, Jildau Bouwman, Anthony J. Brookes, Tim Clark, Mercè Crosas, Ingrid Dillo, Olivier Dumon, Scott Edmunds, Chris T. Evelo, Richard Finkers, Alejandra Gonzalez-Beltran, Alasdair J. G. Gray, Paul Groth, Carole Goble, Jeffrey S. Grethe, …Barend Mons. 2016. “The
FAIR Guiding Principles for scientific data management and stewardship”. Scientific
Data, 3(1).
