Article published In: Terminology and e-dictionaries
Edited by Amparo Alcina, Rute Costa and Christophe Roche
[Terminology 25:2] 2019
► pp. 222–258
EcoLexicon and by-products
Integrating and reusing terminological resources
Published online: 26 November 2019
https://doi.org/10.1075/term.00037.leo
https://doi.org/10.1075/term.00037.leo
Abstract
Reutilization and interoperability are major issues in the fields of knowledge representation and extraction, as
reflected in initiatives such as the Semantic Web and the Linked Open Data Cloud. This paper shows how terminological resources
can be integrated and reused within different types of application. EcoLexicon is a multilingual terminological knowledge base
(TKB) on environmental science that integrates conceptual, linguistic and visual information. It has led to the following
by-products: (i) the EcoLexicon English Corpus; (ii) EcoLexiCAT, a terminology-enhanced translation tool; and (iii) Manzanilla, an
image annotation tool. This paper explains EcoLexicon and its by-products, and shows how the latter exploit and enhance the data
in the TKB.
Article outline
- 1.Introduction
- 2.EcoLexicon and Frame-based Terminology
- 2.1User interface
- 2.2Definitions
- 2.3Images
- 2.4Information filtering and recontextualization
- 2.5Access to corpus
- 3.EcoLexicon English corpus: An open corpus in Sketch Engine
- 3.1Word sketches and sketch diff
- 3.2Text types and CQL queries
- 4.EcoLexiCAT: A terminology-enhanced CAT tool
- 4.1EcoLexiCAT features
- 4.1.1The EcoLexicon and BabelNet boxes
- 4.1.2The Sketch Engine box
- 4.1.3The IATE box and other resources
- 4.1.4Editing log and revision panel
- 4.2EcoLexiCAT evaluation: A pilot study
- 4.1EcoLexiCAT features
- 5.Manzanilla: An image annotation tool
- 6.Conclusions and future work
- Acknowledgements
- Notes
References Online sources
References (56)
Barriuso, Adela Barriuso, and Antonio Torralba. 2012. Notes on Image Annotation. arXiv1:1210.3448 [cs.CV].
Buendía Castro, Miriam. 2013. Phraseology in Specialized Language and its Representation in Environmental Knowledge Resources. PhD Thesis. Granada, Spain: University of Granada.
Bodenreider, Olivier. 2004. “The Unified Medical Language System (UMLS): Integrating Biomedical Terminology.” Nucleic Acids Research 32 (1): 267–270.
Christensen, Tina Paulsen, and Anne Schjoldager. 2016. “Computer-aided Translation Tools: The Uptake and Use by Danish Service Providers.” Journal of Specialized Translation 251: 89–105.
Durán Muñoz, Isabel. 2012. “Specialized Lexicographical Resources: A Survey of Translators’ Needs.” In eLexicography in the 21st Century: New Challenges, New applications, ed. by S. Granger and M. Paquot, 55–66. Proceedings of ELEX2009. Cahiers du Cental. Vol. 71, 55–66. Louvain-La-Neuve: Presses Universitaires de Louvain.
Faber, Pamela. (ed). 2012. A Cognitive Linguistics View of Terminology and Specialized Language. Berlin/Boston: De Gruyter Mouton.
Faber, Pamela, Pilar León-Araúz, Juan Antonio Prieto Velasco, and Arianne Reimerink. 2007. “Linking Images and Words: The Description of Specialized Concepts.” International Journal of Lexicography, 20 (1): 39–65.
Faber, Pamela, Pilar León-Araúz, and Arianne Reimerink. 2014. “Representing Environmental Knowledge in EcoLexicon.” In Languages for Specific Purposes in the Digital Era, ed. by E. Bárcena, T. Read and J. Arhus. Educational Linguistics 19, 267–301. Berlin, Heidelberg: Springer.
. 2016. “EcoLexicon: New Features and Challenges.” In GLOBALEX 2016: Lexicographic Resources for Human Language Technology in conjunction with the 10th edition of the Language Resources and Evaluation Conference, ed. by I. Kernerman, I. Kosem Trojina, S. Krek, and L. Trap-Jensen, 73–80, Portorož, Slovenia.
Federico, Marcello, Nicola Bertoldi, Mauro Cettolo, Matteo Negri, Marco Turchi, Marco Trombetti, Alessandro Cattelan, Antonio Farina, Domenico Lupinetti, Andrea Martines, Alberto Massidda, Holger Schwenk, Loic Barrault, Frederic Blain, Philipp Koehn, Christian Buck, and Ulrich Germann. 2014. “The Matecat Tool.” In Proceedings of COLING 2014: System Demonstrations, 129–132. Dublin, Ireland.
Fellbaum, Christiane (ed.). 1998. WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press.
Fillmore, Charles J. 1982. “Frame Semantics.” In Linguistics in the Morning Calm, ed. by The Linguistic Society of Korea, 111–137. Seoul: Hanshin.
Fillmore, Charles J., and Beryl T. Sue Atkins. 1992. “Toward a Frame-based Lexicon: The Semantics of RISK and its Neighbors.” In Frames, Fields and Contrasts: New Essays in Semantic and Lexical Organization, ed. by A. Lehrer, and E. Kittay, 75–102. Hillsdale NJ: Erlbaum.
Fontenelle, Thierry, and Dieter Rummel. 2014. “Term Banks.” In International Handbook of Lexis and Lexicography, ed. by P. Hanks, and G.-M. De Schrijver, 1–12. Berlin-Heidelberg: Springer.
Gornostay, Tatiana, and Andrejs Vasiljevs. 2014. “Terminology Resources and Terminology Work Benefit from Cloud Services.” In Proceedings of the 9th International Language Resources and Evaluation Conference (LREC2014), 1942–1948. Reykjavik, Iceland.
HarperCollins Publishers L.L.C. The Collins English Dictionary. ([URL]). Accessed 26 June 2019.
International Organization for Standarization (ISO) (1999). ISO 12620. Computer applications in terminology – Data categories. Geneva: ISO.
Jakubíček, Miloš, Adam Kilgarriff, Dianna McCarthy, and Pavel Rychlý. 2010. “Fast Syntactic Searching in Very Large Corpora for Many Languages.” In Proceedings of the PACLIC 241: 741–747. Sendai, Japan.
Kilgarriff, Adam, Pavel Rychly, Pavel Smrz, and David Tugwell. 2004. “The Sketch Engine.” In Proceedings of the 11th EURALEX International Congress, 105–116. Lorient, France.
Lemon – Lexicon Model for Ontologies. ([URL]). Accessed 26 June 2019.
León-Aráuz, Pilar, Pamela Faber, and Silvia Montero Martínez. 2012. “Specialized Language Semantics.” In A cognitive linguistics view of terminology and specialized language, ed. by P. Faber, 95–175. Berlin, Boston: De Gruyter Mouton.
León-Araúz, Pilar, and Arianne Reimerink. 2018. “Evaluating EcoLexiCAT: a Terminology-Enhanced CAT Tool.” In Proceedings of the 11th International Language Resources and Evaluation Conference (LREC2018), 2374–2381. Miyazaki, Japan.
. 2016. “Evaluation of EcoLexicon Images.” In Joint Second Workshop on Language and Ontology & Terminology and Knowledge Structures (LangOnto2 + TermiKS) in conjunction with the 10th edition of the Language Resources and Evaluation Conference, ed. by F. Khan et al., 16–22, Portorož, Slovenia.
León-Araúz, Pilar, Arianne Reimerink, and Alejandro García Aragón. 2013. “Dynamism and Context in Specialized Knowledge.” Terminology 19(1): 31–61.
León-Araúz, Pilar, Arianne Reimerink, and Pamela Faber. 2017. “EcoLexiCAT: A Terminology-Enhanced Translation Tool for Texts on the Environment.” In Electronic Lexicography in the 21st century. Proceedings of eLex 2017 conference, ed. by I. Kosem, J. Kallas, C. Tiberius, S. Krek, M. Jakubíček, and V. Baisa, 321–341. Brno: Lexical Computing CZ s.r.o.
. 2009. “Knowledge Extraction on Multidimensional Concepts: Corpus Pattern Analysis (CPA) and Concordances.” In 8ème Conférence Internationale Terminologie et Intelligence Artificielle. Toulouse. ([URL]). Accessed 26 June 2019.
León-Araúz, Pilar, and Antonio San Martín. 2018. “The EcoLexicon Semantic Sketch Grammar: from Knowledge Patterns to Word Sketches.” In Proceedings of the LREC 2018 Workshop Globalex 2018 – Lexicography & WordNets, ed. by I. Kerneman and S. Krek, 94–99. Miyazaki: Globalex.
León-Araúz, Pilar, Antonio San Martín, and Pamela Faber. 2016. “Pattern-based Word Sketches for the Extraction of Semantic Relations.” In Proceedings of the 5th International Workshop on Computational Terminology (Computerm2016), 73–82. Osaka, Japan.
León-Araúz, Pilar, Antonio San Martín, and Arianne Reimerink. 2018. “The EcoLexicon English Corpus as an Open Corpus in Sketch Engine.” In Proceedings of the 18th EURALEX International Congress, ed. by J. Čibej, V. Gorjanc, I. Kosem, and S. Krek, 893–901. Ljubljana, Slovenia.
Mayer, Richard, and Richard B. Anderson. 1992. “The Instructive Animation: Helping Students Build Connections between Words and Pictures in Multimedia Learning.” Journal of Educational Psychology 84 (4): 715–726.
McCrae, John, Guadalupe Aguado de Cea, Paul Buitelaar, Philipp Cimiano, Thierry Declerck, Asunción Gómez-Pérez, Jorge Gracia, Laura Hollink, Elena Montiel-Ponsoda, Dennis Spohr, and Tobias Wunner. 2012. “Interchanging Lexical Resources on the Semantic Web.” Language Resources and Evaluation 46 (6): 701–709.
McCray, Alexa T., and Stuart J. Nelson. 1995. “The Representation of Meaning in the UMLS.” Methods of Information in Medicine 341: 193–201.
Moro, Andrea, Francesco Cecconi, and Roberto Navigli. 2014. “Multilingual Word Sense Disambiguation and Entity Linking for Everybody.” In Proceedings of the 2014 International Conference on Posters & Demonstrations Track 12721, 25–28. Riva del Garda, Italy.
Moro, Andrea, Alessandro Raganato, and Roberto Navigli. 2014. “Entity Linking Meets Word Sense Disambiguation: A Unified Approach.” Transactions of the Association for Computational Linguistics 21: 231–244.
Navigli, Roberto, and Simone Paolo Ponzetto. 2012. “BabelNet: The Automatic Construction, Evaluation and Application of a Wide-coverage Multilingual Semantic Network.” Artificial Intelligence 1931: 217–250.
Poignant, Johann, Mateusz Budnik, Hervé Bredin, Claude Barras, Mickael Stefas, Pierrick Bruneau, Gilles Adda, Laurent Besacier, Hazim Ekenel, Gil Francopoulo, Javier Hernando, Joseph Mariani, Ramon Morros, Georges Quénot, Sophie Rosset, and Thomas Tamisier. 2016. “The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents.” In Proceedings of the 10th edition of the Language Resources and Evaluation Conference (LREC’16), ed. by N. Calzolari et al., 1421–1425, Portorož, Slovenia.
Prieto Velasco, Juan Antonio. 2008. Información gráfica y grados de especialidad en el discurso científico-técnico: un estudio de corpus. PhD Thesis, University of Granada.
Prieto Velasco, Juan Antonio, and Pamela Faber. 2012. “Graphical Information.” In A Cognitive Linguistics View of Terminology and Specialized Language, ed. by P. Faber, 225–248. Berlin/Boston: De Gruyter Mouton.
Reimerink, Arianne, and Pilar León-Araúz. 2018. “Manzanilla: An Image Annotation Tool for TKB Building.” In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), ed. by N. Calzolari, K. Choukri, C. Cieri, T. Declerck, S. Goggi, K. Hasida, H. Isahara, B. Maegaard, J. Mariani, H. Mazo, A. Moreno, J. Odijk, S. Piperidis, and T. Tokunaga. Miyazaki, Japan ([URL]). Accessed 2 July 2019.
Reimerink, Arianne, Pilar León-Araúz, and Pamela Faber. 2016. “Image Selection and Annotation for an Environmental Knowledge Base.” Language Resources and Evaluation 501: 1–32.
San Martín, Antonio, Melania Cabezas-García, Miriam Buendía Castro, Beatriz Sánchez-Cárdenas, Pilar León-Araúz, and Pamela Faber. 2017. “Recent Advances in EcoLexicon.” Dictionaries: Journal of the Dictionary Society of North America 38 (1): 96–115.
Tudhope, Douglas, Traugott Koch, and Rachel Heery. 2006. Terminology Services and Technology: JISC state of the art review. ([URL]). Accessed 26 June 2019).
DeepL GmbH. Linguee. ([URL]). Accessed 26 June 2019.
EcoLexicon: Terminological Knowledge Base on the Environment. ([URL]). Accessed 26 June 2019.
European Environment Information and Observation Network. GEneral Multilingual Environmental Thesaurus (GEMET). ([URL]). Accessed 26 June 2019.
TaaS Consortium. Terminology as a Service (TaaS). ([URL]). Accessed 26 June 2019.
Translation Centre for the Bodies of the European Union in Luxembourg. Inter-Active Terminology for Europe (IATE). ([URL]). Accessed 26 June 2019).
United States Department of Health & Human Services. National Library of Medicine (NLM). ([URL]). Accessed 26 June 2019.
United Stated National Library of Medicine. Unified Medical Language System (UMLS). ([URL]). Accessed 26 June 2019.
Wikimedia Foundation. Wikipedia. ([URL]). Accessed 26 June 2019.
World Wide Web Consortium. Simple Knowledge Organization System (SKOS). ([URL]). Accessed 26 June 2019.
Cited by (14)
Cited by 14 other publications
Rojas-García, Juan
Faber, Pamela, Pilar León-Araúz, Rossella Resi & Pius ten Hacken
Sanz-Valdivieso, Lucía
ZOLLO, Silvia Domenica
Chambó, Santiago & Pilar León Araúz
2023. Corpus-driven conceptual analysis of epidemic and coronavirus for the Humanitarian Encyclopedia. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 29:2 ► pp. 180 ff.
Wei, Tong, Christophe Roche, Maria Papadopoulou & Yangli Jia
Cabezas-García, Melania & Pilar León-Araúz
Cabezas-García, Melania & Arianne Reimerink
Oakes, Michael & Meng Ji
Vintar, Špela & Matej Martinc
2022. Framing karstology. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 28:1 ► pp. 129 ff.
孙, 晓玲
Ortego-Antón, María Teresa
2021. e-DriMe. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 27:2 ► pp. 294 ff.
Vezzani, Federica & Giorgio Maria Di Nunzio
2020. Methodology for the standardization of terminological resources. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 26:2 ► pp. 265 ff.
[no author supplied]
This list is based on CrossRef data as of 5 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
