Article published In: Terminology and e-dictionaries
Edited by Amparo Alcina, Rute Costa and Christophe Roche
[Terminology 25:2] 2019
► pp. 175–197
Using open data to create the Catalan IATE e-dictionary
Published online: 26 November 2019
https://doi.org/10.1075/term.00035.vaz
https://doi.org/10.1075/term.00035.vaz
Abstract
Linguistic resources available in the form of open data are an essential source of information for creating
e-dictionaries, but access to these linguistic resources is still limited. This paper presents a method for maximising use of open
access linguistic resources and integrating them into specialised e-dictionaries. The method combines automatic compilation of
terminology data with the creation of specialised linguistic corpora to produce a Catalan version of the IATE (InterActive
Terminology for Europe) database. The paper presents new methodological advances applied here to the production of terminological
e-dictionaries, using open access linguistic resources. We observe that, as a result of this new methodology, the Catalan version
of the IATE will be able to include specialised economics, law and health dictionaries. In conclusion, the new methodology
presented here permits the creation of new models of specialised e-dictionaries, facilitates the compilation of terminology in any
language and unifies the access format for terminology data.
Article outline
- 1.Introduction
- 2.Background
- 3.Materials
- 3.1IATE database
- 3.2Terminologia Oberta
- 3.3Wikipedia
- 3.4The DOGC corpus
- 4.Methods
- 4.1Terminological dictionaries
- 4.2Parallel corpora
- 4.3Terminological dictionary creation process
- 5.Results
- 5.1Research results using terminological dictionaries
- 5.2Research results using parallel corpora
- 6.Discussion
- 7.Publication of terminological e-dictionaries
- 8.Conclusions and future work
- Notes
References
References (41)
Abel, Andrea, and Christian M. Meyer. 2013. “The Dynamics Outside the Paper: User Contributions to Online Dictionaries.’” In Proceedings of the eLex 2013 Conference, 179–194. Tallinn, Estonia.
Ball, Liezl. 2016. An Evaluative Study to Determine to What Extent Technology can be Used in e-Dictionaries to Provide Relevant Information on Demand. Doctoral thesis presented at the University of Pretoria.
Blancafort, Helena, Francis Bouvier, Béatrice Daille, Ulrich Heid, and Anita Ramm. 2013. “TTC Web Platform: from Corpus Compilation to Bilingual Terminologies for MT and CAT Tools.” In Futures in Technologies for Translation (TRALOGY II), 14–pages, Paris, France.
Declerck, Thierry, Eveline Wandl-Vogt, Simon Krek, and Carole Tiberius. 2015. “Towards Multilingual eLexicography by Means of Linked (Open) Data.” In Proceedings of the 4th Workshop on the Multilingual Semantic Web (MSW4), ed. by Jorge Gracia, John, P., Portoroz, Slovenia.
de Schryver, Gilles-Maurice. 2012. “Trends in Twenty-five Years of Academic Lexicography.” International Journal of Lexicography 25(4): 464–506.
Engelberg, Stefan, and Lothar Lemnitzer. 2009. Lexikographie und Wörterbuchbenutzung, 4th edition. Tübingen: Stauffenburg Verlag.
Fuertes-Olivera, Pedro A., and Henning Bergenholtz (eds.). 2011. e-Lexicography: The Internet, Digital Initiatives and Lexicography. London / New York: Continuum International Publishing Group.
Gelpí, Cristina. 2003. «El estado actual de la lexicografía: los nuevos diccionarios». In Lexicografía española, ed. by Antonia María Medina Guerra, 307–332. Barcelona: Ariel.
. 2004. “Diccionaris digitals especialitzats per la temàtica: estat actual i perspectives.” Scripta Nova: revista electrónica de geografía y ciencias sociales 8(170) ([URL]). Accessed 1st July 2018.
Gouws, Rufus H. 2014. “Article Structures: Moving from Printed to e-Dictionaries.” Lexikos 24(1): 155–177.
Granger, Sylviane, and Magali Paquot. 2010. “Customising a General EAP Dictionary to Meet Learner Needs.” In Proceedings of ELEX2009: eLexicography in the 21st century: New Challenges, New Applications, ed. by Sylviane Granger, and Magali Paquot, 87–96. Louvain-la-Neuve: Presses universitaires de Louvain.
Heid, Ulrich. 2014. “Natural Language Processing Techniques for Improved User-friendliness of Electronic Dictionaries.” In Proceedings of the XVI Euralex International Congress: The User in Focus: 47–62. Bolzano/Bozen, Italy.
Heid, Ulrich, Daan J. Prinsloo, and Teo J. D. Bothma. 2012. “Dictionary and Corpus Data in a Common Portal: State of the Art and Requirements for the Future.” Lexicographica 28(1): 269–291.
Hibbert, David Brynn, Pentti Minkkinen, Nicolas M. Faber, and Barry M. Wise. 2008. “IUPAC Project: A Glossary of Concepts and Terms in Chemometrics.” Analytica chimica acta 642(1–2): 3–5.
IATE: InterActive Terminology for Europe. Luxembourg: European Union, 2019 ([URL]). Accessed 30 July 2018.
Koehn, Philipp; Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, and Evan Herbst. 2007. “Moses: Open Source Toolkit for Statistical Machine Translation.” In Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, 177–180. Prague, Czech Republic.
Lehmann, Jens, Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas, Pablo N. Mendes, Sebastian Hellmann, Mohamed Morsey, Patrick van Kleef, Sören Auer, and Christian Bizer. 2015. “DBpedia–a Large-scale, Multilingual Knowledge Base Extracted from Wikipedia.” Semantic Web 6(2): 167–195.
Lew, Robert. 2013. “Online Dictionary Skills.” In Electronic Lexicography in the 21st Century: Thinking Outside the Paper. Proceedings of the eLex 2013 Conference, ed. by I. Kosem, J. Kallas, P. Gantar, S. Krek, M. Langemets and M. Tuulik, 16–31. Tallinn, Estonia. Ljubljana/Tallinn: Trojina, Institute for Applied Slovene Studies/Eesti Keele Instituut.
L’Homme, Marie-Claude, and Monique C. Cormier. 2014. “Dictionaries and the Digital Revolution: A Focus on Users and Lexical Databases.” International Journal of Lexicography 27(4): 331–340.
Logar, Nataša, and Iztok Kosem. 2013. “TERMIS: A Corpus-driven Approach to Compiling an e-Dictionary of Terminology.” In Proceedings of the eLex 2013 Conference, 164–178. Tallinn, Estonia.
Martin, Willy, and Hennie van der Vliet. 2003. “Design and Production of Terminological Dictionaries.” In A Practical Guide to Lexicography, ed. by P. van Sterkenburg, 333–365. Amsterdam/Philadelphia: John Benjamins.
Maslias, Rodolfo. 2014. “Combine EU Terminology with Communication and Ontology Research.” In Terminology and Knowledge Engineering, 49–57. Berlin, Germany: Sciencesconf.org.
Nesi, Hilary. 2000. “Electronic Dictionaries in Second Language Vocabulary Comprehension and Acquisition: The State of the Art.” In Proceedings of the Ninth Euralex International Congress, ed. by Ulrich Heid, Stefan Evert, Egbert Lehmann, and Christian Rohrer, 839–847. Stuttgart, Germany.
. 2013. “Researching Users and Uses of Dictionaries.” The Bloomsbury Companion to Lexicography: 62–74. London: Bloomsbury.
Nin, Anna. 2016. Catalan Terminology for IATE. Luxemburg: European Parliament. Terminology Coordination ([URL]). Accessed 1st July 2019.
Oliver, Antoni, and Mercè Vàzquez. 2015. “TBXTools: A Free, Fast and Flexible Tool for Automatic Terminology Extraction.” In International Conference on Recent Advances in Natural Language Processing (RANLP 2015), 473–479. Hissar, Bulgaria.
Oliver, Antoni. 2017. “El corpus paral· lel del Diari Oficial de la Generalitat de Catalunya: compilació, anàlisi i exemples d’ús.” Zeitschrift für Katalanistik, 301: 269–291.
Oliver, Antoni, Mercè Vàzquez, and Georgina Ubide. 2017. “Estudi de la fiabilitat de la Viquipèdia com a recurs terminològic.” Revista Tradumàtica. Tecnologies de la traducció, 151: 10–20.
Pastor, Verónica, and Amparo Alcina. 2010. “Search Techniques in Electronic Dictionaries: A Classification for Translators.” International Journal of Lexicography 23(3): 307–354.
Rundell, Michael, and Adam Kilgarriff. 2011. “Automating the Creation of Dictionaries: Where will it all End.’ In A Taste for Corpora. In honour of Sylviane Granger, ed. by Fanny Meunier, Sylvie de Cock, Gaëtanelle Gilquin, and Magali Paquot, 257–282. Amsterdam/Philadelphia: John Benjamins.
Serra, Montserrat. 2014. “Diccionaris en línia. Diccionaris oberts.” Revista Llengua i ús, 541. [URL]
Tarp, Sven. 2008. Lexicography in the Borderland between Knowledge and Non-knowledge: General Lexicographical Theory with Particular Focus on Learner’s Lexicography. Tübingen: Max Niemeyer Verlag.
. 2009. ”Beyond lexicography: New visions and challenges in the information age.” In Lexicography at a Crossroads: Dictionaries and Encyclopedias Today, Lexicographical Tools Tomorrow, ed. by Henning Bergenholtz, Sandro Nielsen, and Sven Tarp, 901: 17–32. Bern: Peter Lang.
. 2010. “Functions of Specialised Learners’ Dictionaries.” In Specialised Dictionaries for Learners, ed. by Fuertes-Olivera, Pedro A.: 39–53. Germany: De Gruyter.
. 2012. “Online Dictionaries: Today and Tomorrow.” Lexicographica: International Annual for Lexicography 28(1): 253–268.
TERMCAT, Centre de Terminologia, 2019. Terminologia oberta. Barcelona: TERMCAT, Centre de Terminologia, 2019 ([URL]). Accessed 30 July 2018.
. 2010. El diccionari terminològic. Vic: Eumo Editorial; Barcelona: TERMCAT, Centre de Terminologia. (En Primer Terme; 9. Criteris i Mètodes).
Trap-Jensen, Lars. 2010. “One, Two, Many: Customization and User Profiles in Internet Dictionaries.” In Proceedings of the XIV Euralex International Congress, ed. by Anne Dykstra, and Tanneke Schoonheim: 1133–1143. Ljouwert: Fryske Akademy.
Cited by (4)
Cited by four other publications
Wei, Tong, Christophe Roche, Maria Papadopoulou & Yangli Jia
Du, Jiali, Christina Alexandris, Yajun Pei, Yuming Lian & Pingfang Yu
Du, Jiali, Christina Alexantris & Pingfang Yu
This list is based on CrossRef data as of 5 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
