Article published In: Application-Driven Terminology Engineering
Edited by Fidelia Ibekwe-SanJuan, Anne Condamines and Teresa Cabré
[Terminology 11:1] 2005
► pp. 143–180
The first steps towards the automatic compilation of specialized collocation dictionaries
Published online: 17 June 2005
https://doi.org/10.1075/term.11.1.07wan
https://doi.org/10.1075/term.11.1.07wan
Collocation dictionaries are essential in specialized discourse for understanding, production, and translation. Especially translation, which is often undertaken by professionals who are not specialists of the field, is in need of dictionaries with detailed syntactic and semantic information on lexical and semantic links between terms. However, collocation dictionaries are hardly available for general, let alone specialized, discourse. The manual compilation of collocation dictionaries from large corpora is a time consuming and cost-intensive procedure. A (partial) automation of this procedure recently became a high-priority topic in computational lexicography. In this article, we discuss how collocations can be acquired from specialized corpora and labeled with semantic tags using machine-learning techniques. As semantic tags, we use lexical functions from the Explanatory Combinatorial Lexicology. We explore the performance of two different machine-learning techniques, Nearest Neighbor Classification and Tree Augmented Bayesian Classification, testing them on a Spanish law corpus.
Cited by (2)
Cited by two other publications
Gallego-Hernández, Daniel
2022. Extracción de fraseología especializada basada en corpus. Revista Española de Lingüística Aplicada/Spanish Journal of Applied Linguistics 35:1 ► pp. 294 ff.
This list is based on CrossRef data as of 6 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
