Article published In: Frames, constructions, and computation
Edited by Lars Borin, Gerard de Melo, Karin Friberg Heppin and Tiago Timponi Torrent
[Constructions and Frames 6:1] 2014
► pp. 92–113
Automatic expansion of the Swedish FrameNet lexicon
Comparing and combining lexicon-based and corpus-based methods
Published online: 19 August 2014
https://doi.org/10.1075/cf.6.1.06joh
https://doi.org/10.1075/cf.6.1.06joh
We evaluate several lexicon-based and corpus-based methods to automatically induce new lexical units for the Swedish FrameNet, and we see that the best-performing setup uses a combination of both types of methods. A particular challenge for Swedish is the absence of a lexical resource such as WordNet; however, we show that the semantic network SALDO, which is organized according to lexicographical principles quite different from those of WordNet, is very useful for our purposes.
References (32)
Blanchard, E., Harzallah, M., Briand, H., & Kuntz, P. (2005). A typology of ontology-based semantic measures. In M. Missikoff & A. De Nicola. (Eds.), Proceedings of the open Interop workshop on enterprise modelling and ontologies for interoperability, CEUR Workshop Proceedings, vol. 1601. Aachen: CEUR-WS.org.
Boas, H. (Ed.) (2009). Multilingual FrameNets in computational lexicography: Methods and applications. Berlin, Boston: Mouton De Gruyter.
Borin, L., Dannélls, D., Forsberg, M., Toporowska Gronostaj, M., & Kokkinakis, D. (2010). The past meets the present in Swedish FrameNet++. In A. Dykstra & T. Schoonheim (Eds.), Proceedings of the 14th EURALEX international congress (pp. 269–281). Leeuwarden/Ljouwert: Fryske Akademy.
Borin, L., Forsberg, M., & Lönngren, L. (2013). SALDO: A touch of yin to WordNet’s yang. Language Resources and Evaluation, 471, 1191–1211.
Brown, P., deSouza, P., Mercer, R., Della Pietra, V., & Jenifer C. Lai (1992). Class-based
n
-gram models of natural language. Computational Linguistics, 18(4), 467–479.
Burchardt, A., Pennacchiotti, M., Thater, S., & Pinkal, M. (2009). Assessing the impact of frame semantics on textual entailment. Natural Language Engineering, 151, 527–550.
Das, D., & Smith, N. A. (2011). Semi-supervised frame-semantic parsing for unknown predicates. In Y. Matsumoto & R. Mihalcea (Eds.), Proceedings of the 49th annual meeting of the Association for Computational Linguistics: human language technologies – Volume 1 (pp. 1435–1444). Stroudsburg, PA: Association for Computational Linguistics.
2012). Graph-based lexicon expansion with sparsity-inducing penalties. In E. Fosler-Lussier, E. Riloff & S. Bangalore (Eds.), Proceedings of the 2012 conference of the North American chapter of the Association for Computational Linguistics: human language technologies (pp. 677–687). Stroudsburg, PA: Association for Computational Linguistics.
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., & Lin, C.-J. (2008). LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research, 91, 1871–1874.
Fillmore, C. J., & Baker, C. (2009). A frames approach to semantic analysis. In B. Heine, & H. Narrog (Eds.), The Oxford handbook of linguistic analysis (pp. 313–340). Oxford: OUP.
Friberg Heppin, K., & Toporowska Gronostaj, M. (2012). The rocky road towards a Swedish FrameNet – creating SweFN. In N. Calzolari, K. Choukri, T. Declerck, M. U. Doğan, B. Maegaard, J. Mariani, J Odijk & S. Piperidis (Eds.), Proceedings of the eighth conference on international language resources and evaluation (pp. 256–261). European Language Resources Association.
Gärdenfors, P. (2000). Conceptual spaces: The geometry of thoughts. Cambridge, MA: Bradford Books, MIT Press.
Johansson, R., & Nugues, P. (2006). A FrameNet-based semantic role labeler for Swedish. In N. Calzolari, C. Cardie & P. Isabelle (Eds.), Proceedings of the COLING/ACL 2006 main conference poster sessions (pp. 436–443).Stroudsburg, PA: Association for Computational Linguistics.
2007a). LTH: Semantic structure extraction using nonprojective dependency trees. In E. Agirre, L. Màrquez & R. Wicentowski (Eds.), Proceedings of the fourth international workshop on semantic evaluations (pp. 227–230). Stroudsburg, PA: Association for Computational Linguistics.
2007b). Using WordNet to extend FrameNet coverage. In P. Nugues & R. Johansson (Eds.) Proceedings of the workshop on building frame-semantic resources for Scandinavian and Baltic languages (pp. 27–30).Lund: LU-CS-TR 2007-240.
Johansson, R. (2012). Non-atomic classification to improve a semantic role labeler for a low-resource language. In E. Agirre, J. Bos, M. Diab, S. Manandhar, Y. Marton & D. Yuret (Eds.), Proceedings of the first joint conference on lexical and computational semantics (pp. 95–99). Stroudsburg, PA: Association for Computational Linguistics.
Johansson, R., Friberg Heppin, K., & Kokkinakis, D. (2012). Semantic role labeling with the Swedish FrameNet. In N. Calzolari, K. Choukri, T. Declerck, M. U. Doğan, B. Maegaard, J. Mariani, J Odijk & S. Piperidis (Eds.), Proceedings of the eighth conference on international language resources and evaluation (pp. 3697–3700). European Language Resources Association.
Kanerva, P., Kristoffersson, J., & Holst, A. (2000). Random indexing of text samples for latent semantic analysis. In L. R. Gleitman & A. Joshi (Eds.), Proceedings of the 22nd annual conference of the Cognitive Science Society (p. 1036). Mahwah, NJ: Erlbaum.
Koo, T., Carreras, X., & Collins, M. (2008). Simple semi-supervised dependency parsing. In J. D. Moore, S. Teufel, J. Allan & S. Furui (Eds.), Proceedings of ACL-08: HLT (pp. 595–603).Stroudsburg, PA: Association for Computational Linguistics.
Manning, C. D., Raghavan, P., & Schütze, H. (2008). Introduction to information retrieval. Cambridge: Cambridge University Press.
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013a). Efficient estimation of word representations in vector space. In A. Courville, R. Fergus & C. Manning (Eds.), International conference on learning representations, workshop track. CoRR abs/1301.3781.
Mikolov, T., Yih, W.-t., & Zweig, G. (2013b). Linguistic regularities in continuous space word representations. In L. Vanderwende, H. Daumé III & K. Kirchhoff (Eds.) Proceedings of the 2013 conference of the North American chapter of the Association for Computational Linguistics: human language technologies (pp. 746–751). Stroudsburg, PA: Association for Computational Linguistics.
Mohammad, S. & Hirst, G. (2012). Distributional measures of semantic distance: A survey. CoRR abs/1203.1858.
Padó, S. (2007) Cross-lingual annotation projection models for role-semantic information. Doctoral dissertation. Saarbrücken: Saarland University.
Palmer, A., & Sporleder, C. (2010). Evaluating FrameNet-style semantic parsing: the role of coverage gaps in FrameNet. In C.-R. Huang & D. Jurafsky (Eds.), Proceedings of the 23rd international conference on computational linguistics: posters (pp. 928–936). Stroudsburg, PA: Association for Computational Linguistics.
Pennacchiotti, M., De Cao, D., Basili, R., Croce, D., & Roth, M. (2008). Automatic induction of FrameNet lexical units. In M. Lapata & H. T. Ng (Eds.) Proceedings of the 2008 conference on empirical methods in natural language processing (pp. 457–465). Stroudsburg, PA: Association for Computational Linguistics.
Rada, R., Mili, H., Bicknell, E., & Blettner, M. (1989). Development and application of a metric on semantic nets. IEEE Transactions on Systems, Man, and Cybernetics, 19(1), 17–30.
Tonelli, S., Giuliano, C., & Tymoshenko, K. (2013). Wikipedia-based WSD for multilingual frame annotation. Artificial Intelligence, 1941, 203–221.
Turian, J., Ratinov, L.-A., & Bengio, Y. (2010). Word representations: A simple and general method for semi-supervised learning. In J. Hajič, S. Carberry, S. Clark & J. Nivre (Eds.) Proceedings of the 48th annual meeting of the Association for Computational Linguistics (pp. 384–394). Stroudsburg, PA: Association for Computational Linguistics.
Cited by (5)
Cited by five other publications
Borin, Lars, Dana Dannélls & Karin Friberg Heppin
Dannélls, Dana, Lars Borin, Markus Forsberg, Karin Friberg Heppin & Maria Toporowska Gronostaj
Johansson, Richard
2021. NLP for resource building. In The Swedish FrameNet++ [Natural Language Processing, 14], ► pp. 169 ff.
Brambilla, Silvia, Danilo Croce, Fabio Tamburini & Roberto Basili
Borin, Lars, Dana Dannélls & Normunds Grūzītis
2018. Linguistics vs. language technology in constructicon building and use. In Constructicography [Constructional Approaches to Language, 22], ► pp. 229 ff.
This list is based on CrossRef data as of 5 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
