In:Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations
Edited by Julia Lavid-López, Carmen Maíz-Arévalo and Juan Rafael Zamorano-Mansilla
[Benjamins Translation Library 158] 2021
► pp. 325–342
Chapter 13Binomial adverbs in Germanic and Romance Languages
A corpus-based study
Published online: 8 December 2021
https://doi.org/10.1075/btl.158.13gra
https://doi.org/10.1075/btl.158.13gra
Abstract
As a special type of multiword expressions, binomials are a frequent phenomenon in many languages. We focus on binomial adverbs that are coordinations of two adverbial constituents. Their syntactic contribution to a sentence is adverbial as well and their semantic contribution is idiomatic. They have many uses, such as to intensify (first and foremost), express tendency (more and more), frequency (over and over [again]), vagueness (more or less), determination (sooner or later) etc.
In this work, we describe our approaches to identify binomial adverbs in a large multiparallel corpus. Alongside the well-known measure of reversibility, we also calculate measures of statistical association and look for single-word translation equivalents in other languages. Combining these features facilitates the identification of binomial adverbs.
Article outline
- 1.Introduction
- 2.Related work
- 3.Corpus preparation
- 4.Identification of candidates
- 5.Candidate ranking
- 6.Expression boundaries
- 7.Translation variants
- 8.Conclusions
Acknowledgements Note References Appendix
References (24)
Callegaro, Elena. 2017. “Parallel Corpora for the Investigation of (Variable) Article Use in English: A Construction Grammar Approach.” PhD diss., University of Zurich.
Constant, Matthieu, and Joakim Nivre. 2016. “A Transition-Based System for Joint Lexical and Syntactic Analysis.” In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL), 161–171.
Copestake, Ann, and Aurélie Herbelot. 2011. “Exciting and interesting: issues in the generation of binomials.” In Proceedings of the UCNLG+Eval: Language Generation and Evaluation Workshop, 45–53.
Evert, Stefan. 2008. “Corpora and collocations.” In Corpus Linguistics. An International Handbook, edited by A. Lüdeling and M. Kytö, 2:1212–1248. Berlin: Walter de Gruyter.
Graën, Johannes. 2018. “Exploiting Alignment in Multiparallel Corpora for Applications in Linguistics and Language Learning.” PhD diss., University of Zurich.
Graën, Johannes, Dolores Batinic, and Martin Volk. 2014. “Cleaning the Europarl Corpus for Linguistic Applications.” In Proceedings of the Conference on Natural Language Processing (KONVENS), 222–227. Stiftung Universität Hildesheim, October.
Graën, Johannes, Mara Bertamini, and Martin Volk. 2018. “Cutter – a Universal Multilingual Tokenizer.” In Proceedings of the 3rd Swiss Text Analytics Conference, edited by Mark Cieliebak, Don Tuggener, and Fernando Benites, 75–81. CEUR Workshop Proceedings 2226. CEUR-WS, June.
Graën, Johannes, Tannon Kew, Anastassia Shaitarova, and Martin Volk. 2019. “Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection.” In Proceedings of the 7th Workshop on Challenges in the Management of Large Corpora (CMLC), edited by Piotr Bański et al.
Koehn, Philipp. 2005. “Europarl: A parallel corpus for statistical machine translation.” In Proceedings of the 10th Machine Translation Summit, 5:79–86. Asia-Pacific Association for Machine Translation.
Kopaczyk, Joanna, and Hans Sauer, eds. 2017a. Binomials in the History of English: Fixed and Flexible. Cambridge University Press.
. 2017b. “Defining and exploring binomials.” Binomials in the History of English: Fixed and Flexible: 1–23.
Lambrecht, Knud. 1984. “Formulaicity, frame semantics, and pragmatics in German binomial expressions.” Language 60 (4): 753–796.
Masini, Francesca. 2006. “Binomi coordinati in italiano.” In Prospettive nello studio del lessico italiano: Atti del 9. congresso SILFI, 2:563–571.
Mollin, Sandra. 2014. The (Ir)reversibility of English Binomials. Corpus, constraints, developments. Vol. 64. Studies in Corpus Linguistics. John Benjamins.
Müller, Gereon. 1997. “Beschränkungen für Binomialbildungen im Deutschen.” Zeitschrift für Sprachwissenschaft 16 (1): 25–51.
Östling, Robert. 2012. “Stagger: A modern POS tagger for Swedish.” In Proceedings of the 4th Swedish Language Technology Conference (SLTC).
Petrov, Slav, Dipanjan Das, and Ryan McDonald. 2012. “A Universal Part-of-Speech Tagset.” In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC), edited by Nicoletta Calzolari et al. Istanbul: European Language Resources Association (ELRA). isbn: 978-2-9517408-7-7.
Schmid, Helmut. 1994. “Probabilistic part-of-speech tagging using decision trees.” In Proceedings of International Conference on New Methods in Natural Language Processing, 12:44–49.
Shannon, Claude Elwood. 1948. “A Mathematical Theory of Communication.” The Bell System Technical Journal 27:379–423.
Voghera, Miriam. 2004. “Polirematiche.” In La formazione delle parole in italiano, 56–69. Max Niemeyer Verlag.
Cited by (1)
Cited by one other publication
This list is based on CrossRef data as of 3 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
