Article published In: Contrasting contrastive approaches
Guest-edited by Bart Defrancq
[Languages in Contrast 15:1] 2015
► pp. 4–33
On the use of translation corpora in contrastive linguistics
A case study of impersonalization in English and German
Published online: 3 April 2015
https://doi.org/10.1075/lic.15.1.02gas
https://doi.org/10.1075/lic.15.1.02gas
This article argues for a type of corpus-based contrastive research that is item-specific, predictive and hypothesis-driven. It reports on a programmatic study of the ways in which impersonalization is expressed in English and German. Impersonalization is taken to be epitomized by human impersonal pronouns like German man (e.g. Man lebt nur einmal ‘You/one only live(s) once’). English does not have a specialized impersonal pronoun like Germ. man and uses a variety of strategies instead. The question arises what determines the choice of a given impersonalization strategy in English. Drawing on relevant theoretical work and using data from a translation corpus (Europarl), variables potentially affecting the distribution of impersonalization strategies in English are identified, and their influence on the choice of a strategy is determined. By testing hypotheses derived from theoretical work and using multivariate quantitative methods of analysis, the study is intended to illustrate how bridges can be built between fine-grained semantic analyses, on the one hand, and more coarse-grained, but empirically valid, corpus research, on the other.
Keywords: impersonal, non-veridical, modal, generalizing, veridical, English/German
References (22)
Alonso-Ovalle, Luis. 2002. Arbitrary Pronouns are not that Indefinite. In C. Beyssade, R. Bok-Bennema, F. Drijkoningen, and P. Monachesi (eds.), Romance Languages and Linguistic Theory 2000, 1–14. Amsterdam: John Benjamins.
van der Auwera, Johan, Volker Gast and Jeroen Vanderbiesen. 2012. Human Impersonal Pronouns in English, Dutch and German. Leuvense Bijdragen 981: 27–64.
Cartoni, Bruno and Thomas Meyer. 2012. Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies. In Proceedings 8th International Conference on Language Resources and Evaluation (LREC). Istanbul, Turkey.
Cartoni, Bruno, Sandrine Zufferey and Thomas Meyer. 2013. Using the Europarl Corpus for Cross-linguistic Research. Belgian Journal of Linguistics 271. 23–42.
Cohen, A. 1980. On the Graphical Display of the Significant Components in a Two-way Contingency Table. Communications in Statistics – Theory and Methods A. 1025–1041.
Friendly, M. 1992. Graphical Methods for Categorical Data. In
SAS User Group International Conference Proceedings
. 190–200.
Gast, Volker. 2012. Contrastive Linguistics: Theories and Methods. In B. Kortmann (ed.), Dictionary of Linguistics and Communication Science: Linguistics Theory and Methodology. Berlin: de Gruyter Mouton.
Gast, Volker and Johan van der Auwera. 2013. Towards a Distributional Typology of Human Impersonal Pronouns, Based on Data from European Languages. In D. Bakker and M. Haspelmath (eds), Languages across Boundaries. Studies in Memory of Anna Siewierska, 31–56. Berlin: de Gruyter Mouton.
Giannakidou, Anastasia. 1998. Polarity Sensitivity as (Non)Veridical Dependency. Amsterdam: John Benjamins.
. 2011. Negative and Positive Polarity Items. In Klaus von Heusinger and Claudia Maienborn (eds), Semantics: An International Handbook, volume 33.2 of Handbücher der Sprach- und Kommunikationswissenschaften, 1660–1712. Berlin: de Gruyter Mouton.
Granger, Sylviane, Jacques Lerot and Stephanie Petch-Tyson, (eds), 2003. Corpus-Based Approaches to Contrastive Linguistics and Translation Studies. Amsterdam: Rodopi.
Johansson, Stig. 1998. On the Role of Corpora in Cross-linguistic Research. In S. Johansson and S. Oksefjell (eds.), Corpora and Cross-linguistic Research: Theory, Method, and Case Studies, 3–24. Amsterdam/Atlanta: Rodopi.
. 2000. Contrastive Linguistics and Corpora. SPRIKreports, Reports from the project ‘Languages in Contrast’. University of Oslo. Available at [URL] [last accessed December 2014].
. 2004. Viewing Languages through Multilingual Corpora, with Special Reference to the Generic Person in English, German and Norwegian. Languages in Contrast 41. 261–280.
Koehn, Phillip. 2005. Europarl: A Parallel Corpus for Statistical Machine Translation. Pukhet: MT Summit X.
Lewis, David. 1975. Adverbs of Quantification. In E. Keenan (ed.), Formal Semantics of Natural Language, 3–15. Cambridge: Cambridge University Press.
Moltmann, Friederike. 2006. Generic one, Arbitrary PRO, and the First Person. Natural Language Semantics 141. 257–81.
R Core Team. 2013. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. Available at [URL] [last accessed December 2014].
Schmied, Josef. 2008. Contrastive Corpus Studies. In A. Lüdeling and M. Kytö (eds), Corpus Linguistics: An International Handbook, 1140–1159. Berlin: de Gruyter Mouton.
Cited by (20)
Cited by 20 other publications
Breed, Adri & Daniel Van Olmen
Breed, Adri & Daniël Van Olmen
Вилинбахова, Е.Л. & Чуйкова, О.Ю.
Fastrich, Bridgit
Плунгян, В. А. & А. Д. Подгорная
Shang, Xin
2023. When Contrastive Analysis meets Translation Studies. Target. International Journal of Translation Studies 35:2 ► pp. 186 ff.
Gast, Volker
Rădulescu, Valentin & Daniël Van Olmen
2022. A questionnaire-based study of impersonalization in Romanian and English. Languages in Contrast 22:1 ► pp. 1 ff.
Breed, Adri, Jo-Ann Chan & Daniël van Olmen
Poiret, Rafaël, Simon Mille & Haitao Liu
2021. Paraphrase and parallel treebank for the comparison of French and Chinese syntax. Languages in Contrast 21:2 ► pp. 298 ff.
Manzella, Pietro
Bisiada, Mario
Malá, Markéta & Gabriela Brůhová
2019. English presentative semantic patterns as seen through a parallel translation corpus. Languages in Contrast 19:2 ► pp. 232 ff.
Enghels, Renata
Van Olmen, Daniël & Adri Breed
Van Olmen, Daniël & Adri Breed
Aijmer, Karin & Diana Lewis
Brůhová, Gabriela & Markéta Malá
[no author supplied]
This list is based on CrossRef data as of 25 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
