Re-opening the case
A corpus-based study of translational patterns using non-agentive constructions in translations from English to German
Published online: 10 October 2025
https://doi.org/10.1075/target.24072.fre
https://doi.org/10.1075/target.24072.fre
Abstract
How does the variability of the language system affect translational language use? Despite inconclusive results of
earlier corpus studies, machine learning approaches reach high accuracies in distinguishing translated from non-translated texts.
Translations must therefore involve linguistic patterns easy to spot by computers, but harder to spot by the human analyst.
Consequently, the differences have been characterized as small, but systematic. This article adopts a quantitative corpus-based
approach to examining shifts in probability between translated and non-translated language. To this end, we investigate how
translators handle non-agentive constructions, a phenomenon displaying contrastive usage differences in English and German. A
multifactorial analysis of 7441 instances shows that differences are indeed small and that they stem from a multitude of factors.
Accounting for these factors turns out to be challenging even for a sophisticated statistical procedure. Therefore, the case of
how subtle the effects of translation are cannot be settled just yet.
Article outline
- 1.Introduction
- 2.Method
- 2.1The corpus and the data
- 2.1.1Linguistic feature
- 2.1.2Annotation
- 2.1.3Statistical analysis
- 2.1The corpus and the data
- 3.Results
- 4.Discussion
- 5.Conclusion
- Acknowledgements
- Notes
References
References (45)
Bates, Douglas, Martin Mächler, Ben Bolker, and Steve Walker. 2015. “Fitting
Linear Mixed-Effects Models Using Lme4.” Journal of Statistical
Software 67 (1): 1–48.
Beckner, Clay, Richard Blythe, Joan Bybee, Morten H. Christiansen, William Croft, Nick C. Ellis, John Holland, Jinyun Ke, Diane Larsen-Freeman, and Tom Schoenemann. 2009. “Language
Is a Complex Adaptive System: Position Paper.” Language
Learning 59 (s1): 1–26.
Burchardt, Aljoscha, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Padó, and Manfred Pinkal. 2009. “8.
Using FrameNet for the Semantic Analysis of German: Annotation, Representation, and
Automation.” In Multilingual FrameNets in Computational
Lexicography, edited by Hans C. Boas, 209–244. Berlin: De Gruyter Mouton.
Chater, Nick, Joshua B. Tenenbaum, and Alan Yuille. 2006. “Probabilistic
Models of Cognition: Conceptual Foundations.” Probabilistic Models of
Cognition, edited by Nick Chater, Joshua B. Tenenbaum, and Alan Yuille, special
issue of Trends in Cognitive
Sciences, 10 (7): 287–291.
De Sutter, Gert, and Marie-Aude Lefer. 2020. “On
the Need for a New Research Agenda for Corpus-Based Translation Studies: A Multi-Methodological, Multifactorial and
Interdisciplinary
Approach.” Perspectives 28 (1): 1–23.
De Swart, Peter. 2014. “Prepositional
Inanimates in Dutch: A Paradigmatic Case of Differential Object Marking.” Linguistics: An
Interdisciplinary Journal of the Language
Sciences 52 (2): 445–468.
Doms, Steven, Bernard de Clerck, and Sonia Vandepitte. 2016. “Non-Human
Agents as Subjects in English and Dutch: A Corpus-Based Translation
Study.” In Atypical Predicate-Argument
Relations, edited by Thierry Ruchot and Pascale Van Praet, 87–112. Lingvisticæ
Investigationes Supplementa 33. Amsterdam: John Benjamins.
Ellis, Nick C. 2019. “Essentials of a Theory of
Language Cognition.” The Modern Language
Journal 103 (S1): 39–60.
Evert, Stefan, and Stella Neumann. 2017. “The
Impact of Translation Direction on Characteristics of Translated Texts: A Multivariate Analysis for English and
German.” In Empirical Translation Studies: New Theoretical and
Methodological Traditions, edited by Gert de Sutter, Marie-Aude Lefer, and Isabelle Delaere, 47–80. Berlin: De Gruyter.
Fillmore, Charles J., Christopher R. Johnson, and Miriam R. L. Petruck. 2003. “Background
to FrameNet.” International Journal of
Lexicography 16 (3): 235–250.
Freiwald, Jonas. 2016. “You
Say Theme, I Say Thema: A Corpus-Based Approach to Theme in English and German from an SFL
Perspective.” MA thesis. RWTH Aachen University.
. 2023. Theme
in English and German: A Corpus-Based Contrastive Analysis of Clause Openings in Original and Translated
Texts. Amsterdam: John Benjamins.
García García, Marco, Beatrice Primus, and Nikolaus P. Himmelmann. 2018. “Shifting
from Animacy to Agentivity.” Theoretical
Linguistics 44 (1–2): 25–39.
Grafmiller, Jason, Benedikt Szmrecsanyi, Melanie Röthlisberger, and Benedikt Heller. 2018. “General
Introduction: A Comparative Perspective on Probabilistic Variation in Grammar.” Glossa: A
Journal of General
Linguistics 3 (1): 94.
Granger, Sylviane, and Marie-Aude Lefer. 2020. “The
Multilingual Student Translation Corpus: A Resource for Translation Teaching and
Research.” Language Resources and
Evaluation 541: 1183–1199.
Gries, Stefan Th., and Stefanie Wulff. 2021. “Examining
Individual Variation in Learner Production Data: A Few Programmatic Pointers for Corpus-Based Analyses Using the Example of
Adverbial Clause Ordering.” Applied
Psycholinguistics 42 (2): 279–299.
Halliday, M. A. K. 2005. “Corpus
Studies and Probabilistic Grammar (1991).” In Computational and
Quantitative Studies, vol. 61 Collected Works of
M.A.K. Halliday, edited by Jonathan J. Webster, 63–75. London: Continuum.
Halliday, M. A. K., and Christian M. I. M. Matthiessen. 2014. Halliday’s
Introduction to Functional Grammar. 4th
ed. London: Routledge.
Halverson, Sandra L., and Haidee Kotze. 2022. “Sociocognitive
Constructs in Translation and Interpreting Studies (TIS): Do We Really Need Concepts Like Norms and Risk When We Have a
Comprehensive Usage-Based Theory of Language?” In Contesting
Epistemologies in Cognitive Translation and Interpreting Studies, edited by Sandra L. Halverson and Álvaro Marín García, 51–79. Abingdon: Routledge.
Hansen-Schirra, Silvia, Stella Neumann, and Erich Steiner. 2012. Cross-Linguistic
Corpora for the Study of Translations: Insights from the Language Pair
English–German. Berlin: De Gruyter Mouton.
Hansen-Schirra, Silvia, and Erich Steiner. 2012. “Towards
a Typology of Translation Properties.” In Cross-Linguistic Corpora
for the Study of Translations, by Silvia Hansen-Schirra, Stella Neumann, and Erich Steiner, 255–279. Berlin: De Gruyter.
Hawkins, John A. 1986. A Comparative Typology of English and
German: Unifying the Contrasts. London: Croom Helm.
Heilmann, Arndt, Tatiana Serbina, Jonas Freiwald, and Stella Neumann. 2021. “Animacy
and Agentivity of Subject Themes in English–German
Translation.” In Dynamicity and Contrast in Systemic Functional
Linguistics, edited by Izaskun Elorza, Jorge Arús-Hita, and Tom Bartlett, special
issue
of Lingua 261 (October): 102813.
Hothorn, Torsten, Kurt Hornik, and Achim Zeileis. 2006. “Unbiased
Recursive Partitioning: A Conditional Inference Framework.” Journal of Computational and
Graphical
Statistics 15 (3): 651–674.
Kast, Marlene. 2012. “Variation
within the Grammatical Function ‘Subject’ in English–German and German–English
Translations.” In Cross-Linguistic Corpora for the Study of
Translations: Insights from the Language Pair English–German, by Silvia Hansen-Schirra, Stella Neumann, and Erich Steiner, 147–160. Berlin: De Gruyter.
König, Ekkehard, and Volker Gast. 2018. Understanding
English–German Contrasts. 4th
ed. Berlin: Erich Schmidt Verlag.
Königs, Karin. 2011. Übersetzen Englisch–Deutsch: Lernen Mit System [Translation English–German: Learning systematically]. 3rd
ed. München: Oldenbourg.
Kotze, Haidee. 2022. “Translation
as Constrained Communication: Principles, Concepts and
Methods.” In Extending the Scope of Corpus-Based Translation
Studies, edited by Sylviane Granger and Marie-Aude Lefer, 67–97. London: Bloomsbury Academic.
Levshina, Natalia. 2015. How
to Do Linguistics with R: Data Exploration and Statistical
Analysis. Amsterdam: John Benjamins.
. 2020. “Conditional
Inference Trees and Random Forests.” In A Practical Handbook of
Corpus Linguistics, edited by Magali Paquot and Stefan Th. Gries, 611–643. Cham: Springer.
Lødrup, Helge. 1993. “Subjects
and Thematic Roles in English and Norwegian.” Norsk Lingvistisk
Tidsskrift 11 (2): 105–124.
Macken, Lieve, Orphée de Clercq, and Hans Paulussen. 2011. “Dutch
Parallel Corpus: A Balanced Copyright-Cleared Parallel Corpus.” Meta: Journal des
traducteurs 56 (2): 374–390.
Neumann, Stella. 2014. Contrastive
Register Variation: A Quantitative Approach to the Comparison of English and
German. Berlin: De Gruyter Mouton.
Neumann, Stella, Gert de Sutter, and Stefan Evert. 2017. “Register-Specific
Interference in Translation.” In Conference Booklet of the 39th
Annual Conference of the German Linguistic Society. Saarbrücken, Germany.
Neumann, Stella, and Stefan Evert. 2021. “A
Register Variation Perspective on Varieties of
English.” In Corpus-Based Approaches to Register
Variation, edited by Elena Seoane and Douglas Biber, 143–178. Amsterdam: John Benjamins.
Neumann, Stella, and Silvia Hansen-Schirra. 2012. “Corpus
Methodology and Design.” In Cross-Linguistic Corpora for the Study of
Translations: Insights from the Language Pair English–German, by Silvia Hansen-Schirra, Stella Neumann, and Erich Steiner, 21–34. Berlin: De Gruyter.
R Core Team. 2018. R: A Language and
Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing. [URL]
Reynaert, Ryan, Lieve Macken, Arda Tezcan, and Gert de Sutter. 2021. “Building
a New-Generation Corpus for Empirical Translation Studies: The Dutch Parallel Corpus
2.0.” In New Perspectives on Corpus Translation
Studies, edited by Vincent X. Wang, Lily Lim, and Defeng Li, 75–100. Singapore: Springer.
Serbina, Tatiana. 2015. A
Construction Grammar Approach to the Analysis of Translation Shifts: A Corpus-Based Study. PhD
thesis. RWTH Aachen University.
Szmrecsanyi, Benedikt, Jason Grafmiller, Benedikt Heller, and Melanie Röthlisberger. 2016. “Around
the World in Three Alternations: Modeling Syntactic Variation in Varieties of English.” English
World-Wide 37 (2): 109–137.
Tagliamonte, Sali A., and R. Harald Baayen. 2012. “Models,
Forests, and Trees of York English: Was/Were Variation as a Case Study for Statistical
Practice.” Language Variation and
Change 24 (2): 135–178.
Teich, Elke, Peter Fankhauser, Stefania Degaetano-Ortlieb, and Yuri Bizzoni. 2021. “Less
Is More/More Diverse: On The Communicative Utility of Linguistic
Conventionalization.” Frontiers in
Communication 51.
Toury, Gideon. 2004. “Probabilistic
Explanations in Translation Studies: Welcome as They Are, Would They Qualify as
Universals?” In Translation Universals: Do They
Exist?, edited by Anna Mauranen and Pekka Kujamäki, 15–32. Amsterdam: John Benjamins.
Volansky, Vered, Noam Ordan, and Shuly Wintner. 2015. “On
the Features of Translationese.” Digital Scholarship in the
Humanities 30 (1): 98–118.
Zaenen, Annie, Jean Carletta, Gregory Garretson, Joan Bresnan, Andrew Koontz-Garboden, Tatiana Nikitina, M. Catherine O’Connor, and Tom Wasow. 2004. “Animacy
Encoding in English: Why and How.” In Proceedings of the Workshop on
Discourse
Annotation, 118–125. Barcelona: Association for Computational Linguistics. [URL].
