Re-opening the case: A corpus-based study of translational patterns using non-agentive constructions in translations from English to
German
JonasFreiwald and StellaNeumann
RWTH Aachen University
How does the variability of the language system affect translational language use? Despite inconclusive results of
earlier corpus studies, machine learning approaches reach high accuracies in distinguishing translated from non-translated texts.
Translations must therefore involve linguistic patterns easy to spot by computers, but harder to spot by the human analyst.
Consequently, the differences have been characterized as small, but systematic. This article adopts a quantitative corpus-based
approach to examining shifts in probability between translated and non-translated language. To this end, we investigate how
translators handle non-agentive constructions, a phenomenon displaying contrastive usage differences in English and German. A
multifactorial analysis of 7441 instances shows that differences are indeed small and that they stem from a multitude of factors.
Accounting for these factors turns out to be challenging even for a sophisticated statistical procedure. Therefore, the case of
how subtle the effects of translation are cannot be settled just yet.
Recent empirically oriented linguistics has emphasized the idea that language is probabilistic in nature and evolves through
usage both as a system as well as in the development of the individual (see Beckner et al.
2009). According to this view, a language remains alive by being spoken by millions of language users in overlapping
generations who interact with each other in speech communities. Linguistic features are chosen by language users based on their
continued observations of how others use them — in slightly different ways (Grafmiller et al.
2018, 2). In this process, language users also extract information about the specific contexts in which these features are
used in a particular way and in a particular combination with other features. From a cognitive point of view, language users make
inferences by judging the probabilities of an event to happen. They might, for instance, infer the meaning of an item based on the
(recurring) input they receive through the use by others (Chater, Tenenbaum, and Yuille
2006). More specifically, language users continuously expand their linguistic experience through use (Ellis 2019, 48), thus becoming highly skilled at choosing features that are likely to occur in a given
context. The fact that language users choose between different possibilities of expressing a given state or event, also known as
construal, foregrounds the paradigmatic nature of language. The result of language users’ likely choices has been observed in corpora
as (co-)occurrence patterns that can be modelled as probabilistic choices. In light of such a probabilistic conception of the
distribution of features in language, a rigid notion of ‘rule’ can be meaningfully replaced by ‘regularity’, thus making room for the
possibility that a language user might still opt for an improbable choice. In this sense, each individual instance of using language
“both maintains and perturbs the system” (Halliday 2005, 66).
References
Bates, Douglas, Martin Mächler, Ben Bolker, and Steve Walker
2015 “Fitting
Linear Mixed-Effects Models Using Lme4.” Journal of Statistical
Software 67 (1): 1–48.
Beckner, Clay, Richard Blythe, Joan Bybee, Morten
H. Christiansen, William Croft, Nick
C. Ellis, John Holland, Jinyun Ke, Diane Larsen-Freeman, and Tom Schoenemann
2009 “Language
Is a Complex Adaptive System: Position Paper.” Language
Learning 59 (s1): 1–26.
Burchardt, Aljoscha, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Padó, and Manfred Pinkal
2009 “8.
Using FrameNet for the Semantic Analysis of German: Annotation, Representation, and
Automation.” In Multilingual FrameNets in Computational
Lexicography, edited by Hans
C. Boas, 209–244. Berlin: De
Gruyter Mouton.
Chater, Nick, Joshua
B. Tenenbaum, and Alan Yuille
2006 “Probabilistic
Models of Cognition: Conceptual Foundations.” Probabilistic Models of
Cognition, edited by Nick Chater, Joshua
B. Tenenbaum, and Alan Yuille, special
issue of Trends in Cognitive
Sciences, 10 (7): 287–291.
De
Sutter, Gert, and Marie-Aude Lefer
2020 “On
the Need for a New Research Agenda for Corpus-Based Translation Studies: A Multi-Methodological, Multifactorial and
Interdisciplinary
Approach.” Perspectives 28 (1): 1–23.
De
Swart, Peter
2014 “Prepositional
Inanimates in Dutch: A Paradigmatic Case of Differential Object Marking.” Linguistics: An
Interdisciplinary Journal of the Language
Sciences 52 (2): 445–468.
Doms, Steven, Bernard
de Clerck, and Sonia Vandepitte
2016 “Non-Human
Agents as Subjects in English and Dutch: A Corpus-Based Translation
Study.” In Atypical Predicate-Argument
Relations, edited by Thierry Ruchot and Pascale
Van Praet, 87–112. Lingvisticæ
Investigationes Supplementa 33. Amsterdam: John
Benjamins.
Ellis, Nick
C.
2019 “Essentials of a Theory of
Language Cognition.” The Modern Language
Journal 103 (S1): 39–60.
Evert, Stefan, and Stella Neumann
2017 “The
Impact of Translation Direction on Characteristics of Translated Texts: A Multivariate Analysis for English and
German.” In Empirical Translation Studies: New Theoretical and
Methodological Traditions, edited by Gert
de Sutter, Marie-Aude Lefer, and Isabelle Delaere, 47–80. Berlin: De
Gruyter.
Fillmore, Charles
J., Christopher
R. Johnson, and Miriam
R. L. Petruck
2003 “Background
to FrameNet.” International Journal of
Lexicography 16 (3): 235–250.
Freiwald, Jonas
2016 “You
Say Theme, I Say Thema: A Corpus-Based Approach to Theme in English and German from an SFL
Perspective.” MA thesis. RWTH Aachen
University.
Freiwald, Jonas
2023Theme
in English and German: A Corpus-Based Contrastive Analysis of Clause Openings in Original and Translated
Texts. Amsterdam: John
Benjamins.
García
García, Marco, Beatrice Primus, and Nikolaus
P. Himmelmann
2018 “Shifting
from Animacy to Agentivity.” Theoretical
Linguistics 44 (1–2): 25–39.
Grafmiller, Jason, Benedikt Szmrecsanyi, Melanie Röthlisberger, and Benedikt Heller
2018 “General
Introduction: A Comparative Perspective on Probabilistic Variation in Grammar.” Glossa: A
Journal of General
Linguistics 3 (1): 94.
Granger, Sylviane, and Marie-Aude Lefer
2020 “The
Multilingual Student Translation Corpus: A Resource for Translation Teaching and
Research.” Language Resources and
Evaluation 54: 1183–1199.
Gries, Stefan
Th., and Stefanie Wulff
2021 “Examining
Individual Variation in Learner Production Data: A Few Programmatic Pointers for Corpus-Based Analyses Using the Example of
Adverbial Clause Ordering.” Applied
Psycholinguistics 42 (2): 279–299.
Halliday, M. A. K.
2005 “Corpus
Studies and Probabilistic Grammar (1991).” In Computational and
Quantitative Studies, vol. 6 Collected Works of
M.A.K. Halliday, edited by Jonathan
J. Webster, 63–75. London: Continuum.
Halliday, M. A. K., and Christian
M. I. M. Matthiessen
2014Halliday’s
Introduction to Functional Grammar. 4th
ed. London: Routledge.
Halverson, Sandra
L., and Haidee Kotze
2022 “Sociocognitive
Constructs in Translation and Interpreting Studies (TIS): Do We Really Need Concepts Like Norms and Risk When We Have a
Comprehensive Usage-Based Theory of Language?” In Contesting
Epistemologies in Cognitive Translation and Interpreting Studies, edited by Sandra
L. Halverson and Álvaro
Marín García, 51–79. Abingdon: Routledge.
Hansen-Schirra, Silvia, Stella Neumann, and Erich Steiner
2012Cross-Linguistic
Corpora for the Study of Translations: Insights from the Language Pair
English–German. Berlin: De Gruyter
Mouton.
Hansen-Schirra, Silvia, and Erich Steiner
2012 “Towards
a Typology of Translation Properties.” In Cross-Linguistic Corpora
for the Study of Translations, by Silvia Hansen-Schirra, Stella Neumann, and Erich Steiner, 255–279. Berlin: De
Gruyter.
Hawkins, John
A.
1986A Comparative Typology of English and
German: Unifying the Contrasts. London: Croom
Helm.
Heilmann, Arndt, Tatiana Serbina, Jonas Freiwald, and Stella Neumann
2021 “Animacy
and Agentivity of Subject Themes in English–German
Translation.” In Dynamicity and Contrast in Systemic Functional
Linguistics, edited by Izaskun Elorza, Jorge Arús-Hita, and Tom Bartlett, special
issue
of Lingua 261 (October): 102813.
Hothorn, Torsten, Kurt Hornik, and Achim Zeileis
2006 “Unbiased
Recursive Partitioning: A Conditional Inference Framework.” Journal of Computational and
Graphical
Statistics 15 (3): 651–674.
Kast, Marlene
2012 “Variation
within the Grammatical Function ‘Subject’ in English–German and German–English
Translations.” In Cross-Linguistic Corpora for the Study of
Translations: Insights from the Language Pair English–German, by Silvia Hansen-Schirra, Stella Neumann, and Erich Steiner, 147–160. Berlin: De
Gruyter.
König, Ekkehard, and Volker Gast
2018Understanding
English–German Contrasts. 4th
ed. Berlin: Erich Schmidt
Verlag.
Königs, Karin
2011Übersetzen Englisch–Deutsch: Lernen Mit System [Translation English–German: Learning systematically]. 3rd
ed. München: Oldenbourg.
Kotze, Haidee
2022 “Translation
as Constrained Communication: Principles, Concepts and
Methods.” In Extending the Scope of Corpus-Based Translation
Studies, edited by Sylviane Granger and Marie-Aude Lefer, 67–97. London: Bloomsbury
Academic.
Levshina, Natalia
2015How
to Do Linguistics with R: Data Exploration and Statistical
Analysis. Amsterdam: John
Benjamins.
Levshina, Natalia
2020 “Conditional
Inference Trees and Random Forests.” In A Practical Handbook of
Corpus Linguistics, edited by Magali Paquot and Stefan
Th. Gries, 611–643. Cham: Springer.
Lødrup, Helge
1993 “Subjects
and Thematic Roles in English and Norwegian.” Norsk Lingvistisk
Tidsskrift 11 (2): 105–124.
Macken, Lieve, Orphée
de Clercq, and Hans Paulussen
2011 “Dutch
Parallel Corpus: A Balanced Copyright-Cleared Parallel Corpus.” Meta: Journal des
traducteurs 56 (2): 374–390.
Neumann, Stella
2014Contrastive
Register Variation: A Quantitative Approach to the Comparison of English and
German. Berlin: De Gruyter
Mouton.
Neumann, Stella, Gert
de Sutter, and Stefan Evert
2017 “Register-Specific
Interference in Translation.” In Conference Booklet of the 39th
Annual Conference of the German Linguistic Society. Saarbrücken,
Germany.
Neumann, Stella, and Stefan Evert
2021 “A
Register Variation Perspective on Varieties of
English.” In Corpus-Based Approaches to Register
Variation, edited by Elena Seoane and Douglas Biber, 143–178. Amsterdam: John
Benjamins.
Neumann, Stella, and Silvia Hansen-Schirra
2012 “Corpus
Methodology and Design.” In Cross-Linguistic Corpora for the Study of
Translations: Insights from the Language Pair English–German, by Silvia Hansen-Schirra, Stella Neumann, and Erich Steiner, 21–34. Berlin: De
Gruyter.
R Core Team
2018R: A Language and
Environment for Statistical Computing. Vienna: R
Foundation for Statistical Computing. https://www.R-project.org
Reynaert, Ryan, Lieve Macken, Arda Tezcan, and Gert
de Sutter
2021 “Building
a New-Generation Corpus for Empirical Translation Studies: The Dutch Parallel Corpus
2.0.” In New Perspectives on Corpus Translation
Studies, edited by Vincent
X. Wang, Lily Lim, and Defeng Li, 75–100. Singapore: Springer.
Serbina, Tatiana
2015A
Construction Grammar Approach to the Analysis of Translation Shifts: A Corpus-Based Study. PhD
thesis. RWTH Aachen University.
Szmrecsanyi, Benedikt, Jason Grafmiller, Benedikt Heller, and Melanie Röthlisberger
2016 “Around
the World in Three Alternations: Modeling Syntactic Variation in Varieties of English.” English
World-Wide 37 (2): 109–137.
Tagliamonte, Sali
A., and R.
Harald Baayen
2012 “Models,
Forests, and Trees of York English: Was/Were Variation as a Case Study for Statistical
Practice.” Language Variation and
Change 24 (2): 135–178.
Teich, Elke, Peter Fankhauser, Stefania Degaetano-Ortlieb, and Yuri Bizzoni
2021 “Less
Is More/More Diverse: On The Communicative Utility of Linguistic
Conventionalization.” Frontiers in
Communication 5.
Toury, Gideon
2004 “Probabilistic
Explanations in Translation Studies: Welcome as They Are, Would They Qualify as
Universals?” In Translation Universals: Do They
Exist?, edited by Anna Mauranen and Pekka Kujamäki, 15–32. Amsterdam: John
Benjamins.
Volansky, Vered, Noam Ordan, and Shuly Wintner
2015 “On
the Features of Translationese.” Digital Scholarship in the
Humanities 30 (1): 98–118.
Zaenen, Annie, Jean Carletta, Gregory Garretson, Joan Bresnan, Andrew Koontz-Garboden, Tatiana Nikitina, M.
Catherine O’Connor, and Tom Wasow
2004 “Animacy
Encoding in English: Why and How.” In Proceedings of the Workshop on
Discourse
Annotation, 118–125. Barcelona: Association
for Computational Linguistics. https://aclanthology.org/W04-0216.