Article published In: Comparing Crosslinguistic Complexity
Edited by Jenny Ström Herold and Magnus Levin
[Languages in Contrast 24:1] 2024
► pp. 133–163
Cross-linguistic Dependency Length Minimization in scientific language
Syntactic complexity reduction in English and German in the Late Modern period
Published online: 16 February 2024
https://doi.org/10.1075/lic.00038.kri
https://doi.org/10.1075/lic.00038.kri
Abstract
We use Universal Dependencies (UD) for the study of cross-linguistic diachronic syntactic complexity reduction.
Specifically, we look at whether and how scientific English and German minimize the length of syntactic dependency relations in
the Late Modern period (ca. 1650–1900). Our linguistic analysis follows the assumption that over time, scientific discourse
cross-linguistically develops towards an increasingly efficient syntactic code by minimizing Dependency Length (DL) as a factor of
syntactic complexity. For each language, we analyse a large UD-annotated scientific and general language corpus for comparison.
While on a macro level, our analysis suggests that there is an overall diachronic cross-linguistic and cross-register reduction in
Average Dependency Length (ADL), on the micro level we find that only scientific language shows a sentence length independent
reduction of ADL, while general language shows an overall decrease of ADL due to sentence length reduction. We further analyse the
syntactic constructions responsible for this reduction in both languages, showing that both scientific English and German
increasingly make use of short, intra-phrasal dependency relations while long dependency relations such as clausal embeddings
become rather disfavoured over time.
Article outline
- 1.Introduction
- 2.Related work
- 3.Data and methods
- 3.1Corpora
- 3.2Average Dependency Length
- 4.Analyses and results
- 4.1Interplay of Average Dependency Length and Sentence Length
- 4.1.1English
- 4.1.2German
- 4.1.3English vs. German
- 4.2Inspecting different sentence lengths for DLM
- 4.2.1English
- 4.2.2German
- 4.2.3English vs. German
- 4.3Analyzing SL30
- 4.1Interplay of Average Dependency Length and Sentence Length
- 5.Summary and conclusions
- Acknowledgements
- Notes
References
References (40)
Aarts, B., López-Couso, M. and Méndez-Naya, B. 2012. Late Modern English Syntax. In English Historical Linguistics: An International Handbook, A. Bergs and L. J. Brinton (eds), 869–887. Berlin: De Gruyter Mouton.
Atkinson, D. 1996. The “Philosophical Transactions of the Royal Society of London,” 1675–1975: A Sociohistorical Discourse Analysis. Language in Society, 25(3), 333–371.
Banks, D. 2008. The Development of Scientific Writing. Linguistic Features and Historical Context. London: Equinox.
Romero-Barranco, J. 2020. Linguistic Complexity across two Early Modern English Scientific Text Types. Atlantis 42(2): 50–71.
Beneš, E. 1981. Die formale Struktur der wissenschaftlichen Fachsprachen aus syntaktischer Hinsicht. In Wissenschaftssprache, T. Bungarten (ed.), 185–212. München: Fink.
Biber, D. 2006. University Language: A Corpus-Based Study of Spoken and Written Registers. Amsterdam: John Benjamins.
Biber, D. and Clark, V. 2002. Historical Shifts in Modification Patterns with Complex Noun Phrase Structures. In English Historical Morphology, T. Fanego, M. López-Couso and J. Pérez-Guerra (eds). 43–66. Amsterdam: John Benjamins.
Biber, D. and Gray, B. 2011. The Historical Shift of Scientific Academic Prose in English towards less Explicit Styles of Expression: Writing without Verbs. In Researching Specialized Languages, V. Bathia, P. Sánchez, and P. Pérez-Paredes (eds), 11–24. Amsterdam: John Benjamins.
2016. Grammatical Complexity in Academic English: Linguistic Change in Writing. Studies in English Language. Cambridge: Cambridge University Press.
De Marneffe, M., Manning, C., Nivre, J. and Zeman, D. 2021. Universal Dependencies. Computational Linguistics 47(2): 255–308.
Degaetano-Ortlieb, S., Kermes, H., Khamis, A. and Teich, E. 2018. An Information-Theoretic Approach to Modelling Diachronic Change in Scientific English. In From Data to Evidence in English Language Research, C. Suhr, T. Nevalainen and I. Taavitsainen (eds), 258–281. Leiden: Brill.
Fischer, S., Menzel, K., Knappen, J. and Teich, E. 2020. The Royal Society Corpus 6.0 providing 300+ Years of Scientific Writing for Humanistic Study. Proceedings of the Twelfth Language Resources and Evaluation Conference (LREC 2020). Marseille, France, 11–16 May 2020. European Language Resources Association. 794–802.
Futrell, R., Mahowald, K. and Gibson, E. 2015. Large-Scale Evidence of Dependency Length Minimization in 37 languages. Proceedings of the National Academy of Sciences 112(33): 10336–10341.
Gerdes, K. and Kahane, S. 2001. Word Order in German: A Formal Dependency Grammar using a Topological Hierarchy. Proceedings of the Thirty-Nineth Annual Meeting of the Association for Computational Linguistics. Toulouse, France, 6–11 July. Association for Computational Linguistics. 220–227.
Geyken, A., Boenig, M., Haaf, S., Jurish, B., Thomas, C. and Wiegand, F. 2018. Das Deutsche Textarchiv als Forschungsplattform für historische Daten in CLARIN. In Germanistische Sprachwissenschaft um 2020, H. Lobin, R. Schneider and A. Witt (eds), 219–248. Berlin: De Gruyter.
Gibson, E., Futrell, R., Piantadosi, S., Dautriche, V., Mahowald, K., Bergen, L. and Levy, R. 2019. How Efficiency Shapes Human Language. Trends in Cognitive Sciences 23(5): 389–407.
Gibson, E. 2000. Dependency Locality Theory: A Distance-Based Theory of Linguistic Complexity. In Image, Language, Brain: Papers from the first Mind Articulation Project Symposium, A. Miyashita, Y. Marantz and W. O’Neil (eds), 95–126. Cambridge: MIT Press.
Gildea, D. and Temperley, D. 2010. Do Grammars minimize Dependency Length? Cognitive Science 34(2): 286–310.
Gonnerman, L. and Hayes, C. 2005. The Professor Chewed the Students… out: Effects of Dependency, Length, and Adjacency on Word Order Preferences in Sentences with Verb Particle Constructions. Proceedings of the Twenty-Seventh Annual Meeting of the Cognitive Science Society (CogSci 05). Stresa, Italy, July 21–23. 785–790.
Gulordava, K., Merlo, P. and Crabbé, B. 2015. Dependency Length Minimisation Effects in Short Spans: A Large-Scale Analysis of Adjective Placement in Complex Noun Phrases. Proceedings of the Fifty-Third Annual Meeting of the Association for Computational Linguistics and the Seventh International Joint Conference on Natural Language Processing. Beijing, China, July 26–31. 477–482.
Habermann, M. 2011. Deutsche Fachtexte der Neuzeit. Naturkundlich-medizinische Wissensvermittlung im Spannungsfeld von Latein und Volkssprache. Berlin: De Gruyter.
Halliday, M. A. K. and Martin, J. 1993. Writing Science: Literacy and Discursive Power. London: Falmer Press.
Halliday, M. A. K. 1988. On the Language of Physical Science. In Registers of Written English: Situational Factors and Linguistic Features, M. Ghadessy (ed.), 162–177. London: Pinter.
Heringer, H., Strecker, B. and Wimmer, R. 1980. Syntax: Fragen, Lösungen, Alternativen. München: Fink.
Hundt, M., Denison, D. and Schneider, G. 2012. Relative Complexity in Scientific Discourse. English Language & Linguistics 16(2): 209–240.
Juzek, T. S., Krielke, M.-P. and Teich, E. 2020. Exploring Diachronic Syntactic Shifts with Dependency Length: The Case of Scientific English. Proceedings of the Fourth Workshop on Universal Dependencies. Barcelona, Spain, 13 December 2020. Association for Computational Linguistics. 109–119.
Krielke, M.-P. 2021. Relativizers as Markers of Grammatical Complexity: A Diachronic, Cross-Register Study of English and German. Bergen Language and Linguistics Studies 11(1): 91–120.
Krielke, M.-P., Talamo, L., Fawzi, M. and Knappen, J. 2022. Tracing Syntactic Change in the Scientific Genre: Two Universal Dependency-Parsed Diachronic Corpora of Scientific English and German. Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC 2022). Marseille, France, 20–25 June 2022. European Language Resources Association. 4808–4816.
Lei, L. and Wen, J. 2020. Is Dependency Distance Experiencing a Process of Minimization? A Diachronic Study Based on the State of the Union Addresses. Lingua 2391: 102762.
Liu, H., Xu, C. and Liang, J. 2017. Dependency Distance: A New Perspective on Syntactic Patterns in Natural Languages. Physics of Life Reviews 211: 171–193.
Möslein, K. 1974. Einige Entwicklungstendenzen in der Syntax der wissenschaftlich-technischen Literatur seit dem Ende des 18. Jahrhunderts. Zur Geschichte der deutschen Sprache und Literatur 941: 156–198.
Nicenboim, B., Vasishth, S., Gattei, C., Sigman, M., and Kliegl, R. 2015. Working Memory Differences in Long-Distance Dependency Resolution. Frontiers in Psychology 61: 312.
Rudnicka, K. 2018. Variation of Sentence Length across Time and Genre: Influence on Syntactic Usage in English. In Diachronic Corpora, Genre, and Language Change, R. J. Whitt (ed.), 219–240. Amsterdam: John Benjamins.
Straka, M. and Straková, J. 2017. Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPIPE. Proceedings of the Conference on Computational Natural Language Learning (CoNLL) 2017. Vancouver, Canada, 3–4 August 2017. 88–99.
Tily, H. 2010. The Role of Processing Complexity in Word Order Variation and Change. PhD Thesis, Stanford University.
