In:Mathematical Modelling in Linguistics and Text Analysis: Theory and applications
Edited by Adam Pawłowski, Sheila Embleton, Jan Mačutek and Aris Xanthos
[Current Issues in Linguistic Theory 370] 2025
► pp. 191–206
Measuring language complexity about European politics in Swiss parliamentary debates
Published online: 13 October 2025
https://doi.org/10.1075/cilt.370.16rev
https://doi.org/10.1075/cilt.370.16rev
Abstract
This study investigates changes in the political discourse surrounding Europe and European integration in
Swiss parliamentary debates from 1995 to 2022. It relies on text analysis methods for measuring linguistic complexity and
benchmarks the observed trends against topicality, party affiliation and other information sources. Analysis of variance
assesses how linguistic complexity indicators vary across these factors, principal component analysis identifies the
multidimensionality of complexity in parliamentary speeches, and an experiment is conducted with ChatGPT to reduce the
complexity of the most complex parliamentary speeches about Europe. Results highlight a decline in complexity over time in
speeches related to the EU debate and overall. Furthermore, the study identifies four key dimensions of complexity
(technicality, clarity, accessibility and familiarity), and shows the intervention of ChatGPT to reduce technicality and
improve familiarity of complex political speeches.
Article outline
- 1.Introduction
- 2.Theoretical background
- 2.1Measurement of complexity
- 2.1.1Exploring language complexity in sociolinguistics and political discourse
- 2.1.2Assessing complexity in political discourse
- 2.2Strategic incentives of complexity
- 2.1Measurement of complexity
- 3.Data and measures
- 3.1Data sources
- 3.2Measures
- 3.3Analytical strategy
- 4.Results
- 4.1Benchmarking trends in linguistic complexity indicators
- 4.2Extracting relevant political text complexity dimensions
- 4.3Experiment: Potential of ChatGPT to reduce complex speeches
- 5.Discussion of the main findings
Note References Appendix
References (39)
Ayre, Julia, Olivia Mac, Kirsten McCaffery, Brad McKay, Mingyi Liu, Yi Shi, Atria Rezwan & Adam Dunn. 2023. New
frontiers in health literacy: using ChatGPT to simplify health information for people in the
community. Journal of general internal
medicine 39(4). 573–577.
Benoit, Kenneth, Kevin Munger & Arthur Spirling. 2019. Measuring
and explaining political sophistication through textual complexity. American Journal of
Political
Science 63(2). 491–508.
Biber, Douglas, Susan Conrad & Randi Reppen. 1998. Corpus
linguistics: Investigating language structure and
use. Cambridge: Cambridge University Press.
Biber, Douglas & Edward Finegan. 1989. Styles
of stance in English: Lexical and grammatical marking of evidentiality and
affect. Text 9(1). 93–124.
Biber, Douglas, Stig Johansson, Geoffrey Leech, Susan Conrad & Edward Finegan. 1999. Longman
grammar of spoken and written
English. London: Longman.
Biber, Douglas & Bethany Gray. 2016. Grammatical
complexity in academic English. Linguistic change in writing. studies in English
language. Cambridge: Cambridge University Press.
Bischof, Daniel & Roman Senninger. 2018. Simple
politics for the people? Complexity in campaign messages and political
knowledge. European Journal of Political
Research 57(2). 473–495.
Ehret, Katharina & Maite Taboada. 2021. The
interplay of complexity and subjectivity in opinionated discourse. Discourse
Studies 23(2). 141–165.
Fairclough, Norman. 2003. Analysing
discourse: Textual analysis for social
research. London: Routledge.
Fournier-Tombs, Eleonore & Michael K. MacKenzie. 2021. Big
data and democratic speech: Predicting deliberative quality using machine learning
techniques. Methodological
Innovations 14(2).
Fowler, Roger. 1979. Preliminaries
to a sociolinguistic theory of literary
discourse. Poetics 8(6). 531–556.
Hutter, Swen & Edgar Grande. 2014. Politicizing
Europe in the national electoral arena: A comparative analysis of five West European countries,
1970–2010. Journal of Common Market
Studies 52(5). 1002–1018.
Hyland, Ken. 1998. Hedging
in scientific research
articles. Amsterdam: Benjamins.
. 2004. Disciplinary
interactions: Metadiscourse in L2 postgraduate writing. Journal of Second Language
Writing 13(2). 133–151.
Jagaiah, T., Olinghouse, N. G. & Kearns, D. M. 2020. Syntactic
complexity measures: variation by genre, grade-level, students’ writing abilities, and writing
quality. Read
Writ 33, 2577–2638.
Kincaid, J. Peter, Robert P. Fishburne Jr., Richard L. Rogers & Brad S. Chissom. 1975. Derivation
of new readability formulas (automated readability index, fog count and Flesch reading ease formula) for navy enlisted
personnel. Naval Technical Training Command Millington TN Research
Branch. [URL].
Levy, Roger & T. Florian Jaeger. 2006. Speakers
optimize information density through syntactic
reduction. In Advances in neural information processing
systems 19, (NIPS 2006). [URL]
Malvern, David & Brian Richards. 2002. Investigating
accommodation in language proficiency interviews using a new measure of lexical
diversity. Language
Testing, 19(1). 85–104.
McWhorter, John H. 2001. The world’s simplest
grammars are creole grammars. Linguistic
Typology 5:2(3). 125–166.
Michel, Jean-Baptiste et al. 2011. Quantitative
analysis of culture using millions of digitized
books. Science 331(6014). 176–182.
Pitler, Emily & Ani Nenkova. 2008. Revisiting
readability: A unified framework for predicting text
quality. In Proceedings of the 2008 conference on empirical
methods in natural language
processing, 186–195. ACL.
Rauh, Christian. 2022. Clear
messages to the European public? The language of European Commission press releases
1985–2020. Journal of European
Integration 45(4). 683–701.
Rauh, Christian, Bart Joachim Bes & Martijn Schoonvelde. 2020. Undermining,
defusing or defending European integration? Assessing public communication of European executives in times of EU
politicisation. European Journal of Political
Research 59(2). 397–423.
Rauh, Christian & Pieter De Wilde. 2018. The
opposition deficit in EU accountability: Evidence from over 20 years of plenary debate in four member
states. European Journal of Political
Research 57(1). 194–216.
Reveilhac, Maud & Davide Morselli. 2022. Populism
in an identity framework: A feedback model. Communication
Theory 32(1). 88–115.
Schoonvelde, Martijn, Anna Brosius, Gijs Schumacher & Bert N. Bakker. 2019. Liberals
lecture, conservatives communicate: Analyzing complexity and ideology in 381,609 political
speeches. PloS
one 14(2). e0208450.
Steenbergen, Marco, André Bächtiger, Markus Spörndli & Jürg Steiner. 2003. Measuring
political deliberation: A discourse quality index. Comparative European
Politics 1. 21–48.
Suedfeld, Peter & Philip Tetlock. 1977. Integrative
complexity of communications in international crises. Journal of conflict
resolution 21(1). 169–184.
Szmrecsanyi, Benedikt. 2009. Typological
parameters of intralingual variability: Grammatical analyticity versus syntheticity in varieties of
English. Language Variation and
Change 21(3). 319–353.
Tolochko, Petro & Hajo Boomgaarden. 2019. Determining
political text complexity: Conceptualizations, measurements, and
application. International Journal of
Communication 13(21).
Tucker, Ethan C., Colton J. Capps & Lior Shamir. 2020. A
data science approach to 138 years of congressional
speeches. Heliyon 6(8). e04417.
Tyrkkö, Jukka. 2016. Looking
for rhetorical thresholds: Pronoun frequencies in political speeches. Studies in
Variation, Contacts and Change in English, Vol. 17. [URL]
Vliegenthart, Rens, Andreas R. Schuck, Hajo G. Boomgaarden & Claes H. De Vreese. 2008. News
coverage and support for European integration, 1990–2006. International Journal of
Public Opinion
Research 20(4). 415–439.
Wiebe, Janyce, Theresa Wilson, Rebecca Bruce, Matthew Bell & Melanie Martin. 2004. Learning
subjective language. Computational
Linguistics 30(3). 277–308.
