Get fulltext from our e-platform

Mathematical Modelling in Linguistics and Text Analysis
Theory and applications
This book is a panorama of contemporary quantitative linguistics, as developed over decades. It highlights the main topics of QL: statistical laws of language, taxonomy of linguistic phenomena, authorial attribution, quantitative analysis of syntax (e.g., dependency grammar), measurement of text difficulty, and other phenomena at the intersection of linguistics, literary studies, semiotics, and information science. It also reflects on the relevance of these time-honoured approaches in our new reality increasingly dominated by AI – both in terms of text material and methodology. Before our very eyes, computers are achieving human-level linguistic competence. The era of LLMs and the growing dominance of machine-generated text is becoming reality. The scale of these changes, initiated by the replacement of print with the digital universe, is enormous. Today, linguistics is closer than ever to mathematics and computer science, and thus quantitatively-oriented linguists are particularly well-suited to address questions about the boundary between humans and machines in scientific research.
[Current Issues in Linguistic Theory, 370] 2025. vi, 241 pp.
Publishing status: Available
Published online on 13 October 2025
Published online on 13 October 2025
© John Benjamins
Table of Contents
- IntroductionAdam Pawłowski, Sheila Embleton, Jan Mačutek and Aris Xanthos | pp. 1–5
- Testing diachronic measures of productivity using the Zipf-Mandelbrot lawQuentin Feltgen | pp. 6–16
- Not just frequency: Keyness should integrate frequency, association, and dispersionStefan Th. Gries | pp. 17–26
- Noun declension in Slavic languages: Animacy has a stronger influence than genderJán Mačutek, Emmerich Kelih, Michaela Koščová and Mirjam Sepesy Maučec | pp. 27–42
- Simple stochastic processes behind Menzerath’s LawJiří Milička | pp. 43–59
- Distributional properties of linear dependency segmentsMichaela Nogolová, Ján Mačutek and Radek Čech | pp. 60–69
- The length and order of grammatical elements in clauses in JapaneseHaruko Sanada | pp. 70–80
- Analyzing Japanese texts with evaluation of randomness in binary expressionYosuke Takubo, Masayuki Asahara and Makoto Yamazaki | pp. 81–89
- Distribution of dependency tags in different text types in CzechXinying Chen and Miroslav Kubát | pp. 90–103
- Discourse markers’ role in syntactic complexity of sentence structure: A distance-driven case study based on TED talksZheyuan Dai and Jianwei Yan | pp. 104–117
- Development of mean dependency distance in Czech L2 textsMichaela Hanušková, Michaela Nogolová and Miroslav Kubát | pp. 118–127
- A comparative analysis of stylometry and authorship attribution in a creole and non-creole languagePatrick Juola and Alejandro J. Napolitano Jawerbaum | pp. 128–137
- Long time no Joe: Trends of post-WWII proper names development in CzechiaTereza Klemensová, Michal Místecký and Peter Zörnig | pp. 138–148
- Probabilistic regularity in translation: A quantitative description of dependency treebank of translated academic abstractsYan Liang | pp. 149–160
- Linguistic correlates of semantic knowledge ontologies: An example of UDC and DDC classificationAdam Pawłowski, Tomasz Walkowiak and Lars G. Johnsen | pp. 161–172
- CapekDraCor database and some aspects of quantitative linguistic analysis of the Čapek brothers’ playsPetr Pořízka | pp. 173–190
- Measuring language complexity about European politics in Swiss parliamentary debatesMaud Reveilhac and Gerold Schneider | pp. 191–206
- Corpus-driven analysis using Convolutional Neural Networks with Multi-Head AttentionLaurent Vanni, Sofiane Haris and Damon Mayaffre | pp. 207–216
- Visualizing character profile shifts in English texts over the centuriesEric S. Wheeler and Sheila Embleton | pp. 217–225
- Quantitative assessment of paralinguistic accommodation in WhatsApp chats: A user-centered case studyAris Xanthos | pp. 226–235
- Subject index | pp. 237–239
- Language index | p. 241