Cover not available

Mathematical Modelling in Linguistics and Text Analysis

Theory and applications

Editor

Adam Pawłowski | University of Wroclaw

Sheila Embleton | York University

Jan Mačutek | Slovak Academy of Sciences | Constantine the Philosopher University in Nitra

Aris Xanthos | University of Lausanne

Hardbound – Available

ISBN 9789027228376 | EUR 125.00 | USD 163.00

e-Book –

ISBN 9789027244482 | EUR 125.00 | USD 163.00

Get fulltext from our e-platform

Download Book PDF

Download Book EPUB

This book is a panorama of contemporary quantitative linguistics, as developed over decades. It highlights the main topics of QL: statistical laws of language, taxonomy of linguistic phenomena, authorial attribution, quantitative analysis of syntax (e.g., dependency grammar), measurement of text difficulty, and other phenomena at the intersection of linguistics, literary studies, semiotics, and information science. It also reflects on the relevance of these time-honoured approaches in our new reality increasingly dominated by AI – both in terms of text material and methodology. Before our very eyes, computers are achieving human-level linguistic competence. The era of LLMs and the growing dominance of machine-generated text is becoming reality. The scale of these changes, initiated by the replacement of print with the digital universe, is enormous. Today, linguistics is closer than ever to mathematics and computer science, and thus quantitatively-oriented linguists are particularly well-suited to address questions about the boundary between humans and machines in scientific research.

[Current Issues in Linguistic Theory, 370] 2025. vi, 241 pp.

Publishing status: Available
Published online on 13 October 2025

© John Benjamins

https://doi.org/10.1075/cilt.370

Table of Contents

Adam Pawłowski, Sheila Embleton, Jan Mačutek and Aris Xanthos | pp. 1–5

Testing diachronic measures of productivity using the Zipf-Mandelbrot law

Quentin Feltgen | pp. 6–16

Not just frequency: Keyness should integrate frequency, association, and dispersion

Stefan Th. Gries | pp. 17–26

Noun declension in Slavic languages: Animacy has a stronger influence than gender

Ján Mačutek, Emmerich Kelih, Michaela Koščová and Mirjam Sepesy Maučec | pp. 27–42

Simple stochastic processes behind Menzerath’s Law

Jiří Milička | pp. 43–59

Distributional properties of linear dependency segments

Michaela Nogolová, Ján Mačutek and Radek Čech | pp. 60–69

The length and order of grammatical elements in clauses in Japanese

Haruko Sanada | pp. 70–80

Analyzing Japanese texts with evaluation of randomness in binary expression

Yosuke Takubo, Masayuki Asahara and Makoto Yamazaki | pp. 81–89

Distribution of dependency tags in different text types in Czech

Xinying Chen and Miroslav Kubát | pp. 90–103

Discourse markers’ role in syntactic complexity of sentence structure: A distance-driven case study based on TED talks

Zheyuan Dai and Jianwei Yan | pp. 104–117

Development of mean dependency distance in Czech L2 texts

Michaela Hanušková, Michaela Nogolová and Miroslav Kubát | pp. 118–127

A comparative analysis of stylometry and authorship attribution in a creole and non-creole language

Patrick Juola and Alejandro J. Napolitano Jawerbaum | pp. 128–137

Long time no Joe: Trends of post-WWII proper names development in Czechia

Tereza Klemensová, Michal Místecký and Peter Zörnig | pp. 138–148

Probabilistic regularity in translation: A quantitative description of dependency treebank of translated academic abstracts

Yan Liang | pp. 149–160

Linguistic correlates of semantic knowledge ontologies: An example of UDC and DDC classification

Adam Pawłowski, Tomasz Walkowiak and Lars G. Johnsen | pp. 161–172

CapekDraCor database and some aspects of quantitative linguistic analysis of the Čapek brothers’ plays

Petr Pořízka | pp. 173–190

Measuring language complexity about European politics in Swiss parliamentary debates

Maud Reveilhac and Gerold Schneider | pp. 191–206

Corpus-driven analysis using Convolutional Neural Networks with Multi-Head Attention

Laurent Vanni, Sofiane Haris and Damon Mayaffre | pp. 207–216

Visualizing character profile shifts in English texts over the centuries

Eric S. Wheeler and Sheila Embleton | pp. 217–225

Quantitative assessment of paralinguistic accommodation in WhatsApp chats: A user-centered case study

Aris Xanthos | pp. 226–235

Subject index | pp. 237–239

Language index | p. 241

Subjects and metadata

Linguistics

Theoretical linguistics

Computational & corpus linguistics

Main BIC Subject

CFX: Computational linguistics

Main BISAC Subject

LAN009000: LANGUAGE ARTS & DISCIPLINES / Linguistics / General

ONIX Metadata

VPAT

ePub Accessibility Conformance Report (VPAT)

LoC, MARC XML

U.S. Library of Congress Control Number: 2025029878 | Marc record