In:Mathematical Modelling in Linguistics and Text Analysis: Theory and applications
Edited by Adam Pawłowski, Sheila Embleton, Jan Mačutek and Aris Xanthos
[Current Issues in Linguistic Theory 370] 2025
► pp. 27–42
Noun declension in Slavic languages
Animacy has a stronger influence than gender
Published online: 13 October 2025
https://doi.org/10.1075/cilt.370.03mac
https://doi.org/10.1075/cilt.370.03mac
Abstract
Some quantitative properties of inflexional morphology of nouns in four Slavic languages (Czech, Russian,
Slovak, and Slovene) are presented. We analyse the frequency behaviour of grammatical cases and the variability of noun word
forms. The difference between a word form and its lemma is expressed by Levenshtein distance. Across the four languages under
study, word forms more similar to the nominative form occur more often. We observe that the category of animacy has a decisive
influence on the properties under study, with grammatical gender being another important factor.
Keywords: Slavic languages, noun declension, animacy, gender, case, frequency
Article outline
- 1.Introduction
- 2.Language material and methodology
- 2.1Texts used
- 2.2Methodological aspects
- 3.Results
- 3.1Frequency of cases
- 3.2Frequency of Levenshtein distances
- 4.Conclusion
Notes References
References (22)
Benko, Vladimír. 2014. Aranea:
Yet another family of (comparable) web corpora. In Petr Sojka, Aleš Horák, Ivan Kopeček & Karel Pala (eds.), Text,
speech and
dialogue, 247–254. Cham: Springer.
Erjavec, Tomaž & Simon Krek. 2008. The
JOS morphosyntactically tagged corpus of Slovene. In Nicoletta Calzonari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis & Daniel Tapias (eds.), Proceedings
of the sixth international conference on language resources and evaluation
(LREC’08), 322–326. Marrakech: ELRA.
Hentschel, Gerd & Thomas Menzel. 2009. Nominale
Kategorien: Kasus. In Tilman Berger, Karl Gutschmidt, Sebastian Kempgen & Peter Kosta (eds.), Die
slavischen Sprachen. Ein internationales Handbuch zu ihrer Geschichte, ihrer Struktur und ihrer Erforschung,
Band 1, 161–176. Berlin: de Gruyter.
Igartua, Iván & Ekaitz Santazilia. 2018. How
animacy and natural gender constrain morphological complexity: Evidence from
diachrony. Open
Linguistics 4(1). 438–452.
Janda, Laura A. 2014. Introduction to Slavic
historical morphology: Slavic noun classes. In Tilman Berger, Karl Gutschmidt, Sebastian Kempgen & Peter Kosta (eds.), Die
slavischen Sprachen. Ein internationales Handbuch zu ihrer Geschichte, ihrer Struktur und ihrer Erforschung, Band
1, 1565–1582. Berlin: de Gruyter.
Kelih, Emmerich. 2009a. Preliminary
analysis of a Slavic parallel corpus. In Jana Levická & Radovan Garabík (eds.), NLP,
corpus linguistics, corpus based grammar
research, 175–183. Brno: Tribun.
. 2009b. Slawisches
Parallel-Textkorpus: Projektvorstellung von “Kak zakaljalas’ stal’
(KZS)”. In Emmerich Kelih, Viktor Levickij & Gabriel Altmann (eds.), Methods
of text
analysis, 106–124. Černivci: ČNU.
Kelih, Emmerich, Ján Mačutek, Michaela Koščová & Vladimír Benko. 2023. Nouns
more similar to the nominative form are more frequent: A case study in
Slovak. Glottotheory 14(1). 69–80.
Kelih, Emmerich & Peter Zörnig. 2012. Models
of morph length: Discrete and continous
approaches. Glottometrics 24. 70–78.
Klenin, Emily. 2009. Animacy,
personhood. In Tilman Berger, Karl Gutschmidt, Sebastian Kempgen & Peter Kosta (eds.), Die
slavischen Sprachen. Ein internationales Handbuch zu ihrer Geschichte, ihrer Struktur und ihrer Erforschung, Band
1, 152–161. Berlin: de Gruyter.
Koščová, Michaela, Ján Mačutek & Emmerich Kelih. 2016. A
data-based classification of Slavic languages: Indices of qualitative variation applied to grapheme
frequencies. Journal of Quantitative
Linguistics 23(2). 177–190.
Köhler, Reinhard. 2005. Synergetic
linguistics. In Gabriel Altmann, Reinhard Köhler & Rajmund G. Piotrowski (eds.), Handbook
of quantitative
linguistics, 760–774. Berlin: de Gruyter.
Levenshtein, Vladimir I. 1965. Binary codes capable of
correcting deletions, insertions, and reversals. Soviet Physics
Doklady 10(8). 707–710.
Mačutek, Ján & Radek Čech. 2013. Frequency
and declensional morphology of Czech nouns. In Ivan Obradović, Emmerich Kelih & Reinhard Köhler (eds.), Methods
and applications of quantitative
linguistics, 59–68. Belgrade: Akademska Misao.
Mačutek, Ján, Michaela Koščová, Emmerich Kelih, & Radek Čech. 2023. Frequency
and morphological behaviour of nouns in Czech and
Russian. Bohemistyka 23(1). 109–117.
Rujević, Biljana, Marija Kaplar, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović & Ján Mačutek. 2021. Quantitative
analysis of syllable properties in Croatian, Serbian, Russian, and
Ukrainian. In Adam Pawłowski, Ján Mačutek, Sheila Embleton & George Mikros (eds.), Language
and text: Data, models, information and
applications, 55–67. Amsterdam: Benjamins.
