In:Current Issues in Morphological Theory: (Ir)regularity, analogy and frequency
Edited by Ferenc Kiefer †, Mária Ladányi and Péter Siptár
[Current Issues in Linguistic Theory 322] 2012
► pp. 135–162
Morphological complexity and unsupervised learning
Validating Russian inflectional classes using high frequency data
Published online: 30 May 2012
https://doi.org/10.1075/cilt.322.07bro
https://doi.org/10.1075/cilt.322.07bro
This paper addresses the question of whether it is possible to use machine learning techniques on linguistic data to validate linguistic theory. We determine how readily inflectional classes recognized by linguists can be inferred by an unsupervised learning method when it is presented with the paradigms of a small number (80) of high frequency Russian noun lexemes. We interpret this as a measure of the validity of the linguistic theory. Inflectional classes are of particular interest, because they constitute a kind of autonomous morphological complexity that has no direct relationship to other levels of linguistic description, and hence there is no other objective way of assessing a theoretical characterization of them. Using the same method, we also examine the status of principal parts and defaults in inflectional classes, and the relationship between inflectional classes and stress in Russian nominal morphology. Our experiments suggest that this is an effective and interesting technique for shedding additional light on theoretical claims.
Cited by (4)
Cited by four other publications
Bonami, Olivier & Benoît Sagot
Goldsmith, John A., Jackson L. Lee & Aris Xanthos
CRYSMANN, BERTHOLD & OLIVIER BONAMI
This list is based on CrossRef data as of 6 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
