Article published In: The Mental Lexicon: Online-First Articles
Morpho-phonology is not independent of semantics
The case of German nominal number marking
Published online: 12 September 2025
https://doi.org/10.1075/ml.24008.pla
https://doi.org/10.1075/ml.24008.pla
Abstract
Morpho-phonological alternations in inflectional paradigms are commonly analyzed as purely formal phenomena, in which the mapping of phonological structure and morpho-syntactic categories is organized without recourse to semantic properties of the words involved. The present paper explores the role of semantics using the Discriminative Lexicon approach (Baayen, R. H., Chuang, Y.-Y., & Heitmeier, M. (2019). Wpmwithldl:
Implementation of word and paradigm morphology with linear discriminative learning [Computer software
manual]. (R package version 1.4.6, available at [URL])). The test case explored in this paper is German nominal number, a system involving complex morpho-phonological variation (e.g. Köpcke, K.-M., Schimke, S., & Wecker, V. (2021). Processing
of german noun plurals: Evidence for first- and second-order schemata. Word
Structure, 14 (1), 1–24. ; Heitmeier, M., Chuang, Y.-Y., & Baayen, R. H. (2021). Modeling
morphology with linear discriminative learning: Considerations and design choices. Frontiers in
Psychology, 4929. ; Plag, I., Heitmeier, M., & Domahs, F. (2024). German
nominal number interpretation in an impaired mental lexicon: A naive discriminative learning
perspective. The Mental
Lexicon. 18(3), 417–445.; McCurdy, K. (2024). Rules,
frequency, and predictability in morphological generalization: Behavioral and computational evidence from the german plural
system. Edinburgh Research Archive. [URL].). Using word2vec vectors as semantic representations, and triphones as form representations, we created two-layer linear discriminative learning (LDL) networks that map form representations directly onto semantic representations (modeling comprehension), and semantic representations onto form representations (modeling production). The LDL mappings successfully predict the forms and the meanings of the singular and plural nouns taken from a pertinent study (Domahs, F., Bartha-Doering, L., Domahs, U., & Delazer, M. (2017). Wie
muss ein “guter” Deutscher Plural klingen? In N. Fuhrhop, R. Szczepaniak, & K. Schmidt (Eds.), Sichtbare
und hörbare
Morphologie (pp. 205–237). Berlin & Boston: De Gruyter Mouton. ). A number of semantic and phonological measures derived from the LDL network also very successfully distinguished between singular and plural forms. Our results demonstrate that semantics, in addition to formal and grammatical properties, may play a decisive role in the representation and processing of German nominal number. The system of German nominal number can be understood as emerging from the distributional properties of words on the one hand, and basic principles of discriminative human learning on the other.
Article outline
- 1.Introduction
- 2.Modeling German nominal number
- 2.1Previous approaches
- 2.2Modeling nominal number with discriminative learning
- 3.Methodology
- 3.1The data
- 3.2The baseline models
- 3.3The LDL model
- 4.Results
- 4.1Predicting form and meaning using LDL
- 4.2Predicting number
- 4.2.1The baseline models: Predicting number based on structural-phonological properties
- 4.2.2Predicting number using the LDL model: T-SNE and LDA analysis
- 4.3Inspecting individual LDL measures: PCA analysis
- 5.Discussion and conclusion
- Notes
References
References (85)
Arnold, D., Tomaschek, F., Lopez, F., Sering, K., & Baayen, R. H. (2017). Words
from spontaneous conversational speech can be recognized with human-like accuracy by an error-driven learning algorithm that
discriminates between meanings straight from smart acoustic features, bypassing the phoneme as recognition
unit. PLOS
ONE, 12 (4), e0174623. Retrieved
from [URL].
Arseniev-Koehler, A. (2024). Theoretical
foundations and limits of word embeddings: What types of meaning can they capture? Sociological
Methods &
Research, 53 (4), 1753–1793.
Auguste, J., Rey, A., & Favre, B. (2017). Evaluation
of word embeddings against cognitive processes: Primed reaction times in lexical decision and naming
tasks. In Proceedings of the 2nd workshop on evaluating vector space
representations for
NLP (pp. 21–26).
Baayen, R. H. (2008). Analyzing
linguistic data. A practical introduction to statistics using
R. Cambridge: Cambridge University Press.
Baayen, R. H., Chuang, Y.-Y., & Blevins, J. P. (2018). Inflectional
morphology with linear mappings. The Mental
Lexicon, 13 (2), 230–268.
Baayen, R. H., Chuang, Y.-Y., & Heitmeier, M. (2019). Wpmwithldl:
Implementation of word and paradigm morphology with linear discriminative learning [Computer software
manual]. (R package version 1.4.6, available at [URL])
Baayen, R. H., Chuang, Y.-Y., Shafaei-Bajestan, E., & Blevins, J. P. (2019). The
discriminative lexicon: A unified computational model for the lexicon and lexical processing in comprehension and production
grounded not in (de) composition but in linear discriminative
learning. Complexity, 2019 (1), 1–39.
Baayen, R. H., Milin, P., Filipović Durdević, D., Hendrix, P., & Marelli, M. (2011). An
amorphous model for morphological processing in visual comprehension based on naive discriminative
learning. Psychological
Review, 118 (3), 438–481.
Baayen, R. H., & Moscoso del Prado Martín, F. (2005). Semantic
density and past-tense formation in three germanic
languages. Language, 811, 666–698.
Behrens, H. (2009). Usage-based
and emergentist approaches to language
acquisition. Linguistics, 47 (2), 383–411.
Berent, I., Pinker, S., Tzelgov, J., Bibi, U., & Goldfarb, L. (2005). Computation
of semantic number from morphological information. Journal of Memory and
Language, 531, 342–358.
Beser, D. (2021). Falling
through the gaps: Neural architectures as models of morphological rule learning. Retrieved
from [URL]
Boleda, G. (2020). Distributional
semantics and linguistic theory. Annual Review of
Linguistics, 6 (1), 213–234.
Buch, A. (2011). Linguistic
spaces: Kernel-based models of natural language (PhD
Dissertation). Universität Tübingen, Tübingen.
Carlson, M. T., & Crosson, A. C. (2025). The
synchronic status of historical bound roots in the mental lexicon: A dynamic, psychocentric
perspective. The Mental
Lexicon. 19(2), 224–252.
Charlesworth, T. E., Yang, V., Mann, T. C., Kurdi, B., & Banaji, M. R. (2021). Gender
stereotypes in natural language: Word embeddings show robust consistency across child and adult language corpora of more than
65 million words. Psychological
Science, 32 (2), 218–240.
Chuang, Y.-Y., & Baayen, R. H. (2021). Discriminative
learning and the lexicon: Ndl and ldl. In Oxford research
encyclopedia of linguistics. Oxford: Oxford University Press.
Chuang, Y.-Y., Brown, D., Baayen, H., & Evans, R. (2023). Paradigm
gaps are associated with weird “distributional semantics” properties. The Mental
Lexicon. 17(3), 395–421.
Daelemans, W. (2002). A
comparison of analogical modeling of language to memory-based language
processing. In R. Skousen, D. Lonsdale, & D. B. Parkinson (Eds.), Analogical
modeling: An exemplar-based approach to
language (pp. 157–179). Amsterdam: John Benjamins.
Daelemans, W., Zavrel, J., van der Sloot, K., & van den Bosch, A. (2007). TiMBL:
Tilburg Memory Based Learner, version 6.0, Reference Guide: LK Technical Report
04–02. Tilburg: ILK.
Dankers, V., Langedijk, A., McCurdy, K., Williams, A., & Hupkes, D. (2021). Generalising
to german plural noun classes, from the perspective of a recurrent neural
network. In Proceedings of the 25th conference on computational
natural language
learning (pp. 94–108). Retrieved
from [URL].
Domahs, F., Bartha-Doering, L., Domahs, U., & Delazer, M. (2017). Wie
muss ein “guter” Deutscher Plural klingen? In N. Fuhrhop, R. Szczepaniak, & K. Schmidt (Eds.), Sichtbare
und hörbare
Morphologie (pp. 205–237). Berlin & Boston: De Gruyter Mouton.
Eisenberg, P., & Fuhrhop, N. (2020). Grundriss
der Deutschen Grammatik -- Das Wort (5.Auflage ed.). Stuttgart & Weimar: J.B. Metzler.
Fábregas, A. (2018). Defectiveness
in morphology. In Oxford research encyclopedia of
linguistics. Retrieved from [URL].
Günther, F., Petilli, M. A., & Marelli, M. (2020). Semantic
transparency is not invisibility: A computational model of perceptually-grounded conceptual combination in word
processing. Journal of Memory and
Language, 1121, 104104.
Hahn, U., & Nakisa, R. C. (2000). German
inflection: Single route or dual route? Cognitive
Psychology, 41 (4), 313–360.
Heitmeier, M. (2022). Judilingmeasures.jl. [URL]
Heitmeier, M., & Baayen, R. H. (2020). Simulating
phonological and semantic impairment of english tense inflection with linear discriminative
learning. The Mental
Lexicon, 15 (3), 385–421.
Heitmeier, M., Chuang, Y.-Y., & Baayen, H. (2025). The
Discriminative Lexicon: Theory, implementation in the Julia package JudiLing, and
applications. Manuscript, University of Tübingen.
Heitmeier, M., Chuang, Y.-Y., & Baayen, R. H. (2021). Modeling
morphology with linear discriminative learning: Considerations and design choices. Frontiers in
Psychology, 4929.
Hilpert, M. (2019). Higher-order
schemas in morphology: What they are, how they work, and where to find them. Word
Structure, 12 (3), 261–273.
Hothorn, T., & Zeileis, A. (2015). partykit:
A modular toolkit for recursive partytioning in r. The Journal of Machine Learning
Research, 16 (1), 3905–3909.
Kamin, L. J. (1969). Predictability,
surprise, attention, and conditioning. In B. A. Campbell & R. M. Church (Eds.), Punishment
and aversive
behavior (pp. 276–296). New York: Appleton-Century-Crofts.
Kassambara, A., & Mundt, F. (2020). Extract
and visualize the results of multivariate data analyses [r package factoextra version
1.0.7]. Retrieved from [URL]
(1993). Schemata
bei der pluralbildung im deutschen: Versuch einer kognitiven
morphologie (Vol. 471). Tübingen: G. Narr.
(1998). The
acquisition of plural marking in english and german revisited: Schemata versus rules. Journal
of Child
Language, 25 (2), 293–319.
Köpcke, K.-M., Schimke, S., & Wecker, V. (2021). Processing
of german noun plurals: Evidence for first- and second-order schemata. Word
Structure, 14 (1), 1–24.
Köpcke, K.-M., & Wecker, V. (2017). Source-
and product-oriented strategies in l2 acquisition of plural marking in
german. Morphology, 27 (1), 77–103. Retrieved
from [URL]
Luo, X. (2021). Judiling:
An implementation for discriminative learning in julia (Master
Thesis, Eberhard Karls University of Tübingen, Tübingen). Retrieved
from [URL]
MacLeod, B. (2015). A
critical evaluation of two approaches to defining perceptual
salience. Ampersand, 21, 83–92.
MacWhinney, B., Pléh, C., & Bates, E. (1985). The
development of sentence interpretation in hungarian. Cognitive
psychology, 17(2), 178–209.
Marcus, G. F., Brinkmann, U., Clahsen, H., Wiese, R., & Pinker, S. (1995). German
inflection: The exception that proves the rule. Cognitive
Psychology, 29 (3), 189–256.
McCurdy, K. (2024). Rules,
frequency, and predictability in morphological generalization: Behavioral and computational evidence from the german plural
system. Edinburgh Research Archive. [URL].
McCurdy, K., Goldwater, S., & Lopez, A. (2020). Inflecting
when there’s no majority: Limitations of encoder-decoder neural networks as cognitive models for german
plurals. Retrieved from [URL]
McCurdy, K., Lopez, A., & Goldwater, S. (2020). Conditioning,
but on which distribution? Grammatical gender in german plural
inflection. In Proceedings of the workshop on cognitive modeling and
computational
linguistics (pp. 59–65). Retrieved
from [URL].
Mujezinović, E., Kapatsinski, V., & van de Vijver, R. (2024). One
cue’s loss is another cue’s gain — learning morphophonology through unlearning. Cognitive
Science, 48 (5), e13450.
Nieder, J., Chuang, Y.-Y., van de Vijver, R., & Baayen, R. H. (2022). A
Discriminative Lexicon approach to word comprehension, production and processing: Maltese
plurals. Language, 99(2), 242–274.
Pearce, J. M., & Bouton, M. E. (2001). Theories
of associative learning in animals. Annual Review of
Psychology, 52 (1), 111–139.
Penke, M., & Krause, M. (2002). German
noun plurals: A challenge to the dual-mechanism model. Brain and
Language, 81 (1–3), 303–311.
Penke, M., Wimmer, E., Hennies, J., Hess, M., & Rothweiler, M. (2016). Inflectional
morphology in german hearing-impaired children. Logopedics Phoniatrics
Vocology, 41 (1), 9–26.
Pescuma, V. N., Zanini, C., Crepaldi, D., & Franzon, F. (2021). Form
and function: A study on the distribution of the inflectional endings in italian nouns and
adjectives. Frontiers in
Psychology, 121, 720228.
Plag, I., Heitmeier, M., & Domahs, F. (2024). German
nominal number interpretation in an impaired mental lexicon: A naive discriminative learning
perspective. The Mental
Lexicon. 18(3), 417–445.
Polišenská, D. (2010). Dutch
children’s acquisition of verbal and adjectival inflection (PhD
Dissertation). Universiteit van Amsterdam, Amsterdam.
Ramscar, M., Yarlett, D., Dye, M., Denny, K., & Thorpe, K. (2010). The
effects of feature-label-order and their implications for symbolic learning. Cognitive
Science, 34 (6), 909–957.
Rescorla, R., & Wagner, A. (1972). A
theory of pavlovian conditioning: Variations in the effectiveness of reinforcement and
nonreinforcement. In A. Black & W. Prokasy (Eds.), Classical
conditioning ii: Current research and
theory (p. 64–99). New York: Appleton-Century-Crofts.
Rescorla, R. A. (1988a). Behavioral
studies of pavlovian conditioning. Annual Review of
Neuroscience, 11 (1), 329–352.
(1988b). Pavlovian
conditioning. it’s not what you think it is. American
Psychologist, 43(3), 151–160.
Rosen, E. (2022). Modeling
human-like morphological prediction. In Proceedings of the Society
for Computation in
Linguistics 20221 (pp. 133–142). Retrieved
from [URL]
Saito, M., Tomaschek, F., & Baayen, R. H. (2020). Relative
functional load determines co-articulatory movements of the tongue
tip. In Proceedings of the 12th international seminar on speech
production. New Haven, CT. Retrieved from [URL] (217)
Schakel, A. M., & Wilson, B. J. (2015). Measuring
word significance using distributed representations of words. arXiv preprint
arXiv:1508.02297.
Schmid, H.-J., & Günther, F. (2016). Toward
a unified socio-cognitive framework for salience in language. Frontiers in
Psychology, 71, 1110.
Schmitz, D., Plag, I., Baer-Henney, D., & Stein, S. (2021). Durational
differences of word-final /s/ emerge from the lexicon: Modeling morpho-phonetic effects in pseudowords with linear
discriminative learning. Frontiers in
Psychology, 12. (680889)
Schäfer, M. (2025). The
role of meaning in the rivalry of -ity and -ness: Evidence from distributional
semantics. English Language and Linguistics.
Shafaei-Bajestan, E., Moradipour-Tari, M., Uhrig, P., & Baayen, R. H. (2021). LDL-auris:
A computational model, grounded in error-driven learning, for the comprehension of single spoken
words. Language, Cognition and
Neuroscience, 1–28.
(2024). The
pluralization palette: unveiling semantic clusters in english nominal pluralization through distributional
semantics. Morphology, 34 (4), 369–413.
Simoens, H., Housen, A., & De Cuypere, L. (2017). The
effect of perceptual salience on processing l2 inflectional
morphology. In S. M. Gass, P. Spinner, & J. Behney (Eds.), Salience
in second language
acquisition (pp. 107–130). New York: Routledge.
Stein, S. D., & Plag, I. (2021). Morpho-phonetic
effects in speech production: Modeling the acoustic duration of english derived words with linear discriminative
learning. Frontiers in
Psychology, 12. (678712)
Taatgen, N. A. (2001). Extending
the past-tense debate: A model of the german plural. In Proceedings
of the annual meeting of the Cognitive Science
Society (Vol. 231, pp. 94–108).
Tomaschek, F., Plag, I., Ernestus, M., & Baayen, R. H. (2021). Phonetic
effects of morphology and context: Modeling the duration of word-final S in English with naïve discriminative
learning. Journal of
Linguistics, 57 (1), 123–161.
Tomaschek, F., & Ramscar, M. (2022). Understanding
the phonetic characteristics of speech under uncertainty-implications of the representation of linguistic knowledge in
learning and processing. Frontiers in
Psychology, 131, 754395.
Trommer, J. (2021). The
subsegmental structure of german plural allomorphy. Natural Language & Linguistic
Theory, 39 (2), 601–656.
Van der Maaten, L., & Hinton, G. (2008). Visualizing
data using t-sne. Journal of Machine Learning
Research, 9 (11).
van de Vijver, R., & Uwambayinema, E. (2022). A
word-based account of comprehension and production of Kinyarwanda nouns in the Discriminative
Lexicon. Linguistics
Vanguard. 8(1), 197–207.
(2009). The
grammar and typology of plural noun inflection in varieties of German. Journal of Comparative
Germanic
Linguistics, 12 (2), 137–173.
Wulf, D. J. (2002). Applying
analogical modeling to the german plural. In R. Skousen, D. Lonsdale, & D. B. Parkinson (Eds.), Analogical
modeling: An exemplar-based approach to
language (pp. 109–122). Amsterdam: John Benjamins.
Yamada, I., Asai, A., Sakuma, J., Shindo, H., Takeda, H., Takefuji, Y., & Matsumoto, Y. (2020, October). Wikipedia2Vec:
An efficient toolkit for learning and visualizing the embeddings of words and entities from
Wikipedia. In Q. Liu & D. Schlangen (Eds.), Proceedings
of the 2020 conference on empirical methods in natural language processing: System
demonstrations (pp. 23–30). Online:
Association for Computational Linguistics. Retrieved from [URL].