Morpho-phonology is not independent of semantics: The case of German nominal number marking

Plag, Ingo; Heitmeier, Maria; Domahs, Frank

doi:10.1075/ml.24008.pla

Article published In: The Mental Lexicon: Online-First Articles

Get fulltext from our e-platform

Download EPUB

Morpho-phonology is not independent of semantics

The case of German nominal number marking

Ingo Plag | Heinrich-Heine-Universität Düsseldorf

Maria Heitmeier | Universität Tübingen

Frank Domahs | Universität Erfurt

Published online: 12 September 2025

https://doi.org/10.1075/ml.24008.pla

Abstract

Morpho-phonological alternations in inflectional paradigms are commonly analyzed as purely formal phenomena, in which the mapping of phonological structure and morpho-syntactic categories is organized without recourse to semantic properties of the words involved. The present paper explores the role of semantics using the Discriminative Lexicon approach (). The test case explored in this paper is German nominal number, a system involving complex morpho-phonological variation (e.g. ; ; ; ). Using word2vec vectors as semantic representations, and triphones as form representations, we created two-layer linear discriminative learning (LDL) networks that map form representations directly onto semantic representations (modeling comprehension), and semantic representations onto form representations (modeling production). The LDL mappings successfully predict the forms and the meanings of the singular and plural nouns taken from a pertinent study (). A number of semantic and phonological measures derived from the LDL network also very successfully distinguished between singular and plural forms. Our results demonstrate that semantics, in addition to formal and grammatical properties, may play a decisive role in the representation and processing of German nominal number. The system of German nominal number can be understood as emerging from the distributional properties of words on the one hand, and basic principles of discriminative human learning on the other.

Keywords: number inflection, discriminative learning, semantic vectors, morpho-phonology, German

Article outline

1.Introduction
2.Modeling German nominal number
- 2.1Previous approaches
- 2.2Modeling nominal number with discriminative learning
3.Methodology
- 3.1The data
- 3.2The baseline models
- 3.3The LDL model
4.Results
- 4.1Predicting form and meaning using LDL
- 4.2Predicting number
  - 4.2.1The baseline models: Predicting number based on structural-phonological properties
  - 4.2.2Predicting number using the LDL model: T-SNE and LDA analysis
- 4.3Inspecting individual LDL measures: PCA analysis
5.Discussion and conclusion
Notes
References

References (85)

References

Anderson, J. R., & Lebiere, C. J. (1998). The atomic components of thought. Mahwah, NJ: Erlbaum.

Arnold, D., Tomaschek, F., Lopez, F., Sering, K., & Baayen, R. H. (2017). Words from spontaneous conversational speech can be recognized with human-like accuracy by an error-driven learning algorithm that discriminates between meanings straight from smart acoustic features, bypassing the phoneme as recognition unit. PLOS ONE, 12 (4), e0174623. Retrieved from [URL].

Arseniev-Koehler, A. (2024). Theoretical foundations and limits of word embeddings: What types of meaning can they capture? Sociological Methods & Research, 53 (4), 1753–1793.

Auguste, J., Rey, A., & Favre, B. (2017). Evaluation of word embeddings against cognitive processes: Primed reaction times in lexical decision and naming tasks. In Proceedings of the 2nd workshop on evaluating vector space representations for NLP (pp. 21–26).

Baayen, R. H. (2008). Analyzing linguistic data. A practical introduction to statistics using R. Cambridge: Cambridge University Press.

Baayen, R. H., Chuang, Y.-Y., & Blevins, J. P. (2018). Inflectional morphology with linear mappings. The Mental Lexicon, 13 (2), 230–268.

Baayen, R. H., Chuang, Y.-Y., & Heitmeier, M. (2019). Wpmwithldl: Implementation of word and paradigm morphology with linear discriminative learning [Computer software manual]. (R package version 1.4.6, available at [URL])

Baayen, R. H., Chuang, Y.-Y., Shafaei-Bajestan, E., & Blevins, J. P. (2019). The discriminative lexicon: A unified computational model for the lexicon and lexical processing in comprehension and production grounded not in (de) composition but in linear discriminative learning. Complexity, 2019 (1), 1–39.

Baayen, R. H., Milin, P., Filipović Durdević, D., Hendrix, P., & Marelli, M. (2011). An amorphous model for morphological processing in visual comprehension based on naive discriminative learning. Psychological Review, 118 (3), 438–481.

Baayen, R. H., & Moscoso del Prado Martín, F. (2005). Semantic density and past-tense formation in three germanic languages. Language, 811, 666–698.

Behrens, H. (2009). Usage-based and emergentist approaches to language acquisition. Linguistics, 47 (2), 383–411.

Berent, I., Pinker, S., Tzelgov, J., Bibi, U., & Goldfarb, L. (2005). Computation of semantic number from morphological information. Journal of Memory and Language, 531, 342–358.

Beser, D. (2021). Falling through the gaps: Neural architectures as models of morphological rule learning. Retrieved from [URL]

Boleda, G. (2020). Distributional semantics and linguistic theory. Annual Review of Linguistics, 6 (1), 213–234.

Boswijk, V., & Coler, M. (2020). What is salience? Open Linguistics, 61, 713–722.

Buch, A. (2011). Linguistic spaces: Kernel-based models of natural language (PhD Dissertation). Universität Tübingen, Tübingen.

Carlson, M. T., & Crosson, A. C. (2025). The synchronic status of historical bound roots in the mental lexicon: A dynamic, psychocentric perspective. The Mental Lexicon. 19(2), 224–252.

Charlesworth, T. E., Yang, V., Mann, T. C., Kurdi, B., & Banaji, M. R. (2021). Gender stereotypes in natural language: Word embeddings show robust consistency across child and adult language corpora of more than 65 million words. Psychological Science, 32 (2), 218–240.

Chuang, Y.-Y., & Baayen, R. H. (2021). Discriminative learning and the lexicon: Ndl and ldl. In Oxford research encyclopedia of linguistics. Oxford: Oxford University Press.

Chuang, Y.-Y., Brown, D., Baayen, H., & Evans, R. (2023). Paradigm gaps are associated with weird “distributional semantics” properties. The Mental Lexicon. 17(3), 395–421.

Daelemans, W. (2002). A comparison of analogical modeling of language to memory-based language processing. In R. Skousen, D. Lonsdale, & D. B. Parkinson (Eds.), Analogical modeling: An exemplar-based approach to language (pp. 157–179). Amsterdam: John Benjamins.

Daelemans, W., Zavrel, J., van der Sloot, K., & van den Bosch, A. (2007). TiMBL: Tilburg Memory Based Learner, version 6.0, Reference Guide: LK Technical Report 04–02. Tilburg: ILK.

Dankers, V., Langedijk, A., McCurdy, K., Williams, A., & Hupkes, D. (2021). Generalising to german plural noun classes, from the perspective of a recurrent neural network. In Proceedings of the 25th conference on computational natural language learning (pp. 94–108). Retrieved from [URL].

Diessel, H. (Ed.). (2019). The grammar network. Cambridge: Cambridge University Press.

Domahs, F., Bartha-Doering, L., Domahs, U., & Delazer, M. (2017). Wie muss ein “guter” Deutscher Plural klingen? In N. Fuhrhop, R. Szczepaniak, & K. Schmidt (Eds.), Sichtbare und hörbare Morphologie (pp. 205–237). Berlin & Boston: De Gruyter Mouton.

Eisenberg, P., & Fuhrhop, N. (2020). Grundriss der Deutschen Grammatik -- Das Wort (5.Auflage ed.). Stuttgart & Weimar: J.B. Metzler.

Fábregas, A. (2018). Defectiveness in morphology. In Oxford research encyclopedia of linguistics. Retrieved from [URL].

Firth, J. (1957). A synopsis of linguistic theory 1930–1955. Studies in Linguistic Analysis, 1–32.

Günther, F., Petilli, M. A., & Marelli, M. (2020). Semantic transparency is not invisibility: A computational model of perceptually-grounded conceptual combination in word processing. Journal of Memory and Language, 1121, 104104.

Hahn, U., & Nakisa, R. C. (2000). German inflection: Single route or dual route? Cognitive Psychology, 41 (4), 313–360.

Harris, Z. (1954). Distributional structure. Word, 10 (2–3), 146–162.

Heitmeier, M. (2022). Judilingmeasures.jl. [URL]

Heitmeier, M., & Baayen, R. H. (2020). Simulating phonological and semantic impairment of english tense inflection with linear discriminative learning. The Mental Lexicon, 15 (3), 385–421.

Heitmeier, M., Chuang, Y.-Y., & Baayen, H. (2025). The Discriminative Lexicon: Theory, implementation in the Julia package JudiLing, and applications. Manuscript, University of Tübingen.

Heitmeier, M., Chuang, Y.-Y., & Baayen, R. H. (2021). Modeling morphology with linear discriminative learning: Considerations and design choices. Frontiers in Psychology, 4929.

Hilpert, M. (2019). Higher-order schemas in morphology: What they are, how they work, and where to find them. Word Structure, 12 (3), 261–273.

Hothorn, T., & Zeileis, A. (2015). partykit: A modular toolkit for recursive partytioning in r. The Journal of Machine Learning Research, 16 (1), 3905–3909.

Kamin, L. J. (1969). Predictability, surprise, attention, and conditioning. In B. A. Campbell & R. M. Church (Eds.), Punishment and aversive behavior (pp. 276–296). New York: Appleton-Century-Crofts.

Kassambara, A., & Mundt, F. (2020). Extract and visualize the results of multivariate data analyses [r package factoextra version 1.0.7]. Retrieved from [URL]

Köpcke, K.-M. (1988). Schemas in german plural formation. Lingua, 74 (4), 303–335.

(1993). Schemata bei der pluralbildung im deutschen: Versuch einer kognitiven morphologie (Vol. 471). Tübingen: G. Narr.

(1998). The acquisition of plural marking in english and german revisited: Schemata versus rules. Journal of Child Language, 25 (2), 293–319.

Köpcke, K.-M., Schimke, S., & Wecker, V. (2021). Processing of german noun plurals: Evidence for first- and second-order schemata. Word Structure, 14 (1), 1–24.

Köpcke, K.-M., & Wecker, V. (2017). Source- and product-oriented strategies in l2 acquisition of plural marking in german. Morphology, 27 (1), 77–103. Retrieved from [URL]

Lieber, R. (2021). Introducing morphology. Cambridge: Cambridge University Press.

Luo, X. (2021). Judiling: An implementation for discriminative learning in julia (Master Thesis, Eberhard Karls University of Tübingen, Tübingen). Retrieved from [URL]

MacLeod, B. (2015). A critical evaluation of two approaches to defining perceptual salience. Ampersand, 21, 83–92.

MacWhinney, B., Pléh, C., & Bates, E. (1985). The development of sentence interpretation in hungarian. Cognitive psychology, 17(2), 178–209.

Marcus, G. F., Brinkmann, U., Clahsen, H., Wiese, R., & Pinker, S. (1995). German inflection: The exception that proves the rule. Cognitive Psychology, 29 (3), 189–256.

McCurdy, K. (2024). Rules, frequency, and predictability in morphological generalization: Behavioral and computational evidence from the german plural system. Edinburgh Research Archive. [URL].

McCurdy, K., Goldwater, S., & Lopez, A. (2020). Inflecting when there’s no majority: Limitations of encoder-decoder neural networks as cognitive models for german plurals. Retrieved from [URL]

McCurdy, K., Lopez, A., & Goldwater, S. (2020). Conditioning, but on which distribution? Grammatical gender in german plural inflection. In Proceedings of the workshop on cognitive modeling and computational linguistics (pp. 59–65). Retrieved from [URL].

Mujezinović, E., Kapatsinski, V., & van de Vijver, R. (2024). One cue’s loss is another cue’s gain — learning morphophonology through unlearning. Cognitive Science, 48 (5), e13450.

Nieder, J., Chuang, Y.-Y., van de Vijver, R., & Baayen, R. H. (2022). A Discriminative Lexicon approach to word comprehension, production and processing: Maltese plurals. Language, 99(2), 242–274.

Pearce, J. M., & Bouton, M. E. (2001). Theories of associative learning in animals. Annual Review of Psychology, 52 (1), 111–139.

Penke, M., & Krause, M. (2002). German noun plurals: A challenge to the dual-mechanism model. Brain and Language, 81 (1–3), 303–311.

Penke, M., Wimmer, E., Hennies, J., Hess, M., & Rothweiler, M. (2016). Inflectional morphology in german hearing-impaired children. Logopedics Phoniatrics Vocology, 41 (1), 9–26.

Pescuma, V. N., Zanini, C., Crepaldi, D., & Franzon, F. (2021). Form and function: A study on the distribution of the inflectional endings in italian nouns and adjectives. Frontiers in Psychology, 121, 720228.

Plag, I. (2018). Word-formation in English, 2nd edition. Cambridge: Cambridge University Press.

Plag, I., Heitmeier, M., & Domahs, F. (2024). German nominal number interpretation in an impaired mental lexicon: A naive discriminative learning perspective. The Mental Lexicon. 18(3), 417–445.

Polišenská, D. (2010). Dutch children’s acquisition of verbal and adjectival inflection (PhD Dissertation). Universiteit van Amsterdam, Amsterdam.

Ramscar, M., Yarlett, D., Dye, M., Denny, K., & Thorpe, K. (2010). The effects of feature-label-order and their implications for symbolic learning. Cognitive Science, 34 (6), 909–957.

Rescorla, R., & Wagner, A. (1972). A theory of pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In A. Black & W. Prokasy (Eds.), Classical conditioning ii: Current research and theory (p. 64–99). New York: Appleton-Century-Crofts.

Rescorla, R. A. (1988a). Behavioral studies of pavlovian conditioning. Annual Review of Neuroscience, 11 (1), 329–352.

(1988b). Pavlovian conditioning. it’s not what you think it is. American Psychologist, 43(3), 151–160.

Rosen, E. (2022). Modeling human-like morphological prediction. In Proceedings of the Society for Computation in Linguistics 20221 (pp. 133–142). Retrieved from [URL]

Saito, M., Tomaschek, F., & Baayen, R. H. (2020). Relative functional load determines co-articulatory movements of the tongue tip. In Proceedings of the 12th international seminar on speech production. New Haven, CT. Retrieved from [URL] (217)

Schakel, A. M., & Wilson, B. J. (2015). Measuring word significance using distributed representations of words. arXiv preprint arXiv:1508.02297.

Schmid, H.-J., & Günther, F. (2016). Toward a unified socio-cognitive framework for salience in language. Frontiers in Psychology, 71, 1110.

Schmitz, D., Plag, I., Baer-Henney, D., & Stein, S. (2021). Durational differences of word-final /s/ emerge from the lexicon: Modeling morpho-phonetic effects in pseudowords with linear discriminative learning. Frontiers in Psychology, 12. (680889)

Schäfer, M. (2025). The role of meaning in the rivalry of -ity and -ness: Evidence from distributional semantics. English Language and Linguistics.

Shafaei-Bajestan, E., Moradipour-Tari, M., Uhrig, P., & Baayen, R. H. (2021). LDL-auris: A computational model, grounded in error-driven learning, for the comprehension of single spoken words. Language, Cognition and Neuroscience, 1–28.

(2024). The pluralization palette: unveiling semantic clusters in english nominal pluralization through distributional semantics. Morphology, 34 (4), 369–413.

Simoens, H., Housen, A., & De Cuypere, L. (2017). The effect of perceptual salience on processing l2 inflectional morphology. In S. M. Gass, P. Spinner, & J. Behney (Eds.), Salience in second language acquisition (pp. 107–130). New York: Routledge.

Stein, S. D., & Plag, I. (2021). Morpho-phonetic effects in speech production: Modeling the acoustic duration of english derived words with linear discriminative learning. Frontiers in Psychology, 12. (678712)

Taatgen, N. A. (2001). Extending the past-tense debate: A model of the german plural. In Proceedings of the annual meeting of the Cognitive Science Society (Vol. 231, pp. 94–108).

Tomaschek, F., Plag, I., Ernestus, M., & Baayen, R. H. (2021). Phonetic effects of morphology and context: Modeling the duration of word-final S in English with naïve discriminative learning. Journal of Linguistics, 57 (1), 123–161.

Tomaschek, F., & Ramscar, M. (2022). Understanding the phonetic characteristics of speech under uncertainty-implications of the representation of linguistic knowledge in learning and processing. Frontiers in Psychology, 131, 754395.

Trommer, J. (2021). The subsegmental structure of german plural allomorphy. Natural Language & Linguistic Theory, 39 (2), 601–656.

Van der Maaten, L., & Hinton, G. (2008). Visualizing data using t-sne. Journal of Machine Learning Research, 9 (11).

van de Vijver, R., & Uwambayinema, E. (2022). A word-based account of comprehension and production of Kinyarwanda nouns in the Discriminative Lexicon. Linguistics Vanguard. 8(1), 197–207.

Wiese, R. (2000). The phonology of German. Oxford: Oxford University Press.

(2009). The grammar and typology of plural noun inflection in varieties of German. Journal of Comparative Germanic Linguistics, 12 (2), 137–173.

Wulf, D. J. (2002). Applying analogical modeling to the german plural. In R. Skousen, D. Lonsdale, & D. B. Parkinson (Eds.), Analogical modeling: An exemplar-based approach to language (pp. 109–122). Amsterdam: John Benjamins.

Yamada, I., Asai, A., Sakuma, J., Shindo, H., Takeda, H., Takefuji, Y., & Matsumoto, Y. (2020, October). Wikipedia2Vec: An efficient toolkit for learning and visualizing the embeddings of words and entities from Wikipedia. In Q. Liu & D. Schlangen (Eds.), Proceedings of the 2020 conference on empirical methods in natural language processing: System demonstrations (pp. 23–30). Online: Association for Computational Linguistics. Retrieved from [URL].