Article published In: The Mental Lexicon
Vol. 17:1 (2022) ► pp.34–75
Processing Spanish gender in a usage‑based model with special reference to dual‑gendered nouns
Published online: 8 April 2022
https://doi.org/10.1075/ml.21011.ell
https://doi.org/10.1075/ml.21011.ell
Abstract
In an experiment, Spanish speakers assigned gender to nouns. Some nouns had biological referents. Others had a
mismatch between their gender and their final phones (e.g. problema). Nouns with biological referents and nouns
with matching gender and phonology were responded to faster suggesting that gender does not depend solely on a noun’s gender.
Gender was also assigned to dual-gendered nouns, which are feminine nouns that take the masculine article el
(e.g. agua). Most participants assigned them masculine gender.
Dual-gendered nouns are often preceded by masculine modifiers which is due to analogy to el. The
idea is explored that the gender of el, along with all modifiers a noun has been experienced with, explains
gender assignment. Computational simulations were carried out to test this using exemplar, naive Bayes, and decision tree
algorithms. They made accurate predictions without referencing the noun’s gender. In dual-gendered nouns, a shift towards preposed
masculine modifiers was observed. A simulation predicted the gender of bare dual-gendered nouns which mirrored the masculine gender the
experimental participants provided. These results suggest a usage-based model in which a noun’s gender is determined by the
modifiers it has been experienced with.
Keywords: Spanish, gender, usage-based model, corpus, experiment
Article outline
- Introduction
- Dual-gendered nouns
- The experiment
- Procedure
- Test items
- Method
- Participants
- Statistical results
- Biological gender and error rates
- Biological gender and reaction time
- Incongruently-gendered words and error rates
- Incongruently-gendered words and reaction time
- Dual-gendered nouns and error rates
- Dual-gendered nouns and reaction times
- Discussion of the results of the experiment
- Processing of dual-gendered nouns
- Usage-based models
- Modeling gender in an exemplar model
- Other computation models
- Bayes classifiers
- Decision trees
- Modeling gender
- Data
- Variables used in simulations
- Data sets used in simulations
- Simulations
- Discussion
- Conclusions
- Acknowledgements
- Notes
References
References (68)
Aha, D. W., Kibler, D., & Albert, M. K. (1991). Instance-based
learning algorithms. Machine
Learning, 6(1), 37–66.
Alarcón, I. (2009). The
processing of gender agreement in L1 and L2 Spanish: Evidence from reaction time
data. Hispania, 814–828.
(2020). Early
and late bilingual processing of Spanish gender, morphology and gender congruency. Borealis–An
International Journal of Hispanic
Linguistics, 9(2), 175–208.
Álvarez de Miranda, P. (1993). El
alomorfo de’la’y sus consecuencias. LEA: Lingüística Española
Actual, 15(1), 5–44.
Ambridge, B. (2020a). Against
stored abstractions: A radical exemplar model of language acquisition. First
Language, 40(5–6), 509–559.
(2020b). Abstractions
made of exemplars or ‘You’re all right, and I’ve changed my mind’: Response to
commentators. First
Language, 40(5–6), 640–659.
Arnon, I., & Ramscar, M. (2012). Granularity
and the acquisition of grammatical gender: How order-of-acquisition affects what gets
learned. Cognition, 122(3), 292–305.
Barber, H., & Carreiras, M. (2005). Grammatical
gender and number agreement in Spanish: An ERP comparison. Journal of cognitive
neuroscience, 17(1), 137–153.
Bates, E., Devescovi, A., Pizzamiglio, L., D’amico, S., & Hernandez, A. (1995). Gender
and lexical access in Italian. Perception &
Psychophysics, 57(6), 847–862.
Beatty-Martínez, A. L., Bruni, M. R., Bajo, M. T., & Dussias, P. E. (2021). Brain
potentials reveal differential processing of masculine and feminine grammatical gender in native Spanish
speakers. Psychophysiology, 58(3), e13737.
Beatty-Martínez, A. L., & Dussias, P. E. (2019). Revisiting
masculine and feminine grammatical gender in Spanish: Linguistic, psycholinguistic, and neurolinguistic
evidence. Frontiers in
Psychology, 101, 751.
(2013). Usage-based
theory and exemplar representations of constructions. In T. Hoffmann and G. Trousdale (eds.), The
Oxford Handbook of Construction
Grammar, 49–69. Oxford: Oxford University Press.
Caffarra, S., Janssen, N., & Barber, H. A. (2014). Two
sides of gender: ERP evidence for the presence of two routes during gender agreement
processing. Neuropsychologia, 631, 124–134.
Caramazza, A. (1997). How
many levels of processing are there in lexical access?. Cognitive
Neuropsychology, 14(1), 177–208.
Chandler, S. (2017). The
analogical modeling of linguistic categories. Language and
Cognition, 9(1), 52–87.
Cubelli, R., Lotto, L., Paolieri, D., Girelli, M., & Job, R. (2005). Grammatical
gender is selected in bare noun production: Evidence from the picture–word interference
paradigm. Journal of Memory and
Language, 53(1), 42–59.
Daelemans, W., Zavrel, J., Van Der Sloot, K., & Van den Bosch, A. (2004). Timbl:
Tilburg memory-based learner. Tilburg University.
Davies, M. (2002). Corpus
del Español, 100 Million Words, 1200s-1900s. [URL]
(2019). Corpus
del Español, News on the Web Corpus. [URL]
Domínguez, A., Cuetos, F., & Segui, J. (1999). The
processing of grammatical gender and number in Spanish. Journal of Psycholinguistic
Research, 28(5), 485–498.
Duchon, A., Perea, M., Sebastián-Gallés, N., Martí, A., & Carreiras, M. (2013). EsPal:
One-stop shopping for Spanish word properties. Behavior research
methods, 45(4), 1246–1258.
Eddington, D. (2002). Spanish
Gender Assignment in an Analogical Framework. Journal of Quantitative
Linguistics 91, 49–75.
Eddington, D., & Hualde, J. I. (2008). El
abundante agua fría: Hermaphroditic Spanish Nouns. Studies in Hispanic & Lusophone
Linguistics, 1(1).
Elmougy, S., Taher, H., & Noaman, H. (2008). Naïve
Bayes classifier for Arabic word sense disambiguation. In Proceeding
of the 6th International Conference on Informatics and Systems Shanghai, China: School of Electronics & Information Shanghai Dianji University, 16–21.
Faussart, C., Jakubowicz, C., & Costes, M. (1999). Gender
and number processing in spoken French and Spanish. Rivista di
Linguistica, 11(1), 75–101.
Frauenfelder, U. H., & Tyler, L. K. (1987). The
process of spoken word recognition: An
introduction. Cognition, 25(1–2), 1–20.
Goldinger, S. D. (1996). Words
and voices: episodic traces in spoken word identification and recognition memory. Journal of
experimental psychology: Learning, memory, and
cognition, 22(5), 1166.
Hay, J. & Bresnan, J. (2006). Spoken syntax: The phonetics of giving a hand in New Zealand English. The Linguistic Review, 231, 321–349.
Heitmeier, M., Chuang, Y. Y., & Baayen, R. H. (2021). Modeling
morphology with Linear Discriminative Learning: considerations and design choices. arXiv
preprint arXiv:2106.07936.
Hernández, Arturo E., Sonja A. Kotz, Juliane Hofmann, Vivian V. Valentin, Mirella Dapretto, and Susan Y. Bookheimer. 2004. The
neural correlates of grammatical gender decisions in
Spanish. NeuroReport, 151, 863–866.
Harris, J. W. (1987). Disagreement
rules, referral rules, and the Spanish feminine article el. Journal of
Linguistics, 23(1), 177–183.
Igoa, J. M., García-Albea, J. E., & Sánchez-Casas, R. (1999). Gender-number
dissociations in sentence production in Spanish. Rivista di
Linguistica, 11(1), 163–196.
Paolieri, D., Lotto, L., Morales, L., Bajo, T., Cubelli, R., & Job, R. (2010). Grammatical gender processing in romance languages: Evidence from bare noun production in Italian and Spanish. European Journal of Cognitive Psychology, 22(3), 335–347.
The Jamovi
Project (2021). jamovi. (Version
1.6) [Computer Software]. Retrieved from [URL]
Janda, R. D., & Varela-García, F. (1991). On
Lateral Hermaphroditism and Other Variation in Spanish « Feminine »
el. In Lise M. Dobrin, Lynn Nichols, Rosa M. Rodríguez (eds.). Papers
from the Chicago Linguistics
Society (Vol. 27, No. 1, pp. 276–290).
Johns, B. T., Jamieson, R. K., Crump, M. J., Jones, M. N., & Mewhort, D. J. K. (2020). Production
without rules: Using an instance memory model to exploit structure in natural language. Journal
of Memory and
Language, 1151, 104165.
Kempen, G., & Hoenkamp, E. (1987). An
incremental procedural grammar for sentence formulation. Cognitive
science, 11(2), 201–258.
John, G. H., & Langley, P. (2013). Estimating
continuous distributions in Bayesian classifiers. arXiv preprint
arXiv:1302.4964.
Levelt, W. J., Roelofs, A., & Meyer, A. S. (1999). A
theory of lexical access in speech production. Behavioral and Brain
Cciences, 221, 1–38.
Love, B. C. (2013). Categorization. In (Eds.), Oxford
handbook of cognitive neuroscience, ed. by K. N. Ochsner & S. M. Kosslyn, 342–358. Oxford, UK: Oxford University Press.
Mirkovic, J., MacDonald, M. C., & Seidenberg, M. S. (2005). Where
does gender come from? Evidence from a complex inflectional system. Language and Cognitive
Processes, 20(1–2), 139–167.
Nosofsky, R. M. (2011). The
generalized context model: An exemplar model of classification. Formal approaches in
categorization, 18–39.
Ogneva, A. (2020). Gender
agreement hierarchy in common gender and epicene nouns in Spanish. Borealis–An International
Journal of Hispanic
Linguistics, 9(1), 279–292.
Pérez-Pereira, M. (1991). The
acquisition of Gender: What Spanish tells us. Journal of Child
Language, 181, 571–590.
Pharies, D.A. (2007). A Brief History of the Spanish
Language. Chicago and London: University of Chicago Press.
Pierrehumbert, J. (2001). Exemplar
dynamics: Word frequency, lenition and contrast. Frequency and the emergence of linguistic
structure, ed. by Joan Bybee and Paul Hopper, 137–57. Amsterdam: John Benjamins.
Plank, F. (1984). Romance
disagreements: phonology interfering with syntax. Journal of
linguistics, 20(2), 329–349.
Pratama, Y., Tampubolon, A. R., Sianturi, L. D., Manalu, R. D., & Pangaribuan, D. F. (2019). Implementation
of sentiment analysis on Twitter using Naïve Bayes algorithm to know the people responses to debate of DKI Jakarta governor
election. Journal of Physics: Conference
Series, 11751, 012102.
R Core Team (2020). R: A Language and
environment for statistical computing. (Version 4.0) [Computer
software]. Retrieved from [URL]. (R packages retrieved from MRAN snapshot
2020-08-24).
Rini, J. (2016). Are
some Spanish nouns truly grammatical hermaphrodites? Zeitschrift für romanische
Philologie, 132(3), 731–754.
Russell, I., & Markov, Z. (2017, March). An
introduction to the Weka data mining system. In Proceedings of the
2017 ACM SIGCSE Technical Symposium on Computer Science
Education (pp. 742–742).
Sagarra, N., & Herschensohn, J. (2010). The
role of proficiency and working memory in gender and number agreement processing in L1 and L2
Spanish. Lingua, 120(8), 2022–2039.
Skousen, R. 1989. Analogical
Modeling of Language. Kluwer Academic Publishers, Dordrecht, Netherlands.
Tagliamonte, S. A., & Baayen, R. H. (2012). Models,
forests, and trees of York English: Was/were variation as a case study for statistical
practice. Language Variation and
Change, 24(2), 135–178.
Varis, E. E. (2012). The
Spanish feminine el at the syntax-phonology interface. PhD
dissertation, University of Southern California.
Webb, G. I. (1999). Decision
tree grafting from the all-tests-but-one partition. In Proceedings of
the Sixteenth International Joint Conference on Artificial
Intelligence, vol. 21. San Francisco, CA: Morgan Kaufmann, 702–707.
Wicha, N. Y., Bates, E. A., Moreno, E. M., & Kutas, M. (2003). Potato
not Pope: human brain potentials to gender expectation and agreement in Spanish spoken
sentences. Neuroscience
letters, 346(3), 165–168.
Cited by (2)
Cited by two other publications
Kanampiu, Patrick Njue, Alexander Martin & Jennifer Culbertson
This list is based on CrossRef data as of 27 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
