Article published In: The Mental Lexicon
Vol. 18:1 (2023) ► pp.120–150
Native and foreign language orthotactic probability and neighborhood density in word learning
Available under the Creative Commons Attribution (CC BY) 4.0 license.
For any use beyond this license, please contact the publisher at rights@benjamins.nl.
Open Access publication of this article was funded through a Transformative Agreement with Radboud University Nijmegen.
Published online: 17 August 2023
https://doi.org/10.1075/ml.22006.rin
https://doi.org/10.1075/ml.22006.rin
Abstract
Laboratory studies on word learning in a foreign language (L2) have identified several variables involved in the learning process, key amongst them the orthotactic probability and neighborhood density of new words relative to learners’ native (L1) lexicons. More recently, learners’ sensitivity to orthotactic probability and neighborhood density relative to their developing L2 lexicons has come into focus. Past studies on word learning have largely focused on early stages of learning, in controlled studies spanning hours or days. Few studies have considered large corpora of ‘real-life’ learning data, spanning several weeks. In this study, we validate past findings outside of controlled laboratory conditions, by analyzing a dataset collected from Duolingo (Settles, B., Brust, C., Gustafson, E., Hagiwara, M., & Madnani, N. (2018). Second language acquisition modeling. Proceedings of the NAACL-HLT Workshop on Innovative Use of NLP for Building Educational Applications (BEA). ), a popular language learning app. Effects of orthotactic probability and neighborhood density observed in controlled studies persist under uncontrolled, big-data conditions for learners of Spanish, but not French. As learning progresses, we observe a previously unreported reversal of the effects of L1 orthotactic probability and neighborhood density, challenging theoretical models of word learning. Finally, we confirm the importance of orthotactic probability and neighborhood density relative to learners’ developing L2 Spanish lexicons, lending support to theories which posit that the same processes underly both L1 and L2 acquisition.
Keywords: foreign language learning, wordlikeness, e-learning
Article outline
- Effects of orthotactic probability and neighborhood density on word learning
- Models of word learning
- Empirical effects of orthotactic probability and neighborhood density in word learning
- Method
- Data selection
- Selection of participants
- Selection of language task
- Selection of stimuli
- Recorded variables
- Data cleaning and processing
- Duolingo dataset
- SUBTLEX-US lexicon
- Calculation of L1 orthotactic probability and neighborhood density
- Calculation of L2 orthotactic probability and neighborhood density
- Regression analysis
- Regression on L1 variables
- Regression on L2 variables
- Data selection
- Results
- Analysis of L1 variables
- Spanish dataset
- French dataset
- Analysis of L2 variables
- Spanish dataset
- Analysis of L1 variables
- Discussion
- Time-dependent effects of L1 orthotactic probability and neighborhood density in Spanish learners
- Effects of L2 orthotactic probability and neighborhood density in Spanish learners
- Absence of effects in the French analysis
- Future research
- Conclusion
- Notes
References
References (33)
Aiken, L. S., & West, S. G. (1991). Multiple regression: Testing and interpreting interactions. Sage Publications, Inc.
Baayen, R. H., Chuang, Y.-Y., Shafaei-Bajestan, E., & Blevins, J. P. (2019). The discriminative lexicon: A unified computational model for the lexicon and lexical processing in comprehension and production grounded not in (de) composition but in linear discriminative learning. Complexity, 2019.
Baayen, R. H., Milin, P., Durđević, D. F., Hendrix, P., & Marelli, M. (2011). An amorphous model for morphological processing in visual comprehension based on naive discriminative learning. Psychological review, 118 (3), 438.
Bartolotti, J., & Marian, V. (2017). Orthographic knowledge and lexical form influence vocabulary learning. Applied Psycholinguistics, 38 (2), 427–456.
Bordag, D., Kirschenbaum, A., Rogahn, M., & Tschirner, E. (2017). The role of orthotactic probability in incidental and intentional vocabulary acquisition L1 and L2. Second Language Research, 33 (2), 147–178.
Brysbaert, M., & New, B. (2009). Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behavior Research Methods, 41 (4), 977–990.
Chan, K. Y., & Vitevitch, M. S. (2009). The influence of the phonological neighborhood clustering coefficient on spoken word recognition. Journal of Experimental Psychology: Human Perception and Performance, 35 (6), 1934.
Chetail, F. (2015). Reconsidering the role of orthographic redundancy in visual word recognition. Frontiers in Psychology, 61, 645.
Dijkstra, T., Wahl, A., Buytenhuijs, F., Van Halem, N., Al-Jibouri, Z., De Korte, M., & Rekké, S. (2019). Multilink: A computational model for bilingual word recognition and word translation. Bilingualism: Language and Cognition, 22 (4), 657–679.
Ellis, N. C. (2006). Selective attention and transfer phenomena in L2 acquisition: Contingency, cue competition, salience, interference, overshadowing, blocking, and perceptual learning. Applied linguistics, 27 (2), 164–194.
Ellis, N. C., & Beaton, A. (1993). Psycholinguistic determinants of foreign language vocabulary learning. Language Learning, 43 (4), 559–617.
Gaskell, M. G., & Dumay, N. (2003). Lexical competition and the acquisition of novel words. Cognition, 89 (2), 105–132.
Grainger, J., Midgley, K., & Holcomb, P. J. (2010, December). Re-thinking the bilingual interactive-activation model from a developmental perspective (BIA-d). In M. Kail & M. Hickmann (Eds.), Language Acquisition across Linguistic and Cognitive Systems (pp. 267–283). John Benjamins Publishing Company.
Hosmer, D. W., & Lemeshow, S. (2000). Assessing the fit of the model. In Applied logistic regression (pp. 143–202). John Wiley & Sons, Ltd.
Kroll, J. F., Hell, J. G. V., Tokowicz, N., & Green, D. W. (2010). The revised hierarchical model: A critical review and assessment. Bilingualism: Language and Cognition, 13 (3), 373–381.
Leach, L., & Samuel, A. G. (2007). Lexical configuration and lexical engagement: When adults learn new words. Cognitive Psychology, 55 (4), 306–353.
McClelland, J. L., & Elman, J. L. (1986). The TRACE model of speech perception. Cognitive Psychology, 18 (1), 1–86.
Miwa, K., & Baayen, H. (2021). Nonlinearities in bilingual visual word recognition: An introduction to generalized additive modeling. Bilingualism: Language and Cognition, 24 (5), 825–832.
Rescorla, R. A., & Wagner, A. R. (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. Classical Conditioning II: Current Research and Theory, 64–99.
Serrano, F., Genard, N., Sucena, A., Defior, S., Alegria, J., Mousty, P., Leybaert, J., Castro, S. L., & Seymour, P. H. K. (2011). Variations in reading and spelling acquisition in Portuguese, French and Spanish: A cross-linguistic comparison. Journal of Portuguese Linguistics, 10 (1).
Settles, B., Brust, C., Gustafson, E., Hagiwara, M., & Madnani, N. (2018). Second language acquisition modeling. Proceedings of the NAACL-HLT Workshop on Innovative Use of NLP for Building Educational Applications (BEA).
Seymour, P. H. K., Aro, M., & Erskine, J. M. (2003). Foundation literacy acquisition in European orthographies. British Journal of Psychology, 94 (2), 143–174.
Stamer, M. K., & Vitevitch, M. S. (2012). Phonological similarity influences word learning in adults learning Spanish as a foreign language. Bilingualism: Language and Cognition, 15 (3), 490–502.
Storkel, H. L. (2004). Do children acquire dense neighborhoods? An investigation of similarity neighborhoods in lexical acquisition. Applied Psycholinguistics, 25 (2), 201–221.
Storkel, H. L., Armbruster, J., & Hogan, T. P. (2006). Differentiating phonotactic probability and neighborhood density in adult word learning. Journal of Speech, Language, and Hearing Research, 49 (6), 1175–1192.
Storkel, H. L., & Lee, S.-Y. (2011). The independent effects of phonotactic probability and neighbourhood density on lexical acquisition by preschool children. Language and Cognitive Processes, 26(2), 191–211.
Storkel, H. L., & Rogers, M. A. (2000). The effect of probabilistic phonotactics on lexical acquisition. Clinical Linguistics & Phonetics, 14 (6), 407–425.
van der Velde, M., Sense, F., Borst, J., & van Rijn, H. (2021). Alleviating the cold start problem in adaptive learning using data-driven difficulty estimates. Computational Brain & Behavior, 4 (2), 231–249.
Verbeke, G., & Lesaffre, E. (1997). The effect of misspecifying the random-effects distribution in linear mixed models for longitudinal data. Computational Statistics Data Analysis, 23 (4), 541–556.
Wood, S. N. (2013). A simple test for random effects in regression models. Biometrika, 100 (4), 1005–1010.
