Article published In: International Journal of Learner Corpus Research
Vol. 7:2 (2021) ► pp.230–258
Lexical diversity in an L2 Spanish learner corpus
The effect of topic-related variables
Published online: 11 October 2021
https://doi.org/10.1075/ijlcr.20017.fer
https://doi.org/10.1075/ijlcr.20017.fer
Abstract
This study examines the impact of two topic-related variables (i.e., valence polarity and everyday-life closeness)
on the lexical diversity scores (i.e., MTLD) of learners of L2 Spanish at different proficiency levels. The analysis included
3,045 texts written in response to two pairs of prompts by 1,165 students enrolled in an L2 Spanish program. The first pair of
prompts asked learners to narrate an event: prompt 1 focused on a perfect vacation (positive event), while prompt 2 asked
participants to tell a terrible story (negative event). The second pair asked to describe a person: prompt 1 required that the
subject be famous, thus not close to the writer, whereas prompt 2 required that the subject be special and close to the writer.
Results indicate that lexical diversity scores were higher for the texts written about the positive event and the famous subject
across all proficiency levels.
Keywords: L2 Spanish, lexical diversity, topic-related variables, writing, learner corpus
Article outline
- 1.Introduction
- 2.Background
- 2.1Spanish learner corpora and writing prompts
- 2.2Lexical diversity
- 2.3Task – and topic-related variables that influence writing outcomes and lexical diversity
- 2.4Topic-related variables: Valence and closeness
- 3.Methods
- 3.1Corpus of written Spanish of L2 and Heritage speakers (COWS-L2H)
- 3.2Measure of textual lexical diversity (MTLD)
- 3.3Analytical procedure
- 4.Results
- 4.1Research question 1: Valence condition
- 4.2Research question 2: Closeness
- 5.Discussion and limitations
- 5.1RQ 1: How does the valence polarity (positive vs. negative) associated with a narrated event affect the LD of the texts written by L2 Spanish learners at different proficiency levels?
- 5.2RQ 2: How does the everyday-life closeness with the person described affect the LD of the texts written by L2 Spanish learners at different proficiency levels?
- 5.3Topic-related effects by proficiency level
- 6.Conclusion
- Notes
References
References (58)
Alonso-Ramos, M. (2016). Spanish
learner corpus research: Current trends and future
perspectives. Amsterdam: John Benjamins.
Anooshian, L. J., & Hertel, P. T. (1994). Emotionality
in free recall: Language specificity in bilingual memory. Cognition &
Emotion, 8(6), 503–514.
Bell, P., Collins, L., & Marsden, E. (2021). Building
an oral and written learner corpus of a school programme : Methodological
issues. In B. Le Bruyn & M. Paquot (Eds.), Learner
corpus research meets second language
acquisition (pp. 214–242). Cambridge: Cambridge University Press.
Buyse, K., & González Melón, E. (2012). El
corpus de aprendices Aprescrilov y su utilidad para la didáctica de ELE en la Bélgica
multilingüe. Congreso de ASELE: Multilingüismo y enseñanza de ELE en contextos multiculturales
2012, 247–261.
Castañeda-Jiménez, G., & Jarvis, S. (2014). Exploring
lexical diversity in second language Spanish. In K. Geeslin (Ed.), The
handbook of Spanish second language
acquisition (pp. 498–513). Malden, MA: Wiley & Sons.
Cestero, A., & Penadés, I. (2009). Corpus
de textos escritos para el análisis de errores de aprendices de E/LE
(CORANE) [CD-ROM]. Alcalá de Henares: Universidad de Alcalá.
Cheng, Y.-S. (2008). Factors
associated with foreign language writing anxiety. Foreign Language
Annals, 35(5), 647–656.
Cho, M. (2019). The
effects of prompts on L2 writing performance and engagement. Foreign Language
Annals, 52(3), 576–594.
Conrad, M., Recio, G., & Jacobs, A. (2011). The
time course of emotion effects in first and second language processing: A cross cultural ERP study with German–Spanish
bilinguals. Frontiers in
Psychology, 21, 351.
Cortes, V. (2004). Lexical
bundles in published and student disciplinary writing: Examples from history and
biology. English for Specific
Purposes, 23(4), 397–423.
Crossley, S. A., Salsbury, T., McNamara, D. S., & Jarvis, S. (2010). Predicting
lexical proficiency in language learner texts using computational indices. Language
Testing, 28(4), 561–580.
Davidson, S., Yamada, A., Fernández-Mira, P., Carando, A., Sánchez-Gutiérrez, C. H., & Sagae, K. (2020). Developing
NLP tools with a new corpus of learner Spanish. Proceedings of the 12th Language Resources and
Evaluation Conference 2020, 7238–7243.
Degner, J., Doycheva, C., & Wentura, D. (2012). It
matters how much you talk: On the automaticity of affective connotations of first and second language
words. Bilingualism: Language and
Cognition, 15(1), 181–189.
Dewaele, J. M., & MacIntyre, P. D. (2014). The
two faces of Janus? Anxiety and enjoyment in the foreign language classroom. Studies in Second
Language Learning and
Teaching, 4(2), 237–274.
D’Mello, S., & Mills, C. (2014). Emotions
while writing about emotional and non-emotional topics. Motivation and
Emotion, 38(1), 140–156.
Ellis, R., & Yuan, F. (2004). The
effects of planning on fluency, complexity, and accuracy in second language narrative
writing. Studies in Second Language
Acquisition, 26(1), 59–84.
Emmersion. (2020). WebCAPE/CEFR
Equivalencies. Retrieved on 27
July 2021 from [URL]
Frens, J. (2017). Lexical
diversity [GitHub repository]. Retrieved
on 27 July
2021 from [URL]
Granger, S. (2011). How
to use foreign and second language learner corpora? In A. Mackey & S. Gass (Eds.), A
guide to research methods in second language
acquisition (pp. 7–29). Oxford: Basil Blackwell.
Granger, S., Gilquin, G., & Meunier, F. (2015). Introduction:
Learner corpus research– past, present and future. In S. Granger, G. Gilquin, & F. Meunier (Eds.), The
Cambridge handbook of learner corpus
research (pp. 1–5). Cambridge: Cambridge University Press.
Gregori-Signes, C., & Clavel-Arroitia, B. (2015). Analysing
lexical density and lexical diversity in university students’ written discourse. Procedia –
Social and Behavioral
Sciences, 1981, 546–556.
Hashemian, M., & Heidari, A. (2013). The
relationship between L2 learners’ motivation/attitude and success in L2 writing. Procedia –
Social and Behavioral
Sciences, 701, 476–489.
Instituto Cervantes. (2020). El español:
Una lengua viva. Informe 2020. Madrid: Instituto Cervantes. Retrieved on 27 July
2021 from [URL]
Ishikawa, T. (2006). The
effect of task complexity and language proficiency on task-based language performance. Journal
of Asia
TEFL, 3(4), 193–225.
Jarvis, S. (2002). Short
texts, best-fitting curves and new measures of lexical diversity. Language
Testing, 19(1), 57–84.
Koizumi, R., & In’nami, Y. (2012). Effects
of text length on lexical diversity measures: Using short texts with less than 200
tokens. System, 40(4), 554–564.
Kormos, J. (2011). Task
complexity and linguistic and discourse features of narrative writing performance. Journal of
Second Language
Writing, 20(2), 148–161.
(2012). The
role of individual differences in L2 writing. Journal of Second Language
Writing, 21(4), 390–403.
Kormos, J., & Trebits, A. (2012). The
role of task complexity, modality and aptitude in narrative task performance. Language
Learning, 62(2), 439–472.
Kousta, S. T., Vinson, D. P., & Vigliocco, G. (2009). Emotion
words, regardless of polarity, have a processing advantage over neutral
words. Cognition, 112(3), 473–481.
Kuiken, F., & Vedder, I. (2008). Cognitive
task complexity and written output in Italian and French as a foreign language. Journal of
Second Language
Writing, 17(1), 48–60.
Kuperman, V., Estes, Z., Brysbaert, M., & Warriner, A. B. (2014). Emotion
and language: Valence and arousal affect word recognition. Journal of Experimental Psychology:
General, 143(3), 1065–1081.
Kyle, K. (2020). Measuring
lexical richness. In S. Webb (Ed.), The
Routledge handbook of vocabulary
studies (pp. 454–476). New York, NY: Routledge.
Laufer, B., & Nation, P. (1995). Vocabulary
size and use: Lexical richness in L2 written production. Applied
linguistics, 16(3), 307–322.
Lozano, C. (2009). CEDEL2:
Corpus Escrito del Español como L2. In C. M. Bretones Callejas, J. F. Fernández Sánchez, J. R. Ibáñez Ibáñez, M. E. García Sánchez, M. E. Cortés de los Ríos, S. Salaberri Ramiro, M. S. Cruz Martínez, N. Perdú Honeyman, & B. Cantizano Márquez (Eds.), Applied
linguistics now: Understanding language and mind/La lingüística aplicada actual: Comprendiendo el lenguaje y la
mente (pp. 197–212) Almería: Universidad de Almería.
Mavrou, I., & Bustos-López, F. (2019). La
influencia de la memoria operativa y de la inteligencia emocional en la expresión de la emoción en español como segunda
lengua. Revista de Lingüística y Lenguas
Aplicadas, 141, 117–129.
McCarthy, P. M. (2005). An
assessment of the range and usefulness of lexical diversity measures and the potential of the measure of textual, lexical
diversity (MTLD) (Doctoral
dissertation). Memphis: University of Memphis Retrieved from ProQuest Dissertations & Theses.
McCarthy, P. M., & Jarvis, S. (2010). MTLD,
vocd-D, and HD-D: A validation study of sophisticated approaches to lexical diversity
assessment. Behavior Research
Methods, 42(2), 381–392.
Mitchell, R., Tracy-Ventura, N., & McManus, K. (2017). Anglophone
students abroad: Identity, social relationships, and language learning. New York, NY: Routledge.
Paquot, M. (2014). Cross-linguistic
influence and formulaic language: Recurrent word sequences in French learner
writing. In L. Roberts, I. Vedder, & J. Hulstijn (Eds.), EuroSLA
Yearbook (pp. 216–237). Amsterdam: John Benjamins.
(2017). L1
frequency in foreign language acquisition: Recurrent word combinations in French and Spanish EFL learner
writing. Second Language
Research 33(1), 13–32.
Paquot, M., Naets, H., & Gries, S. T. (2021). Using
syntactic co-occurrences to trace phraseological complexity development in learner writing: Verb + object structures in
LONGDALE. In B. Le Bruyn & M. Paquot (Eds.), Learner
corpus research meets second language
acquisition (pp. 122–147). Cambridge: Cambridge University Press.
Pavlenko, A. (2012). Affective
processing in bilingual speakers: Disembodied cognition? International Journal of
Psychology, 47(6), 405–428.
Robinson, P. (2001). Task
complexity, task difficulty, and task production: Exploring interactions in a componential
framework. Applied
Linguistics, 22(1), 27–57.
Rojo, G., & Palacios, I. M. (2016). Learner
Spanish on computer: The CAES Corpus de Aprendices de Español
project. In M. Alonso-Ramos (Ed.), Spanish
learner corpus research: Current trends and future
perspectives (pp. 55–87). Amsterdam: John Benjamins.
Sadeghi, K., & Dilmaghani, S. (2013). The
relationship between lexical diversity and genre in Iranian EFL Learners’ writings. Journal of
Language Teaching and
Research, 4(2), 328–334.
Schoonen, R. (2005). Generalizability
of writing scores: An application of structural equation modeling. Language
Testing, 22(1), 1–30.
Schoonen, R., van Gelderen, A., de Glopper, K., Hulstijn, J., Simis, A., Snellings, P., & Stevenson, M. (2003). First
language and second language writing: The role of linguistic knowledge, speed of processing, and metacognitive
knowledge. Language
Learning, 53 (1), 165–202.
Segalowitz, N., Trofimovich, P., Gatbonton, E., & Sokolovskaya, A. (2008). Feeling
affect in a second language: The role of word recognition automaticity. The Mental
Lexicon, 3(1), 47–71.
Sheikh, N. A., & Titone, D. (2016). The
embodiment of emotional words in a second language: An eye-movement study. Cognition and
Emotion, 30(3), 488–500.
Stadthagen-Gonzalez, H., Imbault, C., Sánchez, M. A. P., & Brysbaert, M. (2017). Norms
of valence and arousal for 14,031 Spanish words. Behavior Research
Methods, 49(1), 111–123.
Tabari, M., & Wang, Y. (2021). The
effects of prompt types on L2 learners’ textual emotionality and lexical complexity. Journal of
Second Language Studies.
Tracy-Ventura, N., Mitchell, R., & McManus, K. (2016). The
LANGSNAP Longitudinal Learner Corpus: Design and use. In Alonso Ramos, M. (Ed.), Spanish
learner corpus research: Current trends and future
perspectives (pp. 117–142). Amsterdam: John Benjamins.
Tracy-Ventura, N., & Myles, F. (2015). The
importance of task variability in the design of learner corpora for SLA research. International
Journal of Learner Corpus
Research, 1(1), 58–95.
Yamada, A., Davidson, S., Fernández-Mira, P., Carando, A., Sagae, K., & Sánchez-Gutiérrez, C. (2020). COWS-L2H:
A corpus of Spanish learner writing. Research in Corpus
Linguistics, 8(1), 17–32.
Cited by (9)
Cited by nine other publications
Bozdağ, Fatih Ünal
Sánchez‐Gutiérrez, Claudia, Sophia Minnillo, Ana Ortega Pérez & Ana Ruiz‐Alonso‐Bartol
Minnillo, Sophia, Claudia Sánchez-Gutiérrez, Ana Ruiz-Alonso-Bartol, Emily Morgan & Carmen González Gómez
2024. Predictors of accuracy in L2 Spanish preterit-imperfect production. International Journal of Learner Corpus Research 10:2 ► pp. 301 ff.
Sánchez-Gutiérrez, Claudia H., Sophia Minnillo, Paloma Fernández Mira & Andrea Hernández
Carando, Agustina, Sophia Minnillo, Paloma Fernández-Mira, Sam Davidson, Kenji Sagae & Claudia Sánchez-Gutiérrez
Gudmestad, Aarnes
Lozano, Cristóbal & Paloma Fernández-Mira
Minnillo, Sophia, Claudia Sánchez-Gutiérrez, Agustina Carando, Samuel Davidson, Paloma Fernández Mira & Kenji Sagae
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
