In:Applying Corpora in Teaching and Learning Romance Languages
Edited by Henry Tyne and Stefania Spina
[Studies in Corpus Linguistics 122] 2025
► pp. 12–39
Chapter 1Corpora and the learning and teaching of French and Spanish
Published online: 20 November 2025
https://doi.org/10.1075/scl.122.01per
https://doi.org/10.1075/scl.122.01per
Abstract
This chapter examines the use of corpora in the teaching and learning of French and Spanish. It
reviews research that focuses on potential and actual uses of corpora, and in particular data-driven learning, in the
learning and teaching of written and spoken French and Spanish. This reveals some of the main research trends which
have emerged in the last three decades, with a particular focus on current research developments. We welcome the
emergence of empirical studies which give a voice to both learners and teachers of the two languages and argue that
the dearth of this type of research calls for a reevaluation of how researchers and teachers approach corpus-based
pedagogy in the context of Romance languages.
Keywords: French, Spanish, language teaching, language learning, corpora, DDL
Article outline
- 1.Introduction
- 2.Corpora and written language
- 2.1Spanish
- 2.1.1Spanish language pedagogic and empirical studies
- 2.1.2Spanish language pedagogical resources and tools
- 2.2French
- 2.2.1French language pedagogic and empirical studies
- 2.2.2French language pedagogic resources and tools
- 2.3Corpora and written language: Conclusion
- 2.1Spanish
- 3.Corpora and spoken language
- 3.1Spanish language resources and studies
- 3.2French language resources and studies
- 3.3Corpora and spoken language: Conclusion
- 4.Conclusion
Notes References Appendix
References (62)
Abad Castelló, Magdalena. 2019. Uso
de corpus lingüísticos por y para profesores de español como lengua
extranjera. RedELE: Revista Electrónica de Didáctica Español Lengua
Extranjera 31. [URL]
Abeillé, Anne, Clément, Lionel & Liégeois, Loïc. 2019. Un
corpus annoté pour le français: Le French Treebank. TAL: Traitement automatique
des
langues 60(2): 19–43. [URL]
Adami, Hervé & André, Virginié. 2013. Corpus
et apprentissage du français langue d’intégration
(FLI). Linx 68–69: 135–158.
Alberdi, Carmen, Etienne, Carole & Jouin-Chardon Émilie. 2018. Les
apports des corpus d’interactions naturelles en situation de classe: Enjeux et
pratiques. Action
didactique 1: 1–16. halshs-01955824
Amador Moreno, Carolina. P., O’Riordan, Stéphanie & Chambers, Angela. 2006. Integrating
a corpus of classroom discourse in language teacher education: The case of discourse
markers. ReCALL 18(1): 83–104.
André, Virginie. 2018. Nouvelles
actions didactiques: Faire de la sociolinguistique de corpus pour enseigner et apprendre à interagir en
français langue étrangère. Action
didactique 1: 71–88. [URL]
. 2020. Faire
de la linguistique de corpus avec des apprenants de français langue
étrangère. In La didactisation du français
vernaculaire, Pierre Larrivée & Florence Lefeuvre (eds), 37–66. Caen: Presses universitaires de Caen. ffhal-02993341f
André, Virginie & Ciekanski, Maud. 2018. Apprendre
à interagir à l’oral à partir d’un concordancier multimodal: Effets sur le développement de la conscience
langagière et sur l’autonomie de l’apprenant dans le dispositif FLEURON. EPAL —
Echanger pour apprendre en
ligne, Grenoble, France. ffhal-01996477f
Asención-Delaney, Yuly, Collentine, Joseph G., Collentine, Karina, Colmenares, Jersus & Plonsky, Luke. 2015. El
potencial de la enseñanza del vocabulario basada en corpus: Optimismo con
precaución. Journal of Spanish Language
Teaching 2(2): 140–151.
Benavides, Carlos. 2015. Using
a corpus in a 300-level Spanish grammar course. Foreign Language
Annals 48(2): 218–235.
Benzitoun, Christophe, Debaisieux, Jeanne-Marie & Deulofeu, Henri-José. 2016. Le
projet ORFEO: Un corpus d’études pour le français
contemporain. Corpus 15: 91–114.
Boulton, Alex. 2021. Research
in data-driven learning. In Beyond Concordance Lines.
Corpora in Language Education [Studies in Corpus Linguistics
102], Pascual Pérez-Paredes & Geraldine Mark (eds), 9–34. Amsterdam: John Benjamins.
Boulton, Alex & Tyne, Henry. 2013. Corpus
linguistics and data-driven learning: A critical overview. Bulletin suisse de
linguistique
appliquée 97: 97–118. [URL]
. 2014. Des
documents authentiques aux corpus: Démarches pour l’apprentissage des
langues. Paris: Didier.
Boulton, Alex & Vyatkina, Nina. 2021. Thirty
years of data-driven learning: Taking stock and charting new directions over
time. Language Learning &
Technology 25(3): 66–89. [URL]
Braun, Sabine. 2005. From
pedagogically relevant corpora to authentic language learning
contents. ReCALL 17(1): 47–64.
. 2007. Integrating
corpus work into secondary education: From data-driven learning to needs-driven
corpora. ReCALL 19(3): 307–328.
. 2010. Getting
past ‘Groundhog Day’: Spoken multimedia corpora for student-centred corpus
exploration. In Corpus Linguistics in Language
Teaching, Tony Harris & Maria Moreno Jaén (eds), 75–97. Bern: Peter Lang.
Carter, Ronald & McCarthy, Michael. 2017. Spoken
grammar: Where are we and where are we going? Applied
Linguistics 38(1): 1–20.
Cavalla, Cristelle & Loiseau, Mathieu. 2013. Scientext
comme corpus pour l’enseignement. In L’écrit
scientifique: Du lexique au discours. Autour de Scientext, Agnès Tutin & Francis Grossmann (eds), 163–182. Rennes: Presses universitaires de Rennes. ffhal-01154754f
Chambers, Angela. 2013. Learning
and teaching the subjunctive in French: The contribution of corpus
data. Bulletin suisse de linguistique
appliquée 97: 41–58.
. 2019. Towards
the corpus revolution? Bridging the research-practice gap. Language
Teaching 52(4): 460–75.
Cobb, Tom. 2014. A
resource wish-list for data-driven learning in
French. In French through Corpora: Ecological and
Data-driven Perspectives. In Henry Tyne, Virginie André, Christophe Benzitoun, Alex Boulton & Yan Greub (eds), 255–290. Newcastle upon Tyne: Cambridge Scholars.
Cobb, Tom, Greaves, Chris & Horst, Marlise. 2001. Can
the rate of lexical acquisition from reading be increased? An experiment in reading French with a suite of
on-line resources (trad.). In Regards sur la
didactique des langues secondes, Claudette Cornaire & Patricia Raymond (eds), 133–153. Quebec: Editions Logiques.
Cohen, Louis, Manion, Lawrence & Morrison, Keith (eds). 2018. Research
Methods in Education, 8th
edn. London: Routledge.
Corréard, Marie-Hélène (ed.) 1997. The
Oxford-Hachette French dictionary, 2nd edn. Oxford and Paris: OUP and Hachette.
Council of
Europe. 2020[2001]. Common European Framework of Reference for
Languages: Learning, Teaching, Assessment. Companion Volume. [URL]
Crosthwaite, Peter (ed.) 2019. Data-driven
Learning for the Next Generation: Corpora and DDL for Pre-tertiary
Learners. London: Routledge.
Cruz Piñol, Mar. 2015. Léxico
y ELE: Enseñanza/aprendizaje con tecnologías. Journal of Spanish Language
Teaching 2(2): 165–179.
Da Silva, Iandra Maria. 2006. La
utilización de la lingüística del corpus en la enseñanza del español: Usos de pero y
sino. In Actas del III simposio internacional José
Carlos Lisboa de didáctica del español como lengua extranjera del Instituto Cervantes de Río de
Janeiro, Pedro Benítez Pérez (ed.), 219–228. Rio de Janeiro: Instituto Cervantes. [URL]
Davies, Mark. 2002. Un
corpus anotado de 100.000. 000 palabras del español histórico y
moderno. Procesamiento del Lenguaje
Natural 29: 21–27. [URL]
Di Vito, Sonia. 2019. Teaching
French to young learners through DDL. In Data-driven
Learning for the Next Generation: Corpora and DDL for Pre-tertiary Learners, Peter Crosthwaite (ed.), 171–186. London: Routledge.
Eliche, Fermín Martos & Izquierdo, Narciso. 2018. El
empleo de corpus para el aprendizaje de secuencias formulaicas en ELE/EL2. La frecuencia de uso en el nivel B2
del PCIC. CHIMERA: Revista de Corpus de Lenguas Romances y Estudios
Lingüísticos 5(1): 1–26. [URL]
Farr, Fiona, Chambers, Angela & O’Riordan, Stéphanie. 2010. Corpora
for materials development in language teacher education: Underlying principles and useful
data. In Perspectives on Language Learning Materials
Development, Freda Mishan & Angela Chambers (eds), 33–61. Bern: Peter Lang.
Forti, Luciana. 2023. Corpus
use in Italian Language Pedagogy: Exploring the Effects of Data-driven
Learning. London: Routledge.
Gadet, Françoise & Ludwig. Ralph. 2014. French
language(s) in contact worldwide: History, space, system and other ecological
parameters. Journal of Language
Contact 7(1): 3–35.
Giuliani, Delphine & Hannachi, Radia. 2010. Linguistique
de corpus et didactique du FLE: Une exploitation du corpus IntUne. Cahiers de
praxématique 54–55: 145–160.
Jablonkai, Reka, Forti, Luciana, Abad Castelló, Magdalena, Salengros Iguenane, Isabelle, Schaeffer-Lacroix, Eva & Vyatkina, Nina. 2020. Data-driven
learning for languages other than English: The cases of French, German, Italian, and
Spanish. In CALL for Widening Participation: Short
Papers from EUROCALL 2020, Karen-Margrete Frederiksen, Sanne Larsen, Linda Bradley & Sylvie Thouësny (eds), 132–137. Research-publishing.net.
Kennedy, Claire & Miceli, Tiziana. 2017. Cultivating
effective corpus use by language learners. Computer Assisted Language
Learning 30(1–2): 91–114.
Kerr, Betsy. 2013. Grammatical
description and classroom application: Theory and practice in data-driven
learning. Bulletin suisse de linguistique
appliquée 97: 17–39.
Klinger, Dominique & Véronique, Georges Daniel. 2006. La
grammaire du Français fondamental: Interrogations historiques et
didactiques. Documents pour l’histoire du français langue étrangère ou
seconde 36.
Liu, Yongcan., Fisher, Linda, Forbes, Karen & Evans, Michael. 2017. The
knowledge base of teaching in linguistically diverse contexts: 10 grounded principles of multilingual
classroom pedagogy for EAL. Language and Intercultural
Communication 17(4): 378–395.
Nica, Violeta. 2012. Los
corpus discursivos orales como herramienta didáctica para los niveles
iniciales. Foro de Profesores de
E/LE 8.
O’Sullivan, Íde. 2007. Enhancing
a process-oriented approach to literacy and language learning: The role of corpus consultation
literacy. ReCALL 19(3): 269–286.
O’Sullivan, Íde & Chambers, Angela. 2006. Learners’
writing skills in French: Corpus consultation and learner evaluation. Journal
of Second Language
Writing 15: 49–68.
Paz, Yanira. 2013. Aplicación
pedagógica de la lingüística de corpus a una clase de gramática avanzada de español como segunda
lengua. MarcoELE Revista de Didáctica
ELE 6.
Pérez-Paredes, Pascual. 2010. Corpus
linguistics and language education in perspective: Appropriation and the possibilities
scenario. In Corpus Linguistics in Language
Teaching, Tony Harris & Maria Moreno Jaén (eds), 53–73. Bern: Peter Lang.
. 2019. The
pedagogic advantage of teenage corpora for secondary school
learners. In Data-driven Learning for the Next
Generation: Corpora and DDL for Pre-tertiary Learners, Peter Crosthwaite (ed.), 67–87. London: Routledge.
. 2022. A
systematic review of the uses and spread of corpora and data-driven learning in CALL research during
2011–2015. Computer Assisted Language
Learning, 35(1–2): 36–61.
Pérez-Paredes, Pascual, Ordoñana, Carlos & Aguado, Pilar. 2018. Language
teachers’ perceptions on the use of OER language processing technologies in
MALL. Computer Assisted Language
learning 31(5–6): 522–545.
Pérez-Paredes, P. & Abad, M. 2025. Integrating
language teachers’ voices in the design and exploitation of Spanish corpora in the
UK. In Henry Tyne & Stefania Spina (eds). Applying
corpora in teaching and learning Romance
languages, pp. 326–355. John Benjamins.
Pons, Salvador. 2019. Corpus
Val.Es.Co 2.1. [URL]
Sidoti, Rossana. 2011. Las
colocaciones en el aula de
E/LE. redELE 21. [URL]
Thompson, Paul. 2010. Building
a specialised audio-visual corpus. In The Routledge
Handbook of Corpus Linguistics, Anne O’Keeffe & Michael McCarthy (eds), 93–104. London: Routledge.
Tran, Thi Thu Hoai & Falaise, Achille. 2018. Un
dictionnaire basé sur corpus pour une aide à la rédaction
universitaire. LIDIL 58.
Tran, Thi Thu Hoai, Tutin, Agnès & Cavalla, Cristelle. 2016. Pour
un enseignement systématique des marqueurs discursifs à l’aide de corpus en classe de FLE: L’exemple des
marqueurs de reformulation. Linguistik
Online 78(4): 113–128. ffhalshs-01377955f.
Tyne, Henry. 2012. Corpus
work with ordinary teachers: Data-driven learning
activities. In Input, Process and Product:
Developments in Teaching and Language Corpora, James Thomas & Alex Boulton (eds), 136–151. Brno: Masaryk University Press.
Vandeweerd, Nathan & Keijzer, Merel. 2018. J’ai
l’impression que: Lexical bundles in the dialogues of beginner French
textbooks. Canadian Journal of Applied Linguistics/Revue canadienne de
linguistique
appliquée 21(2): 80–101.
