Article published In: International Journal of Learner Corpus Research
Vol. 8:1 (2022) ► pp.97–120
Corpus reports
Leonide
A longitudinal trilingual corpus of young learners of Italian, German and English
Available under the Creative Commons Attribution (CC BY) 4.0 license.
For any use beyond this license, please contact the publisher at rights@benjamins.nl.
Published online: 8 March 2022
https://doi.org/10.1075/ijlcr.21004.gla
https://doi.org/10.1075/ijlcr.21004.gla
Abstract
This article presents the longitudinal trilingual corpus of young learners of Italian, German and English called
LEONIDE. The corpus consists of L1, L2 and L3 learner texts. L1 texts were written in two languages of schooling (i.e. Italian and
German), L2 texts in two languages learned as second languages (i.e. German and Italian), and L3 texts in an additional foreign
language (i.e. English). All texts were collected from a group of lower secondary school pupils from the multilingual Italian
province of South Tyrol whose development in all three languages was observed over a period of three years. Each text comes with
rich metadata as well as manual and automatic annotations.
Article outline
- 1.Introduction
- 2.Origin of the corpus data
- 2.1Linguistic situation of South Tyrol
- 2.2The project One School, Many Languages
- 3.Corpus design
- 3.1Written language assessment tasks
- 3.2Participants
- 4.Corpus data
- 4.1Corpus size
- 4.2Transcription
- 4.3Manual annotations
- 4.4Automatic annotations
- 4.5Metadata
- 5.Corpus access and reusability
- 6.Potential future use of LEONIDE in LCR and corpus-based language teaching
- Acknowledgements
- Notes
References
References (38)
Abel, A., Glaznieks, A., Nicolas, L., & Stemle, E. W. (2014). KoKo:
An L1 learner corpus for German. Proceedings of the 9th International Conference on Language
Resources and Evaluation (LREC
2014) (pp. 2414–2421).
Alber, E. (2012). South
Tyrol’s education system: Plurilingual answers for monolinguistic spheres? L’Europe en
Formation,
363
(1), 399–415.
Astat. (2012). Volkszählung 2011 –
Censimento della popolazione 2011. astatinfo 38. [URL]
Bettoni, C., & di Biase, B. (Eds.). (2015). Grammatical
development in second languages: Exploring the boundaries of Processability Theory. The European Second Language Association.
Boyd, A., Hana, J., Nicolas, L., Meurers, D., Wisniewski, K., Abel, A., Schöne, K., Štindlová, B., & Vettori, C. (2014). The
MERLIN corpus: learner language and the CEFR. Proceedings of the 9th International Conference
on Language Resources and Evaluation (LREC
2014) (pp. 1281–1288).
Cummins, J. (1984). Wanted:
A theoretical framework for relating language proficiency to academic achievement among bilingual
students. In C. Rivera (Ed.), Language
proficiency and academic
achievement (pp. 2–19). Mulitlingual Matters.
de Bot, K., & Jaensch, C. (2015). What
is special about L3 processing? Bilingualism: Language and
Cognition,
18
(2), 130–144.
Deutsches
Schulamt (2009). Rahmenrichtlinien des Landes für die Festlegung der Curricula für
die Grundschule und die Mittelschule an den autonomen deutschsprachigen Schulen in
Südtirol. Deutsches Schulamt.
Dipartimento Istruzione e Formazione
italiana (2008). Indicazioni provinciali per la definizione dei curriculi del
primo ciclo d’istruzione della scuola in lingua italiana della Provincia Autonoma di
Bolzano. Provincia Autonoma di Bolzano.
Dirdal, H., Danbolt Drange, E.-M., Graedler, A.-L., Guldal, T. M., Hasund, I. K., Nacey, S. L., & Rørvik, S. (2017). Tracking
Written Learner Language (TRAWL): A longitudinal corpus of Norwegian pupils’ written texts in second/foreign
languages. Book of Abstracts of the 4th Learner Corpus Research Conference – LCR 2017
(Bolzano/Bozen, 5–7 October
2017) (pp. 182–183).
Ehlich, K. (2005). Sprachaneignung
und deren Feststellung bei Kindern mit und ohne Migrationshintergrund – Was man weiß, was man braucht, was man erwarten
kann. In Bundesministerium für Bildung und
Forschung (Ed.), Bildungsreform Band 11: Anforderungen an Verfahren der
regelmäßigen Sprachstandsfeststellung als Grundlage für die frühe und individuelle Förderung von Kindern mit und ohne
Migrationshintergrund (pp. 11–63). BMBF.
Engel, D., & Stopfner, M. (2019). Communicative
competence in the context of increasing diversity in South Tyrolean
schools. In E. Vetter & U. Jessner (Eds.), International
research on multilingualism: Breaking with the monolingual
perspective (pp. 59–80). Springer Nature.
Flick, U. (2011). Das
episodische Interview. In G. Oelerich & H.-U. Otto (Eds.), Empirische
Forschung und Soziale Arbeit: Ein
Studienbuch (pp. 273–280). VS Verlag für Sozialwissenschaften/Springer.
Gogolin, I. (2004). Lebensweltliche
Mehrsprachigkeit. In K.-R. Bausch, F. G. Königs, & H.-J. Krumm (Eds.), Mehrsprachigkeit
im Fokus: Arbeitspapiere der 24. Frühjahrskonferenz zur Erforschung des
Fremdsprachenunterrichts (pp. 55–61). Narr.
Gogolin, I., Lange, I., Hawighorst, B., Bainski, C., Heintze, A., Rutten, S., & Saalmann, W. (2011). Durchgängige
Sprachbildung: Qualitätsmerkmale für den
Unterricht. Waxmann.
Granger, S. (2015). The
contribution of learner corpora to reference and instructional materials
design. In S. Granger, G. Gilquin, & F. Meunier (Eds.), The
Cambridge handbook of learner corpus
research (pp. 486–510). Cambridge University Press.
Granger, S., & Paquot, M. (2017). Core
metadata for learner corpora. Draft 1.0. (Unpublished
manuscript). Université catholique de Louvain.
Grießhaber, W. (2006). Testen
nichtdeutschsprachiger Kinder bei der Einschulung mit dem Verfahren der Profilanalyse – Konzeption und praktische
Erfahrungen. In B. Ahrenholz, & E. Apeltauer (Eds.), Zweitsprachenerwerb
und curriculare Dimensionen. Empirische Untersuchungen zum Deutschlernen in Kindergarten und
Grundschule (pp. 73–90). Stauffenburg.
(2010). Sprachkenntnisse
einschätzen – Schreibfertigkeiten fördern. In C. Benholz, G. Kniffka, & E. Winters-Ohle (Eds.), Fachliche
und sprachliche Förderung von Schülern mit Migrationsgeschichte. Beiträge des Mercator-Symposions im Rahmen des 15.
AILA-Weltkongresses „Mehrsprachigkeit: Herausforderungen und
Chancen“ (pp. 115–135). Waxmann.
Karges, K., Studer, T., & Wiedenkeller, E. (2019). On
the way to a new multilingual learner corpus of foreign language learning in school: Observations about task
variations. In A. Abel, A. Glaznieks, V. Lyding, & L. Nicolas (Eds.), Widening
the Scope of Learner Corpus Research. Selected Papers from the Fourth Learner Corpus Research
Conference (pp. 137–165). Presses universitaires de Louvain.
Keßler, J.-U. (2006). Englischerwerb
im Anfangsunterricht diagnostizieren. Linguistische Profilanalysen am Übergang von der Primarstufe in die Sekundarstufe
I. Narr.
Leech, G. (1997). Teaching
and language corpora: A convergence. In A. Wichmann, S. Fligelstone, T. Mc Enery, & G. Knowles (Eds.), Teaching
and language
corpora (pp. 1–23). Addison Wesley Longman.
Lüdi, G. (2006). Multilingual
repertoires and the consequences for linguistic theory. In K. Bührig, & J. D. ten Thije (Eds.), Beyond
misunderstanding: Linguistic analyses of intercultural
communication (pp. 11–42). John Benjamins.
Okinina, N., Nicolas, L., & Lyding, V. (2018). Transc&Anno:
A graphical tool for the transcription and on-the-fly annotation of handwritten
documents. Proceedings of the Eleventh International Conference on Language Resources and
Evaluation (LREC
2018) (pp. 701–705).
Pienemann, M. (1998). Language
processing and second language development: Processability Theory. John Benjamins.
Reich, H. (2010). Sprachstanderhebung,
ein- und mehrsprachig. In B. Ahrenholz, & I. Oomen-Welke (Eds.), Deutsch
als
Zweitsprache (pp. 420–429). Schneider Hohengehren.
Tono, Y., & Díez-Bedmar, M. B. (2014). Focus
on learner writing at the beginning and intermediate stages: The ICCI corpus. International
Journal of Corpus
Linguistics,
19
(2), 163–177.
Voltmer, L. (2007). Languages
in South Tyrol: Historical and legal aspects. In A. Abel, M. Stuflesser, & L. Voltmer (Eds.), Aspects
of multilingualism in European border regions: Insights and views from Alsace, Eastern Macedonia and Thrace, Lublin
Voivodeship and South
Tyrol (pp. 201–219). Europäische Akademie Bozen.
Cited by (14)
Cited by 14 other publications
Arhar Holdt, Špela & Iztok Kosem
Brasolin, Paolo & Arianna Bienati
Frey, Jennifer-Carmen
Lopopolo, Olga, Arianna Bienati, Jennifer-Carmen Frey, Aivars Glaznieks & Stefania Spina
Pauls, Tobias
Forti, Luciana
2024. Proficiency-rated learner corpora. International Journal of Learner Corpus Research 10:1 ► pp. 216 ff.
Granger, Sylviane
2024. From early to future learner corpus research. International Journal of Learner Corpus Research 10:2 ► pp. 247 ff.
Ramponi, Alan
Spina, Stefania, Irene Fioravanti, Luciana Forti & Fabio Zanda
Xu, Bo
Abidah, Khilda Husnia, Elisa Ratih, Evynurul Laily Zen, Ira Maria Fran Lumbanbatu & Anisatul Ilmiah
Glaznieks, Aivars, Jennifer-Carmen Frey, Andrea Abel, Lionel Nicolas & Chiara Vettori
Lüdtke, Ulrike, Juan Bornman, Febe de Wet, Ulrich Heid, Jörn Ostermann, Lars Rumberg, Jeannie van der Linde & Hanna Ehlert
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
