Introduction published In: Learner translation corpus research
Guest-edited by Sylviane Granger and Marie-Aude Lefer
[International Journal of Learner Corpus Research 9:1] 2023
► pp. 1–28
Introduction
Learner translation corpora
Bridging the gap between learner corpus research and corpus-based translation studies
Published online: 11 April 2023
https://doi.org/10.1075/ijlcr.00032.gra
https://doi.org/10.1075/ijlcr.00032.gra
Article outline
- 1.Introduction
- 2.Interfacing learner corpus research and corpus-based translation studies
- 2.1Learner corpus research
- 2.2Corpus-based translation studies
- 2.3Integrating the two perspectives
- 3.Overview of learner translation corpora
- 4.Main research strands in LTC-based research
- 4.1Translation features
- 4.2Teaching applications
- 4.3Some emerging trends
- 5.Overview of the special issue
- Acknowledgements
- Notes
References
References (94)
Alfuraih, R. F. (2020). The
undergraduate learner translator corpus: A new resource for translation studies and computational
linguistics. Language Resources &
Evaluation, 541, 801–830.
Altenberg, B. (1998). Connectors
and sentence openings in English and Swedish. In S. Johansson & S. Oksefjell (Eds.), Corpora
and cross-linguistic
research (pp. 115–143). Rodopi.
Baker, M. (1993). Corpus
Linguistics and Translation Studies: Implications and
applications. In M. Baker, G. Francis, & E. Tognini-Bonelli (Eds.), Text
and Technology: In honour of John
Sinclair (pp. 233–250). John Benjamins.
(1995). Corpora
in Translation Studies: An overview and some suggestions for future
research. Target, 7(2), 223–243.
(1996). Corpus-based
Translation Studies: The challenges that lie ahead. In H. Somers (Ed.), Terminology,
LSP and Translation: Studies in language engineering in honour of Juan C.
Sager (pp. 175–86). John Benjamins.
Barker, F., Salamoura, A., & Saville, N. (2015). Learner
corpora and language testing. In S. Granger, G. Gilquin, & F. Meunier (Eds.), The
Cambridge Handbook of Learner Corpus
Research (pp. 511–533). Cambridge University Press.
Beeby, A., Rodríguez-Inés, P., & Sánchez-Gijón, P. (Eds.). (2009). Corpus
use and translating: Corpus use for learning to translate and learning corpus use to
translate. John Benjamins.
Borin, L., & Prütz, K. (2004). New
wine in old skins? A corpus investigation of L1 syntactic transfer in learner
language. In G. Aston, S. Bernardini, & D. Stewart (Eds.), Corpora
and language
learners (pp. 67–87). John Benjamins.
Bowker, L. (2012). Meeting
the needs of translators in the age of e-lexicography: Exploring the
possibilities. In S. Granger & M. Paquot (Eds.), Electronic
Lexicography (pp. 379–397). Oxford University Press.
Bowker, L., & Bennison, P. (2003). Student
Translation Archive: Design, development and application. In F. Zanettin, S. Bernardini, & D. Stewart (Eds.), Corpora
in translator
education (pp. 103–117). Routledge.
Castagnoli, S. (2020). Translation
choices compared: Investigating variation in a learner translation
corpus. In S. Granger & M.-A. Lefer (Eds.), Translating
and comparing languages: Corpus-based insights – Selected Proceedings of the Fifth ‘Using Corpora in Contrastive and
Translation Studies’
Conference (pp. 25–44). Presses universitaires de Louvain.
(2016). Investigating
trainee translators’ contrastive pragmalinguistic competence: A corpus-based analysis of interclausal linkage in learner
translations. The Interpreter and Translator
Trainer, 10(3), 343–363.
(2009). Regularities
and variations in learner translations: A corpus-based study of conjunctive
explicitation [Unpublished doctoral dissertation]. Pisa University.
Castagnoli, S., Ciobanu, D., Kübler, N., Kunz, K., & Volanschi, A. (2011). Designing
a learner translator corpus for training purposes. In N. Kübler (Ed.), Corpora,
language, teaching, and resources: From theory to
practice (pp. 221–248). Peter Lang.
Chesterman, A. (2004). Hypotheses
about translation universals. In G. Hansen, K. Malmkjaer, & D. Gile (Eds.), Claims,
Changes and Challenges in Translation
Studies (pp. 1–14). John Benjamins.
(2007). Similarity
analysis and the translation profile. Belgian Journal of
Linguistics, 211, 53–66.
Cook, G. (2010). Translation
in language teaching: An argument for reassessment. Oxford University Press.
De Sutter, G., Cappelle, B., De Clercq, O., Loock, R., & Plevoets, K. (2017). Towards
a corpus-based, statistical approach to translation quality: Measuring and visualizing linguistic deviance in student
translation. Linguistica Antverpiensia, New Series: Themes in Translation
Studies, 161, 25–39.
De Sutter, G., & Lefer, M.-A. (2020). On
the need for a new research agenda for corpus-based translation studies: A multi-methodological, multifactorial and
interdisciplinary
approach. Perspectives, 28(1), 1–23.
Díez-Bedmar, M. B. (2021). Error
analysis. In N. Tracy-Ventura & M. Paquot (Eds.), The
Routledge handbook of second language acquisition and
corpora (pp. 90–104). Routledge.
Espunya, A. (2014). The
UPF learner translation corpus as a resource for translator training. Language Resources and
Evaluation, 481, 33–43.
Ferraresi, A. (2019). Collocations
in contact: Exploring constrained varieties of English through corpora. Textus: English Studies
in Italy, 1/2019, 203–222.
Fictumova, J., Obrusnik, A., & Stepankova, K. (2017). Teaching
specialized translation error-tagged translation learner
corpora. Sendebar, 281, 209–241.
Frankenberg-Garcia, A. (2015). Training
translators to use corpora hands-on: challenges and reactions by a group of thirteen students at a UK
university. Corpora, 10(3), 351–380.
Gilquin, G. (2000/2001). The
Integrated Contrastive Model: Spicing up your data. Languages in
Contrast, 3(1), 95–123.
(2015). From
design to collection of learner corpora. In S. Granger, G. Gilquin, & F. Meunier (Eds.), The
Cambridge Handbook of Learner Corpus
Research (pp. 9–34). Cambridge University Press.
Ginovart Cid, C., Colominas, C., & Oliver, A. (2020). Language
industry views on the profile of the post-editor. Translation
Spaces, 9(2), 283–313.
Graedler, A.-L. (2013). NEST –
A corpus in the brooding box. Studies in Variation, Contacts and Change in
English, 131.
Granger, S. (1996). From
CA to CIA and back: an integrated contrastive approach to computerized bilingual and learner
corpora. In K. Aijmer, B. Altenberg, & M. Johansson (Eds.), Languages
in contrast. Text-based cross-linguistic
studies (pp. 37–51). Lund University Press.
(1998). The
computerized learner corpus: a versatile new source of data for SLA
research. In S. Granger (Ed.), Learner
English on
computer (pp. 3–18). Addison Wesley Longman.
(2009). The
contribution of learner corpora to second language acquisition and foreign language teaching: A critical
evaluation. In K. Aijmer (Ed.), Corpora
and language
teaching (pp. 13–32). John Benjamins.
(2012). How
to use foreign and second language learner corpora. In A. Mackey & S. Gass (Eds.), Research
methods in second language acquisition: A practical
guide (pp. 7–29). Blackwell.
(2015). Contrastive
Interlanguage Analysis: A reappraisal. International Journal of Learner Corpus
Research, 1(1), 7–24.
(2018). Tracking
the third code: A cross-linguistic corpus-driven approach to metadiscursive
markers. In A. Cermakova & M. Mahlberg (Eds.), The
corpus linguistics
discourse (pp. 185–204). John Benjamins.
(2019). Formulaic
sequences in learner corpora: Collocations and lexical
bundles. In A. Siyanova-Chanturia & A. Pellicer-Sanchez (Eds.), Understanding
formulaic language: A second language acquisition
perspective (pp. 228–247). Routledge.
Granger, S., & Lefer, M.-A. (2016). From
general to learners’ bilingual dictionaries: Towards a more effective fulfilment of advanced learners’ phraseological
needs. International Journal of
Lexicography, 29(3), 279–295.
(2020). The
Multilingual Student Translation corpus: a resource for translation teaching and
research. Language Resources and
Evaluation, 541, 1183–1199.
(2021). Translation-oriented
Annotation System manual (Version 2.0). CECL Papers
3. Centre for English Corpus Linguistics/Université catholique de Louvain. [URL]
Granger, S., & M.-A. Lefer. (2022). Corpus-based
translation and interpreting studies: A forward-looking
review. In S. Granger & M.-A. Lefer (Eds.), Extending
the scope of corpus-based translation
studies (pp. 13–41). Bloomsbury.
Granger, S., & Paquot, M. (2015). Electronic
lexicography goes local: Design and structures of a needs-driven online academic writing
aid. Lexicographica – International Annual for
Lexicography, 31(1), 118–141.
Gries, S. Th. (2015). Statistics for learner corpus
research. In S. Granger, G. Gilquin, & F. Meunier (Eds.), The
Cambridge Handbook of Learner Corpus
Research (pp. 159–181). Cambridge University Press.
Halverson, S. L. (2017). Gravitational
pull in translation. Testing a revised model. In G. De Sutter, M.-A. Lefer, & I. Delaere (Eds.), Empirical
translation studies: New methodological and theoretical
traditions (pp. 9–45). De Gruyter.
Hansen-Schirra, S., Neumann, S., & Steiner, E. (2012). Cross-linguistic
corpora for the study of translations: Insights from the language pair English-German. De Gruyter.
Hasselgård, H., & Ebeling, S. O. (2018). At
the interface between Contrastive Analysis and learner corpus research: A parallel contrastive
approach. Nordic Journal of English
Studies, 17(2), 182–214.
Herbst, T., Faulhaber, S., & Uhrig, P. (Eds.). (2011). The
phraseological view of language: A tribute to John Sinclair. Walter de Gruyter.
Ivaska, I., & Bernardini, S. (2020). Constrained
language use in Finnish: A corpus-driven approach. Nordic Journal of
Linguistics, 43(1), 33–57.
Ivaska, I., Ferraresi, A., & Bernardini, S. (2022). Syntactic
properties of constrained English: A corpus-driven approach. In S. Granger, & M.-A. Lefer (Eds.), Extending
the scope of corpus-based translation
studies (pp. 133–157). Bloomsbury.
Johansson, S. (2007). Seeing
through Multilingual Corpora: On the Use of Corpora in Contrastive Studies. John Benjamins.
Kajzer-Wietrzny, M. (2022). An
intermodal approach to cohesion in constrained and unconstrained
language. Target, 34(1), 130–162.
Koletnik Korošec, M. (2013). Translation
in foreign language teaching. In N. K. Pokorn & K. Koskinen (Eds.), New
horizons in translation research and education
1 (pp. 61–74). Publications of the University of Eastern Finland Reports and Studies in Education, Humanities and Theology.
Kotze, H. (2019). Converging
what and how to find out why: An outlook on empirical translation
studies. In L. Vandevoorde, J. Daems, & B. Defrancq (Eds.), New
empirical perspectives on translation and
interpreting (pp. 333–370). Routledge.
(2022). Translation
as constrained communication: Principles, concepts and
methods. In S. Granger & M.-A. Lefer (Eds.), Extending
the scope of corpus-based translation
studies (pp. 67–97). Bloomsbury.
Kruger, H., & Van Rooy, B. (2016). Constrained
language: A multidimensional analysis of translated English and non-native indigenised varieties of
English. English
World-Wide, 37(1), 26–57.
Kübler, N. (2008). A
comparable learner translator corpus: Creation and use. Proceedings of the Comparable Corpora
Workshop of the LREC Conference
2008 (Marrakech), 73–78. [URL]
Kübler, N., Mestivier, A., & Pecman, M. (2018). Teaching
specialised translation through corpus linguistics: quality assessment and methodology evaluation by experimental
approach. Meta, 63(3), 806–824.
(2022). Using
comparable corpora for translating and post-editing complex noun phrases in specialized texts: Insights from English-to-French
specialized translation. In S. Granger & M.-A. Lefer (Eds.), Extending
the scope of corpus-based translation
studies (pp. 237–266). Bloomsbury.
Kunilovskaya, M., Ilyushchenya, T., Morgoun, N., & Mitkov, R. (2022). Source
language difficulties in learner translation: Evidence from an error-annotated
corpus. Target.
Kunilovskaya, M., & Lapshinova-Koltunski, E. (2019). Translationese
features as indicators of quality in English-Russian human
translation. In I. Temnikova, C. Orasan, G. Corpas Pastor, & R. Mitkov (Eds.), Proceedings
of the 2nd Workshop on Human-Informed Translation and Interpreting Technology (HiT-IT
2019) (pp. 47–56).
Kunilovskaya, M., Morgoun, N., & Pariy, A. (2018). Learner
vs. professional translations into Russian: Lexical profiles. Translation and
Interpreting, 10(1), 33–52.
Kutuzov, A., & Kunilovskaya, M. (2014). Russian
Learner Translator Corpus: Design, research potential and
applications. In P. Sojka, A. Horak, I. Kopecek, & K. Palak (Eds.), Text,
speech and dialogue: Lecture notes in computer
science (pp. 315–323). Springer.
Lanstyák, I., & Heltai, P. (2012). Universals
in language contact and translation. Across Languages and
Cultures, 13(1), 99–121.
Lapshinova-Koltunski, E. (2022). Detecting
normalisation and shining-through in novice and professional
translations. In S. Granger, & M.-A. Lefer (Eds.), Extending
the scope of corpus-based translation
studies (pp. 182–206). Bloomsbury.
Lapshinova-Koltunski, E., Popović, M., & Koponen, M. (2022). DiHuTra:
a parallel corpus to analyse differences between human
translations. In L. Macken, A. Rufener, J. Van den Bogaert, J. Daems, A. Tezcan, B. Vanroy, M. Fonteyne, L. Barrault, M. R. Costa-jussà, E. Kemp, S. Pilos, Ch. Declercq, M. Koponen, M. L. Forcada, C. Scarton, & H. Moniz (Eds.), Proceedings
of the 23rd Annual Conference of the European Association for Machine
Translation (pp. 335–336). European Association for Machine Translation.
Laviosa, S. (1998). Core
patterns of lexical use in a comparable corpus of English narrative
prose. Meta, 43(4), 557–570.
Leedham, M., & Cai, G. (2013). Besides…on
the other hand: Using a corpus approach to explore the influence of teaching materials on Chinese students’ use
of linking adverbials. Journal of Second Language
Writing, 221, 374–389.
Lefer, M.-A. (2020). Parallel
corpora. In M. Paquot & S. Th. Gries (Eds.), A
practical handbook of corpus
linguistics (pp. 257–282). Springer.
Lefer, M.-A., Piette, J., & Bodart, R. (2022). Machine
Translation Post-Editing Annotation System (MTPEAS) manual (Version
1.0). OER UCLouvain. [URL]
Lefer, M.-A., & Vogeleer, S. (Eds.). (2013). Interference
and normalisation in genre-controlled multilingual corpora. Belgian Journal of
Linguistics, 271.
Loock, R. (2020). It’s
non-canonical word order that you should use! A corpus approach to avoiding standardized word order in translated
French. In S. Granger & M.-A. Lefer (Eds.), Translating
and comparing languages: Corpus-based insights – Selected Proceedings of the Fifth ‘Using Corpora in Contrastive and
Translation Studies’
Conference (pp. 69–85). Presses universitaires de Louvain.
Macken, L., De Clercq, O., & Paulussen, H. (2011). Dutch
Parallel Corpus: A balanced copyright-cleared parallel
corpus. Meta, 56(2), 374–390.
Malmkjaer, K. (1998). Love
thy Neighbour: Will Parallel Corpora Endear Linguists to
Translators? Meta, 43(4), 1–8.
Milton, J., & Tsang, E. S. C. (1993). A
corpus-based study of logical connectors in EFL students’ writing: directions for future
research. In R. Pemberton & E. S. C. Tsang (Eds.), Studies
in
lexis (pp. 215–246). The Hong Kong Institute of Science and Technology.
Nesselhauf, N. (2004). Learner
corpora and their potential in language teaching. In J. Sinclair (Ed.), How
to use corpora in language
teaching (pp. 125–152). John Benjamins.
Neumann, S., Kerz, E., & Heilmann, A. (forthcoming). Comparing
contact effects in translation and second language learning. In H. Kotze & B. Van Rooy (Eds.), Constraints
on language variation and change in complex multilingual contact settings. John Benjamins.
Olohan, M., & Baker, M. (2000). Reporting
that in translated English: Evidence for subconscious processes of explicitation? Across
Languages and
Cultures, 1(2), 141–158.
Paquot, M., & Plonsky, L. (2017). Quantitative
research methods and study quality in learner corpus research. International Journal of Learner
Corpus
Research, 3(1), 61–94.
Penha-Marion, L. A. de S., Gilquin, G., & Lefer, M.-A. (forthcoming). The
effect of directionality on lexico-syntactic simplification in French><English student
translation. In H. Kotze & B. Van Rooy (Eds.), Constraints
on language variation and change in complex multilingual contact settings. John Benjamins.
Redelinghuys, K., & Kruger, H. (2015). Using
the features of translated language to investigate translation expertise: A corpus-based
study. International Journal of Corpus
Linguistics, 20(3), 293–325.
Reynaert, R., Macken, L., Tezcan, A., & De Sutter, G. (2021). Building
a new-generation corpus for empirical translation studies: the Dutch Parallel Corpus
2.0. In V. Wang, L. Lim, & D. Li (Eds.), New
perspectives on corpus translation
studies (pp. 75–100). Springer.
Rundell, M., & Granger, S. (2007). From
corpora to confidence. English Teaching
Professional, 501, 15–18.
Schneider, G., & Gilquin, G. (2016). Detecting
innovations in a parsed corpus of learner English. International Journal of Learner Corpus
Research, 2(2), 177–204.
Seidlhofer, B. (2002). Pedagogy
and local learner corpora. In S. Granger, J. Hung, & S. Petch-Tyson (Eds.), Computer
learner corpora, second language acquisition and foreign language
teaching (pp. 213–234). John Benjamins.
Sinclair, J. (1996). Preliminary
recommendations on corpus typology [Technical report]. EAGLES (Expert Advisory Group on
Language Engineering Standards). [URL]
Swain, M. (1995). Three
functions of output in second language learning. In G. Cook & B. Seidlhofer (Eds.), Principle
and Practice in Applied
Linguistics (pp. 125–144). Oxford University Press.
Tsagari, D., & Floros, G. (Eds.). (2013). Translation
in language teaching and assessment. Cambridge Scholars Publishing.
Uzar, R., & Walinski, J. (2001). Analysing
the fluency of translators. International Journal of Corpus
Linguistics, 61, 155–166.
Vanderbauwhede, G. (2012). The
Integrated Contrastive Model evaluated: The French and Dutch demonstrative determiner in L1 and
L2. International Journal of Applied
Linguistics, 22(3), 392–413.
Van Vuuren, S., & Berns, J. (2018). Same
difference? L1 influence in the use of initial adverbials in English novice
writing. IRAL, 56(4), 427–461.
Wurm, A. (2016). Presentation
of the KOPTE Corpus and Research Project. [URL]
(2020). Translation
quality in an error-annotated translation learner corpus. In S. Granger & M.-A. Lefer (Eds.), Translating
and comparing languages: Corpus-based insights – Selected Proceedings of the Fifth ‘Using Corpora in Contrastive and
Translation Studies’
Conference (pp. 141–162). Presses universitaires de Louvain.
Xiao, R. (2007). What
can SLA learn from contrastive corpus linguistics? The case of passive constructions in Chinese learner
English. Indonesian Journal of English Language
Teaching, 3(1), 1–19.
Cited by (13)
Cited by 13 other publications
Al Nafjan, Eman & Sabria Jawhar
Giczela-Pastwa, Justyna
Granger, Sylviane & Magali Paquot
Hovhannisyan, Gayane R., Kristina A. Harutyunova, Hayk B. Hambardzumyan & Srbuhi Aydinyan
Larsson, Tove & Gaëtanelle Gilquin
Yang, Chunwen, Jing Chen & Shuai Hou
Al-Harthi, Maha, Amal Alsaif, Eman Al-Nafjan, Fatma Alshihri, Mahmoud Saleh & Muhammad Afzaal
Bodart, Romane, Justine Piette & Marie-Aude Lefer
2024. The Machine Translation Post-Editing Annotation System (MTPEAS). Translation Spaces 13:2 ► pp. 265 ff.
Granger, Sylviane
2024. From early to future learner corpus research. International Journal of Learner Corpus Research 10:2 ► pp. 247 ff.
Hovhannisyan, Gayane
Stritar Kučuk, Mojca
Xu, Wei
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
