In:Parallel Corpora for Contrastive and Translation Studies: New resources and applications
Edited by Irene Doval and M. Teresa Sánchez Nieto
[Studies in Corpus Linguistics 90] 2019
► pp. 19–38
Comparable parallel corpora
A critical review of current practices in corpus-based translation studies
Published online: 20 March 2019
https://doi.org/10.1075/scl.90.02har
https://doi.org/10.1075/scl.90.02har
Are papers presented in corpus-based translation studies truly scientific? These are normally done on only one language pair, often on purpose-made parallel corpora, and can normally not be replicated. Therefore their value is limited in a strictly scientific sense. The use of comparable parallel corpora allows both for the replication of studies, and the testing of complex hypotheses like Halverson’s Gravitational Pull hypothesis. This chapter defines and discusses the concept of comparable parallel corpora, and exemplifies their value by illustrating their use. The chapter also presents hopes for the future, as new groundbreaking technology that will allow the linguist to create her own parallel corpora without the aid of computer scientists is currently being launched at the University of León in Spain.
Article outline
- 1.Introduction
- 2.Comparable parallel corpora
- 3.Universals of translation
- 4.What makes parallel corpora comparable, and does size matter?
- 5.The sampling challenge of the NSPC
- 6.Testing of the gravitational pull hypothesis
- 7.Conclusions
Acknowledgment Notes References
References (51)
Altenberg, Bengt & Granger, Sylviane. 2002. Recent trends in cross-linguistic lexical studies. In Lexis in Contrast. Corpus Based Approaches, Bengt Altenberg & Sylviane Granger (eds), 3–48. Amsterdam: John Benjamins.
Baker, Mona. 1993. Corpus Linguistics and Translation Studies. In Text and Technology: in Honour of John Sinclair, G. Francis & E. Tognini-Bonelli (eds), 233–252. Amsterdam: John Benjamins.
. 1996. Corpus-based translation studies: The challenges that lie ahead. In Terminology, LSP and Translation: Studies in Language Engineering in honour of Juan C. Sagerm Harold L. Somers (ed.), 175–186. Amsterdam: John Benjamins.
. 1999. The role of corpora in investigating the linguistic behaviour of professional translators. International Journal of Corpus Linguistics 4(2): 281–298.
Becher, Victor. 2010. Abandoning the notion of translation-inherent explicitation. Against a dogma of translation studies. Across Languages and Cultures 11(1):1–28.
Biber, Douglas. 1993. Representativeness in corpus design. Literary and Linguistic Computing 8(4): 243–257.
Blum-Kulka, Shoshana & Levenston, Eddie A. 1983. Universals of lexical simplification. In Strategies in Interlanguage Communication, C. Faerch & G Kasper (eds), 119–139. London & New York: Longman.
Blum-Kulka, Shoshana. 1986. Shifts of cohesion and coherence in translation. In Interlingual and Intercultural Communication: Discourse and Cognition in Translation and Second Language Aquisition Studies, Julianne House & Shoshana Blum-Kulka (eds), 17–35. Tübingen: Gunter Narr.
Capelle, Bert. 2012. English is less rich in manner-of-motion verbs when translated from French. Across Languages and Cultures 13(2): 173–195.
Chesterman, Andrew. 2004. Beyond the particular. In Translation Universals: Do they exist? Anna Mauranen & Pekka Kujamäki (eds.), 33–49. Amsterdam: John Benjamins.
De Sutter, Gert, Goethals, Patrick, Leuschner, Torsten & Vandepitte, Sonia. 2012a. Towards methodologically more rigorous corpus-based translation studies. In Across Languages and Cultures 13(2):137–143.
De Sutter Gert, Goethals, Patrick, Leuschner, Torsten & Vandepitte, Sonia. 2012b. Towards methodologically more rigorous corpus-based translation studies. Across Languages and Cultures 13(2): 137–143.
Eskola, Sari. 2004. Untypical frequencies in Translated language. A Corpus based study on a literary corpus of translated and non-translated Finnish In Translation Universals, Do They Exist?, Anna Mauranen & Pekka Kujamaki (eds), 83–99. Amsterdam: John Benjamins.
Gass, Susan M. & Larry Selinker. 2008. Second Language Acquisition: An Introductory Course. New York. Routeledge.
Goldschneider, Jennifer M. & DeKeyser, Robert. 2005. Explaining the “Natural Order of L2 Morpheme Acquisition” in English: A Meta-analysis of Multiple Determinants. Language Learning 55(1): 22–77.
Halverson, Sandra. 2003. The cognitive basis of translation universals. Target 15(2):197–241.
. 2007. Investigating Gravitational Pull in Translation: The Case of the English Progressive Construction. In Text, Processes, and Corpora: Research Inspired by Sonja Tirkkonen-Condit, Riita Jääskeläinen, Tiina Puurtinen & Hilkka Stotesbury (eds). Savonlinna: Savonlinna School of Translation Studies 5.
. 2009. Elements of doctoral training: The logic of the research process, research design and the evaluation of design quality. The Interpreter and Translator Trainer 3(1): 79–106.
. 2010. Cognitive translation studies: developments in theory and method. In Translation and Cognition, Gregory M Shreve & Erik Angelone (eds). Amsterdam: John Benjamins.
Hareide, Lidun & Hofland, Knut. 2012. Compiling a Norwegian–Spanish Parallel Corpus: methods and challenges. In Quantitative Methods in Corpus Based Translation Studies, Michael Oakes & Meng Ji (eds), 75–114. Amsterdam: John Benjamins.
Hareide, Lidun. 2014. Is there Gravitational Pull in translation? A corpus-based test of the Gravitational Pull Hypothesis on the language pairs Norwegian–Spanish and English–Spanish. In Testing the Gravitational Pull Hypothesis in translation – A Corpus-Based Study of the Gerund in Translated Spanish. PhD thesis, University of Bergen.
. 2017a. The translation of formal source language lacunas: An empirical study of the overrepresentation of target-language specific features and the unique items hypotheses. In Corpus Methodologies Explained Meng Ji, Lidun Hareide, Defeng Li & Michael Oakes (eds) 137–187. London & New York: Routledge.
. 2017b. Is there gravitational pull in translation? A corpus-based study of the Gravitational Pull Hypothesis on the language pairs Norwegian–Spanish and English–Spanish. In Corpus Methodologies Explained Meng Ji, Lidun Hareide, Defeng Li & Michael Oakes (eds) 137–187. London & New York: Routledge.
Izquierdo Fernández, Marlén. 2008. Estudio contrastivo y de traducción de las construcciones de -ing inglesas y sus equivalentes en español, Departamento de Filología Moderna, Universidad de León, León.
. 2012. Corpus-based functionality and translatability: English–Spanish progressive constructions in contrast and translation. Languages in Contrast 12 (2): 186–210.
Izquierdo, Marlén, Hofland, Knut & Reigem, Øystein. 2008. The ACTRES parallel corpus: an English–Spanish translation corpus. Corpora 3(1): 31–41.
Jenset, Gard Buen & Hareide, Lidun. 2013. A multidimensional approach to aligned sentences in translated text. In The Many Facets of Corpus Linguistics in Bergen - In Honour of Knut Hofland, Lidun Hareide, Johansson Christer & Michael Oakes (eds), 195–210. Bergen: Bergen Language and Linguistics Studies (BeLLS).
Ji, Meng, Hareide, Lidun, Li, Defeng Li & Oakes, Michael. (2017). Introduction to Corpus Methodologies Explained An empirical approach to translation studies. In Corpus Methodologies Explained An empirical approach to translation studies, Meng Ji, Lidun Hareide, Defeng Li & Michael Oakes (eds), 1–4. London: Routledge.
Pekka Kujamäki. 2004. What happens to ‘unique items’ in learners’ translation. In Translation Universals: Do they exist?, Anna Mauranen y Pekka Kujamäki (eds), 187–204. Amsterdam: John Benjamins.
Laviosa, Sara. 1997. How comparable can ‘comparable corpora’ be? Target 9(2): 289–319.
Leech, Geoffrey. 1992. Corpora and theories of linguistic performance. In Directions in Corpus Linguistics: Proceedings of the Nobel Symposium
82
, Stockholm, 4–8 August 1991, Jan Svartevik. (ed.), 105–122. Berlin: Mouton de Gruyter.
. 2007. New resources, ot just better old ones? The Holy Grail of representativeness. In Corpus Linguistics and the Web, Marianne Hundt, Nadja Nesselhauf & Carolin Biewer (eds), 133–149. Amsterdam: Rodopi.
Mauranen, Anna & Kujamäki, Pekka. 2004. Introduction. In Translation Universals Do they exist?, Anna Mauranen & Pekka Kujamäki (eds), 65–82. Amsterdam: John Benjamins.
McEnery, Tony, Tono, Yukio & Xiao, Richard. 2006. Corpus-Based Language Studies an Advanced Resource Book, Routledge applied linguistics. London & New York: Routledge.
Rabadán, Rosa, Labrador, Belén & Ramón, Noelia. 2009. Corpus-based contrastive analysis and translation universals. A tool for translation quality assessment English --> Spanish. Babel 55 (4): 303–328.
Rabadán, Rosa. 2005. Hipótesis, explicaciones y aplicaciones: los caminos de la investigación en traducción inglés-español. In Estudios de Traducción, Lingüística y filología dedicados a Valentín García Yebra, 148–170. Arco Libros.
. 2005–2008. Tools for English–Spanish Cross-Linguistic Applied Research. Journal of English Studies 5–6: 309–324.
. 2007. Divisions, descriptions and applications The interface between DTS, corpus-based research and contrastive analysis. In Doubts and Directions in Translation Studies, Yves Gambier, Miriam Shlesinger & Radegundis Stolze (eds), 237–252. Amsterdam: John Benjamins.
RAE. 2009. Nueva gramática de la lengua Española / Real Academia Española; Asociación de Academias de la Lengua Española. Madrid.
Sanchez, Tara. 2006. The progressive in the Spoken Papiamentu of Aruba. In Structure and Variation in Language Contact, Ana Deumert & Stephanie Durrleman-Tame (eds), 291–314. Amsterdam: John Benjamins.
Sanjurjo González, Hugo. Creación de un Framework para el Tratamiento de Corpus Lingüísticos. Phd-thesis, Departamento de Ingeniería Eléctrica y de Sistemas y Automática, Universidad de León.
Shlesinger, Miriam. 1991. Interpreter latitude vs. due process. Simultaneous and consecutive interpretation in multilingual trials. In Empirical Research in Translation and Intercultural Studies Sonja Tirkkonen-Condit (ed.), 147–155. Tübingen: Gunter Narr.
Snell-Hornby, Mary. 2006. The turns of Translation Studies. New paradigms or shifting viewpoints? Amsterdam/Philadelphia: John Benjamins.
Tirkkonen-Condit, Sonja. 2002. Unique Items – over- or under-represented in translated language? Target 14(2): 207–220.
. 2004. Unique Items – over- or under-represented in translated language? In Translation Universals: Do they exist?, Anna Mauranen & Pekka Kujamäki (eds), 177 – 184. Amsterdam: John Benjamins.
. 1991. What are descriptive translation studies into translation likely to yield apart from isolated descriptions. In Translation Studies: The State of the Art, Kitty M. Van Leuven-Zwart & Ton Naaijkens (eds), 179–192. Amsterdam: Rodopi.
Tymoczko, Maria. 1998. Computerized Corpora and the Future of translation Studies. Meta 43(4): 652–660.
Cited by (8)
Cited by eight other publications
Schmidhofer, Astrid & Jesús Manuel Millán Vidal
Shang, Xin
2023. When Contrastive Analysis meets Translation Studies. Target. International Journal of Translation Studies 35:2 ► pp. 186 ff.
Grabowski, Łukasz & Nicholas Groom
2022. Functionally-defined recurrent multi-word units in English-to-Polish translation. Revista Española de Lingüística Aplicada/Spanish Journal of Applied Linguistics 35:1 ► pp. 1 ff.
Oakes, Michael & Meng Ji
Shevchuk, Mozghov & Vasylkivska
Zavala-Rojas, Diana, Danielly Sorato, Lidun Hareide & Knut Hofland
Karakanta, Alina, Heike Przybyl & Elke Teich
2021. Exploring variation in translation with probabilistic language models. In Corpora in Translation and Contrastive Research in the Digital Age [Benjamins Translation Library, 158], ► pp. 307 ff.
This list is based on CrossRef data as of 1 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
