Article published In: International Journal of Corpus Linguistics
Vol. 22:2 (2017) ► pp.270–297
Methodological issues in the use of directional parallel corpora
A case study of English and French concessive connectives
Published online: 16 October 2017
https://doi.org/10.1075/ijcl.22.2.05dup
https://doi.org/10.1075/ijcl.22.2.05dup
Abstract
The recent emergence of large parallel corpora has represented a leap ahead for cross-linguistic and translation studies. However, the specificities of these corpora and their influence on the nature of observed linguistic phenomena remain underexplored, especially in the field of contrastive linguistics. In this study, we compare the translation equivalences of four concessive adverbial connectives in English and in French across three corpora varying along three dimensions: register, directionality of the translation and translator expertise. Our results indicate that these dimensions affect the cross-linguistic equivalences observed between connectives. We conclude that, in future work, translation-based claims about cross-linguistic equivalences should be balanced according to the type of data analysed. We also identify a pressing need for more rigorously-documented parallel corpora for the English-French language pair.
Keywords: translation corpora, concessive connectives, register variation, English, French
Article outline
- 1.Introduction
- 2.Register effects in contrastive corpus studies
- 3.Cross-linguistic studies of connectives
- 4.Equivalences between English and French connectives in three directional parallel corpora
- 4.1Data
- 4.2Extraction and disambiguation of concessive connectives
- 5.Results
- 5.1Frequencies of concessive connectives across the three corpora
- 5.2Translations of the concessive connectives across the three corpora
- 6.Discussion
- 6.1Register differences in cross-linguistic equivalences
- 6.2Differences between the translation directions
- 6.3Differences related to the translator’s expertise
- 6.4Other potentially relevant factors
- 7.Conclusion
- Acknowledgements
- Notes
References
References (58)
Albl-Mikasa, M. (2013). ELF speakers’ restricted power of expression: Implications for interpreters’ processing. Translation and Interpreting Studies, 8(2), 191–210.
Altenberg, B. (1999). Adverbial connectors in English and Swedish: Semantic and lexical correspondences. In H. Hasselgård & S. Oksefjell (Eds.), Out of Corpora. Studies in Honour of Stig Johansson (pp. 249–268). Amsterdam: Rodopi.
(2007). The correspondence of resultive connectors in English and Swedish. Nordic Journal of English Studies, 6(1), 1–26. Retrieved from [URL] (last accessed November 2015).
Asr, F., & Demberg, V. (2012). Implicitness of discourse relations. In M. Kay & C. Boitet (Eds.), Proceedings of COLING: Technical Papers (pp. 2669–2684). Mumbai: Indian Institute of Technology Bombay.
Baker, M. (1993). Corpus linguistics and translation studies: Implications and applications. In M. Baker, G. Francis & E. Tognini-Bonelli (Eds.), Studies in Text and Technology. In Honour of John Sinclair (pp. 233–250). Amsterdam/Philadelphia: John Benjamins.
(1995). Corpora in translation studies: An overview and some suggestions for future research. Target, 7(2), 223–243.
Barlow, M. (2008). Parallel texts and corpus-based contrastive analysis. In M. de los Ángeles Gómez González, J. Lachlan Mackenzie & E. González Álvarez (Eds.), Current Trends in Contrastive Linguistics. Functional and Cognitive Perspectives (pp. 101–121). Amsterdam/Philadelphia: John Benjamins.
Becher, V. (2011). When and why do translators add connectives? Target, 23(1), 26–47.
Beeby Lonsdale, A. (2009). Directionality. In G. Saldanha & M. Baker (Eds.), Routledge Encyclopedia of Translation Studies (pp. 84–88). London: Routledge.
Bernardini, S., & Baroni, M. (2005). Spotting translationese: A corpus-driven approach using support vector machines. In P. Danielsson & M. Wagenmakers (Eds.), Proceedings of Corpus Linguistics 2005, Vol. 11 (pp. 1–12). Birmingham: University of Birmingham.
Blakemore, D. (2002). Relevance and Linguistic Meaning. The Semantics and Pragmatics of Discourse Markers. Cambridge: Cambridge University Press.
Blumenthal, P. (1980). La Syntaxe du Message. Application au Français Moderne. Tübingen: Max Niemeyer Verlag.
Bowker, L. (2005). Productivity vs. quality? A pilot study on the impact of translation memory systems. Localisation Focus, 4(1), 13–20.
Candel-Mora, M. (2015). Comparable corpus approach to explore the influence of computer-assisted translation systems on textuality. Procedia – Social and Behavioural Sciences, 1981, 67–73.
Cartoni, B., Zufferey, S., & Meyer, T. (2013). Using the Europarl corpus for linguistic research. Belgian Journal of Linguistics, 271, 23–42.
Cettolo, M., Girardi, C., & Federico, M. (2012). WIT: Web Inventory of Transcribed and Translated Talks. Proceedings of the 16th EAMT Conference (pp. 261–268). Trento, Italy.
Das, D., & Taboada, M. (2013). Explicit and implicit coherence relations: A corpus study. In S. Luo (Ed.), Proceedings of the 2013 annual conference of the Canadian Linguistic Association. Victoria: University of Victoria.
Degand, L. (2004). Contrastive analyses, translation and speaker involvement: The case of ‘puisque’ and ‘aangezien’. In M. Achard & S. Kemmer (Eds.), Language, Culture and Mind (pp. 251–270). Stanford: CSLI Publications.
Delaere, I., & De Sutter, G. (2016). Variability of English loanword use in Belgian Dutch translations. Measuring the effect of source language, register, and editorial intervention. In G. De Sutter, M. -A. Lefer & I. Delaere (Eds.), Empirical Translation Studies. New Methodological and Theoretical Traditions. Berlin: Mouton De Gruyter.
Elimam, A. (2007). The impact of translation memory tools on the translation profession. Translation Journal, 11(1).Retrieved from [URL] (last accessed August 2017).
Englund Dimitrova, B. (2005). Expertise and Explicitation in the Translation Process. Amsterdam/Philadelphia: John Benjamins.
Granger, S. (2014). A lexical bundle approach to comparing languages. Stems in English and French. Languages in Contrast, 14(1), 58–72.
Gregory, M., & Carroll, S. (1978). Language and Situation: Language Varieties and their Social Contexts. London: Routledge.
Halverson, S. (2004). Connectives as a translation problem. In H. Kittel, A. Frank, N. Greiner, T. Hermans, W. Koller, J. Lamber & F. Paul (Eds.), Translation: An International Encyclopedia of Translation Studies (pp. 562–572). Berlin/New York: Walter de Gruyter.
Hansen-Schirra, S., Neumann, S., & Steiner, E. (2012). Cross-Linguistic Corpora for the Study of Translations. Insights from the Language Pair English-German. Berlin: Mouton De Gruyter.
Hoek, J., & Zufferey, S. (2015). Factors influencing the implicitation of discourse relations across languages. In H. Bunt (Ed.), Proceedings of the 11th Joint ISO-ACL/SIGSEM Workshop on Interoperable Semantic Annotation (pp. 39–45). Tilburg: Tilburg Centre for Cognition and Communication.
Holland, R. (2012). News translation. In C. Millán & F. Bartrina (Eds.) The Routledge Handbook of Translation Studies (pp. 332–346). London: Routledge.
Johansson, S. (2007). Seeing Through Multilingual Corpora. On the Use of Corpora in Contrastive Studies. Amsterdam/Philadelphia: John Benjamins.
Knott, A., & Dale, R. (1994). Using linguistic phenomena to motivate a set of coherence relations. Discourse Processes, 18(1), 35–62.
Koehn, P. (2005). Europarl: A parallel corpus for statistical machine translation. (pp. 79–86). Phuket, Thailand.
Kunz, K., & Lapshinova-Koltunski, E. (2015). Cross-linguistic analysis of discourse variation across registers. Nordic Journal of English Studies, 14(1), 258–288.
Lefer, M. -A., & Grabar, N. (2015).
Super-creative and over-bureaucratic: A cross-genre corpus-based study on the use and translation of evaluative prefixation in TED talks and EU parliamentary debates. Across Languages and Cultures, 16(2), 187–208.
Lefer, M. -A., & Vogeleer, S. (Eds.) (2014). Genre- and Register-related Discourse Features in Contrast [Special issue]. Languages in Contrast. 14(1).
Liu, D. (2008). Linking adverbials. An across-register corpus study and its implications. International Journal of Corpus Linguistics, 13(4), 491–518.
Macken, I., De Clercq, O., & Paulussen, H. (2011). Dutch Parallel Corpus: A balanced copyright-cleared parallel corpus. Meta, 56(2), 374–390.
Mann, W., & Thompson, S. (1992). Relational discourse structure: A comparison of approaches to structuring text by ‘Contrast’. In S. Hwang & W. Merrifield (Eds), Language in Context: Essays for Robert E. Longacre (pp. 19–45). Dallas, TX: SIL.
Murray, J. (1995). Logical connectives and local coherence. In R. Lorch & E. O’Brien (Eds.), Sources of Cohesion in Text Comprehension (pp. 107–125). Hillsdale, NJ: Erlbaum.
(1997). Connectives and narrative text. The role of continuity. Memory and Cognition, 25(2), 227–236.
Neumann, S. (2014). Cross-linguistic register studies. Theoretical and methodological considerations. Languages in Contrast, 14(1), 35–57.
O’Hagan, M. (2009). Computer-aided translation (CAT). In G. Saldanha & M. Baker (Eds.), Routledge Encyclopedia of Translation Studies (pp. 48–51). London: Routledge.
Pit, M. (2007). Cross-linguistic analyses of backward causal connectives in Dutch, German and French. Languages in Contrast, 7(1), 53–82.
Pokorn, N. (2010). Directionality. In Y. Gambier & L. Van Doorslaer (Eds.), Handbook of Translation Studies, vol. 21 (pp. 37–39). Amsterdam/Philadelphia: John Benjamins.
Prasad, R., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A., & Webber, B. (2008). The Penn Discourse TreeBank 2.0. In Proceedings of the 6th International Conference on Language Resources and Evaluation (pp. 2961–2968). Marrakech: European Language Resources Association.
Quirk, R., Greenbaum, S., Leech, G., & Svartvik, J. (1985). A Comprehensive Grammar of the English Language. London: Longman.
Nisioi, S., Rabinovich, E., Dinu, L. & Wintner, S. (2016). A corpus of native, non-native and translated texts. In N. Calzolari et al. (Eds.), Proceedings of the 10th Language and Resources Evaluation Conference (LREC) (pp. 4197–4201). Porotrož, Slovenia.
Sanders, T. (1997). Semantic and pragmatic sources of coherence: On the categorization of coherence relations in context. Discourse Processes, 24(1), 119–148.
Schäler, R. (2009). Localization. In G. Saldanha & M. Baker (Eds.), Routledge Encyclopedia of Translation Studies (pp. 157–161). London: Routledge.
Shreve, G. (1997). Cognition and the evolution of translation competence. In J. Danks, G. Shreve, S. Fountain & M. McBeath (Eds.), Cognitive Processes in Translation and Interpreting (pp. 120–136). Thousand Oaks: Sage.
Tirkonnen-Condit, S. (2005). The Monitor Model revisited: Evidence from process research. Meta, 50(2), 405–414.
Vinay, J. -P., & Darbelnet, J. (1995). Comparative Stylistics of French and English. A Methodology for Translation. Amsterdam/Philadelphia: John Benjamins.
Zufferey, S., (2016). Discourse connectives across languages. Factors influencing their explicit or implicit translation. Languages in Contrast, 16(2), 264–279.
Zufferey, S., & Cartoni, B. (2012). English and French causal connectives in contrast. Languages in Contrast, 12(2), 232–250.
(2014). A multifactorial analysis of explicitation in translation. Target, 26(3), 361–384.
Cited by (17)
Cited by 17 other publications
Farahani, Mehrdad Vasheghani & Reza Soltani
Gabarró-López, Sílvia
Oleškevičienė, Giedrė Valūnaitė, Deniz Zeyrek, Amalia Mendes & Dalia Gulbinskienė
2025. Translation Awareness of Sense Shifts of Coveying Discourse Relations Based on English TED Talks and Their Translation into Lithuanian, Portuguese and Turkish. In Technology and Innovation in Learning, Teaching and Education [Communications in Computer and Information Science, 2480], ► pp. 36 ff.
Penha-Marion, Laura, Gaëtanelle Gilquin & Marie-Aude Lefer
2024. The effect of directionality on lexico‑syntactic simplification in French><English student translation. In Constraints on Language Variation and Change in Complex Multilingual Contact Settings [Contact Language Library, 60], ► pp. 153 ff.
Çeltek, Aytaç
2024. The functions of “I think” in TED Talks and their Turkish translations. Pragmatics & Cognition 31:2 ► pp. 391 ff.
Ghane, Zahra & Mehrdad Vasheghani Farahani
Grabowski, Łukasz & Nicholas Groom
2022. Functionally-defined recurrent multi-word units in English-to-Polish translation. Revista Española de Lingüística Aplicada/Spanish Journal of Applied Linguistics 35:1 ► pp. 1 ff.
Farahani, Mehrdad Vasheghani & Reza Kazemian
Guryev, Alexander
Lefer, Marie-Aude
Oksefjell Ebeling, Signe & Jarle Ebeling
Shin, Jiyoung, Harris Hyun-soo Kim, Eun Mee Kim, Yookyung Choi & Eunhee Ha
Tatsenko, Nataliia, Vitalii Stepanov & Hanna Shcherbak
Crible, Ludivine, Ágnes Abuczki, Nijolė Burkšaitienė, Péter Furkó, Anna Nedoluzhko, Sigita Rackevičienė, Giedrė Valūnaitė Oleškevičienė & Šárka Zikánová
Hoek, Jet, Sandrine Zufferey, Jacqueline Evers-Vermeul & Ted J.M. Sanders
[no author supplied]
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
