How Comparable Can 'Comparable Corpora' Be?
Published online: 1 January 1997
https://doi.org/10.1075/target.9.2.05lav
https://doi.org/10.1075/target.9.2.05lav
Abstract
The development of a coherent methodology for corpus-based work in translation studies is essential for the evolution of this newfield of research into a fully-fledged paradigm within the discipline. The design of a monolingual, multi-source-language comparable corpus of English as a resource for the systematic study of the nature of translated text can be regarded as an important step towards the development of such a methodology. This paper deals with a crucial and problematic aspect of the design of a monolingual comparable corpus, namely the achievement of an adequate level of comparability between its translational and non-translational components.
Résumé
Une méthodologie cohérente pour l'étude de corpus de traductions s'impose si l'on entend conférer à celle-ci la valeur d'un paradigme au sein de la traductologie. L'étude systématique du texte traduit qui s'étaye sur un corpus comparable d'anglais à plusieurs langues-sources peut être considérée comme une étape importante de ce processus. Le présent article étudie un aspect essentiel mais problématique de la constitution de corpus monolingues comparables, à savoir la mise au point d'un niveau approprié de comparabilité entre ses parties traduites et non-traduites.
Article outline
- 1.Introduction
- 2.The Design of the English Comparable Corpus (ECC)
- 2.1.Initial Definition of Comparable Corpus
- 2.2Corpus Typology and Classification of the ECC
- 2.3.The Theoretical and Practical Motivation for the TEC Design
- 2.4.The Identification of the Text Categories of TEC
- 2.5.Selecting Suitable Texts for TEC
- 2.5.1.Selection Criteria
- 2.5.2.Selection Procedures
- 2.5.3.Copyright Permissions
- 2.5.4.The Acquisition of Texts for TEC
- 2.5.5.From a Printed or Electronic Text to a TEC Text
- 2.5.6.The File Structure of TEC
- 2.5.7.The Extra-Textual Attributes of TEC Texts
- TRANSLATOR(S)
- TRANSLATION
- TRANSLATION PROCESS
- SOURCE TEXT
- 2.6.Design of NON-TEC
- 2.6.1.The Dimensions of Comparability
- 2.6.2.Selection, Acquisition and Markup of NON-TEC Texts
- 2.6.3.The File Structure of NON-TEC
- 2.6.4.The Extra-Textual Attributes of NON-TEC Texts
- 3.Evaluation of Comparability in the Design of ECC
- 4.Use of the English Comparable Corpus
- Notes
References
References (27)
Aijmer, Karin, Bengt Altenberg and Mats Johansson, eds. 1996. Languages in Contrast: Papers from the Lund Symposium on Text-Based Cross-Linguistic Studies, 4–5 March 1994. Lund: Lund University Press.
Atkins, Sue, Jeremy Clear and Nicholas Olster. 1992. "Corpus Design Criteria". Literary and Linguistic Computing. 7: 1. . 1–16.
Baker, Mona. 1995. "Corpora in Translation Studies: An Overview and Some Suggestions for Future Research". Target. 7: 2. 223–243.
Biber, Douglas and Edward Finegan. 1986. "An Initial Typology of English Text Types". Jan Aarts and Willem Meijs, eds. Corpus Linguistics II: New Studies in the Analysis and Exploitation of Computer Corpora. Amsterdam: Rodopi, 1986. 19–46.
Biber, Douglas Edward Finegan. 1991. "On the Exploitation of Computerized Corpora in Variation Studies". Karin Aijmer and Bengt Altenberg, eds. English Corpus Linguistics. London and New York: Longman, 1991. 204–220.
. 1995. User Reference Guide for the British National Corpus: Version 1.0.Oxford: Oxford University Computing Services.
Bush, Peter. 1995. "Translator Activism and Translation Theory: A Cuban Case Study". Talk given at the Conference on the Linguistic Foundations of Translation. University of Liverpool, 15–17 September 1995.
Gellerstam, Martin. 1996. "Translations as a Source for Cross-Linguistic Studies". Aijmer et al. 1996: 53–63.
Granger, Sylviane. 1996. "From CA to CIA and Back: An Integrated Approach to Computerized Bilingual and Learner Corpora". Aijmer et al. 1996: 37–52.
Hartmann, R.R.K. 1994. "The Use of Parallel Text Corpora in the Generation of Translation Equivalents for Bilingual Lexicography". Paper presented at the EURALEX Congress, Amsterdam 1994.
Johansson, Stig and Knut Hofland. 1994. "Towards an English-Norwegian Parallel Corpus". Udo Fries, Gunnel Tottie and Peter Schneider, eds. Creating and Using English Language Corpora: Papers from the Fourteenth International Conference on English Language Research and Computerized Corpora, Zürich 1993. Amsterdam and Atlanta: Rodopi, 1994. 25–37.
Laviosa, Sara. Core Patterns of Lexical Use in a Comparable Corpus of English Narrative Prose". Sara Laviosa ed. The Corpus-Based Approach: A New Paradigm in Translation Studies.
Laviosa-Braithwaite, Sara. 1996. The English Comparable Corpus (ECC): A Resource and a Methodology for the Empirical Study of Translation.. Manchester: UMIST. [PhD Thesis.]
forthcoming a. "Analysing Comparable Translational and Non-Translational Texts with Tools of Corpus Linguistics". The Proceedings of the Second International Conference on Current Trends in Studies of Translation and Interpreting, Budapest, 5-7 September, 1996
forthcoming b. "The English Comparable Corpus: A Resource and a Methodology". The Proceedings of the Conference on Translation Studies: "Unity in Diversity?" Dublin City University 9-11 May, 1996
Nakamura, Junsaku. 1989. "A Quantitative Study on the Use of Personal Pronouns in the Brown Corpus". Jacet Bulletin 201. 51–71.
. 1991. "The Relationships among Genres in the LOB Corpus Based upon the Distribution of Grammatical Tags". Jacet Bulletin 221. 55–74.
. 1994. "Extended HAYASHFs Quantification Method Type III and Its Applications in Corpus Linguistics". Journal of language and Literature 11. 141–192.
Peters, Calor and Eugenio Picchi. 1996. "Bilingual Reference Corpora for Translators and Translation Studies". Paper presented at the Conference on Translation Studies: "Unity in Diversity? "Dublin City University 9-11 May, 1996.
Sager, Juan C. 1994. Language Engineering and Translation: Consequences of Automation. Amsterdam and Philadelphia: John Benjamins.
. 1991b. Council of Europe Multilingual Lexicography Project. Report submitted to the Council of Europe under contract no. 57/89.
Teubert, Wolfgang. 1994. "Parallel Corpora and Multilingual Lexicography". Unpublished manuscript provided by the author.
Toury, Gideon. 1995. Descriptive Translation Studies and beyond. Amsterdam and Philadelphia: John Benjamins.
Cited by (49)
Cited by 49 other publications
Fan, Lingxi, Andrew K. F. Cheung & Han Xu
何, 丹琦
Qin, Xiaowan & Keming Peng
Wang, Qiao, Laurence Anthony & Nurul Ihsan Arshad
Castilho, Sheila & Natália Resende
Llorens Simón, Eva María
Mattioli, Virginia
Mohammed, Tawffeek A. S.
Xu, Cui & Dechao Li
2022. Exploring genre variation and simplification in interpreted language from comparable and intermodal
perspectives. Babel. Revue internationale de la traduction / International Journal of Translation 68:5 ► pp. 742 ff.
Li, Tao & Kaibao Hu
Chen, Wallace
Scarpa, Federica
Hareide, Lidun
2019. Comparable parallel corpora. In Parallel Corpora for Contrastive and Translation Studies [Studies in Corpus Linguistics, 90], ► pp. 19 ff.
Lv, Qianxi & Junying Liang
Wang, Jinwei
Almujaiwel, Sultan
Boulanger, Pier-Pascale & Chantal Gagnon
Boulanger, Pier-Pascale & Chantal Gagnon
Davier, Lucile & Luc Van Doorslaer
Gagnon, Chantal, Pier-Pascale Boulanger & Esmaeil Kalantari
Vaičenonienė, Jurgita
2018. Lithuanian literature in English. In Key Cultural Texts in Translation [Benjamins Translation Library, 140], ► pp. 95 ff.
Sánchez-Nieto, María Teresa
Gwang-Yoon Goh, DongYoung Kim & YOUNGHEE CHERI LEE (Cheri J Lee)
Gwang-Yoon Goh & 이영희
Kajzer-Wietrzny, Marta, Bogusława Whyatt & Katarzyna Stachowiak
Marshall, Sally
Xiao, Richard & Xianyao Hu
Xiao, Richard & Xianyao Hu
Zlatnar Moe, Marija, Tamara Mikolic Juznic & Tanja Žigon
2015. I know languages, therefore, I can translate?. Translation and Interpreting Studies 10:1 ► pp. 87 ff.
Zlatnar Moe, Marija, Tamara Mikolič Južnič & Tanja Žigon
2017. I know languages, therefore, I can translate?. In Translation and Interpreting Pedagogy in Dialogue with Other Disciplines [Benjamins Current Topics, 90], ► pp. 83 ff.
Salhi, Hammouda
Sun, Yuan & Qian Zhao
박명수 & Shinwoong Lee
Candel-Mora, Miguel A. & Chelo Vargas-Sierra
Zanettin, Federico
Zanettin, Federico
2015. Oakes, Michael P. & Meng Ji, eds. 2012. Quantitative Methods in Corpus-Based Translation Studies: A Practical Guide to Descriptive Translation Research. Target. International Journal of Translation Studies 27:1 ► pp. 138 ff.
Bruno, Cosima
2012. The public life of contemporary Chinese poetry in English translation. Target. International Journal of Translation Studies 24:2 ► pp. 253 ff.
Kruger, Haidee
2012. A corpus-based study of the mediation effect in translated and edited language. Target. International Journal of Translation Studies 24:2 ► pp. 355 ff.
Ordan, Noam, Victoria Itskovich, Miriam Shlesinger & Ido Kanter
Baker, Mona
Cheong, Ho-Jeong
Moropa, Koliswa
Rothe-Neves, Rui
2000. On Justification in Translation Studies. Target. International Journal of Translation Studies 12:1 ► pp. 1 ff.
Stewart, Dominic
2000. Poor Relations and Black Sheep in Translation Studies. Target. International Journal of Translation Studies 12:2 ► pp. 205 ff.
Schäffner, Christina
Schäffner, Christina
Schäffner, Christina
Schäffner, Christina
This list is based on CrossRef data as of 5 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
