In:Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations
Edited by Julia Lavid-López, Carmen Maíz-Arévalo and Juan Rafael Zamorano-Mansilla
[Benjamins Translation Library 158] 2021
► pp. 125–146
Chapter 5TAligner 3.0
A tool to create parallel and multilingual corpora
Published online: 8 December 2021
https://doi.org/10.1075/btl.158.05san
https://doi.org/10.1075/btl.158.05san
Abstract
In 2010 the TRALIMA/ITZULIK research group began developing a tool to create parallel and multilingual corpora. This would serve (among other functions) to explain, describe, and predict translational behaviour in terms of translational norms and laws (Toury 2012). Instead of relying on already existing resources, it was decided to invest in the creation of a new tool. As a result, TAligner 3.0 enables users to create corpora simultaneously aligning any number of texts (as required by the research) as well as compile corpora consisting of different text types. Furthermore, unlike most tools the aligned corpora can be queried within the program. The aim of this paper is to present, firstly, TAligner 3.0 and secondly, two parallel corpora built using this tool.
Article outline
- 1.Introduction
- 2.Evolution of TAligner 3.0
- 2.1TRACE Corpus Tagger/Aligner 1.0©
- 2.2TRACEAligner 2.0
- 2.3TAligner 3.0
- 3.Building and querying corpora in TAligner 3.0
- 3.1Building corpora
- 3.2Querying corpora
- 4.Corpora created using TAligner 3.0
- 4.1Narrative corpora
- 4.2Theatre corpora
- 5.Conclusions
Acknowledgements Notes References
References (39)
Anthony, Laurence. 2013. “A critical look at software tools in corpus linguistics.” Linguistic Research 30 (2): 141–161.
. 2017. “Introducing corpora and corpus tools into the technical writing classroom through Data-Driven Learning (DDL).” In Discipline-Specific Writing: Theory into practice, ed. by John Flowerdew, and Tracey Costley, 162–180. Abingdon: Routledge.
Arrula, Garazi. 2018. Autoitzulpenaren teoria eta praktika Euskal Herrian / Theory and practice of self-translation in the Basque Country. Doctoral dissertation. [URL]
Bandín, Elena. 2007. Traducción, recepción y censura de teatro clásico inglés en la España de Franco. Estudio descriptivo-comparativo del Corpus TRACEtci (1939–1985). León: Universidad de León. Doctoral dissertation. [URL]
Barambones, Josu. 2009. La traducción audiovisual en ETB-1: Estudio descriptivo de la programación infantil y juvenil. Doctoral dissertation. [URL]
Cabanillas González, Candelas. 2016. La traducción audiovisual en ETB2: Estudio descriptivo del género Western. Vitoria-Gasteiz: Universidad del País Vasco UPV/EHU. Doctoral dissertation. [URL]
Gutiérrez Lanza, Camino, Elena Bandín Fuertes, José Enrique García González, and Sergio Lobejón Santos. 2015. “Desarrollo de software de etiquetado y alineación textual: TRACE Corpus Tagger/Aligner 1.0©.” II Congreso Internacional de Humanidades Digitales Hispánicas: Innovación, globalización e impacto. Madrid: UNED. [URL]
Hofland, Knut and Johansson, Stig. 1998. “The Translation Corpus Aligner: A program for automatic alignment of parallel texts”. In Corpora and Cross-linguistic research. Theory, Method, and Case Studies, eds. Stig Johansson and Signe Oksefjell, 87–100. Amsterdam; Atlanta: Rodopi.
Kenny, Dorothy. 2000. “Translators at a play: exploitations of collocational norms in German-English translations.” In Working with German corpora, ed. by Bill Dodd, 143–160. Birmingham: The University of Birmingham Press.
Lobejón-Santos, Sergio. 2013. Traducción inglés-español y censura de textos poéticos (1939–1978): TRACEpi (1939–1978). León: Universidad de León. Doctoral dissertation. [URL]
Manso-Osma, Aritz. 2018. Análisis de la traducción de compuestos alemanes del lenguaje científico-técnico mediante corpus paralelo de aprendices. BA dissertation [URL]
Manterola, Elizabete. 2011. Euskal literatura beste hizkuntza batzuetara itzulia. Bernardo Atxagaren lanen itzulpen moten arteko alderaketa. Doctoral dissertation. [URL]
McEnery, Tony, and Andrew Hardie. 2012. Corpus linguistics: Method, theory and practice. Cambridge: Cambridge University Press.
Merino-Álvarez, Raquel. 1992. “Rewriting for the Spanish stage.” KOINÉ. Annali della Scuola Superiore per Interpreti e Traduttori San Pellegrino 2 (1–2): 283–289.
. 1994a. “La réplica como unidad de descripción y comparación de textos dramáticos traducidos.” In IV Encuentros Complutenses en torno a la traducción, ed. by Margit Raders, and Rafael Martín-Gaitero, 397–404. Madrid: Editorial Complutense.
. 1994b. Traducción, tradición y manipulación. Teatro inglés en España 1950–1990. León: Universidad de León / Lejona: Universidad del País Vasco.
. 2017. “Traducción y censura: investigaciones sobre la cultura traducida inglés-español (1938–1985).” Represura. Revista de Historia Contemporánea española en torno a la represión y la censura aplicadas al libro, 2: 139–163. [URL]
Merino-Álvarez, Raquel, and Olaia Andaluz-Pinedo. 2017. “Peter Shaffer en la cultura española.” Creneida. Anuario de Literaturas Hispánicas, 5: 239–278. [URL]
. 2020. “60 años de Beckett en España: Esperando a Godot, de la censura a la audiodescripción.” In Samuel Beckett: Literatura y Traducción; Literature and Translation; Littérature et Traduction, ed. by Bernardo Santano, 37–57. Bern: Peter Lang.
Pérez, M. 2004. Traducciones censuradas de teatro norteamericano en la España de Franco (1939–1963). Bilbao: Universidad del País Vasco.
Riobello Leiva, Janire. 2016. Análisis de la traducción alemán-inglés y alemán-español de las partículas modales en obras de Kafka. BA dissertation. [URL]
. 2018. Creación de un framework para el tratamiento de corpus lingüísticos. León: Universidad de León, Área de Publicaciones.
Sanz-Villar, Zuriñe. 2015. Unitate fraseologikoen itzulpena: alemana-euskara. Literatur testuen corpusean oinarritutako analisia. Doctoral dissertation. [URL]
. 2018. “Diseño, descripción y análisis de un corpus multilingüe (alemán-español-euskera).” TRANS 22: 133–148.
. 2019. “An Overview of Basque Corpora and the Extraction of Certain Multi-Word Expressions from a Translational Corpus.” In Parallel corpora for contrastive and translation studies: New resources and applications, ed. by Irene Doval Reixa, and M. T. Sánchez Nieto, 233–247. Amsterdam/Philadelphia: John Benjamins.
. 2012. Descriptive Translation Studies and Beyond. Amsterdam/Philadelphia: John Benjamins.
Uribarri, Ibon. 2016. “Taligner: itzulpen corpus eleaniztunak sortzeko tresna.” Senez, 47: 251–265. [URL]
Xiao, Richard, and Ming Yue. 2009. “Using corpora in translation studies: The state of the art”. In Contemporary Corpus Linguistics, ed. by Paul Baker, 237–261. London; New York: Continuum.
Zubillaga, Naroa. 2013. Alemanetik euskaratutako haur- eta gazte-literatura: zuzeneko nahiz zeharkako itzulpenen azterketa corpus baten bidez. Doctoral dissertation. [URL]
Zubillaga, Naroa, Zuriñe Sanz-Villar, and Ibon Uribarri. 2015. “Building a trilingual parallel corpus to analyse literary translations from German into Basque.” In New directions in corpus-based translation studies, ed. by Claudio Fantinuoli, and Federico Zanettin, 71–92. Berlin: Language Science Press. [URL]
Cited by (2)
Cited by two other publications
Gutiérrez Lanza, Camino & Cristina Gómez Castro
Gutiérrez Lanza, Camino
2023. Film dialogue synchronization and statistical dubbese. In Corpus Use in Cross-linguistic Research [Studies in Corpus Linguistics, 113], ► pp. 124 ff.
This list is based on CrossRef data as of 3 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
