Article published In: Übersetzen und Dolmetschen im juristischen Bereich / Traduction et interprétation juridiques / Legal Translation and Interpreting: Technologie – Outsourcing – Veränderungen / Technologie – externalisation – transformations / Technology – Outsourcing – Shifts
Edited by Frans De Laet, Reiner Heard and Miodrag Vukčević
[Babel 66:2] 2020
► pp. 294–310
Turns of the centuries. The Transkribus automated tool for recognition, transcription and translation of handwritten historical documents
A German-Serbian case study
Published online: 7 April 2020
https://doi.org/10.1075/babel.00159.vuk
https://doi.org/10.1075/babel.00159.vuk
Abstract
The translation of handwritten historical documents faces many challenges due to variation in the writing style,
local language, and an inevitable language change. Even the transliteration from Cyrillic to Latin characters is standardized by
the bijective transliteration standard ISO 9. This presentation introduces a number of tools offered by Transkribus for the
automated processing of documents, such as Handwritten Text Recognition (HTR) and Document Understanding, which are needed for the
translation of historical documents. Next to the problem of decoding handwritten documents, written for example in Kurrentschrift
using ancient terminology, changed meanings and different spelling have additionally to be considered during the translation of
texts from earlier centuries. Resolution strategies on a case study show different methods for ensuring quality translations.
Keywords: HTR, Kurrentschrift, scripture, Transkribus, transliteration
Résumé
La traduction de documents historiques manuscrits est confrontée à de nombreux défis en raison de variations
dans le style d’écriture dans la langue locale et d’une évolution inévitable de la langue. Même la translittération des caractères
cyrilliques en caractères latins est normalisée par la norme de translittération bijective ISO 9. Dans cet exposé, nous présentons
un certain nombre d’outils offerts par Transkribus pour le traitement automatisé de documents, tels que la reconnaissance de texte
manuscrit et la compréhension de documents, qui sont nécessaires pour traduire des documents historiques. Outre le problème de
décodage des documents manuscrits, écrits par exemple dans ce que l’on désigne par le terme Kurrentschrift, l’utilisation d’une
terminologie ancienne, les changements de sens et les différentes orthographes doivent également être pris en compte lorsque l’on
traduit des textes datant de siècles passés. Les stratégies de résolution concernant une étude de cas présentent différentes
méthodes qui garantissent des traductions de qualité.
Article outline
- 1.Cultural rapprochement process and digital era
- 2.Standards in German-Serbian cooperation in the administrative area
- 3.Historical handwritten documents in legal translation process
- 4.Conclusion
- Notes
References
References (20)
Alvermann, Dirk; and Blüggel, Bruno. 2017. “Transkribus at Greifswald. Idea, practice, results, perspective”. [URL]
Beuth publishing DIN, Translation Services – Requirements for translation services (ISO 17100:2015); German version EN ISO 17100:2015. 2016.
Boenig, Matthias; et al. 2018. “Ground Truth: Grundwahrheit oder Ad-Hoc-Lösung? Wo stehen die DigitalHumanities?” In Kritik der digitalen Vernunft, ed. by Georg Vogeler. 219–223. Köln: Universität Köln. [URL]
Cheong, Otfried: Han Unification in Unicode. 12. Oktober 1999, archived from the original on 28. March 2010; retrieved on 7. November 2015 (English). [URL]
Dalbir, Sanjiv Kumar Singh. 2015. “Review of Online & Offline Character Recognition”. International Journal Of Engineering And Computer Science 4 (5): 11729–11732.
Fiel, Stefan; and Florian Kleber; et al. 2018. “READ ABP WI Dataset – Writer Identification over decades”. [Data set].Zenodo.
Fronhöfer, Andrea; and Elena Mühlbauer. 2017. “Archivnutzung ohne Limit. Digitalisierung, Onlinestellung und das Projekt READ für barrierefreies Forschen”. Der Archivar, Zeitschrift für Archivwesen 70 (4): 422–7.
Graves, Alex; and Jürgen Schmidhuber. 2009. “Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks”. In Advances in Neural Information Processing Systems 22 (NIPS’22), December 7th–10th, 2009, Vancouver, BC, ed. by Bengio, Yoshua; et al., 545–552, San Diego: NIPS.
Jankov, Dragomir. 2007. Vojvodina. Propadanje jednog regiona. Podaci i činjenice. Novi Sad: CPO, Centar za političko obrazovanje: Zrenjanin: Gradska narodna biblioteka “Žarko Zrenjnin”.
Jerkov, Aleksandar; and Adam Sofronijević. 2017. “University of Belgrade library. Transkribus User Experience”. [URL]
Jerkov, Aleksandar; Sofronijević, Adam; and Dejana Kavaja Stanišić. 2015. “Smart and Sustainable Library: Information Literacy Hub of a New City”. In Information Literacy: Movin Toward Sustainability, ed. by Kurbanoğlu, Serap; and Joumana Boustany; et al. 22–30. Basel: Springer.
Laloš, Vesela. 2005. “Bela Crkva, gradić sa četiri imena i nemačkim nasleđem. Tragovi baroknog sjaja provincijalne panonske varoši”. Fenster. Poverenje, pomirenje, poštenje (3): 54–55.
Lukić, Anton N. 2002. Inđija = Indigena, zapisi i predanja, Ruma: Narodna biblioteka “Đorđe Natošević”.
Поповић, Радомир Ј. 2014. “Срби из јужне Угарске и обнова нововековне српске државе у првој половини 19. века”. Летопис Матице српске 493 (6): 866–882.
Schneider, Karin. 2014. Paläographie und Handschriftenkunde für Germanisten: Eine Einführung. Berlin: Walter de Gruyter.
Tappert, C. C.; Suen, C. Y.; Wakahara, T. 1990. “The state of the art in online handwriting recognition”. IEEE Transactions on Pattern Analysis and Machine Intelligence 12 (8): 787.
The Unicode Standard [URL]
Cited by (2)
Cited by two other publications
Beridze, Khatuna & Khatuna Diasamidze
2025. Acquis terminology in English–Georgian translation. Babel. Revue internationale de la traduction / International Journal of Translation / Revista Internacional de Traducción 71:3 ► pp. 279 ff.
This list is based on CrossRef data as of 15 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
