Article published In: International Journal of Corpus Linguistics
Vol. 21:4 (2016) ► pp.559–571
The HeliPaD
A parsed corpus of Old Saxon
Published online: 6 December 2016
https://doi.org/10.1075/ijcl.21.4.05wal
https://doi.org/10.1075/ijcl.21.4.05wal
This short paper introduces the HeliPaD, a new parsed corpus of Old Saxon (Old Low German). It is annotated according to the standards of the Penn Corpora of Historical English, enriched with lemmatization and additional morphological attributes as well as textual and metrical annotation. This paper provides an overview of its main features and compares it to existing resources such as the Deutsch Diachron Digital version of the Old Saxon Heliand as part of the Referenzkorpus Altdeutsch. It closes with a roadmap for planned future expansions.
Keywords: parsed corpus, historical corpus, Old Saxon, Low German
References (51)
Arnett, C. (1997). Perfect auxiliary selection in the Old Saxon Heliand
. American Journal of Germanic Linguistics and Literatures, 9(1), 23–72.
Baesecke, G. (1948). Fulda und die altsächsischen Bibelepen. Niederdeutsche Mitteilungen, 4(1), 5–43.
Beck, J., Ecay, A., & Ingason, A.K. (2015). Annotald (Version 1.3.7) [Computer software]. Retrieved from [URL] (last accessed January 2016).
Behaghel, O. (Ed.) (1902). Der Heliand und die altsächsische Genesis (1st ed.) Gießen: J. Ricker’sche Buchhandlung.
Dewey, T.K. (2006). The origins and development of Germanic V2 (Unpublished doctoral dissertation). University of California, Berkeley, CA.
. (2009). An Annotated English Translation of the Old Saxon Heliand: A Ninth-Century Biblical Paraphrase in the Germanic Epic Style. New York, NY: Edwin Mellen Press.
Dewey, T.K., & Arnett, C. (2015). Motion verbs in Old Saxon with the oblique subject construction: A semantic analysis. Beiträge zur Geschichte der Deutschen Sprache und Literatur, 137(2), 183–220.
Donhauser, K. (2015). Das Referenzkorpus Altdeutsch: Das Konzept, die Realisierung und die neuen Möglichkeiten. In J. Gippert & R. Gehrke (Eds.), Historical Corpora: Challenges and Perspectives (pp. 35–49). Tübingen: Narr Verlag.
Erickson, J. (1997). Some observations on word order in Old Saxon. In C. Dürscheid, K H. Ramers & M. Schwarz (Eds.), Sprache im Fokus: Festschrift für Heinz Vater zum 65. Geburtstag (pp. 95–105). Tübingen: Max Niemeyer.
Faarlund, J.-T. (1990). Syntactic Change: Towards a Theory of Historical Syntax. Berlin: Mouton de Gruyter.
Galves, C., & Faria, P. (2010). Tycho Brahe Parsed Corpus of Historical Portuguese. Retrieved from [URL] (last accessed January 2016).
Gippert, J. (2003). TITUS: Heliand. Retrieved from [URL] (last accessed January 2016).
Grein, C.W.M. (1869). Die Quellen des Heliand. Nebst einem Anhang: Tatians Evangelienharmonien herausgegeben nach dem Codex Cassellanus. Kassel: T. Kay.
Hock, H.H. (2009). Default, animacy, avoidance. In V. Bubenik, J. Hewson & S. Rose (Eds.), Grammatical Change in Indo-European Languages (pp. 29–42). Amsterdam: John Benjamins.
Köbler, G. (1986). Sammlung aller altsächsischer Texte. Gießen: Arbeiten zur Rechts- und Sprachwissenschaft Verlag.
. (2000). Altsächsisches Wörterbuch (3rd ed.). Retrieved from [URL] (last accessed January 2016).
Kroch, A., & Taylor, A. (2000). Penn-Helsinki Parsed Corpus of Middle English (2nd ed.). Retrieved from [URL] (last accessed January 2016).
Linde, S., & Mittmann, R. (2013). Old German reference corpus: Digitizing the knowledge of the 19th century. In P. Bennett, M. Durrell, S. Scheible & R.J. Whitt (Eds.), New Methods in Historical Corpora, 235–246. Tübingen: Narr Verlag.
Martineau, F. (2008). Un corpus pour l’analyse de la variation et du changement linguistique. Corpus, 7(1), 135–155.
McEnery, A., & Wilson, A. (2001). Corpus Linguistics: An Introduction (2nd ed.) Cambridge: Cambridge University Press.
Pintzuk, S., & Kroch, A. (1989). The rightward movement of complements and adjuncts in the Old English of Beowulf
. Language Variation and Change, 1(2), 115–143.
Price, T.B. (2010). The Old Saxon Leipzig Heliand manuscript fragment (MS L): New evidence concerning Luther, the poet, and Ottonian heritage (Unpublished doctoral dissertation). University of California, Berkeley, CA.
. (2015). Multi-faceted alignment: Toward automatic detection of textual similarity in Gospel-derived texts. In J. Gippert & R. Gehrke (Eds.), Historical Corpora: Challenges and Perspectives (pp. 77–89). Tübingen: Narr Verlag.
Randall, B. (2005–7). CorpusSearch 2 [Computer software]. Retrieved from [URL] (last accessed January 2016).
Rauch, I. (1992). The Old Saxon Language: Grammar, Epic Narrative, Linguistic Interference. Frankfurt: Peter Lang.
Rissanen, M. (1989). Three problems connected with the use of diachronic corpora. ICAME Journal, 13(1), 16–19.
Rögnvaldsson, E. (1995). Old Icelandic: A non-configurational language? North-Western European Language Evolution, 26(1), 3–29.
Rögnvaldsson, E, Ingason, A.K., Sigurðsson, E.F., & Wallenberg, J. (2012). The Icelandic Parsed Historical Corpus (IcePaHC). In N. Calzolari, K. Choukri, T. Declerck, M.U. Doğan, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, & S. Piperidis (Eds.), Proceedings of LREC, Istanbul 2012 (pp. 1977–1984). Istanbul, Turkey: European Language Resources Association.
Santorini, B. (1990). Part-of-speech tagging guidelines for the Penn Treebank Project (3rd revision). Technical Report, University of Pennsylvania Department of Computer & Information Science.
Sapp, C. (2010). The reflexive possessive sîn in Old Saxon. Beiträge zur Geschichte der Deutschen Sprache und Literatur, 132(3), 329–342.
Schmid, H., & Laws, F. (2008). Estimation of conditional probabilities with decision trees and an application to fine-grained POS tagging. In D. Scott & H. Uszkoreit (Eds.), Proceedings of the 22nd International Conference on Computational Linguistics, vol. 11 (pp. 777–784). Stroudsburg, PA: Association for Computational Linguistics.
Somers, K., & Dubenion-Smith, S. (2011). Disambiguating clausal status in the Old Saxon Hêliand. Unpublished manuscript, QMUL, London.
. (2014). The intersection between syntax and metre in the Old Saxon Hêliand
. Amsterdamer Beiträge zur älteren Germanistik, 72(1), 83–134.
Suzuki, S. (2004). The Metre of Old Saxon Poetry: The Remaking of Alliterative Tradition. Cambridge: Brewer.
Taylor, A., Warner, A., Pintzuk, S., & Beths, F. (2003). York–Toronto–Helsinki Parsed Corpus of Old English Prose. Retrieved from [URL] (last accessed January 2016).
Walkden, G. (2011). HeliCoPTER: Heliand Corpus, a Partially Tagged Excel Resource (Version 1.0). Retrieved from [URL] (last accessed January 2016).
. (2014b). Object position and heavy NP shift in Old Saxon and beyond. In K. Bech & K.M. Eide (Eds.), Information Structure and Word Order Change in Germanic and Romance Languages (pp. 313–340). Amsterdam: John Benjamins.
. (2015a). HeliPaD: the Heliand Parsed Database (Version 0.9). Retrieved from [URL] (last accessed January 2016).
. (2015b). Verb-third in early West Germanic: A comparative perspective. In T. Biberauer & G. Walkden (Eds.), Syntax Over Time: Lexical, Morphological, and Information-structural Interactions (pp. 236–248). Oxford: Oxford University Press.
Wallenberg, J.C., Ingason, A.K., Sigurðsson, E.F., & Rögnvaldsson, E. (2011). Icelandic Parsed Historical Corpus (IcePaHC). Version 0.9. Retrieved from [URL] (last accessed January 2016).
Cited by (10)
Cited by ten other publications
Fazylzhanova, Anar, Ainur Seitbekova, Gulzhihan Kobdenova, Assel Seidamat & Galymzhan Ayazbayev
Middeke, Kirsten
Xiao, Zihui, Junjun Fan & Wei Gao
Meelen, Marieke & Afra Pujol i Campeny
Farasyn, Melissa
2019. Apparent competing agreement patterns in Middle Low German non-restrictive relative clauses with a first or second person head. In The determinants of diachronic stability [Linguistik Aktuell/Linguistics Today, 254], ► pp. 39 ff.
Farasyn, Melissa, George Walkden, Sheila Watts & Anne Breitbarth
2018. The interplay between genre variation and syntax in a historical Low German corpus. In Diachronic Corpora, Genre, and Language Change [Studies in Corpus Linguistics, 85], ► pp. 281 ff.
Whitt, Richard J.
2018. Using diachronic corpora to understand the connection between genre and language change. In Diachronic Corpora, Genre, and Language Change [Studies in Corpus Linguistics, 85], ► pp. 1 ff.
Koleva, Mariya, Melissa Farasyn, Bart Desmet, Anne Breitbarth & Véronique Hoste
2017. An automatic part-of-speech tagger for Middle Low German. International Journal of Corpus Linguistics 22:1 ► pp. 107 ff.
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
