In:Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations
Edited by Julia Lavid-López, Carmen Maíz-Arévalo and Juan Rafael Zamorano-Mansilla
[Benjamins Translation Library 158] 2021
► pp. 75–100
Chapter 3Word alignment in a parallel corpus of Old English prose
From asymmetry to inter-syntactic annotation
Published online: 8 December 2021
https://doi.org/10.1075/btl.158.03mar
https://doi.org/10.1075/btl.158.03mar
Abstract
This chapter proposes a model of syntactic annotation for the Parallel Corpus of Old English Prose, an aligned corpus of Old English and Present Day English texts. The research focuses on areas of syntactic divergence between the aligned texts. Syntactic divergence is described in terms of four types of alignment asymmetry (markedness, constituency, order, and configuration) and is represented by means of two components: a structural description and a dependency tree. The main conclusion is that these two components constitute a historical micro-grammar that identifies stability and change with respect to specific categories and constructions.
Keywords: parallel corpus, alignment, syntactic annotation, asymmetry, Old English
Article outline
- 1.Introduction
- 2.Background
- 3.The design of an aligned parallel corpus of Old English prose
- 4.Inter-syntax and asymmetry
- 5.The scope and components of the inter-syntax
- 6.Conclusion and further research
The following abbreviations are used in this section References
References (48)
Attenborough, Frederick L. (ed. and trans.). 1922. The Laws of the Earliest English Kings. Cambridge: Cambridge University Press.
Biber, D. 2007. “Representativeness in corpus design”, in: W. Teubert and R. Krishnamurthy (eds.), Corpus linguistics: Critical concepts in linguistics (Vol. II). London: Routledge. 134–165.
Cameron, Angus, Ashley C. Amos, and Antonette diPaolo Healey (eds.). 2018. The Dictionary of Old English in Electronic Form A-I. Toronto: Dictionary of Old English Project, Centre for Medieval Studies, University of Toronto.
Enrique-Arias, Andrés. 2013. “On the usefulness of using parallel texts in diachronic investigations: insights from a parallel corpus of Spanish medieval Bible translations.” In New Methods in Historical Corpora, ed. by Paul Durrell, Martin Scheible, Silke Whitt, and Richard J. Bennett, 105–116. Tübinguen: Gunter Narr.
Faaß, G. 2017. “Lexicography and corpus linguistics.” In The Routledge Handbook of Lexicography, ed. by Pedro A. Fuertes-Olivera, 123–137. Abingdon: Routledge.
Fernández Cuesta, Julia, Nieves Rodríguez Ledesma, and Gloria Alvárez Benito (eds. and trans.). 1997. Prosa anglosajona. Sevilla: Universidad de Sevilla.
García Fernández, Laura. 2018. “Preterite-present verb lemmas from a corpus of Old English.” In Verbs, Clauses and Constructions: Functional and Typological Approaches, ed. by Pilar Guerrero Medina, Roberto Torre Alonso, and Raquel Vea Escarza, 59–76. Newcastle: Cambridge Scholars Publishing.
Godden, Malcom. 2016. The Old English History of the World. An Anglo-Saxon Rewriting of Orosius. Cambridge, Massachusetts: Dumbarton Oaks.
Godden, Malcom, Susan Irvine (eds.), with Mark Griffith, and Rohini Jayatilaka. 2009. The Old English Boethius. Volume II. Oxford: Oxford University Press.
Hanks, Patrick. 2012. “Corpus Evidence and Electronic Lexicography.” In Electronic Lexicography, ed. by Sylviane Granger, and Magali Paquot, 57–82. Oxford University Press.
Healey, A. diPaolo (ed.) with John P. Wilkin, and Xin Xiang. 2004. The Dictionary of Old English Web Corpus. Toronto: Dictionary of Old English Project, Centre for Medieval Studies, University of Toronto.
Heid, Ulrich. 2008. “Corpus linguistics and lexicography.” In Corpus Linguistics. An International Handbook (Volume 1), ed. by Anke Lüdeling and Merja Kytö, 132–153. Berlin: Mouton de Gruyter.
Hogg, Richard M. and Robert D. Fulk. 2011. A Grammar of Old English. Volume 2: Morphology. Oxford: Blackwell.
Johnson, B. 2009. Using the Levenshtein algorithm for automatic lemmatization in Old English. MA Thesis, The University of Georgia.
K Aijmer, B Altenberg, M Johansson. 1996. Languages in contrast: papers from a Symposium on Text-based Cross-linguistic Studies, Lund, 4–5 March 1994. Lund: Lund University Press.
Krause, Thomas, and Amir Zeldes. 2016. ANNIS3: “A new architecture for generic corpus query and visualization.” Literary and Linguistic Computing 31(1): 118–139.
Kübler, Sandra, and Heike Zinsmeister. 2014. Corpus Linguistics and Linguistically Annotated Corpora. London: Bloomsbury.
Leonard, Henry C. (ed. and trans.). 1881. A Translation of the Anglo-Saxon Version of St. Mark’s Gospel. London: James Clarke & Co.
Martín Arista, Javier. 2000a. “Sintaxis medieval inglesa I: complementación, caso y sintaxis verbal.” In Lingüística histórica inglesa, ed. by Isabel de la Cruz Cabanillas and Javier Martín Arista, 224–312. Barcelona: Ariel.
. 2000b. “Sintaxis medieval inglesa II: funciones, construcciones y orden de constituyentes.” In Lingüística histórica inglesa, ed. by Isabel de la Cruz Cabanillas and Javier Martín Arista, 313–377. Barcelona: Ariel.
. 2013. Nerthus. “Lexical Database of Old English: From word-formation to meaning construction.” Lecture delivered at the Research Seminar, School of English, University of Sheffield.
. 2017a. “Toward a parallel corpus of Old English prose. Preliminary questions and initial design.” Lecture delivered at the Departmental Colloquium Series at the Department of Language and Linguistic Science, University of York.
. 2017b. “The Nerthus Project at the crossroads. From lexical database to parallel corpus of Old English.” Lecture delivered at the 2017 International Conference of SELIM, held at the University of Málaga.
. 2018. “The design and implementation of a pilot parallel corpus of Old English.” In Aspects of Medieval English Language and Literature, ed. by Michiko Ogura and Hans Sauer, 111–134. Berlin: Peter Lang.
Martín Arista, Javier, and Ana E. Ojanguren López. 2018. “Doing Electronic Lexicography of Old English with a Knowledge-Base.” Workshop delivered at the CLASP Project (University of Oxford).
. 2003. “Corpus linguistics.” In Oxford handbook of computational linguistics, ed. by Ruslan Mitkov, 448–463. Oxford: Oxford University Press.
McEnery, Tony & Richard Xiao. 2007a. Parallel and comparable corpora: What are they up to? Incorporating Corpora: Translation and the Linguist. Translating Europe. Clevedon: Multilingual Matters.
McEnery, Tony, and Richard Xiao. 2007b. “Parallel and Comparable Corpora-The State of Play.” In Corpus-Based Perspectives in Linguistics, ed. by Yuji Kawaguchi, Toshihiro Takagaki, Nobuo Tomimori, and Yoichiro Tsuruga, 131–146. Amsterdam: John Benjamins.
Metola Rodríguez, Darío. 2017. “Strong Verb Lemmas from a Corpus of Old English. Advances and issues.” Revista de Lingüística y Lenguas Aplicadas 12: 65–76.
Novo Urraca, Carmen, and Ana E. Ojanguren López. 2018. “Lemmatising Treebanks. Corpus Annotation with Knowledge Bases.” RAEL 17: 99–120.
Pintzuk, Susan, and Leendert Plug (comp.). 2001. The York-Helsinki Parsed Corpus of Old English Poetry. Department of Language and Linguistic Science, University of York.
Ringe, Don, and Ann Taylor. 2014. A Linguistic History of English Volume II: The Development of Old English. Oxford: Oxford University Press.
Rissanen, Matti, Merja Kytö, L. Kahlas-Tarkka, Matti Kilpiö, Saara Nevanlinna, Irma Taavitsainen, Tertu Nevalainen and Helena Raumolin-Brunberg (comp.). 1991. The Helsinki Corpus of English Texts. Department of Modern Languages, University of Helsinki.
Schierholz, Stefan J. 2015. “Methods in Lexicography and Dictionary Research.” Lexikos: 25(1): 323–352.
Scrivner, Olga. 2015. “Tools for Digital Humanities: Parallel Corpus and Visualization.” Paper presented at the Conference Corpora 2015, held at Saint-Petersburg, Russia.
Sweet, Henry (ed.). 1881. King Alfred’s West-Saxon Version of Gregory’s Pastoral Care. London: Trübner & Co.
Taylor, Ann, Anthony Warner, Susan Pintzuk and Frank Beths (comp.) 2003. The York-Toronto-Helsinki Parsed Corpus of Old English Prose. Department of Language and Linguistic Science, University of York.
The Holy Bible Translated from the Latin Vulgate (Douay Rheims Version) 1971 (1899). Rpt. Rockford, Illinois: Tan books.
Thorpe, Benjamin. (ed. and trans). 1844. The Homilies of the Anglo-Saxon Church. Volume I. London: Red Lion Court.
Cited by (1)
Cited by one other publication
Ojanguren López, Ana Elvira
2025. Old English perspectives on the complement shift. Journal of Historical Linguistics 15:1 ► pp. 44 ff.
This list is based on CrossRef data as of 3 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
