Article In: Writing/Reading Interface
Edited by Terry Joyce and Constanze Weth
[Written Language & Literacy 28:1] 2025
► pp. 11–43
Frequency-dependent selection in the English spelling system across 1000 years of history
This content is being prepared for publication; it may be subject to changes.
Abstract
English orthography has largely been left to evolve freely across its thousand-year history with little top-down
design or reform. Yet modern English spelling has become standardized, even if its standardized spellings are
sometimes internally inconsistent. The relative freedom with which English orthography has been allowed to evolve makes it an
interesting test-case for exploring whether spelling standardization might arise through a process of positive frequency-dependent
selection: To maximize the chance of being understood, it pays for a writer to adopt spelling variants that occur with higher
frequency, leading to those forms becoming increasingly entrenched. To evaluate this possibility, we assembled a diachronic
dataset of English spelling variation, SpellEng, which we make freely available. SpellEng reports on the frequency of 112,080
spelling variants (across 32,264 lemmata) over the entire history of written English. We use this dataset along with a model of
frequency-dependent selection to estimate the extent to which English spelling standardization can be explained by such a process.
Our findings suggest that the past 1000 years of English spelling have been characterized by both positive and negative
frequency-dependent selection. We discuss how linguistic dynamics and external events have shaped these periods of stability and
change.
Article outline
- 1.Introduction
- 2.SpellEng: A diachronic dataset of English spelling variants
- 2.1Preparation of the corpus
- 2.2Selection of the lemmata (Step 1)
- 2.3Extraction of the OED data (Step 2)
- 2.4Calculating the frequency of spelling variants (Step 3)
- 2.5Structure of the dataset
- 2.6Validation of the dataset
- 2.7General properties of the dataset
- 3.Modeling frequency-dependent selection in English spelling
- 3.1Model
- 3.2Results
- 4.Discussion
- Acknowledgements
- Notes
References
References (51)
Allen, J. A. & B. C. Clarke. (1984). Frequency
dependent selection: Homage to E. B. Poulton. Biological Journal of the Linnean
Society 23(1): 15–18.
Baddeley, Susan & Anja Voeste (eds.) (2012). Orthographies
in early modern Europe. Berlin: De Gruyter Mouton.
Berg, Kristian & Mark Aronoff. (2017). Self-organization
in the spelling of English suffixes: The emergence of culture out of
anarchy. Language 93(1): 37–64.
. (2021). Is
the English writing system phonographic or lexical/morphological? A new look at the spelling of
stems. Morphology 31(3): 315–328.
Brysbaert, Marc%. (2019). How
many words do we read per minute? A review and meta-analysis of reading rate. Journal of Memory
and Language 1091: Article 104047.
Cahill, Lynne%. (2023). The
standardisation of spelling in Middle English: The case of said. Written
Language &
Literacy 26(1): 131–153.
Carr, Jon W. & Kathleen Rastle. (2024). Why
do languages tolerate heterography? An experimental investigation into the emergence of informative
orthography. Cognition 2491: Article
105809.
Castles, Anne & Kate Nation. (2006). How
does orthographic learning happen? In Sally Andrews (ed.), From
inkmarks to ideas: Current issues in lexical
processing, 151–179. London: Psychology Press.
Chouteau, Mathieu, Mónica Arias & Mathieu Joron. (2016). Warning
signals are under positive frequency-dependent selection in nature. Proceedings of the National
Academy of Sciences of the
USA 113(8): 2164–2169.
Condorelli, Marco%. (2021). Positional
spelling redistribution: Word-initial <u>/<v> and <i>/<j> in Early Modern English
(1500–1700). English Language and
Linguistics 25(4): 799–823.
. (2022). Standardising
English spelling: The role of printing in sixteenth and seventeenth-century graphemic
developments. Cambridge: Cambridge University Press.
Considine, John%. (2012). Standardization:
Dictionaries and the standardization of English. In Alexander Bergs & Laurel J. Brinton (eds.), English
historical
linguistics, 1050–1062. Berlin: De Gruyter.
Durkin, Philip%. (2016). Spelling
variation as documented in historical dictionaries. In Vivian Cook & Des Ryan (eds.), The
Routledge handbook of the English writing
system, 163–173. London: Routledge.
Evans, Melanie%. (2012). A
sociolinguistics of Early Modern spelling? An account of Queen Elizabeth I’s
correspondence. Studies in Variation, Contacts and Change in
English 101. [URL]
Faulkner, Mark%. (2023). Corpus
philology: Using the Dictionary of Old English to get bigger data for Old English spelling
variation. Digital Scholarship in the
Humanities 38(4): 1508–1521.
Glushko, R. J. (1979). The
organization and activation of orthographic knowledge in reading aloud. Journal of Experimental
Psychology: Human Perception and
Performance 5(4): 674–691.
Griffiths, Thomas L. & Michael L. Kalish. (2007). Language
evolution by iterated learning with Bayesian agents. Cognitive
Science 31(3): 441–480.
Guerrero Montero, Juan, Andres Karjus, Kenny Smith & Richard A. Blythe. (2023). Reliable
detection and quantification of selective forces in language change. Corpus Linguistics and
Linguistic
Theory 21(1): 21–73.
Karjus, Andres, Richard A. Blythe, Simon Kirby & Kenny Smith. (2020). Challenges
in detecting evolutionary forces in language change using diachronic
corpora. Glossa 5(1): Article
45.
Kirby, Simon, Mike Dowman & Thomas L. Griffiths. (2007). Innateness
and culture in the evolution of language. Proceedings of the National Academy of Sciences of
the
USA 104(12): 5241–5245.
Koller, Aaron%. (2024). The
evolving definition of “word” in early northwest semitic writing: From דברים to תיבות. Journal
of Near Eastern
Studies 83(1): 123–156.
Kuperman, Victor, Amalia Bar-On, Raymond Bertram, Rober Boshra, Avital Deutsch, Aki-Juhani Kyröläinen, Barbara Mathiopoulou, Gaisha Oralova & Athanassios Protopapas. (2021). Prevalence
of spelling errors affects reading behavior across languages. Journal of Experimental
Psychology:
General 150(10): 1974–1993.
Kupková, Tatiana%. (2023). Spelling
variation trends in early English printed texts (Doctoral
dissertation). Charles University, Prague, Czechia. [URL]
Nation, Kate, Nicola J. Dawson & Yaling Hsiao. (2022). Book
language and its implications for children’s language, literacy, and development. Current
Directions in Psychological
Science 31(4): 375–380.
Neijt, Anneke%. (2023). The
Dutch way of spelling. In Dimitrios Meletis, Martin Evertz-Rittich & Rebecca Treiman (eds.), Handbook
of Germanic writing systems. Berlin: De Gruyter.
Newberry, Mitchell G., Christopher A. Ahern, Robin Clark & Joshua B. Plotkin. (2017). Detecting
evolutionary forces in language
change. Nature 551(7679): 223–226.
Newberry, Mitchell G. & Joshua B. Plotkin. (2022). Measuring
frequency-dependent selection in culture. Nature Human
Behaviour 6(8): 1048–1055.
Pagel, Mark, Mark Beaumont, Andrew Meade, Annemarie Verkerk & Andreea Calude. (2019). Dominant
words rise to the top by positive frequency-dependent selection. Proceedings of the National
Academy of Sciences of the
USA 116(15): 7397–7402.
Rastle, Kathleen%. (2019). EPS
mid-career prize lecture 2017: Writing systems, reading, and language. Quarterly Journal of
Experimental
Psychology 72(4): 677–692.
Rastle, Kathleen, Jonathan Harrington & Max Coltheart. (2002). 358,534
nonwords: The ARC Nonword Database. The Quarterly Journal of Experimental Psychology Section
A 55(4): 1339–1362.
Reali, Florencia & Thomas L. Griffiths. (2010). Words
as alleles: Connecting language evolution with Bayesian learners to models of genetic
drift. Proceedings of the Royal Society B: Biological
Sciences 277(1680): 429–436.
Saenger, Paul%. (1997). Space
between words: The origins of silent
reading. Stanford: Stanford University Press.
Scholfield, Phil%. (2016). Modernization
and standardization since the seventeenth century. In Vivian Cook & Des Ryan (eds.), The
Routledge handbook of the English writing
system, 143–161. London: Routledge.
Sebba, Mark%. (2009). Sociolinguistic
approaches to writing systems research. Writing Systems
Research 1(1): 35–49.
Seymour, Philip H. K., Mikko Aro & Jane M. Erskine. (2003). Foundation
literacy acquisition in European orthographies. British Journal of
Psychology 94(2): 143–174.
Simplified Spelling Board (1920). Handbook
of simplified spelling. New York: Simplified Spelling Board.
Smith, Jeremy%. (1996). An
historical study of English: Function, form and
change. London: Routledge.
Stenroos, Merja & Jeremy J. Smith. (2016). Changing
functions: English spelling before 1600. In Vivian Cook & Des Ryan (eds.), The
Routledge handbook of the English writing
system, 125–141. London: Routledge.
Stickel, Gerhard%. (2012). The
German spelling reform—the end of a long struggle. Linguistica
Lettica 201: 5–17.
Svensson, Erik I. & Tim Connallon. (2019). How
frequency-dependent selection affects population fitness, maladaptation and evolutionary
rescue. Evolutionary
Applications 12(7): 1243–1258.
Ulicheva, Anastasia, Hannah Harvey, Mark Aronoff & Kathleen Rastle. (2020). Skilled
readers’ sensitivity to meaningful regularities in English
writing. Cognition 1951: Article
103810.
Villa, Laura%. (2015). Official
orthographies, spelling debates and nation-building projects after the fall of the Spanish
Empire. Written Language &
Literacy 18(2): 228–247.
Wójcik, Jerzy%. (2021). Measuring
internal spelling variation of an Early Modern English text. Linguistica
Silesiana 421: 107–123.
