Article published In: Studies in Language
Vol. 50:2 (2026) ► pp.259–293
Distance-based approach reveals convergence effects in word order among the languages of the Circum-Baltic linguistic area
Published online: 1 December 2025
https://doi.org/10.1075/sl.24061.ser
https://doi.org/10.1075/sl.24061.ser
Abstract
We probe a new approach to linguistic areas. Instead of similarity of a feature across languages
of the area, we focus on its adaptation to the area. Adaptation is a set of changes and/or retentions in a
language towards, but not necessarily into, similarity with the other languages of the area. Technically, we estimate adaptation
by comparing the distance between the focus language from the area and a geographically and genealogically closely related
language outside of the area (its benchmark language) as tertium comparationis. If the focus language is closer
to the area than its benchmark, we interpret it as evidence for adaptation towards the other languages of the area. Adaptation
includes all possible scenarios of change and non-change. We test word order and find that all languages of the CB area show
effects of adaptation, with Baltic Romani and both Baltic languages being in the center of the area.
Article outline
- 1.Introduction
- 2.Why do we need a new approach to establishing convergence in linguistic areas?
- 3.The distance-based approach
- 4.Word order
- 5.Data
- 6.Computation
- 7.Results
- 8.Conclusions
- Acknowledgements
- Notes
- List of abbreviations
References
References (86)
Aikhenvald, Alexandra Y. & R. M. W. Dixon. 2001. Introduction. In Alexandra Y. Aikhenvald & R. M. W. Dixon (eds), Areal
diffusion and genetic inheritance: Problems in comparative
linguistics, 1–26. Oxford: Oxford University Press.
Aktaṣ, Berfin, Maria Ovsjannikova & Ilja A. Seržant. 2025. Data
& scripts for the paper ‘Distance-based approach to the Circum-Baltic Area’ by Ilja A. Seržant, Berfin Aktaṣ, Masha
Ovsjannikova, Manfred Stede. Studies in Language [Data
set]. Zenodo.
Becker, Laura & Matías Guzmán Naranjo. 2025. Replication
and methodological robustness in quantitative typology. Linguistic
Typology 29(3). 463–505.
Borin, Lars & Anju Saxena, eds. 2013. Approaches
to measuring linguistic differences. Berlin: De Gruyter Mouton.
Bowern, Claire. 2013. Relatedness
as a factor in language contact. Journal of Language
Contact 61. 411–432.
Breu, Walter. 1994. Der
Faktor Sprachkontakt in einer dynamischen Typologie des
Slavischen. In Hans Robert Mehlig (ed.), Slavistische
Linguistik
1993, 41–64. München: Sagner.
Bužarovska, Eleni. 2020. The
contact hypothesis revised: DOM in the South Slavic periphery. Journal of Language
Contact 131. 57–95.
Campbell, Lyle. 1985. Areal
linguistics and its implications for historical linguistic
theory. In Jacek Fisiak (ed.), Proceedings
of the Sixth International Conference of Historical
Linguistics, 25–56. Amsterdam: John Benjamins.
. 2006. Introduction. In Yaron Matras, April McMahon & Nigel Vincent (eds.), Linguistic
areas convergence in historical and typological
perspective, 1–31. Basingstoke: Palgrave Macmillan.
Dahl, Östen & Maria Koptjevskaja-Tamm. 1992. Language
typology around the Baltic Sea: A problem inventory. Papers from the Institute of
Linguistics. Stockholm: University of Stockholm.
Dedio, Stefan, Peter Ranacher & Paul Widmer. 2019. Evidence
for Britain and Ireland as a linguistic
area. Language 95(3). 498–522.
Di Garbo, Francesca & Ricardo Napoleão de Souza. 2023. A
sampling technique for worldwide comparisons of contact scenarios. Linguistic
Typology 27(3). 553–589.
Dickey, Stephen M. 2000. Parameters of Slavic aspect. A
cognitive approach. Stanford: CSLI Publications.
Downing, Pamela. 1995. Word
order in discourse: By way of introduction. In Pamela Downing & Michael Noonan (eds), Word
order in
discourse, 1–28. Amsterdam: Benjamins.
Dressler, Wolfgang. 1971. Zur
Rekonstruktion der indogermanischen Syntax. Kuhns
Zeitschrift 851. 5–22.
Dryer, Matthew S. 1989. Discourse-governed word order
and word order typology. Belgian Journal of
Linguistics 4(1). 69–90.
Dryer, Matthew. S. 1997. On the six-way word order
typology. Studies in
Language 21(I). 69–103.
Dryer, Matthew S. 2013. Order of subject, object and
verb. In Matthew S. Dryer & Martin Haspelmath (eds.), WALS
Online (v2020.3) [Data
set]. Zenodo. (Available online at [URL], Accessed
on 2024-07-10.)
Epps, Patience, John Huehnergard and Na’ama Pat-El. 2013. Introduction.
Contact among genetically related languages. Journal of Language
Contact 61. 209–219.
Erker, Aksana. 2014. Ways
of expressing the past tense in Belarusian mixed subdialects spoken in the Baltic-Slavic contact
zone. In Ilja A. Seržant & Björn Wiemer (eds.), Contemporary
approaches to dialectology: The area of North, Northwest Russian and Belarusian
vernaculars, 130–149. Bergen: John Grieg AS.
Gijn, Rik van. 2020. Separating layers of
information: The anatomy of contact zones. In Norval Smith, Enoch O. Aboh & Tonjes Veenstra (eds.), Advances
in contact linguistics: In honour of Pieter Muysken [Contact Language Library
57], 162–178. Amsterdam: John Benjamins.
Gijn, Rik van & Max Wahlström. 2023. Linguistic
areas. In Rik van Gijn, Hanna Ruch, Max Wahlström & Anja Hasse (eds.), Language
contact: Bridging the gap between individual interactions and areal
pattern, 179–219. Berlin: Language Science Press.
Gumperz, John J. & Robert Wilson. 1971. Convergence
and creolization: A case from the Indo-Aryan/Dravidian border in
India. In Dell H. Hymes (ed.), Pigdinization
and creolization of
languages, 151–167. Cambridge: Cambridge University Press.
Haig, Geoffrey. 2001. Linguistic
diffusion in present-day east Anatolia: From top to bottom. In Alexandra Y. Aikhenvald & Robert M. W. Dixon (eds.), Areal
diffusion and genetic inheritance: Problems in comparative
linguistics, 195–224. Oxford: Oxford University Press.
Hammarström, Harald, Robert Forkel, Martin Haspelmath & Bank, Sebastian. 2024. Glottolog 5.11. Leipzig: Max Planck Institute for Evolutionary Anthropology. (Available online at [URL], Accessed on 2025-02-12.)
Haspelmath, Martin. 2001. The
European linguistic area: Standard Average European. In Martin Haspelmath, Ekkkehard König, Wulf Oesterreicher & Wolfgang Raible (eds.), Language
typology and language
universals, 1492–1510. Berlin: De Gruyter Mouton.
Heeringa, Wilbert & John Nerbonne. 2001. Dialect
areas and dialect continua. Language Variation and
Change 13(3). 375–400.
Jaeger, Florian, Peter Graff, William Croft & Daniel Pontillo. 2011. Mixed
effect models for genetic and areal dependencies in linguistic typology. Linguistic
Typology 15(2). 281–319.
Jakobson, Roman. 1931[1971]. Über
die phonologischen Sprachbünde, Travaux du cercle linguistique de
Prague 41, 234–240. Cited after
the reprint in: Jakobson, Roman. 1971. Selected
writings I: Phonological studies, 137–143. The Hague: De Gruyter Mouton.
Kallio, Petri. 2015. The
language contact situation in prehistoric Northeastern
Europe. In Robert Mailhammer, Theo Vennemann and & Birgit Anette Olsen, (eds.), The
linguistic roots of
Europe, 77–102. Copenhagen: Museum Tusculanum Press, University of Copenhagen.
Kassambara, Alboukadel. 2023. _rstatix:
Pipe-friendly framework for Basic statistical tests_. R package version
0.7.2, 〈[URL]〉.
Khomchenkova, Irina A., Marija D. Vojejkova, Natalia M. Zaika, Maxim L. Kisilier, Georgij A. Mol’kov & Anna Ju. Urmanchieva (eds.), Studies
in the theory of grammar, issue 9. The parallel corpus as a grammar database and the New Testament as a parallel
corpus. Acta Linguistica Petropolitana. Transactions of the Institute for Linguistic Studies. Vol. 19 part 31.
Koptjevskaja-Tamm, Maria & Bernhard Wälchli. 2001. The
Circum-Baltic languages: An areal-typological approach. In Östen Dahl & Maria Koptjevskaja-Tamm (eds.), The
Circum-Baltic languages. Typology and contact. Vol. 2: Grammar and
typology, 615–750. Amsterdam: John Benjamins.
Lang, Valter. 2016. Early
Finnic-Baltic contacts as evidenced by archaeological and linguistic data. Journal of Estonian
and Finno-Ugric
Linguistics 71. 11–38.
Levshina, Natalia. 2015. How
to do linguistics with R: Data exploration and statistical
analysis. Amsterdam: John Benjamins.
Levshina, Natalia, Namboodiripad, Savithry, Allassonnière-Tang, Marc, Kramer, Mathew, Talamo, Luigi, Verkerk, Annemarie, Wilmoth, Sasha, Rodriguez, Gabriela Garrido, Gupton, Timothy Michael, Kidd, Evan, Liu, Zoey, Naccarato, Chiara, Nordlinger, Rachel, Panova, Anastasia and Stoynova, Natalia. 2023. Why
we need a gradient approach to word
order. Linguistics 61(4), 825–883.
Maddieson, Ian. 2013. Tone. In Matthew S. Dryer & Martina Haspelmath (eds.), WALS
Online (v2020.3) [Data
set]. Zenodo. (Available online at [URL], Accessed
on 2024-07-09).
Mair, Patrick., Patrick J. F. Groenen & Jan De Leeuw. 2022. More
on multidimensional scaling in R: smacof version 2. Journal of Statistical
Software 102(10), 1–47.
Massicotte, Philippe & Andy South A. 2023. _rnaturalearth: World map
data from natural earth_. R package version 1.0.1, 〈[URL]〉.
Mathiassen, Terje. 1985. A
discussion of the notion ‘Sprachbund’ and its application in the case of the languages in the eastern Baltic
area, International Journal of Slavic
Philology 21/221, 273–281.
. 2007. The
borrowability of structural categories. In Yaron Matras & Jeanette Sakel (eds.), Grammatical
borrowing in cross-linguistic
perspective, 31–73. Amsterdam: John Benjamins.
Mayer, Thomas & Michael Cysouw. 2014. Creating
a massively parallel Bible corpus. Proceedings of the International Conference on Language
Resources and
Evaluation (LREC), Reykjavik, 3158–3163. [URL]
Mithun, Marianne. 1987. Is
basic word order universal? In Russel S. Tomlin (ed.), Coherence
and grounding in discourse: Outcome of a symposium, Eugene, Oregon, June 1984 Typological studies in
language, 281–328. Amsterdam: John Benjamins.
Moroz, George. 2017. _lingtypology:
Easy mapping for linguistic typology_. 〈[URL]〉.
Nau, Nicole. 1996. Ein
Beitrag zur Arealtypologie der Ostseeanrainersprachen. In Boretzky, Norbert (ed.), Areale,
Kontakte, Dialekte, Sprachen und ihre Dynamik in mehrsprachigen Situationen. [Bochum-Essener Beitrage
zur Sprachwandelforschung,
24], 51–67. Bochum: Brockmeyer.
Nichols, Johanna. 1992. Linguistic
diversity in space and time. Chicago: University of Chicago Press.
Oksanen, Jari, Gavin L. Simpson, F. Guillaume Blanchet, Roeland Kindt, Pierre Legendre, Peter R. Minchin, R. B. O’Hara, Peter Solymos, M. Henry, H. Stevens, Eduard Szoecs, Helene Wagner, Matt Barbour, Michael Bedward, Ben Bolker, Daniel Borcard, Tuomas Borman, Gustavo Carvalho, Michael Chirico, Miquel De Caceres, Sebastien Durand, Heloisa Beatriz Antoniazi Evangelista, Rich FitzJohn, Michael Friendly, Brendan Furneaux, Geoffrey Hannigan, Mark O. Hill, Leo Lahti, Cameron Martino, Dan McGlinn, Marie-Helene Ouellette, Eduardo Ribeiro Cunha, Tyler Smith, Adrian Stier, Cajo J. F. Ter Braak, James Weedon. 2025. _vegan:
Community Ecology Package_. R package version 2.7–2, 〈[URL]〉.
Pebesma, Edzer. 2018. Simple
features for R: Standardized support for spatial vector data. The R
Journal 10 (1), 439–446,
Pebesma, Edzer & Roger Bivand. 2023. Spatial
data science: With applications in R. Chapman and Hall/CRC.
Plungian, Vladimir A. 2023. The parallel corpus as a grammar
database and the New Testament as a parallel corpus
(Preface). In Irina A. Khomchenkova, Maria D. Vojejkova, Natalia M. Zaika, Maxim L. Kisilier, Georgij A. Mol’kov & Anna Ju. Urmanchieva (eds.), Studies
in the theory of grammar, issue 9. The parallel corpus as a grammar database and the New Testament as a parallel
corpus. Acta Linguistica Petropolitana. Transactions of the Institute for Linguistic
Studies 19:3. 15–38.
Pozharickaja, Sofia K. 2011. On the areal distribution of
participial forms in Russian dialects. In Ilja A. Seržant & Björn Wiemer (eds.), Contemporary
approaches to dialectology: The area of North, Northwest Russian and Belarusian
vernaculars, 109–129. Bergen: John Grieg AS.
Ranacher, Peter, Nico Neureiter, Rik van Gijn, Barbara Sonnenhauser, Anastasia Escher, Robert Weibel, Pieter Muysken & Balthasar Bickel. 2021. Contact-tracing
in cultural evolution: A Bayesian mixture model to detect geographic areas of language
contact. Journal of The Royal Society Interface. Royal
Society 18(181). 20201031.
Seifart, Frank. 2015. Does
structural-typological similarity affect borrowability? Language Dynamics and
Change 5(1). 92–113.
Selting, Margret & Elizabeth Couper-Kuhlen. 2000. Argumente
für die Entwicklung einer ‘interaktionalen Linguistik 1. Gesprächsforschung —
Online-Zeitschrift zur verbalen
Interaktion 11, 76–95. ([URL])
Seržant, Ilja A. 2015. Dative experiencer constructions
as a Circum-Baltic isogloss. In Peter Arkadiev, Axel Holvoet & Björn Wiemer (eds.), Contemporary
approaches to Baltic linguistics, 325–348. De Gruyter Mouton.
2016. External possession and
constructions that may have it. Sprachtypologie und Universalienforschung
STUF 69(1). 131–169.
2021. Slavic morphosyntax is primarily
determined by the geographic location and contact
configuration. Scando-Slavica 67(1), 65–90.
2025. Statistical signal vs.
areal/universal/genealogical pressure: Commentary on “Replication and methodological robustness in quantitative typology” by
Becker and Guzmán Naranjo. Linguistic
Typology 29(3). 577–585.
2025. Circum-Baltic convergence
area. In Marc Greenberg (ed.), Encyclopedia
of Slavic languages and linguistics online. Brill.
Seržant, Ilja A., Björn Wiemer, Eleni Bužarovska, Martina Ivanová, Maxim Makartsev, Stefan Savić, Dmitri Sitchinava, Karolína Skwarska, Mladen Uhlik. 2022. Areal
and diachronic trends in argument flagging across Slavic. In Eystein Dahl (ed.), Alignment
and alignment change in the Indo-European
family, 300–327. Oxford: Oxford University Press.
Seržant, Ilja A., Daria Alfimova, Petr Biskup, Ivan Seržants. 2025. Efficient
sentence processing significantly affects the position of objects in
Russian. Linguistics.
Siewierska, Anna & Ludmila Uhliřová. 1998. An
overview of word order in Slavic languages. In Anna Siewierska (ed.), Constituent
order in the languages of
Europe, 105–150. Berlin: De Gruyter Mouton.
Sinnemäki, Kaius, Francesca Di Garbo, Ricardo Napoleão de Souza & T. Mark Ellison. 2024. A
typological approach to language change in contact
situations. Diachronica 41(3). 379–413.
Slowikowski, Kamil. 2024. _ggrepel:
Automatically position non-overlapping text labels with ‘ggplot2’_. R package version
0.9.6, 〈[URL]〉.
South, Andy, Michael Schramm & Phillipe Massicotte. 2024. _rnaturalearthdata:
World vector map data from natural earth used in ‘rnaturalearth’_. R package version
1.0.0, 〈[URL]〉.
Stolz, Thomas. 1991. Sprachbund
im Baltikum? Estnisch und Lettisch im Zentrum einer sprachlichen Konvergenzlandschaft. [Bochum-Essener
Beiträge zur Sprachwandelforschung,
13]. Bochum: Brockmeyer.
Tanaka, Hiroko. 2005. Grammar
and the “timing” of social action: Word order and preference
organization in Japanese Language in
Society 341, 389–430.
Trubetzkoy, Nikolai S. (1928): [Proposition
16]. Acts of the 1st International Congress of Linguistics
17–18. Leiden.
Trubinskij, V. I. 1984. Očerki
russkogo dialektnolo sintaksisa. Leningrad: Izdatel’stvo Leningradskogo universiteta.
Van den Heuvel, Wilco. 2020. Romani
Bible translation and the use of Romani in religious
contexts. In Yaron Matras & Anton Tenser (eds.), The
Palgrave handbook of Romani language and
linguistics, 459–486. London: Palgrave Macmillan.
Verhagen, Arie. 2005. Constructions
of intersubjectivity. Discourse, syntax, and
cognition. Oxford: Oxford University Press.
Vihman, Virve-Anneli & George Walkden. 2021. Verb-second
in spoken and written Estonian. Glossa: A journal of general
linguistics 6(1): 151. 1–23.
Watkins, Calvert. 1963. Preliminaries
to a historical and comparative analysis of the syntax of the Old Irish
verb. Celtica 61. 1–49.
