Article published In: Diachronica
Vol. 34:1 (2017) ► pp.79–101
Modeling language family expansions
Published online: 25 April 2017
https://doi.org/10.1075/dia.34.1.03wic
https://doi.org/10.1075/dia.34.1.03wic
Abstract
This paper presents properties of a computer simulation of language migration. It takes as input a simulated phylogeny and a database of today’s populated places. At each time step, a language moves within a geographical quadrilateral defined by the minimal number, ch, of choices of populated places within the quadrilateral. The result is a constrained random walk defined by a combination of the ch parameter and the landscape, which comes into play via the restriction of the walk to populated places. The distribution of move distances is qualitatively similar across values of ch and resembles a Gamma distribution. Through comparisons with densities of real-world language families, the values of ch which yield the closest fits between real and simulated data are found.
Keywords: migration, linguistic diversity, simulation, random walk, Lévy flight, Gamma distribution
Résumé
Cet article présente les propriétés d’une simulation informatique qui permet l’étude de la migration des langues. Pour ce faire, le modèle intègre comme données de base une simulation de phylogénie ainsi qu’une base de données comprenant la localisation des zones habitées de nos jours. A chaque étape, une langue se déplace dans un quadrilatère géographique défini par le nombre minimal, ch, de choix de zones habitées au sein du quadrilatère. Le résultat obtenu présente une marche aléatoire qui est contrainte et définie par une combinaison du paramètre ch et du paysage, qui intervient via la restriction de la marche vers les zones habitées. La distribution des distances de déplacement est qualitativement similaire entre les valeurs de ch et ressemble à une distribution Gamma. Grâce aux comparaisons effectuées avec les densités des familles de langues du monde réel, les valeurs de ch, qui permettent d’obtenir les ajustements les plus proches entre données réelles et données simulées, peuvent se calculer.
Zusammenfassung
Dieser Artikel präsentiert Eigenschaften einer Computersimulation zur Sprachmigration. Als Grundlagen werden eine simulierte Phylogenie und eine Datenbank heute besiedelter Orte verwendet. Bei jedem Simulationsschritt bewegt sich eine Sprache innerhalb eines geographischen Vierecks, das durch die minimale Anzahl ch von Wahlmöglichkeiten hinsichtlich besiedelter Orte innerhalb des Vierecks definiert wird. Das Ergebnis ist ein restringierter „random walk“, der durch die Verbindung des ch-Parameters und des Terrains definiert wird. Letzteres kommt durch eine Beschränkung des Pfads auf besiedelte Orte ins Spiel. Die Verteilung der Distanzen ist über verschiedene Werte von ch hinweg qualitativ gleichartig und ähnelt einer Gammaverteilung. Durch Vergleiche mit der Sprachdichte innerhalb tatsächlich existierender Sprachfamilien werden diejenigen Werte von ch identifiziert, mit denen sich die größte Übereinstimmung zwischen echten und simulierten Daten ergibt.
Article outline
- 1.Introduction
- 2.A computational model of language migration
- 3.Properties of the simulations
- 3.1Geographical density of languages
- 3.2Migratory patterns are qualitatively similar across values of ch and resemble Gamma distributions
- 4.Discussion and conclusions
- Acknowledgements
- Notes
Software References
References (33)
The R programs accompanying this paper can be downloaded from:
Alves, Isabel, Miguel Arenas, Mathias Currat, Anna Sramkova Hanulova, Vitor C. Sousa, Nicolas Ray & Laurent Excoffier. 2016. Long-distance dispersal shaped patterns of human genetic diversity in Eurasia. Molecular Biology and Evolution 331. 946–958.
Adsera, Alicia & Mariola Pytliková. 2015. The role of language in shaping international migration. The Economic Journal 125(586). F49–F81.
Bivand, Roger & Nicholas Lewin-Koh. 2015. maptools: Tools for reading and handling spatial objects. R package version 0.8–37. [URL].
Bouckaert, Remco, Philippe Lemey, Michael Dunn, Simon J. Greenhill, Alexander V. Alekseyenko, Alexei J. Drummond, Russell D. Gray, Marc A. Suchard & Quentin D. Atkinson. 2012. Mapping the origins and expansion of the Indo-European language family. Science 3371. 957–960.
Brown, Clifford T., Larry S. Liebovitch & Rachel Glendon. 2007. Lévy flights in Dobe Ju/’hoansi foraging patterns. Human Ecology 351. 129–138.
Cameron, Cathrine M. 2013. How people moved among ancient societies: Broadening the view. American Anthropologist 1151. 218–231.
Campbell, Lyle. 2015. Do languages and genes correlate? Some methodological issues. Language Dynamics and Change 51. 202–226.
DeBoer, Warren. 2008. Wrenched bodies. In Catherine M. Cameron (ed.), Invisible citizens: Captives and their consequences, 233–261. Salt Lake City: University of Utah Press.
Edwards, Andrew M., Richard A. Phillips, Nicholas W. Watkins, Mervyn P. Freeman, Eugene J. Murphy, Vsevolod Afanasyev, Sergey V. Buldyrev, M. G. E. da Luz, E. P. Raposo, H. Eugene Stanley & Gandhimohan M. Viswanathan. 2007. Revisiting Lévy flight search patterns of wandering albatrosses, bumblebees and deer. Nature 4491. 1044–1049.
Falck, Oliver, Stephan Heblich, Alfred Lameli & Jens Südekum. 2010. Dialects, cultural identity, and economic exchange. Forschungsinstitut zur Zukunft der Arbeit (IZA), Discussion Paper, No. 47431.
Fellows, Ian, and using the JMapViewer library by Jan Peter Stotz. 2015. OpenStreetMap: Access to Open Street Map raster images. R package version 0.3.2. [URL].
Friedrich, Paul. 1970. Proto-Indo-European trees: The arboreal system of a prehistoric people. Chicago & London: The University of Chicago Press.
Hammarström, Harald, Robert Forkel, Martin Haspelmath & Sebastian Bank. 2015. Glottolog 2.6. Jena: Max Planck Institute for the Science of Human History. [URL] (accessed December 28, 2015).
Hijmans, Robert J. 2015. geosphere: Spherical trigonometry. R package version 1.4–3. [URL].
Holman, Eric W. & Søren Wichmann. 2016. New evidence from linguistic phylogenetics identifies limits to punctuational change. Systematic Biology. . Early online publication.
Hunley, Keith & Jeffrey C. Long. 2005. Gene flow across linguistic boundaries in Native North American populations. Proceedings of the National Academy of Sciences of the U.S.A. 102(5). 1312–1317.
Lemey, Philippe, Andrew Rambaut, John J. Welch & Marc A. Suchard. 2010. Phylogeography takes a relaxed random walk in continuous space and time. Molecular Biology and Evolution 27(8). 1877–1885.
Lieberman, Philip. 1984. The biology and evolution of language. Cambridge, MA: Harvard University Press.
Malmberg, Hannes. 2011. Spatial choice processes and the Gamma distribution. BA thesis, Matematiska institutionen, Stockholms universitet. [URL] (accessed January 7, 2016).
Oliveira, de Paulo Murilo Castro de, Dietrich Stauffer, Søren Wichmann & Suzana Moss de Oliveira. 2008. A computer simulation of language families. Journal of Linguistics 441. 659–675.
Pakendorf, Brigitte, Hilde Gunnink, Bonny Sands & Koen Bostoen. Forthcoming. Prehistoric Bantu-Khoisan language contact: A cross-disciplinary approach. Language Dynamics and Change 71.
Quintero, Ignacio, Petr Keil, Walter Jetz & Forrest W. Crawford. 2015. Historical biogeography using species geographical ranges. Systematic Biology 641. 1059–1073.
Seielstad, Mark T., Erich Minch & L. Luca Cavalli-Sforza. 1998. Genetic evidence for a higher female migration rate in humans. Nature Genetics 201. 278–280.
Tamura, Koichiro, Glen Stecher, Daniel Peterson, Alan Filipski & Sudhir Kumar. 2013. MEGA6: Molecular evolutionary genetics analysis version 6.0. Molecular Biology and Evolution 301. 2725–2729.
Tilly, Charles. 1978. Migration in modern European history. In William H. McNeill & Ruth Adams (eds.), Human migration: Patterns and policies, 48–74. Bloomington: Indiana University Press.
Urbanek, Simon. 2015. rJava: Low-level R to Java interface. R package version 0.9–7. [URL].
Vavilov, Nikolai I. 1926. Centers of origin of cultivated plants. Trudi po Prikl. Bot. Genet. Selek. [Bulletin of Applied Botany and Genetics] 161. 139–248.
Venables, W. N. and B. D. Ripley. 2002. Modern applied statistics with S, 4th edn. Springer, New York.
Viswanathan, Gandhimohan M., Marcos G. E. da Luz, Ernesto P. Raposo & H. Eugene Stanley. 2011. The physics of foraging: An introduction to random searches and biological encounters. Cambridge, UK: Cambridge University Press.
Wichmann, Søren. 2005. On the power-law distribution of language family sizes. Journal of Linguistics 411. 117–131.
Wichmann, Søren, André Müller & Viveka Velupillai. 2010. Homelands of the world’s language families: A quantitative approach. Diachronica 271. 247–276.
Cited by (6)
Cited by six other publications
Polyakov, Vladimir N., Elena A. Makarova & Valery D. Solovyev
Battista, Emiliano
Kauhanen, Henri, Deepthi Gopal, Tobias Galla & Ricardo Bermúdez-Otero
Wichmann, Søren
Wichmann, Søren & Taraka Rama
This list is based on CrossRef data as of 8 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
