In:Language Variation – European Perspectives VIII: Selected papers from the Tenth International Conference on Language Variation in Europe (ICLaVE 10), Leeuwarden, June 2019
Edited by Hans Van de Velde, Nanna Haug Hilton and Remco Knooihuizen
[Studies in Language Variation 25] 2021
► pp. 209–226
Get fulltext
Chapter 9Identification of clusters of lexical areas using geographical factors
A case study in the Occitan language area
Available under the Creative Commons Attribution-NonCommercial-NoDerivatives (CC BY-NC-ND) 4.0 license.
For any use beyond this license, please contact the publisher at rights@benjamins.nl.
Published online: 16 June 2021
https://doi.org/10.1075/silv.25.09cha
https://doi.org/10.1075/silv.25.09cha
Abstract
We propose a multidimensional statistical analysis procedure using projection and clustering methods in order to identify coherent clusters in a set of lexical areas. The methodology includes a geographical factor, such as administrative divisions or land cover features, to help the identification of clusters. By applying this method on data from the Occitan language area in the south of France, we are able to identify new spatial patterns and lexical boundaries that do not match traditional dialect boundaries. Our method helps to suggest possible explanations for these new patterns.
Article outline
- 1.Context
- 2.Method
- 2.1Representation space
- 2.2Barycentric projection
- 2.3Clustering
- 3.Implementation of the method
- 3.1Visual exploration
- 3.2Cluster characterization
- 4.Case study: Occitan
- 5.Conclusion
Notes Bibliography
References (20)
Boberg, Charles, John Nerbonne & Dominic Watt (eds.). 2018. The Handbook of Dialectology. UK & USA: John Wiley & Sons, Inc.
Brun-Trigaud, Guylaine, Yves Le Berre & Jean Le Dû. 2005. Lectures de l’Atlas Linguistique de la France de J. Gilléron et E. Edmont: Du temps dans l’espace. Paris: CTHS.
Brun-Trigaud, Guylaine. 2012. Essai de typologie des aires lexicales dans l’Atlas Linguistique du Centre. Annales de Normandie 62(2). 77–93.
Brun-Trigaud, Guylaine & Albert Malfatto. 2013. Limites dialectales vs limites lexicales dans le domaine occitan: Un impossible accord? In Ernestina Carrilho, Catarina Magro & Xosé Afonso Álvarez Perez (eds.), Current Approaches to Limits and Areas in Dialectology, 293–310. Cambridge Scholars Publishers.
Brun-Trigaud, Guylaine, Albert Malfatto & Maguelone Sauzet. 2020. Essai de typologie des aires lexicales occitanes: Regards dialectométriques. Fidélités et dissidences: 12e Congrès de l’Association Internationale d’Etudes Occitanes. 169–179.
Chagnaud, Clement, Philipe Garat, Paule-Annick Davoine, Elisabetta Carpitelli & Axel Vincent. 2017. Shinydialect: A cartographic tool for spatial interpolation of geolinguistic data. 1st ACM SIGSPATIAL workshop on Geospatial Humanities, 23–30. ACM.
Chambers, J. K. & Peter Trudgill. 1998. Dialectology (Cambridge Textbooks in Linguistics). 2nd edn. Cambridge: Cambridge University Press.
Dalbera, Jean-Philippe, Jean-Claude Ranucci, Pierre-Aurélien Georges, Michèle Oliviéri & Guylaine Brun-Trigaud. 2012. La base de données linguistique occitane Thesoc: Trésor patrimonial et instrument de recherche scientifique. Estudis Romànis 34. 367–387.
Dalbera, Jean-Philippe. 2013. La trajectoire de la dialectologie au sein des sciences du langage: De la reconstruction des systèmes dialectaux à la sémantique lexicale et à l’étymologie. Corpus 12. 173–200.
Everitt, Brian S., Sabine Landau, Morven Leese & Daniel Stahl. 2011. Cluster Analysis. 5th edn. United Kingdom: John Wiley.
Heeringa, Wilbert Jan. 2004. Measuring dialect pronunciation differences using levenshtein distance. University of Groningen: Ph.D. thesis.
Léonard, Jean-Léo. 2001. Aréologie dialectale et modularité des réseaux dialectaux: Étagement spatial et structural des processus (morpho)phonologiques dans le réseau dialectal basque. XVe Congrès international de l’Académie basque, 17–19 septembre 2001, 141–168.
Miller, Frederic P., Agnes F. Vandome & John McBrewster. 2009. Levenshtein distance: Information theory, computer science, string (computer science), string metric, damerau- levenshtein distance, spell checker, hamming distance. Orlando: Alpha Press.
Nakache, Jean-Pierre & Josiane Confais. 2004. Approche pragmatique de la classification: Arbres hiérarchiques, partitionnements. Paris: Technip.
Nerbonne, John, Rinke Colen, Charlotte Gooskens, Peter Kleiweg & Therese Leinonen. 2011. Gabmap: A web application for dialectology. Dialectologia Special Issue II. 65–89.
