In:Spanish Language and Sociolinguistic Analysis
Edited by Sandro Sessarego and Fernando Tejedo-Herrero
[Issues in Hispanic and Lusophone Linguistics 8] 2016
► pp. 3–32
Quantitative analysis in language variation and change
Published online: 25 May 2016
https://doi.org/10.1075/ihll.8.01tag
https://doi.org/10.1075/ihll.8.01tag
The essential goal of variation analysis is to understand the behavior of the dependent variable according to a series of factors, either external (social) or internal (grammatical), (Sankoff, 1988, p. 985). I begin with a brief review of the standard tool, logistic regression using the Varb family of programs. I will also introduce new tools, mixed effects models and condition inference trees relying on the discussions in Baayen (2008), Gries (2009), Johnson (2009).
Keywords: language variation, quantitative analysis, Varb
References (62)
Alfaraz, G.G. (2015). Variation of overt and null subject pronouns in the Spanish of Santo Domingo. In A.M. Carvalho, R. Orozco, & N.L. Shin (Eds.), Subject pronoun expression in Spanish: A cross-dialectal perspective (pp. 5-18). Washington, DC: Georgetown University Press.
Baayen, H. (2008). Analyzing linguistic data: A practical introduction to statistics. Cambridge: Cambridge University Press.
Baayen, H., Davidson, D., & Bates, D. (2008). Mixed-effect modeling with crossed random effects for subjects and items. Journal of Memory and Language, 59, 390-412.
. (2005b). lme4: Linear mixed effects models using S4 classes, R package version 9.9975-7.
Blas Arroyo, J.L., & González Martínez, J. (2014). ‘¿Qué tengo que/de hazer?’: Variación y cambio lingüístico en el seno de las perífrasis de infinitivo a partir de textos escritos de impronta oral en el español clásico. Studies in Hispanic & Lusophone Linguistics, 7(2), 241-274.
Bresnan, J., & Ford, M. (2010). Predicting syntax: Processing dative constructions in American and Australian varieties of English Language, 86(1), 168-213.
Brown, E.L., & Cortés-Torres, M. (2012). Syntactic and pragmatic usage of the [estar + Adjective] construction in Puerto Rican Spanish: ¡Está brutal!
In K. Geeslin & M. Díaz-Campos (Eds.), Selected Proceedings of the 14th Hispanic Linguistics Symposium (pp. 61-74). Somerville, MA: Cascadilla Proceedings Project.
Brown, E.L., & Torres Cacoullos, R. (2003). Spanish /s/: A different story from beginning (initial) to end (final). In R. Núñez-Cedeño, L. López, & R. Cameron (Eds.), A Romance perspective in language knowledge and use. Selected papers from the 31st Linguistic Symposium on Romance Languages (LSRL) (pp. 22-38). Amsterdam: John Benjamins.
Bullock, B.E., & Toribio, A.J. (2015). An experimental approach to hypercorrection in Dominican Spanish. In R. Klassen, J.M. Liceras & E. Valenzuela (Eds.), Hispanic linguistics at the crossroads. Theoretical linguistics, language acquisition and language contact (pp. 251-268). Amsterdam: John Benjamins.
Cameron, R. (1993). Ambiguous agreement, functional compensation, and nonspecific tú in the Spanish of San Juan, Puerto Rico and Madrid, Spain. Language Variation and Change, 5(3), 305-334.
Carvalho, A. (2004). I speak like the guys on TV: Palatalization and the urbanization of Uruguayan Portuguese. Language Variation and Change, 16(2), 127-151.
Carvalho, A.M., & Bessett, R.M. (2015). Subject pronoun expression in Spanish in contact with Portuguese. In A.M. Carvalho, R. Orozco, & N.L. Shin (Eds.), Subject pronoun expression in Spanish: A cross-dialectal perspective (pp. 145-168). Washington, DC: Georgetown University Press.
Cedergren, H.J., & Sankoff, D. (1974). Variable rules: Performance as a statistical reflection of competence. Language, 50(2), 333-355.
Chambers, J.K. (2003). Sociolinguistic theory: Linguistic variation and its social significance. Malden, MA: Blackwell.
Díaz-Campos, M., Fafulas, S., & Gradoville, M. (2010). Going retro: An analysis of the interplay between socioeconomic class and age in Caracas Spanish. In J. Michnowicz & R. Dodsworth (Eds.), Selected Proceedings of the Fifth Workshop on Spanish Sociolinguistics (pp. 65-78). Somerville, MA: Cascadilla Proceedings Project.
Geeslin, K., & Guijarro-Fuentes, P. (2008). Variation in contemporary Spanish: Linguistic predictors of estar in four cases of language contact. Bilingualism, Language and Cognition, 11, 365-380.<[URL]>
Guy, G.R. (1980). Variation in the group and the individual: The case of final stop deletion. In W. Labov (Ed.), Locating language in time and space (pp. 1-36). New York, NY: Academic Press.
. (1988). Advanced VARBRUL analysis. In K. Ferrara, B. Brown, K. Walters, & J. Baugh (Eds.), Linguistic change and contact (pp.124-136). Austin, TX: Department of Linguistics, University of Texas at Austin.
. (1993). The quantitative analysis of linguistic variation. In D. Preston (Ed.), American dialect research (pp. 223-249). Amsterdam: John Benjamins.
. (2006b). Spoken syntax: The phonetics of giving a hand in New Zealand English. The Linguistic Review, 23(3), 321-349.
Huddleston, R., & Pullum, Geoffrey K. (2002). The Cambridge Grammar of the English Language. Cambridge: Cambridge University Press.
Jaeger, F. (2005). Optional that indicates production difficulty: Evidence from disfluencies.
Proceedings of DiSS’05, Disfluency in Spontaneous Speech Workshop
(pp. 103-109). Aix-en-Provence, France, 10-12 September 2005.
Johnson, D.E. (2009). Getting off the GoldVarb standard: Introducing Rbrul for mixed effects variable rule analysis. Language and Linguistics Compass, 3(1), 359-383.
. (2010). Rbrul manual. Retrieved from <[URL]>
Johnson, M., & Barnes, S. (2013).
Haya vs. haiga: An analysis of the variation observed in Mexican Spanish using a mixed effects model. In A.M. Carvalho & S. Beaudrie (Eds.), Selected Proceedings of the Sixth Workshop on Spanish Sociolinguistics (pp. 32-40). Somerville, MA: Cascadilla Proceedings Project.
Johnson, M., & Grinstead, J. (2012). Variation in the voseo and tuteo Negative Imperatives in Argentine Spanish. In M. Tamminga (Ed.), University of Pennsylvania Working Papers in Linguistics, 17(2), Article 12. Available at <[URL]>
. (1990). The intersection of sex and social class in the course of linguistic change. Language Variation and Change, 2(2), 205-254.
Lamy, D.S. (2015). A sociophonetic analysis of trill production in Panamanian Spanish. In R. Klassen, J.M. Liceras, & E. Valenzuela (Eds.), Hispanic linguistics at the crossroads. Theoretical linguistics, language acquisition and language contact (pp. 313-336). Amsterdam: John Benjamins.
Lastra, Y., & Martín Butragueño, P. (2015). Subject pronoun expression in oral Mexican Spanish. In A.M. Carvalho, R. Orozco, & N.L. Shin (Eds.), Subject pronoun expression in Spanish: A cross-dialectal perspective (pp. 41-60). Washington, DC: Georgetown University Press.
Medina-Rivera, Antonio. (2011). Variationist Approaches: External factors conditioning variation in Spanish phonology. In M. Díaz-Campos (Ed.), The handbook of Hispanic sociolinguistics (pp. 36-53). Oxford: Wiley-Blackwell.
Michnowitz, J. (2015). Subject pronoun expression in contact with Maya in Yucatan Spanish. In A.M. Carvalho, R. Orozco, & N.L. Shin (Eds.), Subject pronoun expression in Spanish: A cross-dialectal perspective (pp. 103-122). Washington, DC: Georgetown University Press.
Paolillo, J. (2009). Model vs. software in variation analysis. Presented at a Workshop on Using Statistical Tools to Explain Linguistic Variation. Convenor: Sali A. Tagliamonte. New Ways of Analyzing Variation (NWAV) 38 Ottawa, Ontario, Canada. October 22-25, 2009.
Poplack, S., & Tagliamonte, S.A. (2001). African American English in the diaspora: Tense and aspect. Malden, MA: Blackwell.
Rand, D., & Sankoff, D. (1990). GoldVarb: A variable rule application for the Macintosh. Montreal, Canada: Centre de recherches mathématiques, Université de Montréal.
Rodríguez Riccelli, A. (forthcoming).
Espero estén todos:The distribution of the null complementizer in two varieties of Spanish. In J. King & S. Sessarego (Eds.), The dynamics of language variation and change: Varieties of Spanish across space and time. Amsterdam: John Benjamins.
Rohdenburg, G. (1998). Clausal complementation and cognitive complexity in English. In F.-W. Neumann & S. Schülting (Eds.), Anglistentag Erfurt (pp. 101-112). Trier: Wissenschaftlicher Verlag.
Sankoff, D. (1988). Variable rules. In U. Ammon, N. Dittmar, & K.J. Mattheier (Eds.), Sociolinguistics: An international handbook of the science of language and society Vol. 2 (pp. 984-997). Berlin: Walter de Gruyter.
Sankoff, D., Tagliamonte, S.A., & Smith, E. (2005). Goldvarb X. Department of Linguistics, University of Toronto, Toronto, Canada. Retrieved from <[URL]>
Sankoff, G. (1974). A quantitative paradigm for the study of communicative competence. In R. Bauman & J. Sherzer (Eds.), Explorations in the ethnography of speaking (pp. 18-49). Cambridge: Cambridge University Press.
Sankoff, G., Blondeau, H., & Charity, A. (2001). Individual roles in a real-time change: Montreal (r->R) 1947-1995. In R. van Hout & H. vande Velde (Eds.), R-atics: Sociolinguistic, phonetic and phonological characteristics of /r/ (pp.141-157). Brussels: Université Libre de Bruxelles, Institute des Langues Vivantes et de Phonétique.
Schwenter, S.A. (2011). Variationist approaches to Spanish morphosyntax. In M. Díaz-Campos (Ed.), The handbook of Hispanic sociolinguistics (pp. 123-147). Oxford: Wiley-Blackwell.
Schwenter, S.A., & Torres Cacoullos, R. (2008). Defaults and indeterminacy in temporal grammaticalization: The ‘perfect’ road to perfective. Language Variation and Change, 20(1), 1-39.
. (2002). Comparative sociolinguistics. In J.K. Chambers, P. Trudgill, & N. Schilling-Estes (Eds.), Handbook of language variation and change (pp. 729-763). Malden, MA: Blackwell.
. (2007). Quantitative analysis. In R. Bayley & L. Ceil (Eds.), Sociolinguistic variation: Theory, methods, and applications, Dedicated to Walt Wolfram (pp. 190-214). Cambridge: Cambridge University Press.
. (2012). Variationist Sociolinguistics: Change, observation, interpretation. Malden, MA: Wiley-Blackwell.
Tagliamonte, S.A., & Baayen, R.H. (2012). Models, forests and trees of York English: Was/were variation as a case study for statistical practice. Language Variation and Change, 24(2), 135-178.
Tagliamonte, S.A., & Smith, J. (2005). No momentary fancy! The zero 'complementizer' in English dialects. English Language and Linguistics 9(2), 1-21.
Team, R Development Core (2007). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from <[URL]>
Thompson, S., & Mulac, A. (1991). The discourse conditions for the use of the complementizer that in conversational English. Journal of Pragmatics, 15, 237-251.
Torres-Cacoullos, R., & Walker, J.A. (2009). On the persistence of grammar in discourse formulas: A variaionist study of that
. Linguistics, 47(1), 1-43.
Wolfram, W. (1993). Identifying and interpreting variables. In D.R. Preston (Ed.), American dialect research (pp. 193-221). Amsterdam: John Benjamins.
Young, R., & Bayley, R. (1996). VARBRUL analysis for second language acquisition research. In R. Bayley & D.R. Preston (Eds.), Second language acquisition and linguistic variation (pp. 253-306). Amsterdam: John Benjamins.
Cited by (4)
Cited by four other publications
Brownshire, Katharine & Juliana De la Mora Gutiérrez
Insaidoo, Harry Ato, Joseph Benjamin Archibald Afful & Lucy Amoah Abedu
Solon, Megan, Bret Linford & Kimberly L. Geeslin
2018. Acquisition of sociophonetic variation. Revista Española de Lingüística Aplicada/Spanish Journal of Applied Linguistics 31:1 ► pp. 309 ff.
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
