Article published In: Linguistic Approaches to Bilingualism: Online-First Articles
Negative existential constructions in bilingual Russian
A corpus-driven approach to variation
Published online: 16 March 2026
https://doi.org/10.1075/lab.25019.nac
https://doi.org/10.1075/lab.25019.nac
Abstract
This paper reports on a study investigating variation in negative existential constructions in varieties of Russian spoken by bilinguals from different regions of Russia. Non-standard marking of the negated subject in such constructions is reported in the literature as a common feature of varieties of bilingual Russian and is usually explained as the result of pattern borrowing due to contact with languages that, unlike Russian, do not have any special marking of the subject in such constructions. In this study, we analyze data from seven spoken corpora of bilingual Russian and compare them to data from a spoken corpus of monolingual Russian. The results of the analysis show that pattern borrowing can account for part of the variation patterns attested in our data, but not for all of them. By applying a quantitative variationist method to data analysis, we show that variation is motivated by an interaction of multiple linguistic and extralinguistic factors, not all of which are related to language contact.
Article outline
- 1.Introduction
- 2.Negative existential constructions
- 2.1Standard Russian
- 2.2Contact languages
- 2.2.1East Caucasian
- 2.2.2Turkic
- 2.2.3Uralic
- 2.2.4Romani
- 2.2.5Research questions and hypotheses
- 3.Data
- 3.1Corpora and speakers
- 3.2Annotation
- 4.Results
- 4.1Data analysis and descriptive statistics
- 4.2Statistical modeling
- 5.Discussion and concluding remarks
- Data availability statement
- CRediT statement
- Acknowledgements
- Notes
- Abbreviations
References
References (68)
Abdullaeva, A. Z., Gadžiaxmedov, N. È., Kadyradžiev, K. S., Kerimov, I. A., Ol’mesov, N. X., & Xangišiev, D. M. (2014). Sovremennyj kumykskij jazyk [Contemporary Kumyk]. Institute of Language, Literature and Arts G. Cadasa, Daghestan Scientific Centre, Russian Academy of Sciences.
Arkhangelskiy, T. (2020). Corpus of Russian spoken by the Besermans. Linguistic Convergence Laboratory, HSE University. [URL]
Babaliyeva, A. (2013). Études sur la morphosyntaxe du tabasaran littéraire [Studies on the morphosyntax of literary Tabasaran] [Doctoral dissertation]. École Pratique des Hautes Études.
Bammatov, B. G., & Gadžiaxmedov, N. È. (2013). Kumyksko–russkij slovar’ [Kumyk–Russian dictionary]. Institute of Language, Literature and Arts G. Cadasa, Daghestan Scientific Centre, Russian Academy of Sciences.
Baranova, V. V., Fedotov, M. L., & Oskolskaya, S. A. (2021). Expressing absence in the Turkic languages of the Volga–Kama sprachbund: Chuvash and Bashkir. Tomsk Journal of Linguistics and Anthropology, 4(34), 9–31.
Baranova, V. V., & Mishchenko, D. F. (2022). Non-verbal negation markers and the Negative Existential Cycle in Bashkir and Kalmyk with some typological parallels. In L. Veselinova & A. Hamari (Eds.), The negative existential cycle (pp. 403–439). Language Science Press.
Borschev, V., & Partee, B. H. (1998). Formal and lexical semantics and the genitive in negated existential sentences in Russian. In Z. Boskovic, S. Franks, & W. Snyder (Eds.), Formal Approaches to Slavic Linguistics 6: The Connecticut Meeting 1997 (pp. 75–96). Michigan Slavic Publications.
(2002). The Russian genitive of negation in existential sentences: the role of theme–rheme structure reconsidered. In E. Hajicová, P. Sgall, J. Hana, & T. Hoskovec (Eds.), Travaux du Cercle Linguistique de Prague (nouvelle série), Vol. 4 (pp. 185–250). John Benjamins.
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48.
Bayda, K. A., Kholodilova, M. A., Yegorova, A. D., Romanova, E. A., Remizova, T. E., Storozheva, A. A., Tarasova, N. K., Zorina, A. A., Morozova, V. A., Panova, A. B., & Dobrushina, N. R. (2018). Corpus of Russian spoken in Chuvashia. Linguistic Convergence Laboratory, HSE University. [URL]
Bybee, J. L., & Hopper, P. J. (2001). Introduction. In J. L. Bybee & P. J. Hopper (Eds.), Frequency and the emergence of linguistic structure (pp. 1–26). John Benjamins.
Chvany, C. V. (1997). Selected Essays of Catherine V. Chvany (O. T. Yokoyama & E. Klenin, Eds.). Slavica.
Coretta, S., & Casillas, J. V. (2025). A tutorial on generalised additive mixed effects models for bilingualism research. Linguistic Approaches to Bilingualism, 15(4), 429–452.
Daniel, M., Dobrushina, N., & Knyazev, S. (2010). Highlanders’ Russian: Case study in bilingualism and language interference in Central Daghestan. In A. Mustajoki, E. Protassova, & N. Vakhtin (Eds.), Instrumentarium of Linguistics: Sociolinguistic Approach to Non-Standard Russian (pp. 65–93). University of Helsinki.
Dobrushina, N., Daniel, M., von Waldenfels, R., Maisak, T., & Panova, A. (2018). Corpus of Russian spoken in Daghestan. Linguistic Convergence Laboratory, HSE University. [URL]
Dobrushina, N., & Kultepina, O. (2021). The rise of a lingua franca: The case of Russian in Dagestan. International Journal of Bilingualism, 25(1), 338–358.
Dorian, N. C. (2010). Investigating variation: The effects of social organization and social setting. Oxford University Press.
Droganova, K., Lyashevskaya, O., & Zeman, D. (2018). Data conversion and consistency of monolingual corpora: Russian UD treebanks. In D. Haug, S. Oepen, L. Øvrelid, M. Candito, & J. Hajič (Eds.), Proceedings of the 17th International Workshop on Treebanks and Linguistic Theories (TLT 2018) (pp. 52–65). Linköping University Electronic Press.
Edygarova, S. (2015). Negation in Udmurt. In M. Miestamo, A. Tamm, & B. Wagner-Nagy (Eds.), Negation in Uralic languages (pp. 265–292). John Benjamins.
Ehret, K., & Szmrecsanyi, B. (2016). Compressing learner language: An information-theoretic measure of complexity in SLA production data. Second Language Research, 35(1), 23–45.
Ellis, N. C., & Wulff, S. (2020). Usage-based approaches to L2 acquisition. In B. VanPatten, G. D. Keating, & S. Wulff (Eds.), Theories in second language acquisition: An introduction (pp. 63–82). Routledge.
Friedman, V. A. (2021). Lak. In M. Polinsky (Ed.), The Oxford handbook of languages of the Caucasus (pp. 201–240). Oxford Academic.
Gimbatov, M. M. (2006). Avarsko–russkij slovar’ (rukopis’) [Avar–Russian dictionary (manuscript)]. Institute of Language, Literature and Arts G. Cadasa, Daghestan Scientific Centre, Russian Academy of Sciences.
Grenoble, L. A., & Osipov, B. (2023). The dynamics of bilingualism in language shift ecologies. Linguistic Approaches to Bilingualism, 13(1), 1–39.
Grishanova, A. (2025). An overview of morphosyntactic variation in the speech of Russian–Chuvash bilinguals: Number, gender, case assignment and preposition drop. Russian Linguistics, 49(10), Article 10.
Kashkin, E. V. (2020). Osobennosti russkoj reči nositelej mokšanskogo jazyka [Some peculiarities of the Russian speech of Moksha speakers]. Proceedings of the V. V. Vinogradov Russian Language Institute, 4(26), 110–131.
(2022). O nestandartnom (zametki o russkoj reči gornyx marijcev) [On non-standard features of Russian in the grammar and lexicon of Hill Mari speakers]. Rodnoj jazyk, 21, 35–51.
Kibrik, A. E., Kazenin, K. I., Ljutikova, E. A., & Tatevosov, S. G. (Eds.). (2001). Bagvalinskij jazyk: Grammatika, teksty, slovari [The Bagvalal language: Grammar, texts, dictionaries]. Nasledie.
Kozhanov, K., Khadasevich, A., & Chernykh, A. (2020). Corpus of Russian spoken by Roma. Linguistic Convergence Laboratory, HSE University. [URL]
Laleko, O., & Kisselev, O. (2023). Embracing linguistic variation in shift ecologies. The dominant language varies too. Linguistic Approaches to Bilingualism, 13(1), 60–64.
Magomedova, P. T. (2004). Bagvalinsko–russkij slovar’ [Bagvalal–Russian dictionary]. Institute of Language, Literature and Arts G. Cadasa, Daghestan Scientific Centre, Russian Academy of Sciences.
Matras, Y., & Sakel, J. (2007). Investigating the mechanisms of pattern replication in language convergence. Studies in Language, 31(4), 829–865.
Mishchenko, D. F. (2017). Neglagol’nye predloženija baškirskogo jazyka i sposoby vyraženija otricanija v nix [Non-verbal sentences in Bashkir and ways of expressing negation]. Acta Linguistica Petropolitana, 13(1), 110–146.
Naccarato, C., & Moroz, G. (2025). Non-standard numeral constructions in L2 Russian: A corpus-based study. International Journal of Bilingualism. Advance online publication.
Naccarato, C., Panova, A., & Stoynova, N. (2021). Word-order variation in a contact setting: A corpus-based investigation of Russian spoken in Daghestan. Language Variation and Change, 331, 387–411.
Oslon, M. V. (2018). Jazyk kotljarov-moldovaja: Grammatika kèldèrarskogo dialekta cyganskogo jazyka v russkojazyčnom okruženii [The language of Moldovan Kalderash: A grammar of the Kalderash dialect of Romani in a Russian-speaking environment]. JaSK.
Paducheva, E. V. (1992). O semantičeskom podxode k sintaksisu i genitivnom sub’’ekte glagola BYT’ [On the semantic approach to syntax and the genitive subject of the verb TO BE]. Russian Linguistics, 161, 53–63.
(1997). Roditel’nyj sub’’ekta v otricatel’nom predloženii: Sintaksis ili semantika? [The genitive of the subject in the negative sentence: Syntax or semantics?]. Voprosy jazykoznanija, 21, 101–116.
Panova, A. (2020). Corpus of Russian spoken in Zvenigorod. Linguistic Convergence Laboratory, HSE University. [URL]
Panova, A., & Philippova, T. (2021). When a cross-linguistic tendency marries incomplete acquisition: Preposition drop in Russian spoken in Daghestan. International Journal of Bilingualism, 25(3), 640–667.
Perelmutter, R. (2005). Case choice in Russian genitive/nominative absence constructions. Russian Linguistics, 291, 319–346.
Poplack, S. (2021). A variationist perspective on language contact. In E. Adamou & Y. Matras (Eds.), The Routledge handbook of language contact (pp. 46–62). Routledge.
R Core Team. (2024). R: A language and environment for statistical computing. R Foundation for Statistical Computing. [URL]
Rankin, T., & Wagner, T. (2025). Effects of input frequency and microvariation on knowledge of negative inversion in L2 English. Linguistic Approaches to Bilingualism, 15(5), 718–744.
Riese, T., Bradley, J., & Yefremova, T. (2022). Mari: An essential grammar for international learners (Version 1.0). University of Vienna.
Rushworth, A. (2022). inspectdf: Inspection, comparison and visualisation of data frames. R package version 0.0.12. [URL]
Shannon, C. E. (1948). A mathematical theory of communication. The Bell System Technical Journal, 27(3), 379–423.
Sokur, E., & Morozova, V. (2019). Corpus of Russian spoken in Bashkortostan. Linguistic Convergence Laboratory, HSE University. [URL]
Stoynova, N. (2019). Russian in contact with Southern Tungusic languages: Evidence from the Contact Russian Corpus of Northern Siberia and the Russian Far East. In A. Nikunlassi & E. Protassova (Eds.), Russian language in the multilingual world (pp. 9–36). University of Helsinki.
(2021). Nestandartnye količestvennye konstrukcii v russkoj reči nositelej nanajskogo i ul’čskogo jazykov [Non-standard syntax of numerals in the Russian of Nanai and Ulcha speakers]. Russian Linguistics, 451, 305–334.
Sun, K., & Wang, R. (2021). Using the relative entropy of linguistic complexity to assess L2 language proficiency development. Entropy, 23(8), Article 1080.
Tağıyev, M. T. (Ed.). (2006). Azərbaycanca–rusca lüǧet [Azerbaijani–Russian Dictionary] (Vol. 41). Şərq-Qərb.
Tomasello, M. (2003). Constructing a language: A usage-based theory of language acquisition. Harvard University Press.
Veríssimo, J. (2025). A gentle introduction to Bayesian statistics, with applications to bilingualism research. Linguistic Approaches to Bilingualism, 15(4), 453–486.
Veselinova, L. (2013). Negative existentials: A cross-linguistic study. Rivista di Linguistica, 25(1), 107–145.
Vlakhov, A., Lapina, M., Smirnova, A., Elagina, A., Mordasov, M., Ryzhkov, A., Bulgakov, G., Smirnova, E., Timoshina, A., & Tushevskaya, K. (2022). Corpus of Karelian Russian. Linguistic Convergence Laboratory, HSE University. [URL]
Volkova, A. (2023). Corpus of Russian spoken in Mari El. Linguistic Convergence Laboratory, HSE University. [URL]
Wijffels, J. (2023). udpipe: Tokenization, parts of speech tagging, lemmatization and dependency parsing with the ‘UDPipe’ ‘NLP’ toolkit. R package version 0.8.11. [URL]
Wolter, B., & Gyllstad, H. (2013). Frequency of input and L2 collocational processing: A comparison of congruent and incongruent collocations. Studies in Second Language Acquisition, 351, 451–482.
Yakovleva, A., Koshelyuk, N., & Moroz, G. (2025). Preposition drop in Russian spoken by Mari and Beserman bilinguals. International Journal of Bilingualism.