Cover not available

Article published In: Reproducibility, Replicability, and Robustness in Corpus Linguistics
Edited by Martin Schweinberger and Michael Haugh
[International Journal of Corpus Linguistics 30:2] 2025
► pp. 171194

References (52)
References
Abitbol, J. L., Karsai, M., Magué, J.-P., Chevrot, J.-P., & Fleury, E. (2018). Socioeconomic dependencies of linguistic patterns in Twitter: A multivariate analysis use. In P.-A. Champin, F. Gandon, M. Lalmas, & P. G. Ipeirotis (Eds.), Proceedings of the 2018 World Wide Web Conference (pp. 1125–1134). ACM Press. Google Scholar logo with link to Google Scholar
Altahmazi, T. H. (2020). Collective pragmatic acting in networked spaces: The case of #activism in Arabic and English Twitter discourse. Lingua, 2391, Article e102837. Google Scholar logo with link to Google Scholar
An, J., & Weber, I. (2016). #greysanatomy vs. #yankees: Demographics and hashtag use on Twitter. In K. P. Gummadi & M. Strohmaier (Eds.), Proceedings of the tenth international AAAI conference on web and social media (ICWSM 2016) (pp. 523–526). Google Scholar logo with link to Google Scholar
Biber, D. (1993). Representativeness in corpus design. Literary and Linguistic Computing, 8(4), 243–257. Google Scholar logo with link to Google Scholar
Biber, D., Conrad, S., & Reppen, R. (1998). Corpus linguistics: Investigating language structure and use. Cambridge University Press. Google Scholar logo with link to Google Scholar
Biber, D., & Reppen, R. (2015). The Cambridge handbook of English corpus linguistics. Cambridge University Press. Google Scholar logo with link to Google Scholar
Bruns, A. (2019). After the ‘APIcalypse’: Social media platforms and their fight against critical scholarly research. Information, Communication & Society, 22(11), 1544–1566. Google Scholar logo with link to Google Scholar
Clausen, Y., & Scheffler, T. (2020). A corpus-based analysis of meaning variations in German tag questions: Evidence from spoken and written conversational corpora. Corpus Linguistics and Linguistic Theory, 18(1), 1–31. Google Scholar logo with link to Google Scholar
Coats, S. (2017). Gender and lexical type frequencies in Finland Twitter English. In T. Hiltunen, J. McVeigh, & T. Säily (Eds.), Big and rich data in English corpus linguistics: Methods and explorations (Studies in variation, contacts and change in English 19). Varieng. [URL]
Davies, M. (2013). Corpus of Global Web-Based English. [URL]
(2015). Corpora: An introduction. In D. Biber & R. Reppen (Eds.), The Cambridge handbook of English corpus linguistics (pp. 11–31). Cambridge University Press. Google Scholar logo with link to Google Scholar
Dijkstra, J., Heeringa, W., Jongbloed-Faber, L., & Van de Velde, H. (2021). Using Twitter data for the study of language change in low-resource languages. A panel study of relative pronouns in Frisian. Frontiers in Artificial Intelligence, 41, Article e644554. Google Scholar logo with link to Google Scholar
Dunbar, R. I. M. (2020). Structure and function in human and primate social networks: Implications for diffusion, network stability and health. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 476(2240), Article e20200446. Google Scholar logo with link to Google Scholar
Eisenstein, J., O’Connor, B., Smith, N. A., & Xing, E. P. (2014). Diffusion of lexical change in social media. PLOS ONE, 9(11), Article e113114. Google Scholar logo with link to Google Scholar
Fiil-Flynn, S. M., Butler, B., Carroll, M., Cohen-Sasson, O., Craig, C., Guibault, L., Jaszi, P., Jütte, B. J., Katz, A., Quintais, J. P., Margoni, T., Souza, A. R., Sag, M., Samberg, R., Schirru, L., Senftleben, M., Tur-Sinai, O., & Contreras, J. L. (2022). Legal reform to enhance global text and data mining research. Science, 378(6623), 951–953. Google Scholar logo with link to Google Scholar
Flanagan, J. (2017). Reproducible research: Strategies, tools, and workflows. In T. Hiltunen, J. McVeigh, & T. Säily (Eds.), Big and rich data in English corpus linguistics: Methods and explorations (Studies in variation, contacts and change in English 19). Varieng. [URL]
Francis, W. N., & Kučera, H. (1964). A Standard Corpus of Present-Day Edited American English, for use with digital computers. Department of Linguistics, Brown University.Google Scholar logo with link to Google Scholar
Gonçalves, B., Loureiro-Porto, L., Ramasco, J. J., & Sánchez, D. (2018). Mapping the Americanization of English in space and time. PLOS ONE, 13(5), Article e0197741. Google Scholar logo with link to Google Scholar
Grieve, J. (2021). Observation, experimentation, and replication in linguistics. Linguistics, 59(5), 1343–1356. Google Scholar logo with link to Google Scholar
Grieve, J., Montgomery, C., Nini, A., Murakami, A., & Guo, D. (2019). Mapping lexical dialect variation in British English using Twitter. Frontiers in Artificial Intelligence, 21, Article 11. Google Scholar logo with link to Google Scholar
Grieve, J., Nini, A., & Guo, D. (2017). Analyzing lexical emergence in American English online. English Language and Linguistics, 21(1), 99–127. Google Scholar logo with link to Google Scholar
(2018). Mapping lexical innovation on American social media. Journal of English Linguistics, 46(4), 293–319. Google Scholar logo with link to Google Scholar
Groves, R. M., & Couper, M. P. (1998). Nonresponse in household interview surveys. Wiley. Google Scholar logo with link to Google Scholar
Hardie, A. (2012). CQPweb — combining power, flexibility and usability in a corpus analysis tool. International Journal of Corpus Linguistics, 17(3), 380–409. Google Scholar logo with link to Google Scholar
Huang, Y., Guo, D., Kasakoff, A., & Grieve, J. (2016). Understanding U.S. regional linguistic variation with Twitter data analysis. Computers, Environment and Urban Systems, 591, 244–255. Google Scholar logo with link to Google Scholar
Hundt, M., Lehmann, H. M., & Schneider, G. (2023). The International Corpus of English. Retrieved January 7, 2025, from [URL]
Kathpalia, S. (2023). Satiric parody through Indian English tweets in Twitter. World Englishes, 42(4), 606–623. Google Scholar logo with link to Google Scholar
Kazansky, B., Torres, G., van der Velden, L., Wissenbach, K., & Milan, S. (2019). Data for the social good: Toward a data-activist research agenda. In A. Daly, S. K. Devitt, & M. Mann (Eds.), Good data (pp. 244–259). Institute of Network Cultures.Google Scholar logo with link to Google Scholar
Kellert, O., & Matlis, N. H. (2022). Geolocation of multiple sociolinguistic markers in Buenos Aires. PLOS ONE, 17(9), Article e0274114. Google Scholar logo with link to Google Scholar
Laitinen, M., & Fatemi, M. (2022). Big and rich social networks in computational sociolinguistics. In P. Rautionaho, H. Parviainen, M. Kaunisto, & A. Nurmi (Eds.), Social and regional variation in World Englishes: Local and global perspectives (pp. 166–189). Routledge. Google Scholar logo with link to Google Scholar
Laitinen, M., & Lundberg, J. (2020). ELF, language change, and social networks: Evidence from real-time social media data. In A. Mauranen & S. Vetchinnikova (Eds.), Language change: The impact of English as a lingua franca (pp. 179–204). Cambridge University Press. Google Scholar logo with link to Google Scholar
Laitinen, M., Lundberg, J., Levin, M., & Lakaw, A. (2017). Utilizing multilingual language data in (nearly) real time: The case of the Nordic Tweet Stream. The Journal of Universal Computer Science, 231, 1038–1056. Google Scholar logo with link to Google Scholar
Li, H., Dunn, J., & Nini, A. (2022). Register variation remains stable across 60 languages. Corpus Linguistics and Linguistic Theory, 19(3), 397–426. Google Scholar logo with link to Google Scholar
Liimatta, A. (2021). Using lengthwise scaling to compare feature frequencies across text lengths on Reddit. In S. Rüdiger & D. Dayter (Eds.), Corpus approaches to social media (pp. 111–130). John Benjamins. Google Scholar logo with link to Google Scholar
Love, R., Dembry, C., Hardie, A., Brezina, V., & McEnery, T. (2017). The Spoken BNC2014: Designing and building a spoken corpus of everyday conversations. International Journal of Corpus Linguistics, 22(3), 319–344. Google Scholar logo with link to Google Scholar
Lüdeling, A., & Kytö, M. (2009). Corpus linguistics: An international handbook. De Gruyter Mouton. Google Scholar logo with link to Google Scholar
Lundberg, J., Nordqvist, J., & Laitinen, M. (2019). Towards a language independent Twitter bot detector. In C. Navarretta, M. Agirrezabal, & B. Maegaard (Eds.), Proceedings of the Digital Humanities in the Nordic Countries 4th Conference (pp. 308–319). University of Copenhagen. [URL].
Ma, X., Cheng, J., Iyer, S., & Naaman, M. (2019). When do people trust their social groups? In S. Brewster, G. Fitzpatrick, A. Cox, & V. Kostakos (Eds.), CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. Paper 67. Association for Computing Machinery.Google Scholar logo with link to Google Scholar
Milroy, L., & Milroy, J. (1992). Social network and social class: Toward an integrated sociolinguistic model. Language in Society, 21(1), 1–26. Google Scholar logo with link to Google Scholar
Morstatter, F., Wu, L., Nazer, T. H., Carley, K. M., & Liu, H. (2016). A new approach to bot detection: Striking the balance between precision and recall. In R. Kumar, J. Caverlee, & H. Tong (Eds.), Proceedings of the 2016 IEEE/ACM international conference on advances in social networks analysis and mining (pp. 533–540). IEEE press. Google Scholar logo with link to Google Scholar
National Academies of Sciences, Engineering, and Medicine (NAS). (2019). Reproducibility and replicability in science. The National Academies Press. Google Scholar logo with link to Google Scholar
Nevalainen, T., Raumolin-Brunberg, H., Keränen, J., Nevala, M., Nurmi, A., & Palander-Collin, M. (1993–). Corpus of Early English Correspondence. [URL]
Nevalainen, T., Tyrkkö, J., & Minna Palander-Collin, M. (n.d.). Corpus Resource Database. [URL]
Schweinberger, M., & Flanagan, J. (2021). Replication and reproducibility in English corpus linguistics [Workshop]. 6th meeting of the International Society for the Linguistics of English (ISLE 6), Joensuu, Finland.
Shakir, M., & Deuber, D. (2018). A multidimensional study of interactive registers in Pakistani and US English. World Englishes, 37(4), 607–623. Google Scholar logo with link to Google Scholar
Sönning, L., & Werner, V. (2021). The replication crisis, scientific revolutions, and linguistics. Linguistics, 59(5), 1179–1206. Google Scholar logo with link to Google Scholar
Statista. (2022). Number of worldwide social media users. Retrieved November 19, 2022, from [URL]
Tyrkkö, J., Levin, M., & Laitinen, M. (2021). Actually in Nordic tweets. World Englishes, 40(4), 631–649. Google Scholar logo with link to Google Scholar
Van Hee, C., Lefever, E., & Hoste, V. (2018). We usually don’t like going to the dentist: Using common sense to detect irony on Twitter. Computational Linguistics, 44(4), 793–832. Google Scholar logo with link to Google Scholar
Wachter, S., Mittelstadt, B., & Russell, C. (2024). Do large language models have a legal duty to tell the truth? Royal Society Open Science, 111, Article e240197. Google Scholar logo with link to Google Scholar
Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J.-W., Bonino da Silva Santos, L., Bourne, P. E., Bouwman, J., Brookes, A. J., Clark, T., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C. T., Finkers, R.Mons, B. (2016). The FAIR guiding principles for scientific data management and stewardship. Scientific Data, 31, Article e160018. Google Scholar logo with link to Google Scholar
Cited by (1)

Cited by one other publication

Becker, Laura & Matías Guzmán Naranjo
2025. Authors’ response to “Replication and methodological robustness in quantitative typology”. Linguistic Typology 29:3  pp. 591 ff. DOI logo

This list is based on CrossRef data as of 30 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

Mobile Menu Logo with link to supplementary files background Layer 1 prag Twitter_Logo_Blue