In:Investigating Wikipedia: Linguistic corpus building, exploration and analysis
Edited by Céline Poudat, Harald Lüngen and Laura Herzberg
[Studies in Corpus Linguistics 121] 2024
► pp. 76–106
Chapter 3Exploring interactions in Wikipedia talk pages
Published online: 31 October 2024
https://doi.org/10.1075/scl.121.03tan
https://doi.org/10.1075/scl.121.03tan
Abstract
In this chapter we analyze how users interact on Wikipedia talk pages, focusing on the patterns that emerge from a large
corpus of 5 million threads across three languages. These patterns take three simple features into account: who posts,
when they post, and after whom they post. We begin with an overview and a closer examination of some extreme
behaviours: threads with the highest number of users and posts, or those with the longest duration. We then propose
fine-grained typologies to analyze the content of a sample of monologal and trilogal openings. This analysis allows us
to identify recurring behaviours, some of which we consider specific to the Wikipedia and its contributors, and others
more common in online discussions.
Article outline
- 1.Introduction
- 2.What corpus framework is suitable for exploring
Wikipedia Talk Pages?
- 2.1Wikipedia talk pages
- 2.2A corpus framework for exploring interactions
- 3.A coarse-grained view of Wikipedia talks
- 3.1Talks as sequences
- 3.2Data overview
- 3.3A look at extreme behaviors
- 3.3.1The longest threads
- 3.3.2The most populated threads
- 3.3.3The longest-lasting talks
- 3.4Interaction patterns
- 4.Case studies: Monologues and trilogues
- 4.1AA openings and monologues: Are Wikipedia talks really talks?
- 4.1.1AA openings
- 4.1.2Long monologues
- 4.2ABC openings and multiparty conversations
- 4.2.1Why is C posting a message?
- 4.2.2Does C align with A and/or B?
- 4.1AA openings and monologues: Are Wikipedia talks really talks?
- 5.Discussion
Notes References
References (35)
Beaudouin, Valérie, & Velkovska, Julia. 1999. Constitution
d’un espace de communication sur Internet (forums, pages personnelles, courrier
électronique…). Réseaux. Communication — Technologie —
Société 17(97): 121–177.
Beißwenger, Michael, Ermakova, Maria, Geyken, Alexander, Lemnitzer, Lothar & Storrer, Angelika. 2012. A
TEI schema for the representation of computer-mediated communication. Journal
of the Text Encoding Initiative 3.
Davis, James. 2007. Dialogue,
monologue and soliloquy in the large lecture class. International Journal of
Teaching and Learning in Higher
Education 19(2): 178–182
Ferschke, Oliver, Daxenberger, Johannes & Gurevych, Iryna. 2013. A
survey of NLP methods and resources for analyzing the collaborative writing process in
Wikipedia. In The People’s Web Meets NLP:
Collaboratively Constructed Language Resources, Iryna Gurevych & Jungi Kim (eds). Berlin: Springer.
Ferschke, Oliver, Gurevych, Iryna & Chebotar, Yevgen. 2012. Behind
the article: Recognizing dialog acts in Wikipedia talk
pages. In Proceedings of the 13th Conference of the
European Chapter of the Association for Computational Linguistics, Walter Daelemans (ed.), 777–786. Stroudsburg PA: ACL.
Frobenius, Maximiliane. 2011. Beginning
a monologue: The opening sequence of video blogs. Journal of
Pragmatics 43: 814–827.
. 2014. Audience
design in monologues: How vloggers involve their viewers. Journal of
Pragmatics 72: 59–72.
Gómez, Vincenc, Kappen, Hilbert J. & Kaltenbrunner, Andreas. 2011. Modeling
the structure and evolution of discussion
cascades. In Proceedings of the 22nd ACM Conference
on Hypertext and Hypermedia, Eindhoven, the Netherlands, 181–190. New York NY: ACM.
Herring, Susan C. 2010. Computer-mediated
conversation, Part I : Introduction and
overview. Language@Internet 7(2).
Kaltenbrunner, Andreas & Laniado, David. 2012. There
is no deadline: Time evolution of Wikipedia
discussions. In Proceedings of the 8th Annual
International Symposium on Wikis and Open
Collaboration, 1–10. New York NY: ACM.
Kopf, Susanne. 2022. A
Discursive Perspective on Wikipedia: More than an Encyclopaedia? Palgrave Macmillan.
Laniado, David, Tasso, Riccardo, Volkovich, Yana & Kaltenbrunner, Andreas. 2011. When
the Wikipedians talk: Network and tree structure of Wikipedia discussion
pages. In Fifth International AAAI
Conference on Weblogs and Social
Media, Barcelona, 17–21
July.
Marcoccia, Michel. 2004. L’analyse
conversationnelle des forums de discussion: Questionnements
méthodologiques. Les Carnets du Cediscor. Publication du Centre de Recherches
sur la Didacticité des Discours Ordinaires 8.
Mehler, Alexander, Gleim, Rüdiger, Lücking, Andy, Uslu, Tolga, & Stegbauer, Christian. 2018. On
the self-similarity of Wikipedia talks: A combined discourse-analytical and quantitative
approach. Glottometrics 40: 1–45.
Meredith, Joanne, Giles, David, & Stommel, Wyke (eds). 2021. Analysing
Digital
Interaction. Berlin: Springer.
Mondada, Lorenza. 2006. Interactions
en situations professionnelles et institutionnelles: De l’analyse détaillée aux retombées
pratiques. Revue Française de Linguistique
Appliquée 11(2): 5–16.
. 2012a. Conversation
analysis and institutional interaction. In The
Encyclopedia of Applied Linguistics, 1st ed., Carol A. Chapelle (ed.). Malden MA: John Wiley & Sons.
. 2012b. The
conversation analytic approach to data
collection. In The Handbook of Conversation
Analysis, Jack Sidnell & Tanya Stivers (eds), 32–56. Malden MA: John Wiley & Sons.
Pentzold, Christian, Weltevrede, Esther, Mauri, Michele, Laniado, David, Kaltenbrunner, Andreas & Borra, Erik. 2017. Digging
Wikipedia: The online encyclopedia as a digital cultural heritage gateway and
site. Journal on Computing and Cultural Heritage
(JOCCH) 10(1), 1–19.
Poudat, Céline, & Frédéric Landragin. 2017. Explorer
un corpus textuel: Méthodes — pratiques — outils. De Boeck Supérieur.
Poudat, Céline, Wigham, Ciara R., & Liégeois, Loïc. 2020. Les
corpus de la communication médiée par les réseaux: Une
introduction. Corpus 20: Article
20.
Sacks, Harvey, Schegloff, Emanuel A., & Jefferson, Gail. 1974. A
simplest systematics for the organization of turn-taking for
conversation. Language 50(4): 696–735.
Schegloff, Emanuel A. 1968. Sequencing in
conversational openings. American
Anthropologist 70(6): 1075–95.
Schneider, Jodi, Passant, Alexandre & Breslin, John G. 2010. A
content analysis: How Wikipedia talk pages are
used. In Proceedings of the 2nd International
Conference of Web Studies, Everardo Reyes (ed.), 1–7. New York NY: ACM.
Stivers, Tanya & Sidnell, Jack. 2012. Introduction. In The
Handbook of Conversation Analysis, Jack Sidnell & Tanya Stivers (eds), 1–8. Malden MA: John Wiley & Sons.
