Cover not available

In:Investigating Wikipedia: Linguistic corpus building, exploration and analysis
Edited by Céline Poudat, Harald Lüngen and Laura Herzberg
[Studies in Corpus Linguistics 121] 2024
► pp. 1244

References (31)
References
Baldwin, Timothy, Cook, Paul, Lui, Marco, MacKinlay, Andrew & Wang, Li. 2013. How noisy social media text, how different social media sources? In Proceedings of the Sixth International Joint Conference on Natural Language Processing, Ruslan Mitkov & Jong C. Park (eds), 356–364. Nagoya, Japan.Google Scholar logo with link to Google Scholar
Beißwenger, Michael & Lüngen, Harald. 2020. CMC-core: A schema for the representation of CMC corpora in TEI. Corpus 20. 〈[URL]
Beißwenger, Michael, Wigham, Ciara, Etienne, Carole, Grumt Suárez, Holger, Herzberg, Laura, Fišer, Darja, Hinrichs, Erhard, Horsmann, Tobias, Karlova-Bourbonus, Natali, Lemnitzer, Lothar, Longhi, Julien, Lüngen, Harald, Ho-Dac, Lydia-Mai, Parisse, Christophe, Poudat, Céline, Schmidt, Thomas, Stemle, Egon, Storrer, Angelika & Zesch, Torsten. 2017. Connecting resources: Which issues have to be solved to integrate CMC corpora from heterogeneous sources and for different languages? In Proceedings of the 5th Conference on CMC and Social Media Corpora for the Humanities (Cmccorpora17), Egon W. Stemle & Ciara Wigham (eds) 52–55. Bolzano, Italy. Google Scholar logo with link to Google Scholar
Borra, Erik, Weltevrede Esther, Ciuccarelli, Paolo, Kaltenbrunner, Andreas, Laniado, David, Magni, Giovanni, Mauri, Michele, Rogers, Richard & Venturini, Tommaso. 2015. Societal controversies in wikipedia articles. In CHI ’15: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 193–196. New York, NY: ACM. Google Scholar logo with link to Google Scholar
Chang, Jonathan P., Chiam, Caleb, Fu, Liye, Wang, Andrew Z., Zhang, Justine & Danescu-Niculescu-Mizil, Cristian. 2020. ConvoKit: A toolkit for the analysis of conversations. In Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Olivier Pietquin, Smaranda Muresan, Vivian Chen, Casey Kennington, David Vandyke, Nina Dethlefs, Koji Inoue, Erik Ekstedt & Stefan Ultes (eds), 57–60. [System demo]. Stroudsburg PA: ACL. Google Scholar logo with link to Google Scholar
Chang, Jonathan P. & Danescu-Niculescu-Mizil, Cristian. 2019. Trouble on the horizon: Forecasting the derailment of online conversations as they develop. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (XXth EMNLP). Stroudsburg PA: ACL. Google Scholar logo with link to Google Scholar
Elia, Antonella. 2009. Quantitative data and graphics on lexical specificity and index readability: The case of wikipedia. Revista Electrónica de Lingüı́stica Aplicada 8: 248–271.Google Scholar logo with link to Google Scholar
Ferschke, Oliver, Gurevych, Iryna & Chebotar, Yevgen. 2012. Behind the article: Recognizing dialog acts in wikipedia talk pages. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, 777–786. Stroudsburg PA: ACL.Google Scholar logo with link to Google Scholar
Ho-Dac, Lydia-Mai. 2024. EFG WikiCorpus — discussions in Wikipedia’s backstage (English, French, German) [Corpus]. ORTOLANG (Open Resources and TOols for LANGuage) — [URL], [URL]Google Scholar logo with link to Google Scholar
Ho-Dac, Lydia-Mai & Laippala Veronika. 2017. Le corpus WikiDisc: Ressource pour la caractérisation des discussions en ligne. In Corpus de communication médiée par les réseaux: Construction, structuration, analyse. Ciara R. Wigham & Gudrun Ledegen (eds), 107–124. Paris: l’Harmattan.Google Scholar logo with link to Google Scholar
Ho-Dac, Lydia-Mai, Laippala, Veronika, Poudat, Céline & Tanguy, Ludovic. 2017. Exploring Wikipedia talk pages for conflict detection. In Investigating Computer-Mediated Communication: Corpus-Based Approaches to Language in the Digital World, Darja Fišer & Michael Beißwenger (eds), 146–168. Ljubljana: Ljubljana University Press, Faculty of Arts.Google Scholar logo with link to Google Scholar
Huta, YiqingDanescu-Niculescu-Mizil, Cristian, Taraborelli, Dario, Thain, Nithum, Sorensen, Jeffery & Dixon, Lucas. 2018. WikiConv: A corpus of the complete conversational history of a large online collaborative community. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, 2818–2823. Stroudsburg PA: ACL.Google Scholar logo with link to Google Scholar
Konieczny, Piotr. 2010. Adhocratic governance in the internet age: A case of Wikipedia. Journal of Information Technology & Politics 7(4): 263–283. Google Scholar logo with link to Google Scholar
Laniado, David, Tasso, Riccardo, Volkovich, Yana & Kaltenbrunner, Andreas. 2011. When the Wikipedians talk: Network and tree structure of Wikipedia discussion pages. In Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 11), Barcelona, 17–21 July.Google Scholar logo with link to Google Scholar
Langlais, Pierre-Carl. 2014. La négociation contre la démocratie : le cas Wikipedia. Négociations 1: 21–34.Google Scholar logo with link to Google Scholar
Lehmann, Jens, Isele, Robert, Jakob, Max, Jentzsch, Anja, Kontokostas, Dimitri, Mendes, Pablo N., Hellmann, S., Morsey, M., van Kleef, P., Auer, S. & Bizer, C. 2015. Dbpedia — A large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web, 6(2), 167–195. Google Scholar logo with link to Google Scholar
Lih, Andrew. 2004. Wikipedia as Participatory Journalism: Reliable Sources? Metrics for evaluating collaborative media as a news resource.Google Scholar logo with link to Google Scholar
Linguatools (2018). Wikipedia Monolingual Corpora. From Intersectional Accuracy Disparities in Commercial Gender. 〈[URL]〉 (1 June 2024).
Lüngen, Harald & Herzberg, Laura. 2019. Types and annotation of reply relations in computer-mediated communication. European Journal of Applied Linguistics 7(2): 305–331. Google Scholar logo with link to Google Scholar
Margaretha, Eliza & Lüngen, Harald. 2014. Building linguistic corpora from Wikipedia articles and discussions. Journal for Language Technology and Computational Linguistics 29(2): 59–82. Google Scholar logo with link to Google Scholar
Medelyan, Olena, Milne, David, Legg, Catherine & Witten, Ian H. 2009. Mining meaning from Wikipedia. International Journal of Human-Computer Interactions 67(9): 716–754. Google Scholar logo with link to Google Scholar
Mintzberg, Henry. 1979. The Structuring of Organizations. Englewood Cliffs NJ: Prentice-Hall.Google Scholar logo with link to Google Scholar
Mitrevski, Blagoj, Piccardi, Tiziano, & West, Robert. 2020. WikiHist.html: English Wikipedia’s full revision history in HTML Format. Proceedings of the International AAAI Conference on Web and Social Media 14: 878–884. Google Scholar logo with link to Google Scholar
Myers, Greg. 2010. The Discourse of Blogs and Wikis. London: Continuum.Google Scholar logo with link to Google Scholar
Poudat, Céline, Grabar, Natalia, Paloque-Bergès, Camille, Chanier, Thierry & Jin, Kun. 2017. Wikiconflits: Un corpus de discussions éditoriales conflictuelles du Wikipédia francophone. In Corpus de communication médiée par les réseaux: Construction, structuration, analyse, Ciara R. Wigham & Gudrun Ledegen (eds). Paris: l’Harmattan.Google Scholar logo with link to Google Scholar
Poudat, Céline, Vanni, Laurent, & Grabar, Natalia. 2016. How to explore conflicts in French wikipedia talk pages? In Statistics Analysis of Textual Data, Nice, France, June, 645–656. 〈[URL]〉 (1 June 2024).
Potthast, Martin, Stein, Benno, Gerling, Robert. 2008. Automatic Vandalism Detection in Wikipedia. In Advances in Information Retrieval. ECIR 2008. Lecture Notes in Computer Science, Vol. 4956, Craig Macdonald, Iadh Ounis, Vassilis Plachouras, Ian Ruthven & Ryen W. White (eds), 663–668. Springer, Berlin, Heidelberg.Google Scholar logo with link to Google Scholar
Walton, Aengus. 2009. A Statistical Analysis of Stylistics and Homogeneity in the English Wikipedia. PhD dissertation, Trinity College Dublin.
Wulczyn, Ellery, Thain, Nithum and Dixon, Lucas. 2017. Ex machina: Personal attacks seen at scale. In Proceedings of the 26th International Conference on World Wide Web, 1391–1399. International World Wide Web Conferences Steering Committee.Google Scholar logo with link to Google Scholar
Zesch, Torsten, Müller, Christof & Gurevych, Iryna. 2008. Extracting lexical semantic knowledge from Wikipedia and Wiktionary. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC’08), Marrakech, Morocco. Paris: European Language Resources Association (ELRA).Google Scholar logo with link to Google Scholar
Zhang, Justine, Chang, Jonathan P., Danescu-Niculescu-Mizil, Cristian, Dixon, Lucas, Hua, Yiqing, Thain, Nithum & Taraborelli, Dario. 2018. Conversations gone awry: Detecting early signs of conversational failure. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics: Vol. 1: Long Papers, Iryna Gurevych & Yusuke Miyao (eds), 1350–1361. Stroudsburg PA: ACL. Google Scholar logo with link to Google Scholar
Cited by (2)

Cited by two other publications

Tanguy, Ludovic, Céline Poudat & Lydia-Mai Ho-Dac
2025. 453Investigating extreme cases in Wikipedia talk pages: Some insights on user behaviours. In Exploring digitally-mediated communication with corpora,  pp. 453 ff. DOI logo
[no author supplied]
2025. 475Index. In Exploring digitally-mediated communication with corpora, DOI logo

This list is based on CrossRef data as of 1 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

Mobile Menu Logo with link to supplementary files background Layer 1 prag Twitter_Logo_Blue