Cover not available

Investigating Wikipedia

Linguistic corpus building, exploration and analysis

HardboundAvailable
ISBN 9789027215963 | EUR 120.00 | USD 156.00
 
e-Book
ISBN 9789027246462 | EUR 120.00 | USD 156.00
 
The present volume is intended as a reference book on Wikipedia corpus studies, from corpus construction to exploration and analysis. Wikipedia is a complex object, difficult to manipulate for linguists and corpus researchers. In addition to the encyclopedic articles consulted by millions of users, it contains vast spaces of written discussions, aka talk pages, where Wikipedia authors negotiate the collaborative editing of articles, make evaluations, or discuss related topics. The proposed volume covers Wikipedia articles, their revision histories, and discussions, with a focus on discussions, which have not been studied extensively so far and have also been neglected in previous corpus building efforts. Wikipedia discussions are instances of computer-mediated communication (CMC), thus constituting a completely different, interaction-oriented linguistic genre. Sophisticated tools and methods of linguistic annotation and corpus exploration are needed to exploit the huge and valuable corpus resources that can be constructed from the Wikipedia discussions. The present volume aims at encouraging and facilitating Wikipedia corpus studies, providing standards, recommendations, and innovative methods to build and explore Wikipedia corpora, and presenting corpus studies that make the most of the peculiarities of Wikipedia.
[Studies in Corpus Linguistics, 121] 2024.  vi, 264 pp.
Publishing status: Available
Published online on 25 October 2024
Table of Contents
Cited by (2)

Cited by two other publications

Tanguy, Ludovic, Céline Poudat & Lydia-Mai Ho-Dac
2025. 453Investigating extreme cases in Wikipedia talk pages: Some insights on user behaviours. In Exploring digitally-mediated communication with corpora,  pp. 453 ff. DOI logo
[no author supplied]
2025. 475Index. In Exploring digitally-mediated communication with corpora, DOI logo

This list is based on CrossRef data as of 3 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

Subjects and metadata

Main BIC Subject

Main BISAC Subject

ONIX Metadata

ONIX 2.1
ONIX 3.0

VPAT

ePub Accessibility Conformance Report (VPAT)

LoC, MARC XML

U.S. Library of Congress Control Number:  2024033457 | Marc record
Mobile Menu Logo with link to supplementary files background Layer 1 prag Twitter_Logo_Blue