In:Challenges in Corpus Linguistics: Rethinking corpus compilation and analysis
Edited by Mark Kaunisto and Marco Schilk
[Studies in Corpus Linguistics 118] 2024
► pp. v–vi
Published online: 19 September 2024
https://doi.org/10.1075/scl.118.toc
https://doi.org/10.1075/scl.118.toc
Table of contents
AcknowledgementsVII
From fallacies and pitfalls to solutions and future directions: Navigating the evolving terrain of corpus linguistics1
Mark Kaunisto
Engaging with bad (meta)data in historical corpus linguistics9
Turo Vartiainen
Tanja Säily
Named entities as potentially problematic items in corpora35
Mark Kaunisto
Challenges in the compilation, annotation and analysis of learner
corpus data55
corpus data55
Marcus Callies
Early newspapers as data for corpus linguistics (and Digital
Humanities): Issues in using the British Library Newspapers database
as a corpus68
Turo Hiltunen
Open Corpus Linguistics – Or how to overcome common problems
in dealing with corpus data by adopting open research practices89
in dealing with corpus data by adopting open research practices89
Stefan Hartmann
Text length and short texts: An overview of the problem106
Aatu Liimatta
Corpus genre categories: Issues at the intersection of linguistics and literature126
Daniel Ocic Ihrmark
Modeling fine-grained sociolinguistic variation: The promises and pitfalls of Twitter corpora and neural word
embeddings142
Filip Miletić
Anne Przewozny-Desriaux
Ludovic Tanguy
Subject index
