Cover not available

In:Challenges in Corpus Linguistics: Rethinking corpus compilation and analysis
Edited by Mark Kaunisto and Marco Schilk
[Studies in Corpus Linguistics 118] 2024
► pp. v–vi

Get fulltext from our e-platform

Download Book PDF

Published online: 19 September 2024

https://doi.org/10.1075/scl.118.toc

Table of contents

AcknowledgementsVII

From fallacies and pitfalls to solutions and future directions: Navigating the evolving terrain of corpus linguistics1

Mark Kaunisto

Engaging with bad (meta)data in historical corpus linguistics9

Turo Vartiainen

Tanja Säily

Named entities as potentially problematic items in corpora35

Mark Kaunisto

Challenges in the compilation, annotation and analysis of learner
corpus data55

Marcus Callies

Early newspapers as data for corpus linguistics (and Digital Humanities): Issues in using the British Library Newspapers database as a corpus68

Turo Hiltunen

Open Corpus Linguistics – Or how to overcome common problems
in dealing with corpus data by adopting open research practices89

Stefan Hartmann

Text length and short texts: An overview of the problem106

Aatu Liimatta

Corpus genre categories: Issues at the intersection of linguistics and literature126

Daniel Ocic Ihrmark

Modeling fine-grained sociolinguistic variation: The promises and pitfalls of Twitter corpora and neural word embeddings142

Filip Miletić

Anne Przewozny-Desriaux

Ludovic Tanguy

Subject index