In:Frequency, Dispersion, Association, and Keyness: Revising and tupleizing corpus-linguistic measures
Stefan Th. Gries
[Studies in Corpus Linguistics 115] 2024
► pp. 269–303
Chapter 6What should be next
Published online: 4 July 2024
https://doi.org/10.1075/scl.115.c6
https://doi.org/10.1075/scl.115.c6
Article outline
- 6.1Quantifying uncertainty
- 6.1.1An example of bootstrapping
- 6.1.2Excursus: On significance
- 6.2Scaling things up
- 6.2.1Speed: Scaling up with parallelization
- 6.2.2Speed: Scaling up with Rcpp
- 6.2.2.1Dispersion
- 6.2.2.2Association
- 6.2.2.3Keyness
- 6.2.3Size: Scaling up with base
- 6.2.4Size and speed: Scaling up with data.table
- 6.3The dimensions to tupleize
- 6.3.1Dimensions of information: Type frequencies & distributions
- 6.3.2What are our tokens?
Notes
