Article published In: The Mental Lexicon
Vol. 17:2 (2022) ► pp.178–212
A note on the modeling of the effects of experimental time in psycholinguistic experiments
Available under the Creative Commons Attribution (CC BY) 4.0 license.
For any use beyond this license, please contact the publisher at rights@benjamins.nl.
Published online: 4 April 2022
https://doi.org/10.1075/ml.21012.baa
https://doi.org/10.1075/ml.21012.baa
Abstract
Thul, R., Conklin, K., and Barr, D. J. (2020). Using
gamms to model trial-by-trial fluctuations in experimental data: More risks but hardly any
benefit. Open Science
Framework. downloaded October
2020. called attention to problems that arise when chronometric
experiments implementing specific factorial designs are analysed with the generalized additive mixed model (GAMM), using factor smooths to
capture trial-to-trial dependencies. From a series of simulations incorporating such dependencies, they conclude that GAMMs are
inappropriate for between-subject designs. They argue that in addition GAMMs come with too many modeling possibilities, and advise using the
linear mixed model (LMM) instead. As clarified by the title of their paper, their conclusion is: “Using GAMMs to model trial-by-trial
fluctuations in experimental data: More risks but hardly any benefit”.
We address the questions raised by Thul, R., Conklin, K., and Barr, D. J. (2020). Using
gamms to model trial-by-trial fluctuations in experimental data: More risks but hardly any
benefit. Open Science
Framework. downloaded October
2020. , who clearly demonstrated
that problems can indeed arise when using factor smooths in combination with factorial designs. We show that the problem does not arise when
using by-smooths. Furthermore, we have traced a bug in the implementation of factor smooths in the mgcv package, which will
have been removed from version 1.8–36 onwards.
To illustrate that GAMMs now produce correct estimates, we report simulation studies implementing different by-subject
longitudinal effects. The maximal LMM emerges as slightly conservative compared to GAMMs, and GAMMs provide estimated coefficients that can
be less variable across simulation runs. We also discuss two datasets where time-varying effects interact with numerical predictors in a
theoretically informative way.
Article outline
- 1.Introduction
- 2.Sine waves
- 2.1Sine waves with varying amplitudes
- 2.2Sine waves with varying phase
- 3.Power and Type-I error rate
- 3.1Sine waves with varying absolute amplitude
- 3.2Sine waves with varying phase
- 3.3Simulations with the parameters of Thul et al. (2021)
- 3.4From sine waves to random time-varying effects
- 4.Interactions with time in experimental data
- 5.Discussion
- 6.Open practices statement
- Acknowledgements
- Notes
- Mathematical notation
References
References (24)
Baayen, R. H. (2010). The
directed compound graph of English. an exploration of lexical connectivity and its processing
consequences. In Olsen, S., editor, New
impulses in word-formation (Linguistische Berichte Sonderheft
17), pages 383–402. Buske, Hamburg.
Baayen, R. H., van Rij, J., de Cat, C., and Wood, S. N. (2017a). Autocorrelated
errors in experimental data in the language sciences: Some solutions offered by generalized additive mixed
models. In Speelman, D., Heylen, K., and Geeraerts, D., editors, Mixed
Effects Regression Models in
Linguistics, pages 49–69. Springer, Berlin.
Baayen, R. H., Vasishth, S., Bates, D., and Kliegl, R. (2017b). The
cave of shadows. Addressing the human factor with generalized additive mixed models. Journal of
Memory and Language, 941:206–234.
Barr, D. J., Levy, R., Scheepers, C., and Tily, H. J. (2013). Random
effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and
Language, 68(3):255–278.
Bates, D. M., Kliegl, R., Vasishth, S., and Baayen, R. H. (2015). Parsimonious
mixed models. arXiv.org.
Chuang, Y.-Y. (2017). The
effect of phonetic variation on word recognition in Taiwan Mandarin. PhD
thesis, National Taiwan University, Taipei.
Chuang, Y.-Y., Fon, J., Papakyritsis, I., and Baayen, R. H. (2021). Analyzing
phonetic data with generalized additive mixed models. In Ball, M. J., editor, Manual
of Clinical Phonetics. Routledge.
Francis, G. (2012). Publication
bias and the failure of replication in experimental psychology. Psychonomic Bulletin &
Review, 19(6):975–991.
Harm, M. W. and Seidenberg, M. S. (2004). Computing
the meanings of words in reading: Cooperative division of labor between visual and phonological
processes. Psychological
Review, 1111:662–720.
Keuleers, E., Lacey, P., Rastle, K., and Brysbaert, M. (2012). The
British Lexicon Project: Lexical decision data for 28,730 monosyllabic and disyllabic English
words. Behavior Research
Methods, 441:287–304.
Matuschek, H., Kliegl, R., Vasishth, S., Baayen, R. H., and Bates, D. M. (2017). Balancing
Type I Error and Power in Linear Mixed Models. Journal of Memory and
Language.
Mock, P., Tibus, M., Ehlis, A.-C., Baayen, H., and Gerjets, P. (2018). Predicting
adhd risk from touch interaction data. In Proceedings of the 20th ACM
International Conference on Multimodal
Interaction, pages 446–454.
Open Science
Collaboration (2015). Estimating the reproducibility of psychological
science. Science, 349(6251):aac4716.
Pinheiro, J. C. and Bates, D. M. (2000). Mixed-effects
models in S and S-PLUS. Statistics and
Computing. Springer, New York.
Segalowitz, N. S. and Segalowitz, S. J. (1993). Skilled
performance, practice, and the differentiation of speed-up from automatization effects: Evidence from second language word
recognition. Applied
Psycholinguistics, 14(03):369–385.
Thul, R., Conklin, K., and Barr, D. J. (2020). Using
gamms to model trial-by-trial fluctuations in experimental data: More risks but hardly any
benefit. Open Science
Framework. downloaded October
2020.
(2021). Using
gamms to model trial-by-trial fluctuations in experimental data: More risks but hardly any
benefit. Journal of Memory and
Language, 1201:104247.
van Rij, J., Hendriks, P., van Rijn, H., Baayen, R. H., and Wood, S. N. (2019). Analyzing
the time course of pupillometric data. Trends in
hearing, 231:2331216519832483.
Westfall, J., Kenny, D. A., and Judd, C. M. (2014). Statistical
power and optimal design in experiments in which samples of participants respond to samples of
stimuli. Journal of Experimental Psychology:
General, 143(5):2020.
Wieling, M. (2018). Analyzing
dynamic phonetic data using generalized additive mixed modeling: a tutorial focusing on articulatory differences between l1
and l2 speakers of english. Journal of
Phonetics, 701:86–116.
Wood, S. N. (2013). On
p-values for smooth components of an extended generalized additive
model. Biometrika, 1001:221–228.
Cited by (7)
Cited by seven other publications
Chen, Yingzhao & Danni Zhang
Cho, Sun-Joo, Sarah Brown-Schmidt, Sharice Clough & Melissa C. Duff
Widmann, Andreas, Björn Herrmann & Florian Scharf
Winn, Matthew B.
Heitmeier, Maria, Yu-Ying Chuang & R. Harald Baayen
Ke, Sihui (Echo), Rui Jin & Keiko Koda
2023. Adult L2 learners’ morphological sensitivity in a morphosyllabic language. The Mental Lexicon 18:3 ► pp. 446 ff.
This list is based on CrossRef data as of 27 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
