Article published In: Interaction Studies
Vol. 25:3 (2024) ► pp.393–434
A cross-cultural approach to cognitive state attribution based on inter-turn speech pauses
Available under the Creative Commons Attribution (CC BY) 4.0 license.
For any use beyond this license, please contact the publisher at rights@benjamins.nl.
Open Access publication of this article was funded through a Transformative Agreement with University of Vienna.
Published online: 27 June 2025
https://doi.org/10.1075/is.24044.mat
https://doi.org/10.1075/is.24044.mat
Abstract
This study explores how inter-turn speech pauses influence the perception of cognitive states such as knowledge,
confidence, and willingness to grant requests in conversational settings. Longer pauses are typically associated with lower
competence and willingness, but Matzinger, T., Pleyer, M., & Żywiczyński, P. (2023). Pause
Length and Differences in Cognitive State Attribution in Native and Non-Native
Speakers. Languages, 8(1), 26. discovered that this
attribution varies when non-native speakers are involved. They found that listeners were more tolerant of long pauses from
non-native than from native speakers when assessing their willingness to grant requests. This may result from the fact that
listeners may attribute long pauses to the additional cognitive load non-native speakers face during cognitive processing and
response formulation in a second language. This tolerance towards long pauses by non-native speakers did not extend to judgments
about non-native speakers’ knowledge and confidence — potentially because knowledge questions are less socially engaging than
requests. Here, we replicated and extended Matzinger, T., Pleyer, M., & Żywiczyński, P. (2023). Pause
Length and Differences in Cognitive State Attribution in Native and Non-Native
Speakers. Languages, 8(1), 26. experiment, which
focussed on speakers of Polish, to a cross-cultural context with speakers of Chinese. Our results confirmed that non-native accent
mediates perceptions of willingness, but not knowledge or confidence. These findings suggest that inter-turn speech pauses play a
nuanced role in cognitive state attribution of native and non-native speakers and that cultural factors minimally influence these
perceptions. This may indicate that the mechanisms involved are rooted in evolutionarily fundamental aspects of human social
communication and cognition.
Article outline
- 1.Introduction
- 2.Methods
- 2.1Experimental conditions and procedure
- 2.2Participants and setting
- 2.3Stimulus material
- 2.4Analyses
- 3.Results
- 3.1Rating data
- 3.1.1Perceived knowledge
- 3.1.2Perceived confidence
- 3.1.3Perceived willingness to grant requests
- 3.2Questionnaire data
- 3.1Rating data
- 4.Discussion and conclusion
- Acknowledgements
- Notes
References
References (110)
Abreu, F. & Pika, S. (2022). Turn-taking
Skills in Mammals: A Systematic Review Into Development and Acquisition. Frontiers in Ecology
and Evolution, 101.
Alim, H. S., Rickford, J. R. & Ball, A. F. (Eds.) (2016). Raciolinguistics:
How Language Shapes Our Ideas about Race. Oxford University Press.
Atkinson, J. M. & Drew, P. (1979). Order
in Court: The Organization of Verbal Interaction in Juridical
Settings. Macmillan.
Badihi, G., Graham, K. E., Grund, C., Safryghin, A., Soldati, A., Donnellan, E., Hashimoto, C., Mine, J. G., Piel, A. K., Stewart, F., Slocombe, K. E., Wilke, C., Townsend, S. W., Zuberbühler, K., Zulberti, C., & Hobaiter, C. (2024). Chimpanzee
gestural exchanges share temporal structure with human language. Current
Biology, 34(14), R673–R674.
Barr, D. J., Levy, R., Scheepers, C. & Tily, H. J. (2013). Random
Effects Structure for Confirmatory Hypothesis Testing: Keep It Maximal. Journal of Memory and
Language, 681, 255–78.
Bartón, K. (2018). MuMIn:
Multi-Model Inference. R Package Version 1.42.1. Available online: [URL] (accessed
on 2 January 2023).
Bates, D., Mächler, M., Bolker, B. M. & Walker, S. C. (2015). Fitting
Linear Mixed-Effects Models Using Lme4. Journal of Statistical
Software, 671, 1–48.
Boduch-Grabka, K. & Lev-Ari, S. (2021). Exposing
Individuals to Foreign Accent Increases their Trust in What Nonnative Speakers Say. Cognitive
Science, 451, e13064.
Bosker, H. R., Quené, H., Sanders, T. & De Jong, N. H. (2014). The
perception of fluency in native and nonnative speech. Language
Learning, 641, 579–614.
Bradlow, A. R., Kim, M., & Blasingame, M. (2017). Language-independent
talker-specificity in first-language and second-language speech production by bilingual talkers: L1 speaking rate predicts L2
speaking rate. Journal of the Acoustic Society of
America, 141(2), 886.
Brennan, S. E., & Williams, M. (1995). The
Feeling of Another′s Knowing: Prosody and Filled Pauses as Cues to Listeners about the Metacognitive States of
Speakers. Journal of Memory and
Language, 341, 383–98.
Brown, P., & Levinson, S. C. (1987). Politeness:
Some universals in language usage. Cambridge University Press.
Boersma, Paul & Weenink, David (2022). Praat: doing phonetics by computer [Computer program]. Version 6.3, retrieved from [URL]
Carstensen, A., Cao, A., Tan, A. W., Liu, D., Liu, Y., Bui, M. K., et al. (2024). Cognitive
diversity in context: US-China differences in children’s reasoning, visual attention, and social
cognition. Proceedings of the Annual Meeting of the Cognitive Science
Society, 461.
CBOS (Centrum Badania Opinii
Publicznej). (2020). Tożsamość narodowa i postrzeganie praw mniejszości narodowych
i etnicznych. Warszawa: Fundacja Centrum Badania Opinii Społecznej.
Cenoz, J. (2000). Pauses
and hesitation phenomena in second language production. ITL: Review of Applied
Linguistics, 1271, 53–69.
Chang, C. B. (2012). Rapid
and multifaceted effects of second-language learning on first-language speech
production. Journal of
Phonetics, 40(2), 249–268.
Chafe, W. L. (1980). Some
reasons for hesitating. In H. W. Dechert & M. Raupach (Eds.) Temporal
Variables in Speech: Studies in Honour of Frieda
Goldman-Eisler (pp. 169–180). De Gruyter Mouton.
Corriveau, K. H., DiYanni, C. J., Clegg, J. M., Min, G., Chin, J., & Nasrini, J. (2017). Cultural
differences in the imitation and transmission of inefficient actions. Journal of Experimental
Child Psychology, 1611, 1–18.
Crown, C. L., & Feldstein, S. (1985). Psychological
correlates of silence and sound in conversational
interaction. In D. Tannen & M. Saville-Troike (Eds.), Perspectives.
on
silence (pp. 31–54). Ablex Publishing.
Coupland, N. & Bishopm, H. (2007). Ideologised
values for British accents 1. Journal of
Sociolinguistics, 111, 74–93.
Davis, N. (2003). God’s
Playground A History of Poland: Volume II: 1795 to the Present. Oxford University Press.
De Jong, N. H. (2016). Predicting
pauses in L1 and L2 speech: The effects of utterance boundaries and word
frequency. International Review of Applied Linguistics in Language
Teaching, 541, 113–32.
Dingemanse, M. & Liesenfeld, A. (2022). From
text to talk: Harnessing conversational corpora for humane and diversity-aware language
technology. In Proceedings of the 60th Annual Meeting of the
Association for Computational Linguistics 1 (Long
Papers) (pp. 5614–33). Association for Computational Linguistics.
Feng, R. (2022). Cognitive
Factors Influencing Utterance Fluency in L2 Dialogues: Monadic and Non-monadic
Perspectives. Frontiers in
Psychology, 131, 926367.
Finger, H., Goeke, C., Diekamp, D., Standvoß, K. & König, P. (2017). LabVanced:
A unified JavaScript framework for online studies. In International Conference on Computational Social
Science (Cologne) (pp. 1–3). University of Osnabrück.
Fox Tree, J. E. (2002). Interpreting
pauses and ums at turn exchanges. Discourse
Processes, 341, 37–55.
Gleitman, L. R., January, D., Nappa, R. & Trueswell, J. C. (2007). On
the give and take between event apprehension and utterance formulation. Journal of Memory and
Language, 571, 544–69.
Gluszek, A., & Dovidio, J. F. (2010). Speaking
with a nonnative accent: Perceptions of bias, communication difficulties, and belonging in the United
States. Journal of Language and Social
Psychology, 291, 224–34.
Goffman, E. (1963). Embarrassment
and Social Organization. In N. J. Smelser & W. T. Smelser (Eds.), Personality
and social
systems (pp. 541–548). John Wiley & Sons, Inc..
Goupil, L., Ponsot, E., Richardson, D., Reyes, G., & Aucouturier, J.-J. (2021). Listeners’
perceptions of the certainty and honesty of a speaker are associated with a common prosodic
signature. Nature
Communications, 121, 1–17.
Griffin, Z. M. & Bock, K. (2000). What
the eyes say about speaking. Psychological
Science, 111, 274–79.
Guyer, J. J., Fabrigar, L. R. & Vaughan-Johnston, T. I. (2019). Speech
rate, intonation, and pitch: Investigating the bias and cue effects of vocal confidence on
persuasion. Personality and Social Psychology
Bulletin, 451,389–405.
Hanzlíková, D., & Skarnitzl, R. (2017). Credibility
of native and non-native speakers of English revisited: Do non-native listeners feel the
same? Research in
Language, 151, 285–98.
Harumi, S., & King, J. (2020). East
Asian Perspectives on Silence in English Language Education: An
Introduction. In J. King & S. Harumi (Eds.) East
Asian Perspectives on Silence in English Language
Education (pp. 1–16). Multilingual Matters.
Heldner, M. & Edlund, J. (2010). Pauses,
gaps and overlaps in conversations. Journal of
Phonetics, 381, 555–68.
Huang, L. F., & Gráf, T. (2020). Speech
Rate and Pausing in English: Comparing Learners at Different Levels of Proficiency with Native
Speakers. Taiwan Journal of
TESOL, 17(1), 57–86.
Ji, L. J., Peng, K., & Nisbett, R. E. (2000). Culture,
control, and perception of relationships in the environment. Journal of Personality and Social
Psychology, 78(5), 943–955.
Ji, L. J., Nisbett, R. E., & Su, Y. (2001). Culture,
change, and prediction. Psychological
Science, 12(6), 450–456.
Johnson, P. C. D. (2014). Extension
of Nakagawa & Schielzeth’s R2GLMM to random slopes models. Methods in Ecology and
Evolution 51, 44–946.
Kendrick, K. H., & Torreira, F. (2015). The
timing and construction of preference: A quantitative study. Discourse
Processe, 521, 255–89.
Kitayama, S., & Salvador, C. E. (2024). Cultural
psychology: Beyond east and west. Annual Review of
Psychology, 75(1), 495–526.
Koo, M., Choi, J. A., & Choi, I. (2018). Analytic
versus holistic cognition: Constructs and measurement. In J. Spencer-Rodgers & K. Peng (Eds.), The
psychological and cultural foundations of East Asian cognition: Contradiction, change, and
holism (pp. 105–134). Oxford University Press.
Kuznetsova, A., Brockhoff, P. B., & Christensen, R. H. B. (2017). lmerTest
Package: Tests in Linear Mixed Effects Models. Journal of Statistical
Software, 821, 1–26.
Lehtonen, J. & Sajavaara, K. (1985). The
Silent Finn. In D. Tannen & M. Saville-Troike (Eds.) Perspectives
on
Silence (pp 193–201). Ablex Publishing.
Lev-Ari, S. & Keysar, B. (2010). Why
don’t we believe non-native speakers? The influence of accent on credibility. Journal of
Experimental Social
Psychology, 461, 1093–96.
Levelt, W. M. (1999). Producing
spoken language: A blueprint of the speaker. In C. M. Brown & P. Hagoort (Eds.) The
Neurocognition of
Language (pp. 83–122). Oxford University Press.
Levinson, S. C. (2016). Turn-taking
in Human Communication–Origins and Implications for Language Processing. Trends in Cognitive
Sciences 201: 6–14.
Levinson, S. C., & Torreira, F. (2015). Timing
in turn-taking and its implications for processing models of language. Frontiers in
Psychology, 61, 731.
Levy, B. J., McVeigh, N. D., Marful, A., & Anderson, M. C. (2007). Inhibiting
Your Native Language. Psychological
Science, 18(1), 29–34.
Linck, J. A., Kroll, J. F., & Sunderman, G. (2009). Losing
Access to the Native Language While Immersed in a Second Language: Evidence for the Role of Inhibition in Second-Language
Learning. Psychological
Science, 20(12), 1507–1515.
Lippi-Green, R. (2012). English
with an Accent: Language, Ideology, and Discrimination in the United
States. Routledge.
Liu, P., Rigoulot, S., & Pell, M. D. (2017). Cultural
immersion alters emotion perception: Neurophysiological evidence from Chinese immigrants to
Canada. Social
Neuroscience, 12(6), 685–700.
Liu, P., Rigoulot, S., & Pell, M. D. (2016). Cultural immersion alters emotion perception: Neurophysiological evidence from Chinese immigrants to Canada. Social Neuroscience, 12(6), 685–700.
Lundholm-Fors, K. (2015). Production
and Perception of Pauses in Speech. [Doctoral
dissertation, University of Gothenburg], Gothenburg University Publications Electronic Archive. [URL]
Matzinger, T., Ritt, N. & Fitch, W. T. (2020). Non-native
speaker pause patterns closely correspond to those of native speakers at different speech
rates. PLoS
ONE, 151: e0230710.
Matzinger, T., Pleyer, M., & Żywiczyński, P. (2023). Pause
Length and Differences in Cognitive State Attribution in Native and Non-Native
Speakers. Languages, 8(1), 26.
Morris, M. W., & Peng, K. (1994). Culture
and cause: American and Chinese attributions for social and physical events. Journal of
Personality and Social
Psychology, 67(6), 949–971.
Nakagawa, S. & Schielzeth, H. (2013). A
general and simple method for obtaining R2 from generalized linear mixed-effects
models. Methods in Ecology and
Evolution, 41, 133–42.
Nakagawa, S., Johnson, P. C. D. & Schielzeth, H. (2017). The
coefficient of determination R2 and intra-class correlation coefficient from generalized linear mixed-effects models revisited
and expanded. Journal of the Royal Society
Interface, 141, 20170213.
Nakane, I. (2007). Silence
in Intercultural Communication. John Benjamins.
(2012). Silence. In C. B. Paulston, S. F. Kiesling, & E. S. Rangel (Eds.), The
Handbook of Intercultural Discourse and
Communication (pp. 158–179). Blackwell.
Nisbett, R. (2003). The
Geography of Thought: How Asians and Westerners Think Differently... and why. Simon and Schuster.
Peltonen, P. (2017). Temporal
fluency and problem-solving in interaction: An exploratory study of fluency resources in L2
dialogue. System, 701, 1–13.
Pika, S., Wilkinson, R., Kendrick, K. H. & Vernes, S. C. (2018). Taking
turns: bridging the gap between human and animal communication. Proceedings of the Royal
Society B, 2851, 28520180598.
Powell, M. J. (2009). The
BOBYQA algorithm for bound constrained optimization without derivatives. Cambridge NA Report
NA2009/06, University of Cambridge,
Cambridge, 261, 26–46.
R Development Core Team. (2018). R: A
Language and Environment for Statistical
Computing. Vienna: R Foundation for Statistical Computing.
Ramjattan, V. A. (2022). Accenting
racism in labour migration. Annual Review of Applied
Linguistics, 421, 87–92.
Raupach, M. (1980). Temporal
variables in first and second language speech and perception of
fluency. In H. W. Dechert & M. Raupach (Eds.) Temporal
Variables in Speech: Studies in Honour of Frieda
Goldman-Eisler (pp. 263–70). De Gruyter Mouton.
Reisman, J. M., & Yamokoski, T. (1974). Psychotherapy
and Friendship: An Analysis of the Communications of Friends. Journal of Counseling
Psychology, 21 (4), 269–73.
Roberts, F. & Francis, A. L. (2013). Identifying
a temporal threshold of tolerance for silent gaps after requests. The Journal of the Acoustical
Society of
America, 1331, EL471–EL477.
Roberts, F., Margutti, P. & Takano, S. (2011). Judgments
concerning the valence of inter-turn silence across speakers of American English, Italian, and
Japanese. Discourse
Processes, 481, 331–54.
Roberts, S. G., Torreira, F., and Levinson, S. C. (2015). The
effects of processing and sequence organization on the timing of turn taking: a corpus
study. Frontiers in
Psychology, 61, 509.
Roessel, J., Schoel, C. & Stahlberg, D. (2020). Modern
notions of accent-ism: Findings, conceptualizations, and implications for interventions and research on nonnative
accents. Journal of Language and Social
Psychology, 391, 87–111.
R Development Core Team. (2023). R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing.
Sbranna, S., Cangemi, F. & Grice, M. (2020). Quantifying
L2 interactional competence. In L. Romito (Ed.) La variazione linguistica in condizioni di contatto: contesti acquisizionali, lingue, dialetti e minoranze in
italia e nel mondo [Language change under contact conditions: acquisitional
contexts, languages, dialects and minorities in Italy and around the
world] (pp. 383–405). Milan: Officinaventuno. . [URL]
Sbranna, S. (2023). Developing
linguistic competence in second language learning: Fluency and prosody in Italian learners of
German. Unpublished PhD Thesis, University of Cologne. [URL]
Schwartz, S. (2008). Cultural
value orientations: Nature & implications of national differences. Psychology. Journal of
Higher School of
Economics, 5(2), 37–67.
Schielzeth, H. & Forstmeier, W. (2009). Conclusions
beyond support: Overconfident estimates in Mixed Models. Behavioral
Ecology, 201, 416–20.
Shigemitsu, Y. (2005). Different
interpretations of pauses in natural conversation--Japanese, Chinese and Americans. 東京工芸大学工学部紀要.
人文・社会編, 28(2), 8–14.
Stivers, T. (2010). An
overview of the question–response system in American English conversation. Journal of
Pragmatics, 421, 2772–81.
Stivers, T., Enfield, N. J., Brown, P., Englert, E., Hayashi, M., Heinemann, T., Hoymann, G., Rossano, F., de Ruiter, J. P., Yoon, K.-E., et al. (2009). Universals
and cultural variation in turn-taking in conversation. Proceedings of the National Academy of
Sciences, 1061, 10587–92.
Suzuki, S., Kormos, J. & Uchihara, T. (2021). The
relationship between utterance and perceived fluency: A meta-analysis of correlational
studies. The Modern Language
Journal, 1051, 435–63.
Tannen, D. (1985). Silence:
Anything but. In D. Tannen and M. Saville-Troike (Eds.) Perspectives
on
Silence (pp. 93–11). Ablex.
Tavakoli, P. (2011). Pausing
patterns: Differences between L2 learners and native speakers. ELT
Journal, 651, 71–79.
Timming, A. R. (2017). The
effect of foreign accent on employability: A study of the aural dimensions of aesthetic labour in customer-facing and
non-customer-facing jobs. Work, Employment and
Society, 311, 409–28.
Trouvain, J., Fauth, C. & Möbius, B. (2016). Breath
and Non-breath Pauses in Fluent and Disfluent Phases of German and French L1 and L2 Read
Speech. Speech Prosody
(SP8), 311, 31–35.
Tsalikis, J., DeShields, O. W. Jr., & LaTour, M. S. (1991). The
role of accent on the credibility and effectiveness of the salesperson. Journal of Personal
Selling and Sales
Management, 111, 31–41.
Tychmanowicz, A., Filipiak, S. & Sprynska, Z. Extravert individualists or introvert collectivists? Personality traits and individualism and collectivism in students in Poland and Ukraine. Curr Psychol 401, 5947–5957 (2021).
van Hell, J. G., & Dijkstra, T. (2002). Foreign
language knowledge can influence native language performance in exclusively native
contexts. Psychonomic Bulletin &
Review, 9(4), 780–789.
Van Maastricht, L., Zee, T., Krahmer, E., & Swerts, M. (2021). The
interplay of prosodic cues in the L2: How intonation, rhythm, and speech rate in speech by Spanish learners of Dutch
contribute to L1 Dutch perceptions of accentedness and comprehensibility. Speech
Communication, 1331, 81–90.
Van Os, M., De Jong, N. H., & Bosker, H. R. (2020). Fluency
in dialogue: Turn-taking behavior shapes perceived fluency in native and nonnative
speech. Language
Learning, 701, 1183–217.
Verga, L., Kotz, S. A., & Ravignani, A. (2023). The
Evolution of Social Timing. Physics of Life
Reviews, 461, 131–51.
Verouden, N. W. & van der Sanden, M. C. A. (2018). Silence
in intercultural collaboration: A Sino-Dutch research centre. Advances in Applied
Sociology, 81, 125–151.
Walker, Michael B., & Trimboli, C. 1982. Smooth
transitions in conversational interactions. The Journal of Social
Psychology 1171: 305–6.
Wardęga, J. (2017). Chińczycy
w Polsce–niewidoczna mniejszość. Studia Migracyjne-Przegla ̨d
Polonijny, 21, 121–41.
Weilhammer, K. & Rabold, S. (2003). Durational
aspects in turn taking. Proceedings of the international conference of phonetic
sciences, 2145–2148.
Wootton, A. J. (1981). The
Management of Grantings and Rejections by Parents in Request
Sequences. Semiotica, 371, 59–90.
