Article published In: Journal of Second Language Pronunciation: Online-First Articles
Fluency assessment
Incorporating syntactic distance in a new measure of pause location
Published online: 13 March 2026
https://doi.org/10.1075/jslp.25042.cou
https://doi.org/10.1075/jslp.25042.cou
Abstract
Speech fluency is often assessed using articulation rate and pause frequency. However, not all pauses hinder
fluency: when placed strategically, they structure discourse and enhance comprehensibility. To better characterize speaker
fluency, it is crucial to consider where pauses occur. Traditional approaches rely on categorical syntactic
boundaries (e.g., clauses or phrases), but inadequately capture syntactic complexity. We propose a continuous measure of pause
placement based on syntactic distance between adjacent words. Using spontaneous English speech from Japanese learners and native
speakers, we show that syntactic distance robustly predicts both pause location and duration across proficiency levels. We compare
its contribution to proficiency classification against baseline and categorical models. The syntactic distance model outperforms
all others, explaining 87% of variance (versus 65% for baseline and 76% for clause/phrase models), with strongest model fit and
lowest prediction error. This measure provides a robust and meaningful predictor of L2 speech fluency.
Keywords: fluency, syntactic complexity, pauses, syntax, L2 speech, assessment
Article outline
- 1.Introduction
- 2.Literature Review
- 2.1Syntactic Structure Predicting L1 and L2 Pauses
- 2.2Pause Position and L2 Fluency Perception
- 2.3Related Work
- 2.4Research Questions
- 3.Method
- 3.1Data
- CLES-JP Corpus
- CLES-EN Corpus
- 3.2Pause Prediction
- 3.3Statistical Analyses
- Pause Occurrence Prediction
- Pause Duration Prediction
- Proficiency Classification
- 3.1Data
- 4.Results
- 4.1Descriptive Statistics
- 4.2Pause Occurrence Prediction
- Pause Occurrence and Clause/Phrase Boundaries
- Pause Occurrence and Syntactic Distance
- 4.3Pause Duration Prediction
- Pause Duration and Clause/Phrase Boundaries
- Pause Duration and Syntactic Distance
- 4.4Overall Proficiency Prediction
- Baseline Model Performance
- Categorical Syntactic Boundary Measures
- Continuous Syntactic Distance
- Collinearity Assessment
- 5.Discussion
- 5.1Theoretical Implications
- 5.2Methodological Contributions
- 5.3Practical Applications and Pedagogical Implications
- 5.4Limitations and Future Directions
- Conclusion
- Code
- Notes
References
References (50)
Baevski, A., Zhou, Y., Mohamed, A., & Auli, M. (2020). Wav2Vec
2.0: A framework for self-supervised learning of speech representations. Advances in Neural
Information Processing
Systems, 331, 12449–12460.
Bain, M., Huh, J., Han, T., Zisserman, A. (2023). WhisperX:
Time-Accurate Speech Transcription of Long-Form Audio. Proc.
Interspeech 20231, 4489–4493.
Bies, A., Ferguson, M., Katz, K., & MacIntyre, R. (1995). Bracketing
Guidelines For Treebank II Style Penn Treebank Project. University of Pennsylvania Department of Computer and Information
Science Technical Report No. MS-CIS-95-06-07. LINC LAB 281 [URL]
Bhatt, R. (2008). Pharse
Structure rules, Tree rewriting and recursion. Amherst: UMASS. Carnie, A. (2002). Syntax: A
Generative Introduction. Wiley-Blackwell.
Boomer, D. S., & Dittmann, A. T. (1962). Hesitation
pauses and juncture pauses in speech. Language and
Speech, 5(4), 215–220.
Bredin, H. (2023). pyannote.audio
2.1 speaker diarization pipeline: principle, benchmark, and
recipe. Interspeech 20231, 1983–1987.
Burnham, K. P., & Anderson, D. R. (2002). Model
selection and multimodel inference (2nd ed.; K. P. Burnham & D. R. Anderson, Eds.).
Candea, M. (2000). Contribution
à l’étude des pauses silencieuses et des phénomènes dits ‘“d’hésitation”’ en français oral
spontané. Etude sur un corpus de récits en classe de
français (Université de la Sorbonne nouvelle — Paris III). [URL]
Cao, Y., & Chen, H. (2019). World
Englishes and Prosody: Evidence from the Successful Public Speakers. 2019 Asia-Pacific Signal
and Information Processing Association Annual Summit and Conference (APSIPA
ASC), 2048–2052.
Coulange, S. (2025). Évaluation
automatique de la parole spontanée en anglais langue étrangère : le rôle des pauses et de l’accent lexical dans la
compréhensibilité du locuteur. Thèse de doctorat en Sciences du langage Spécialité
Informatique, dirigée par Monica Masperi, Solange Rossato, et Tsuneo Kato, Université Grenoble Alpes.
Coulange, S., de Jong, N. H. (2025). Measuring
L2 Speech Fluency Based on Syntactic Distribution of Pauses. Proc. 12th edition of the
Disfluency in Spontaneous Speech Workshop (DiSS
2025), 37–41.
Coulange, S., Konishi, T., Sugahara, M., & Kato, T. (2024a). A
corpus of spontaneous dialogues in L2 English by French and Japanese L1 speakers for automated assessment of
fluency. 6th International Symposium on Learner Corpus Studies in Asia and the World
(LCSAW6). [URL]
Coulange, S., Kato, T., Rossato, S., & Masperi, M. (2024b). Enhancing
language learners’ comprehensibility through automated analysis of pause positions and syllable
prominence. Languages, 9(3), 78.
(2024c). Exploring
impact of pausing and lexical stress patterns on L2 English comprehensibility in real
time. Interspeech 20241, 1030–1034. Presented
at the Interspeech 2024.
de Jong, N. H. (2016). Predicting
pauses in L1 and L2 speech: the effects of utterance boundaries and word
frequency. International Review of Applied Linguistics in Language
Teaching, 54(2), 113–132.
Fauth, C., & Trouvain, J. (2018). Détails
phonétiques dans la réalisation des pauses en Français : étude de parole lue en langue maternelle vs en langue
étrangère. Langages, 211(3), 81–95.
Fox, B. A., Hayashi, M., & Jasperson, R. (1996). Resources
and repair : a cross-linguistic study of syntax and repair. In E. Ochs, E. A. Schegloff & S. A. Thompson (Éd.), Interaction
and Grammar (p. 185–237). Cambridge Univ. Press.
Goldman-Eisler, F. (1968). Psycholinguistics:
Experiments in Spontaneous Speech. Academic Press Inc.
Götz, S. (2013). Fluency
in Native and Nonnative English
Speech (Vol. 531). John Benjamins Publishing Company.
Grosjean, F., & Deschamps, A. (1975). Analyse
contrastive des variables temporelles de l’anglais et du français: vitesse de parole et variables composantes, phénomènes
d’hésitation. Phonetica, 31(3–4), 144–184.
(2009). Analyse
contrastive des variables temporelles de l’anglais et du français: vitesse de parole et variables composantes, phénomènes
d’hésitation. Phonetica, 31(3–4), 144–184.
Grosman, I., Simon, A. C., & Degand, L. (2018). Variation
de la durée des pauses silencieuses : impact de la syntaxe, du style de parole et des
disfluences. Langages, 211(3), 13–40.
Heldner, M., & Edlund, J. (2010). Pauses,
gaps and overlaps in conversations. Journal of
Phonetics, 38(4), 555–568.
Hsieh, C.-N., Zechner, K., & Xi, X. (2019). Features
measuring fluency and pronunciation. In Automated Speaking
Assessment (pp. 101–122).
Isaacs, T., Trofimovich, P., & Foote, J. A. (2018). Developing
a user-oriented second language comprehensibility scale for English-medium
universities. Language
Testing, 35(2), 193–216.
Kahng, J. (2018). The
effect of pause location on perceived fluency. Applied
Psycholinguistics, 39(3), 569–591.
(2014). Exploring
Utterance and Cognitive Fluency of L1 and L2 English Speakers: Temporal Measures and Stimulated Recall: Utterance and
Cognitive Fluency in L2. Language
Learning, 64(4), 809–854.
Kallio, H., Kuronen, M., & Koivusalo, L. (2022). The
role of pause location in perceived fluency and proficiency in L2 Finnish. Proc. ISAPh 2022,
4th International Symposium on Applied Phonetics, 22–27.
Kitaev, N., Cao, S., & Klein, D. (2019). Multilingual
Constituency Parsing with Self-Attention and Pre-Training. Proceedings of the 57th Annual
Meeting of the Association for Computational
Linguistics, 3499–3505.
Kuang, J., Chan, M. P. Y., Rhee, N., Liberman, M., Ding, H. (2022). The
mapping between syntactic and prosodic phrasing in English and Mandarin. Proc.
Interspeech 20221, 3443–3447.
Maynard, S. K. (1989). Japanese
conversation: Self-contextualization through structure and interactional
management. Praeger.
Montani, I., Honnibal, M., Honnibal, M., Boyd, A., Van Landeghem, S., & Peters, H. (2023). spaCy:
Industrial-strength Natural Language Processing in Python.
Nagle, C., Trofimovich, P., & Bergeron, A. (2019). Toward
a dynamic view of second language comprehensibility. Studies in Second Language
Acquisition, 41(04), 647–672.
Révész, A., Jeong, H., Suzuki, S., Cui, H., Matsuura, S., Saito, K., & Sugiura, M. (2024). Task-generated
processes in second language speech production: Exploring the neural correlates of task complexity during silent
pauses. Studies in Second Language
Acquisition, 46(4), 1179–1205.
Riazantseva, A. (2001). Second
Language Proficiency and Pausing: A Study of Russian Speakers of English. Studies in Second
Language
Acquisition, 23(4), 497–526.
Riggenbach, H. (1991). Toward
an understanding of fluency: A microanalysis of nonnative speaker conversations. Discourse
Processes, 14(4), 423–441.
Ruder, K. F., & Jensen, P. J. (1972). Fluent
and hesitation pauses as a function of syntactic complexity. Journal of speech and hearing
research, 15(1), 49–60.
Segalowitz, N. (01 2010). Cognitive
bases of second language fluency. New York and London: Routledge.
Schweitzer, A., & Haase, M. (2000). Zwei
Ansätze zur syntaxgesteuerten Prosodiegenerierung. KONVENS 2000 / Sprachkommunikation, Vorträge
Der Gemeinsamen Veranstaltung 5. Konferenz Zur Verarbeitung Natürlicher Sprache (KONVENS), 6. ITG-Fachtagung
“Sprachkommunikation,” 197–202.
Shigemitsu, Y. (2007). A
pause in conversation for Japanese native speakers : a case study of successful and unsuccessful conversation in terms of
pause though intercultural communication. Academic
Report, Tokyo Polytechnic University, 30(2), 11–18. [URL]
Shea, C., & Leonard, K. (2019). Evaluating
measures of pausing for second language fluency research. Canadian Modern Language
Review, 75(3), 216–235.
Skehan, P., Foster, P., & Shum, S. (2016). Ladders
and Snakes in Second Language Fluency. International Review of Applied Linguistics in Language
Teaching, 54(2).
Suzuki, S., & Kormos, J. (2020). linguistic
dimensions of comprehensibility and perceived fluency: an investigation of complexity, accuracy, and fluency in second
language argumentative speech. Studies in Second Language
Acquisition, 42(1), 143–167.
Tauberer, J. (2008). Predicting
intrasentential pauses: is syntactic structure useful? Proc. Speech
Prosody 20081, 405–408.
Tavakoli, P., Nakatsuhara, F., & Hunter, A. (2020). Aspects
of Fluency Across Assessed Levels of Speaking Proficiency. The Modern Language
Journal, 104(1), 169–191.
Tavakoli, P., & Skehan, P. (2005). Strategic
planning, task structure and performance testing. In Planning and
Task Performance in a Second
Language (pp. 239–273).