Article published In: Journal of Second Language Pronunciation
Vol. 5:1 (2019) ► pp.98–118
ASR-based dictation practice for second language pronunciation improvement
Published online: 13 March 2019
https://doi.org/10.1075/jslp.16034.mcc
https://doi.org/10.1075/jslp.16034.mcc
Abstract
In pronunciation learning, there is a need for resources and tools that help students monitor their speech or
provide feedback on errors. While researchers have seen ASR-based technologies as potential tools, little attention
has been paid to dictation programs, which have been criticized for low levels of recognition, but offer advantages such as
accessibility and flexibility. This study examines two groups of learners in a pronunciation workshop: CONV, which had fully
face-to-face instruction, and HYBRID, which had half of the instruction face-to-face and half using the computer, practicing
production using a dictation program, Windows Speech Recognition. Results show that both groups improved from pre- to post-test
and that there were no statistically significant differences between the two groups. Results indicate that dictation programs may
be useful as a complement to face-to-face pronunciation teaching, especially if in-class time for pronunciation teaching is
limited.
Article outline
- 1.Introduction
- 1.1Autonomy and strategy use in language learning
- 1.2Computer assisted pronunciation training
- 1.3Benefits of ASR technology for pronunciation learning
- 1.4Choosing dictation programs
- 1.5Research question
- 2.Methodology
- 2.1Participants
- 2.2Workshop design
- 2.3Language learning logs
- 2.4Pre- & post-workshop recordings
- 2.5Raters
- 2.6Analysis of ratings
- 3.Results
- 3.1Time spent on activities
- 3.2Inter-rater reliability
- 3.3Student improvement
- 4.Discussion and conclusion
- Note
References
References (54)
Beddor, P. S. & Strange, W. (1982). Cross-language study of perception of the oral-nasal distinction. Journal of the Acoustical Society of America, 711, 1551–1561.
Benson, P. & Voller, P. (1997). Introduction: Autonomy and independence in language learning. In P. Benson & P. Voller (Eds.), Autonomy and independence in language learning (pp. 1–12). Edinburgh, UK: Addison Wesley Longman Ltd.
Blankenship, B. (1991). Second language vowel perception. Journal of the Acoustical Society of America, 901, 2252–2252.
Bouselmi, G., Fohr, D., & Illina, I. (2012). Multilingual recognition of non-native speech using acoustic model transformation and pronunciation modeling. International Journal of Speech Technology, 151, 203–213.
Brown, A. (1988). Functional load and the teaching of pronunciation. TESOL Quarterly, 22 (4), 593–606.
Burlington English: Meeting the challenges of adult language acquisition. (2014). Retrieved from [URL]
Celce-Murcia, M., Brinton, D., & Goodwin, J. (2010). Teaching pronunciation (2nd ed.). Cambridge, England: Cambridge University Press.
Chang, C. (2012). Instruction on pronunciation learning strategies: Research findings and current pedagogical approaches (Master’s thesis). Retrieved from the University of Texas Digital Repository. UT-Austin, Austin, TX.
Cincarek, T., Gruhn, R., Hacker, C., Nöth, E., & Nakamura, S. (2008). Automatic pronunciation scoring of words and sentences independent from the non-native’s first language. Computer Speech and Language, 231, 65–88.
Coniam, D. (1999). Voice recognition software accuracy with second language speakers of English. System, 271, 49–64.
Cordier, D. (2009). Speech recognition software for language learning: Toward an evaluation of validity and student perceptions (Doctoral dissertation). Retrieved from: [URL]
Cucchiarini, C., Strik, H., & Boves, L. (2000). Different aspects of expert pronunciation quality ratings and their relation to scores produced by speech recognition algorithms. Speech Communication, 301, 109–119.
Derwing, T., Munro, M., & Carbonaro, M. (2000). Does popular speech recognition software work with ESL speech? TESOL Quarterly, 34 (3), 592–603.
Dickerson, W. B. (1994). Empowering students with predictive skills. In J. Morley (Ed.), Pronunciation pedagogy and theory: New views, new directions (pp. 17–35). Alexandria, VA: TESOL, Inc.
Dryer, M. S. & Haspelmath, M. (Eds.). (2013). The world atlas of language structures online. Retrieved from [URL]
Foote, J. A. (2010). Second language learners’ perceptions of their own recorded speech. PMC Working Paper Series W P10-02, 3–27. Edmonton, Alberta: Prairie Metropolis Centre.
Flege, J. E., Munro, M. J., & Fox, R. A. (1993). Auditory and categorical effects on cross-language vowel perception. Journal of the Acoustical Society of America, 951, 3623–3641.
Fraser, H. (2001). Teaching pronunciation: A handbook for teachers and trainers. Sydney, Australia: TAFE NSW.
Furtak, E. M. & Kunter, M. (2012). Effects of autonomy-supportive teaching on student learning and motivation. The Journal of Experimental Education, 801, 284–316.
Hardison, D. (2003). Acquisition of second language speech: Effects of visual cues, context, and talker variability. Applied Psycholinguistics, 261, 579–596.
Hardison, D. M. (2004). Generalization of computer-assisted prosody training: quantitative and qualitative findings. Language Learning and Technology, 8 (1), 34–52.
Hardison, D. (2005). Second language spoken word identification: Effects of training, visual cues, and phonetic environment. Applied Psycholinguistics, 261, 579–596.
Hincks, R. (2003). Speech technologies for pronunciation feedback and evaluation. ReCALL. 15(1), 3–20.
(2015). Technology and leaning pronunciation. In Reed, M. and Levis, J. (Eds). The handbook of English pronunciation (pp. 505–519). Malden, MA: John Wiley & Sons, Inc..
Ingels, S. A. (2011). The effects of self-monitoring strategy use on the pronunciation of learners of English (Doctoral dissertation). University of Illinois, Urbana, IL.
Larson-Hall, J. (2010). A guide to doing statistics in second language research using SPSS. New York, NY: Taylor & Francis.
Levis, J. & Pickering, L. (2004). Teaching intonation in discourse using speech visualization technology. System, 321, 505–524.
Levis, J. & Suvorov, R. (2013). Automated speech recognition. In C. Chapelle (Ed.) The encyclopedia of applied linguistics. Retrieved from [URL]
Lima, E. (2015). Development and evaluation of online pronunciation instruction for international teaching assistants’ comprehensibility (Doctoral Dissertation). Retrieved from: [URL].
McCrocklin, S. (2012). Effect of audio vs. video on aural discrimination of vowels. TESL-EJ, 16 (2), 1–16.
(2016). Pronunciation learner autonomy: The potential of Automatic Speech Recognition. System, 571, 25–42.
Meisel, W. (2010). “Life on-the-go”: The role of speech technology in mobile applications. In A. Neustein (Ed.) Advances in speech recognition: Mobile environments, call centers, and clinics. New York, NY: Springer.
Microsoft. (2014). Microsoft accessibility: Products: Windows Speech Recognition. Retrieved from [URL]
Moustroufas, N. & Digalakis, V. (2007). Automatic pronunciation evaluation of foreign speakers using unknown text. Computer Speech and Language, 211, 219–230.
Murray, G. (1999). Autonomy, technology, and language-learning in a sheltered ESL immersion program. TESL Canada Journal, 17 (1), 1–15.
Neri, A., Cucchiarini, C., & Strik, H. (2006). ASR-based corrective feedback on pronunciation: does it really work? Proceedings of the ISCA Interspeech 2006, Pittsburgh, PA, 1982–1985.
Neri, A., Cucchiarini, H., Strik, L. & Boves, L. (2002). The pedagogy-technology interface in Computer-Assisted Pronunciation Teaching. Computer Assisted Language Learning, 15(5), 441–467.
Neri, A., Mich, O., Gerosa, M., & Giuliani, D. (2008). The effectiveness of computer assisted pronunciation training for foreign language learning by children. Computer Assisted Language Learning, 21(5), 393–408.
Nilsen, D. L. & Nilsen, A. P. (2002). Pronunciation contrasts in English. Prospect Heights, IL: Waveland Press.
Oxford, R. L. (1990). Language learning strategies: What every teacher should know. Boston: Heinle & Heinle.
Rogerson-Revell, P. (2011). English phonology and pronunciation teaching. Broadway, NY: Bloomsbury Academic.
Rosetta Stone: Cutting-edge technology. (2015). Retrieved from [URL]
Saito, K. & Lyster, R. (2012). Effects of form-focused instruction and corrective feedback on L2 pronunciation development of /ɹ/ by Japanese learners of English. Language Learning, 62(2), 595–633.
Saraçlar, M. (2000). Pronunciation modeling for conversational speech recognition (Doctoral dissertation). Ann Arbor, MI: UMI Dissertation Services.
Sardegna, V. (2009). Improving English stress through pronunciation learning strategies. Dissertation Abstracts International: Section A. The Humanities and Social Sciences, 70(1), 0114.
Sardegna, V. G. (2012). Learner differences in strategy use, self-efficacy beliefs, and pronunciation improvement. In. J. Levis & K. LeVelle (Eds.). Proceedings of the 3rd Pronunciation in Second Language Learning and Teaching Conference, Ames, IA: Iowa State University, 39–53.
Seferoğlu, G. (2005). Improving students’ pronunciation through accent reduction software. British Journal of Educational Technology, 36 (2), 303–316.
Sheerin, S. (1997). An exploration of the relationship between self-access and independent learning. In P. Benson and P. Voller (Eds.), Autonomy and independence in language learning (pp. 54–65). London: Longman.
Strik, H., Neri, A., & Cucchiarini, C. (2008). Speech technology for language tutoring. Proceedings of Language and Speech Technology (LangTech ‘08) Conference, Rome, Italy, 73–76.
Cited by (43)
Cited by 43 other publications
Chen, Hao-Jan, Fu-Chan Hsieh & Yu-Ci Yen
Feng, Yisu & Ye Tian
2025. Assessing the accuracy of Chinese speech-to-text tools for Chinese as foreign language learners. Chinese as a Second Language (漢語教學研究—美國中文教師學會學報). The journal of the Chinese Language Teachers Association, USA
Huang, Fang, Dingyang Peng & Timothy Teo
Jakonen, Teppo, Derya Duran & Pauliina Peltonen
Lao-un, Jiraporn & Dararat Khampusaen
Leis, Adrian
Li, Ling, Xiaoyue Zhang, Bin Zou & Qin Yang
Liu, Di, Chun Lai, Rongxin Lin & Tan Jin
Liu, Yao, Faizahani binti Ab Rahman & Farah binti Mohamad Zain
Ll, Ke
McCrocklin, Shannon
Ngo, Thuy Thi-Nhu & Howard Hao-Jan Chen
Rogti, Maroua
Shimanskaya, Elena
Ueda, Ruri
2025. The effect of oral repetition during discrimination training on L2
perceptual learning. Journal of Second Language Pronunciation
Zou, Bin & Qing Ma
Fouz-González, Jonás
2024. Using technology to facilitate the integration of pronunciation into the classroom. In La integración de la pronunciación en el aula de ELE [IVITRA Research in Linguistics and Literature, 42], ► pp. 207 ff.
Johnson, Carol, Walcir Cardoso, Beau Zuercher, Kathleen Brannen & Suzanne Springer
2024. Assessing pronunciation using dictation tools. Journal of Second Language Pronunciation 10:1 ► pp. 10 ff.
Levis, John M. & Charlie Nagle
Nickolai, Dan, Emma Schaefer & Paula Figueroa
Thi-Nhu Ngo, Thuy, Howard Hao-Jan Chen & Kyle Kuo-Wei Lai
Zou, Bin, Sara Liviero, Qing Ma, Weilei Zhang, Yiran Du & Peiling Xing
CENGİZ, Behice Ceyda
Dai, Yuanjun & Zhiwei Wu
Hirai, Akiyo & Angelina Kovalyova
Inceoglu, Solène, Wen-Hsin Chen & Hyojung Lim
Inceoglu, Solène, Wen-Hsin Chen & Hyojung Lim
Jiang, Michael Yi‐Chao, Morris Siu‐Yung Jong, Wilfred Wing‐Fat Lau, Ching‐Sing Chai & Na Wu
Liu, Di, Hyangeun Ji & Dominique Kliger
Pack, Austin & Jeffrey Maloney
Sun, Weina
Vančová, Hana
Chun, Dorothy M. & Yan Jiang
Jiang, Michael Yi-Chao, Morris Siu-Yung Jong, Na Wu, Bin Shen, Ching-Sing Chai, Wilfred Wing-Fat Lau & Biyun Huang
Lai, Kuo-Wei Kyle & Hao-Jan Howard Chen
Busà, Maria Grazia
Evers, Katerina & Sufen Chen
Evers, Katerina & Sufen Chen
Oh, Eun Young & Donggil Song
McCrocklin, Shannon & Idée Edalatishams
Levis, John M.
Levis, John M.
2024. Key issues in L2 pronunciation research. Journal of Second Language Pronunciation 10:3 ► pp. 293 ff.
This list is based on CrossRef data as of 13 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
