Review article published In: Journal of Second Language Pronunciation
Vol. 10:3 (2024) ► pp.404–426
Review article
Artificial intelligence integration in three iOS pronunciation apps
ELSA Speak, Loora, and Vocal Image
Published online: 27 February 2025
https://doi.org/10.1075/jslp.24052.kai
https://doi.org/10.1075/jslp.24052.kai
Abstract
Claims of integrating artificial intelligence (AI) into mobile applications for pronunciation training date back
to at least 2011 with the iOS app T Accent (Arivoc Education International. (2011). T
Accent (Version 2.0.1) [Mobile application software]. [URL]),
which used automatic speech recognition (ASR) to provide “Goodness of Pronunciation” (GOP) ratings (Witt, S. M., & Young, S. J. (2000). Phone-level
pronunciation scoring and assessment for interactive language learning. Speech
Communication, 30(2), 95–108. ). AI has advanced significantly since 2011, most noticeably with the 2022 release
of OpenAI’s ChatGPT, which made chatbots powered by generative AI more widely available. AI has led to new applications in
pronunciation apps that can evaluate pronunciation and integrate communicative role-play activities. This article examines ASR and
chatbot integration in three iOS apps: ELSA Speak, Loora, and Vocal Image. Feedback provided by these apps is frequently
inaccurate and often limited to consonant and vowel sounds. This article cautions teachers and learners about the current
limitations of these apps and provides recommendations for incorporating AI-powered tools into today’s pronunciation
classrooms.
Article outline
- Introduction
- Artificial intelligence
- Three iOS apps using artificial intelligence
- Claims about artificial intelligence
- Automatic speech recognition
- Chatbots
- Other pronunciation features
- Limitations
- Conclusions
References
References (37)
@Vocal_Image. (2024). Vocal image [YouTube
channel]. [URL]
Al-Shallakh, M. A. I. (2023). Artificial
intelligence-based mobile learning in English language teaching (ELT) for EFL learners: Enhancing pronunciation with ELSA
SPEAK in Oman. Arab Humanities
Journal, 4(3), 208–221.
Arivoc Education International. (2011). T
Accent (Version 2.0.1) [Mobile application software]. [URL]
Beach, R., & O’Brien, D. (2015). Using
apps for learning across the curriculum: A literacy-based framework and
guide. Routledge.
Canale, M., & Swain, M. (1980). Theoretical
bases of communicative approaches to second language teaching and testing. Applied
Linguistics, 1(1), 1–47.
Celce-Murcia, M., Dörnyei, Z., & Thurrell, S. (1995). Communicative
competence: A pedagogically motivated model with content specifications. Issues in Applied
Linguistics, 6(2), 5–35.
Chun, D. M. (2023). Review
of ELSA, English Language Speech Assistant ([URL]). Journal of Second Language
Pronunciation, 9(1), 139–150.
ELSA. (2024a). ELSA
AI. [URL]
. (2024b). ELSA conversation
bundle. [URL]
. (2024c). This is
ELSA. [URL]
F6S. (2024). Rusya Shukiurava [Online
profile]. [URL]
Fryer, L., & Carpenter, R. (2006). Bots
as language learning tools. Language Learning &
Technology, 10(3), 8–14.
Indari, A. (2023). Detection
of pronunciation errors in English speaking skills based on artificial intelligence
(AI). Jurnal Serunai Bahasa
Inggris, 15(2), 67–77.
Kaiser, D. (2018). Mobile-assisted
pronunciation training: The iPhone pronunciation app project. IATEFL Pronunciation Special
Interest Group
Journal, 581, 38–52.
Kholis, A. (2021). ELSA
Speak app: Automatic speech recognition (ASR) for supplementing English pronunciation
skills. Pedagogy: Journal of English Language
Teaching, 9(1), 1–14.
Kovalyova, A. (2024). Speech-to-text
applications’ accuracy in English language learners’ speech
transcription. 28(1), 1–21.
(2018). Intelligibility,
oral communication, and the teaching of pronunciation. Cambridge University Press.
Loora A.I LTD. (2024). Loora: “Speak
English with Loora AI” (Version 1.59.5) [Mobile application software].
Mansuri, A. (2014). PronounceApp
(Version 1.0) [Mobile application software]. [URL]
Maulidyah, L., Achadiyah, R., & Azmi, M. U. (2024). Elsa
Speak application as artificial intelligence tools to enhance students’ pronunciation skills in rural
area. ANCOLT: International Proseeding on Language
Teaching, 1(1), 461–469.
Murphy, J. M. (2018). Teacher
training in the teaching of pronunciation. In O. Kang, R. I. Thomson, & J. M. Murphy (Eds.), The
Routledge handbook of contemporary English
pronunciation (pp. 298–319). Routledge.
Neri, A., Cucchiarini, C., Strik, H., & Boves, L. (2002). The
pedagogy-technology interface in computer assisted pronunciation
training. 15(5), 1–27.
OECD. (2024). Recommendation of the Council
on Artificial Intelligence, OECD/LEGAL/0449. [URL]
OpenAI. (2024). Voice mode
FAQ. [URL]
Research and Markets. (2024). E-learning
market report by technology (Online e-learning, learning management system, mobile e-learning, rapid e-learning, virtual
classroom, and others), provider (services, content), application (academic, corporate, government), and region
2024–2032. [URL]
Rudnik, Y. (2024). The
use of artificial intelligence chatbots in teaching foreign languages as an innovative interactive
technology. Educological
Discourse, 45(2), 16–24.
Senowarsito, S., & Ardini, S. N. (2023). The
use of artificial intelligence to promote autonomous pronunciation learning: Segmental and suprasegmental features
perspective. IJELTAL (Indonesian Journal of English Language Teaching and Applied
Linguistics), 8(2), 133–147.
Shehata, M. G. M. (2024). A
program based on artificial intelligence to enhance prospective teachers’ English
pronunciation. CDELT Occasional Papers in the Development of English
Education, 861, 145–179.
Sholekhah, M. F., & Fakhrurriana, R. (2023). The
use of ELSA Speak as a mobile-assisted language learning (MALL) towards EFL students
pronunciation. JELITA: Journal of Education, Language Innovation, and Applied
Linguistics, 2(2), 93–100.
Spezzini, S., Franks, S., & Carter, C. (2018). Accent
reduction versus intelligibility. In J. I. Liontas (Ed.), The
TESOL encyclopedia of English language
teaching (pp. 1–6). Wiley.
Vocal Image. (2024). Vocal
image. [URL]
Weizenbaum, J. (1966). ELIZA
— A computer program for the study of natural language communication between man and
machine. Communications of the
ACM, 9(1), 36–45.
Wiggers, K. (2024, February 21). Loora
wants to leverage AI to teach English. Techcrunch. [URL]
