In:Crossing Boundaries through Corpora: Innovative corpus approaches within and beyond linguistics
Edited by Sarah Buschfeld, Patricia Ronan, Theresa Neumaier, Andreas Weilinghoff and Lisa Westermayer
[Studies in Corpus Linguistics 119] 2024
► pp. 41–61
Chapter 3A corpus-based comparative acoustic analysis of target-like vowel production by L1-Japanese learners and
native speakers of English
Published online: 17 October 2024
https://doi.org/10.1075/scl.119.03sch
https://doi.org/10.1075/scl.119.03sch
Abstract
This study combines acoustic phonetics, (applied) corpus linguistics, machine learning, and speech
recognition to analyse the production of the monophthongal vowels / ɐ ɒ æ e ɛ i: ɪ u; ʊ ʌ / in the speech of
L1-Japanese learners and L1-speakers of English based on transcripts and audio data from the Japanese spoken monologue
section of the International Corpus Network of Asian Learners of English (ICNALE).
The aim of this analysis is to evaluate what vowels L1-Japanese learners struggle with in terms of target-like vowel
production and to provide insights into the determining factors causing divergencies from L1-English produced vowels.
The results of a Multifactorial Prediction and Deviation Analysis Using Regression/Random Forests
(MuPDARF) show that Japanese learners of English do indeed have difficulties in producing English vowels in a
target-like manner but that these difficulties are confined to a relatively small set of vowels (/ ɪ u ʊ ɛ /). In
addition, the analysis shows that difficulties are predominantly correlated with language-internal factors while
language external factors (the age and gender of speakers) as well as their target variety and proficiency do not
significantly correlate with non-target-like vowel production. The results suggest that Japanese learners of English
can focus on specific vowels to enhance their target-like vowel production and that difficulties are caused by
L1-interference due to a lack of phonemic vowel duration in Japanese and the similarity of Japanese and English vowels
leading learners to use their L1 vowels rather than the slightly but notably different English vowel variants. The
results can be used to raise awareness of L1-specific difficulties among this learner cohort due to their
L1-background.
Article outline
- 1.Introduction
- 2.Previous research
- 3.Corpus data
- 3.1Data processing
- 4.Statistical analysis
- 5.Results
- 5.1Initial Random Forest Model
- 5.2Second Random Forest model
- 5.3Correlates of non-target like production
- 6.Discussion
- 7.Conclusion
Notes References
References (46)
Australian National
Corpus. 2012. Australian Corpus of English (Version 2)
[Data set]. <[URL]> (29 May
2024).
Baayen, Harald. 2008. Analyzing
Linguistic Data: A Practical Introduction to Statistics Using
R. Cambridge: CUP.
Bates, Douglas, Maechler, Martin, Bolker, Ben & Walker, Steve. 2015. Fitting
linear mixed-effects models using lme4. Journal of Statistical
Software 67(1): 1–48.
Bombien, Lasse, Winkelmann, Raphael & Scheffers, Michel. 2021. wrassp:
An R wrapper to the ASSP Library (Version 1.0.1) [Computer software]. <[URL]> (29 May
2024).
Bořil, Tomáš & Skarnitzl, Radek. 2016. Tools
rPraat and mPraat. In Text, Speech, and
Dialogue, Petr Sojka, Aleš Horák, Ivan Kopeček & Karel Pala (eds), 367–374. Springer.
Council of
Europe. 2018. Common European Framework of Reference for
Languages: Learning, teaching,
assessment. Cambridge: CUP.
Darcy, Isabelle, Ewert, Doreen & Lidster, Ryan. 2012. Bringing
pronunciation instruction back into the classroom: An ESL teachers’ pronunciation
“toolbox”. In Proceedings of the 3rd Pronunciation in
Second Language Learning and Teaching Conference, John Levis & Kimberly LeVelle (eds), 93–108. Ames IA: Iowa State University.
Derwing, Tracey M. & Munro, Murray J. 1997. Accent,
comprehensibility, and intelligibility: Evidence from four L1s. Studies in
Second Language
Acquisition 19(1): 1–16.
Deterding, David. 2009. The
formants of monophthong vowels in standard southern British English
pronunciation. Journal of the International Phonetic
Association 27(1–2): 47–55.
Flege, James E. 1995. Second language
speech learning theory, findings, and
problems. In Speech Perception and Linguistic
Experience: Issues in Cross-linguistic Research, Winifred Strange (ed.), 233–277. Timonium MD: York Press.
Francis, Winthrop N. & Kucera, Henry. 1982. The
Brown University Standard Corpus of Present-Day American English (BROWN) [Text
corpus]. <[URL]> (29 May
2024).
Franklin, Amber D. & Stoel-Gammon, Carol. 2014. Using
multiple measures to document change in English vowels produced by Japanese, Korean, and Spanish speakers: The
case for goodness and intelligibility. American Journal of Speech-Language
Pathology 23(4): 625–640.
Gries, Stefan T. & Deshors, Sandra C. 2014. Using
regressions to explore deviations between corpus data and a standard/target: Two
suggestions. Corpora 9(1): 109–136.
Hasegawa, Y. & Katsuki, K. 2008. Perception
of vowel duration by native speakers of Japanese and English. Journal of
Phonetics 36(4): 655–680.
Heller, Benedikt, Bernaisch, Tobias & Gries, Stefan T. 2017. Empirical
perspectives on two potential epicenters: The genitive alternation in Asian
Englishes. ICAME
Journal 41: 111–144.
Hirai, Y. 2017. The
role of speaking practice in Japanese learners’ development of English oral
proficiency. Journal of Language Teaching and
Research 8(1): 16–26.
Ingram, John C. L. & Park, See-Gyoon 1997. Cross-language
vowel perception and production by Japanese and Korean learners of
English. Journal of
Phonetics 25(3): 343–370.
Ishikawa, Shin’ichiro. 2014. Design
of the ICNALE Spoken: A new database for multi-modal contrastive interlanguage
analysis. Learner Corpus Studies in Asia and the
World 2: 63–76.
Isphording, I. E. 2021. Does
speaking English with a foreign accent reduce trust? Experimental evidence from
Germany. Journal of Economic Behavior &
Organization 184: 1–16.
Johansson, Stig. 1978. The
LOB corpus: A computer-readable corpus of large amounts of contemporary British
English. In Computer Corpora in English Language
Research, Udo Fries, L. F. Matthiessen & H. Dirven (eds), 33–71. Amsterdam: Rodopi.
Kato, Hiroaki, Tajima, Keiichi & Akahane-Yamada, Reiko. 2001. Native
and nonnative perception of phonemic length contrasts in Japanese. The Journal
of the Acoustical Society of
America 110(5): 2686.
Kiritani, Shigeru. 1994. Acoustic
and perceptual characteristics of vowel length in Japanese. The Journal of the
Acoustical Society of
America 95(1): 481–492.
Kisler, Thomas, Reichel, Uwe D. & Schiel, Florian. 2017. Multilingual
processing of speech via web services. Computer Speech and
Language 45: 326–347.
Kubozono, Haruo. 2005. The
phonetics and phonology of Japanese vowel devoicing. Journal of East Asian
Linguistics 14(1): 1–42.
Lev-Ari, Shiri & Keysar, Boaz. 2010. Why
don’t we believe non-native speakers? The influence of accent on
credibility. Journal of Experimental Social
Psychology 46(6): 1093–1096.
Levis, John M. & Moyer, Alene. 2014. Japanese
learners’ perception of English word stress and vowel quantity. Journal of
Second Language
Pronunciation 1(2): 169–188.
Lüdecke, Daniel. 2021. sjPlot:
Data visualization for statistics in social science (Version 2.8.10) [Computer
software]. <[URL]> (29 May
2024).
Morrison, Geoffrey S. 2002. Japanese
listeners’ use of duration cues in the identification of English high front
vowels. In Proceedings of the 28th Annual Meeting of
the Berkeley Linguistics Society, J. Larson & M. Paster (eds), 189–200. Berkeley CA: Berkeley Linguistics Society.
Munro, Murray J. & Derwing, Tracey M. 2006. The
functional load principle in ESL pronunciation instruction: An exploratory
study. Language Learning &
Technology 10(3): 55–75.
Munro, Murray J., Derwing, Tracey M. & Burgess, S. 2010. Detection
of accent, comprehensibility, and intelligibility in ESL speakers. Journal of
Phonetics 38(4): 607–620.
Peterson, Gordon E. & Barney, Harold L. 1952. Control
methods used in a study of the vowels. The Journal of the Acoustical Society of
America 24(2): 175–184.
R Core
Team. 2023. R: A language and environment for statistical
computing. R Foundation for Statistical Computing. <[URL]> (29 May
2024).
Rogers, C. L. & Meara, P. 2004. The
influence of pronunciation quality on judgments of intelligence and
friendliness. Journal of
Phonetics 32(3): 237–251.
RStudio
Team. 2022. RStudio: Integrated Development Environment for
R. RStudio PBC. <[URL]> (29 May
2024).
Schweinberger, Martin, & Komiya, Yuki. 2022. A
corpus-based computational analysis of high-front and -back vowel production of L1-Japanese learners of
English and L1-English speakers. In Proceedings of
the Eighteenth Australasian International Conference on Speech Science and
Technology, Rosey Billington (ed.), 196–200. Canberra: Australasian Speech Science and Technology Association.
Vance, Timothy J. 1987. An Introduction to
Japanese Phonology. New York NY: State University of New York Press.
Visceglia, Tanya, Chiu-Yu, Tseng, Kondo, Mariko, Meng, Helen & Sagisaka, Yoshinori. 2009. Phonetic
aspects of content design in AESOP (Asian English Speech cOrpus
Project). In Oriental COCOSDA International
Conference on Speech Database and
Assessments, 60–65. IEEE.
Wickham, Hadley, Averick, Mara, Bryan, Jennifer, Chang, Winston, McGowan, Lucy D., François, Romain, Grolemund, Garret, Hayes, Alex, Henry, Lionel, Hester, Jim, Kuhn, Max, Pedersen, Thomas L., Miller, Evan, Bache, Stephan M., Müller, Kirill, Ooms, Jeroen, Robinson, David, Seidel, Dana P., Spinu, Vitalie, … Yutani, Hiroaki. 2019. Welcome
to the tidyverse. Journal of Open Source
Software 4(43): 1686.
Yang, Byunggon. 1996. A
comparative study of American English and Korean vowels produced by male and female
speakers. Journal of
Phonetics 24: 245–261.
Yazawa, Kakeru, & Kondo, Mariko. 2019. Acoustic
characteristics of Japanese short and long vowels: Formant displacement effect
revisited. In Proceedings of the 19th international
congress of phonetic sciences, Sasha Calhoun, Paola Escudero, Marija Tabain & Paul Warren (eds.), 671-675. Canberra: Australasian Speech Science and Technology Association.
