Article published In: English World-Wide
Vol. 42:1 (2021) ► pp.54–84
Extending automatic vowel formant extraction to New Englishes
A comparison of different methods
Published online: 27 January 2021
https://doi.org/10.1075/eww.00060.mee
https://doi.org/10.1075/eww.00060.mee
Abstract
While different automated procedures for vowel formant prediction have recently been proposed, it is unclear how reliably
these methods perform in the phonetic study of vowels in New Englishes and how such approaches could be applied to specific varieties. This
paper compares different automatic methods for vowel formant prediction in New Englishes, using manual measurements of Trinidadian English
as a baseline. The results show that all methods perform significantly better than default formant parameters often used in speech analysis
packages, and that a Bayesian formant tracker calibrated with American (US-FAVE) and Trinidadian English (TRINI-FAVE) generally provides
better results than an automatic procedure that optimizes formant ceilings on a vowel- and speaker-specific level. TRINI-FAVE measures
vowels characteristic of Trinidadian English most accurately. Phonetic studies of vowels in New Englishes can benefit from these
methods.
Article outline
- 1.Introduction
- 2.Trinidadian English
- 3.Data and methods
- 3.1Data
- 3.2TRINI-FAVE
- 3.3Formant ceiling optimization
- 3.4Analysis
- 4.Results
- 4.1Overall comparison to the manual baseline
- 4.2Automatic prediction of monophthongs and diphthongs
- 4.3Differences in predicting individual vowel classes
- 4.4Visual inspection of formant plots
- 5.Discussion
- 6.Conclusion and recommendations
- Notes
References
References (41)
Adank, Patti. 2003. Vowel Normalization. A Perceptual-Acoustic Study of Dutch Vowels. Wageningen: Ponsen and Looijen.
Atal, B. S., and S. L. Hanauer. 1971. “Speech Analysis and Synthesis by Linear Prediction of the Speech Wave”. The Journal of the Acoustical Society of America 501: 637–655.
Boersma, Paul, and David Weenink. 2019. “Praat”, <[URL]>.
Clopper, Cynthia G., David B. Pisoni, and Kenneth de Jong. 2005. “Acoustic Characteristics of the Vowel Systems of Six Regional Varieties of American English”. The Journal of the Acoustical Society of America 1181: 1661–1676.
Deng, Li, Xiaodong Cui, Robert Pruvenok, Jonathan Huang, Safivy Momen, Yanyi Chen, and Abeer Alwan. 2006. “A Database of Vocal Tract Resonance Trajectories for Research in Speech Processing”. Proceedings of ICASSP 31. Toulouse: Institute of Electrical and Electronics Engineers, 369–372.
Deuber, Dagmar. 2014. English in the Caribbean. Variation, Style and Standards in Jamaica and Trinidad. Cambridge: Cambridge University Press.
Escudero, Paola, Paul Boersma, Andréia Schurt Rauber, and Ricardo A. H. Bion. 2009. “A Cross-Dialect Acoustic Description of Vowels: Brazilian and European Portuguese”. The Journal of the Acoustical Society of America 1261: 1379–1393.
Evanini, Keelan. 2009. “The Permeability of Dialect Boundaries. A Case Study of the Region Surrounding Erie, Pennsylvania”. Ph.D. Dissertation, University of Pennsylvania.
Evanini, Keelan, Stephen Isard, and Mark Liberman. 2009. “Automatic Formant Extraction for Sociolinguistic Analysis of Large Corpora”. Proceedings of Interspeech 10. Brighton: International Speech Communication Association, 1639–1642.
Ewald, Otto, Eva Liina Asu, and Susanne Schötz. 2017. “The Formant Dynamics of Long Close Vowels in Three Varieties of Swedish”. Proceedings of Interspeech 18. Stockholm: International Speech Communication Association, 1412–1416.
Fruehwald, Josef. 2013. “The Phonological Influence on Phonetic Change”. Ph.D. Dissertation, University of Pennsylvania.
Glasberg, Brian R., and Brian C. J. Moore. 1990. “Derivation of Auditory Filter Shapes from NotchedNoise Data”. Hearing Research 171: 103–138.
Harrison, Philip T. 2013. “Making Accurate Measurements. An Empirical Investigation of the Influence of the Measurement Tool, Analysis Settings and Speaker on Formant Measurements”. Ph.D. Dissertation, University of York.
Heeringa, Wilbert, and Hans van de Velde. 2017. “Visible Vowels. A Tool for the Visualization of Vowel Variation”. Proceedings of Interspeech 18. Stockholm: International Speech Communication Association, 4034–4035.
Hillenbrand, James, Laura A. Getty, Michael J. Clark, and Kimberlee Wheeler. 1995. “Acoustic Characteristics of American English Vowels”. The Journal of the Acoustical Society of America 971: 3099–111.
Hoffmann, Thomas. 2011. “The Black Kenyan English Vowel System. An Acoustic Phonetic Analysis”. English World-Wide 321: 147–173.
Huber, Jessica E., Elaine T. Stathopoulos, Gina M. Curione, Theresa A. Ash, and Kenneth Johnson. 1999. “Formants of Children, Women, and Men: The Effects of Vocal Intensity Variation”. The Journal of the Acoustical Society of America 1061: 1532–1542.
Kendall, Tyler, and Charlotte Vaughn. 2020. “Exploring Vowel Formant Estimation through Simulation-Based Techniques”. Linguistics Vanguard 61: 1–13.
Kretzschmar, William A. 2008. “Standard American English Pronunciation”. In Bernd Kortmann, and Edgar W. Schneider. eds. The Americas and the Caribbean. Berlin: De Gruyter, 37–51.
Labov, William, Sharon Ash, and Charles Boberg. 2006. The Atlas of North American English. Phonetics, Phonology and Sound Change. Berlin: De Gruyter.
Labov, William, Ingrid Rosenfelder, and Josef Fruehwald. 2013. “One Hundred Years of Sound Change in Philadelphia. Linear Incrementation, Reversal, and Reanalysis”. Language 891: 30–65.
Lee, Sungbok, Alexandros Potamianos, and Shrikanth Narayanan. 1999. “Acoustics of Children’s Speech. Developmental Changes of Temporal and Spectral Parameters”. The Journal of the Acoustical Society of America 1051: 1455–1468.
Leung, Glenda A. 2013. “A Synchronic Sociophonetic Study of Monophthongs in Trinidadian English”. Ph.D. Dissertation, University of Freiburg.
Lobanov, Boris. M. 1971. “Classification of Russian Vowels Spoken by Different Speakers”. The Journal of the Acoustical Society of America 491: 606–608.
Maxwell, Olga, and Janet Fletcher. 2009. “Acoustic and Durational Properties of Indian English Vowels”. World Englishes 281: 52–69.
McAuliffe, Michael, Michaela Socolof, Sarah Mihuc, Michael Wagner, and Morgan Sonderegger. 2017. “Montreal Forced Aligner. Trainable Text-Speech Alignment Using Kaldi”. Proceedings of Interspeech 18. Stockholm: International Speech Communication Association, 498–502.
Meer, Philipp. 2019. “Sociolinguistic Variation in (Standard) Trinidadian English Vowels. A Semi-Automatic Sociophonetic Study of Selected Monophthongs and Diphthongs”. Paper presented at Congress of the Brazilian Linguistics Association, Maceió.
. 2020. “Automatic Alignment for New Englishes. Applying State-of-the-Art Aligners to Trinidadian English”. The Journal of the Acoustical Society of America 1471: 2283–2294.
Meer, Philipp, and José A. Matute Flores. 2018. “Making FAVE Ready for New Englishes. Applying and Modifying FAVE for Semi-Automatic Acoustic Analyses of Trinidadian English Vowels”. Paper presented at NWAV 47, New York University.
Mielke, Jeff, Erik R. Thomas, Josef Fruehwald, Michael McAuliffe, Morgan Sonderegger, Jane Stuart-Smith, and Robin Dodsworth. 2019. “Age Vectors vs. Axes of Intraspeaker Variation in Vowel Formants Measured Automatically from Several English Speech Corpora”. Proceedings of ICPhS 19. Canberra: Australasian Speech Science and Technology Association, 1258–1262.
Moore, Brian C. J. 2010. “Aspects of Auditory Processing Related to Speech Perception”. In Fiona E. Gibbon, John Laver, and William J. Hardcastle. eds. The Handbook of Phonetic sciences. Hoboken: Wiley-Blackwell, 454–488.
Pilgrim, Imelda, Ken Haworth, Anthony Perry, Maria Darlington, Joyce Stewart, and Arlene Dwarika. 2017. English A for CSEC (2nd ed.). Oxford: Oxford University Press.
Rosenfelder, Ingrid, Josef Fruehwald, Keelan Evanini, Scott Seyfarth, Kyle Gorman, Hilary Prichard, and Jiahong Yuan. 2014. “FAVE (Forced Alignment and Vowel Extraction)”, <[URL]>.
Severance, Nathan, Keelan Evanini, and Aaron Dinkin. 2016. “Examining the Reliability of Automated Vowel Analyses Using FAVE”. Paper presented at Second North-West Phonetics and Phonology Conference, University of Oregon.
Tan, Rachel Siew Kuang, and Ee-Ling Low. 2010. “How Different are the Monophthongs of Malay Speakers of Malaysian and Singapore English?” English World-Wide 311: 162–189.
Toefy, Tracey Lynn. 2014. “Sociophonetics and Class Differentiation. A Study of Working- and Middle-Class English in Cape Town’s Coloured Community”. Ph.D. Dissertation, University of Cape Town.
Vallabha, Gautam K., and Betty Tuller. 2002. “Systematic Errors in the Formant Analysis of Steady-State Vowels”. Speech Communication 381: 141–160.
Watson, Catherine I. and Zoe E. Evans. 2016. “Sound change or experimental artifact? A study on the impact of data preparation on measuring sound change”. Proceedings of the 16th Australasian International Conference on Speech Science and Technology. Parramatta: Australasian Speech Science and Technology Association, 261–264.
Cited by (9)
Cited by nine other publications
Coats, Steven
Fuchs, Robert
Hansen Edwards, Jette G.
Jackson, Samantha, Philipp Meer & Mirjam Schmalz
Meer, Philipp & Ryan Durgasingh
Schneider, Edgar W.
Wilson, Guyanne & Michael Westphal
2023. Conclusion. In New Englishes, New Methods [Varieties of English Around the World, G68], ► pp. 263 ff.
[no author supplied]
This list is based on CrossRef data as of 9 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
