Extending automatic vowel formant extraction to New Englishes: A comparison of different methods

Meer, Philipp; Brato, Thorsten; Matute Flores, José Alejandro

doi:10.1075/eww.00060.mee

Article published In: English World-Wide
Vol. 42:1 (2021) ► pp.54–84

Get fulltext from our e-platform

Download PDF

Extending automatic vowel formant extraction to New Englishes

A comparison of different methods

Philipp Meer | University of Münster, Germany | University of Campinas, Brazil

Thorsten Brato | University of Regensburg, Germany

José Alejandro Matute Flores | University of Münster, Germany

Published online: 27 January 2021

https://doi.org/10.1075/eww.00060.mee

Abstract

While different automated procedures for vowel formant prediction have recently been proposed, it is unclear how reliably these methods perform in the phonetic study of vowels in New Englishes and how such approaches could be applied to specific varieties. This paper compares different automatic methods for vowel formant prediction in New Englishes, using manual measurements of Trinidadian English as a baseline. The results show that all methods perform significantly better than default formant parameters often used in speech analysis packages, and that a Bayesian formant tracker calibrated with American (US-FAVE) and Trinidadian English (TRINI-FAVE) generally provides better results than an automatic procedure that optimizes formant ceilings on a vowel- and speaker-specific level. TRINI-FAVE measures vowels characteristic of Trinidadian English most accurately. Phonetic studies of vowels in New Englishes can benefit from these methods.

Keywords: automatic formant prediction, FAVE, Trinidadian English, vowels, New Englishes, formant ceiling optimization, automated acoustic analysis

Article outline

1.Introduction
2.Trinidadian English
3.Data and methods
- 3.1Data
- 3.2TRINI-FAVE
- 3.3Formant ceiling optimization
- 3.4Analysis
4.Results
- 4.1Overall comparison to the manual baseline
- 4.2Automatic prediction of monophthongs and diphthongs
- 4.3Differences in predicting individual vowel classes
- 4.4Visual inspection of formant plots
5.Discussion
6.Conclusion and recommendations
Notes
References

References (41)

References

Adank, Patti. 2003. Vowel Normalization. A Perceptual-Acoustic Study of Dutch Vowels. Wageningen: Ponsen and Looijen.

Atal, B. S., and S. L. Hanauer. 1971. “Speech Analysis and Synthesis by Linear Prediction of the Speech Wave”. The Journal of the Acoustical Society of America 501: 637–655.

Boersma, Paul, and David Weenink. 2019. “Praat”, <[URL]>.

Clopper, Cynthia G., David B. Pisoni, and Kenneth de Jong. 2005. “Acoustic Characteristics of the Vowel Systems of Six Regional Varieties of American English”. The Journal of the Acoustical Society of America 1181: 1661–1676.

Deng, Li, Xiaodong Cui, Robert Pruvenok, Jonathan Huang, Safivy Momen, Yanyi Chen, and Abeer Alwan. 2006. “A Database of Vocal Tract Resonance Trajectories for Research in Speech Processing”. Proceedings of ICASSP 31. Toulouse: Institute of Electrical and Electronics Engineers, 369–372.

Deuber, Dagmar. 2014. English in the Caribbean. Variation, Style and Standards in Jamaica and Trinidad. Cambridge: Cambridge University Press.

Escudero, Paola, Paul Boersma, Andréia Schurt Rauber, and Ricardo A. H. Bion. 2009. “A Cross-Dialect Acoustic Description of Vowels: Brazilian and European Portuguese”. The Journal of the Acoustical Society of America 1261: 1379–1393.

Evanini, Keelan. 2009. “The Permeability of Dialect Boundaries. A Case Study of the Region Surrounding Erie, Pennsylvania”. Ph.D. Dissertation, University of Pennsylvania.

Evanini, Keelan, Stephen Isard, and Mark Liberman. 2009. “Automatic Formant Extraction for Sociolinguistic Analysis of Large Corpora”. Proceedings of Interspeech 10. Brighton: International Speech Communication Association, 1639–1642.

Ewald, Otto, Eva Liina Asu, and Susanne Schötz. 2017. “The Formant Dynamics of Long Close Vowels in Three Varieties of Swedish”. Proceedings of Interspeech 18. Stockholm: International Speech Communication Association, 1412–1416.

Fruehwald, Josef. 2013. “The Phonological Influence on Phonetic Change”. Ph.D. Dissertation, University of Pennsylvania.

Glasberg, Brian R., and Brian C. J. Moore. 1990. “Derivation of Auditory Filter Shapes from NotchedNoise Data”. Hearing Research 171: 103–138.

Harrison, Philip T. 2013. “Making Accurate Measurements. An Empirical Investigation of the Influence of the Measurement Tool, Analysis Settings and Speaker on Formant Measurements”. Ph.D. Dissertation, University of York.

Heeringa, Wilbert, and Hans van de Velde. 2017. “Visible Vowels. A Tool for the Visualization of Vowel Variation”. Proceedings of Interspeech 18. Stockholm: International Speech Communication Association, 4034–4035.

Hillenbrand, James, Laura A. Getty, Michael J. Clark, and Kimberlee Wheeler. 1995. “Acoustic Characteristics of American English Vowels”. The Journal of the Acoustical Society of America 971: 3099–111.

Hoffmann, Thomas. 2011. “The Black Kenyan English Vowel System. An Acoustic Phonetic Analysis”. English World-Wide 321: 147–173.

Huber, Jessica E., Elaine T. Stathopoulos, Gina M. Curione, Theresa A. Ash, and Kenneth Johnson. 1999. “Formants of Children, Women, and Men: The Effects of Vocal Intensity Variation”. The Journal of the Acoustical Society of America 1061: 1532–1542.

Kendall, Tyler, and Charlotte Vaughn. 2020. “Exploring Vowel Formant Estimation through Simulation-Based Techniques”. Linguistics Vanguard 61: 1–13.

Kretzschmar, William A. 2008. “Standard American English Pronunciation”. In Bernd Kortmann, and Edgar W. Schneider. eds. The Americas and the Caribbean. Berlin: De Gruyter, 37–51.

Labov, William, Sharon Ash, and Charles Boberg. 2006. The Atlas of North American English. Phonetics, Phonology and Sound Change. Berlin: De Gruyter.

Labov, William, Ingrid Rosenfelder, and Josef Fruehwald. 2013. “One Hundred Years of Sound Change in Philadelphia. Linear Incrementation, Reversal, and Reanalysis”. Language 891: 30–65.

Lee, Sungbok, Alexandros Potamianos, and Shrikanth Narayanan. 1999. “Acoustics of Children’s Speech. Developmental Changes of Temporal and Spectral Parameters”. The Journal of the Acoustical Society of America 1051: 1455–1468.

Leung, Glenda A. 2013. “A Synchronic Sociophonetic Study of Monophthongs in Trinidadian English”. Ph.D. Dissertation, University of Freiburg.

Lobanov, Boris. M. 1971. “Classification of Russian Vowels Spoken by Different Speakers”. The Journal of the Acoustical Society of America 491: 606–608.

Maxwell, Olga, and Janet Fletcher. 2009. “Acoustic and Durational Properties of Indian English Vowels”. World Englishes 281: 52–69.

McAuliffe, Michael, Michaela Socolof, Sarah Mihuc, Michael Wagner, and Morgan Sonderegger. 2017. “Montreal Forced Aligner. Trainable Text-Speech Alignment Using Kaldi”. Proceedings of Interspeech 18. Stockholm: International Speech Communication Association, 498–502.

Meer, Philipp. 2019. “Sociolinguistic Variation in (Standard) Trinidadian English Vowels. A Semi-Automatic Sociophonetic Study of Selected Monophthongs and Diphthongs”. Paper presented at Congress of the Brazilian Linguistics Association, Maceió.

. 2020. “Automatic Alignment for New Englishes. Applying State-of-the-Art Aligners to Trinidadian English”. The Journal of the Acoustical Society of America 1471: 2283–2294.

Meer, Philipp, and José A. Matute Flores. 2018. “Making FAVE Ready for New Englishes. Applying and Modifying FAVE for Semi-Automatic Acoustic Analyses of Trinidadian English Vowels”. Paper presented at NWAV 47, New York University.

Mielke, Jeff, Erik R. Thomas, Josef Fruehwald, Michael McAuliffe, Morgan Sonderegger, Jane Stuart-Smith, and Robin Dodsworth. 2019. “Age Vectors vs. Axes of Intraspeaker Variation in Vowel Formants Measured Automatically from Several English Speech Corpora”. Proceedings of ICPhS 19. Canberra: Australasian Speech Science and Technology Association, 1258–1262.

Moore, Brian C. J. 2010. “Aspects of Auditory Processing Related to Speech Perception”. In Fiona E. Gibbon, John Laver, and William J. Hardcastle. eds. The Handbook of Phonetic sciences. Hoboken: Wiley-Blackwell, 454–488.

Pilgrim, Imelda, Ken Haworth, Anthony Perry, Maria Darlington, Joyce Stewart, and Arlene Dwarika. 2017. English A for CSEC (2nd ed.). Oxford: Oxford University Press.

Rosenfelder, Ingrid, Josef Fruehwald, Keelan Evanini, Scott Seyfarth, Kyle Gorman, Hilary Prichard, and Jiahong Yuan. 2014. “FAVE (Forced Alignment and Vowel Extraction)”, <[URL]>.

Severance, Nathan, Keelan Evanini, and Aaron Dinkin. 2016. “Examining the Reliability of Automated Vowel Analyses Using FAVE”. Paper presented at Second North-West Phonetics and Phonology Conference, University of Oregon.

Tan, Rachel Siew Kuang, and Ee-Ling Low. 2010. “How Different are the Monophthongs of Malay Speakers of Malaysian and Singapore English?” English World-Wide 311: 162–189.

Thomas, Erik R. 2011. Sociophonetics. An Introduction. Basingstoke: Palgrave Macmillan.

Toefy, Tracey Lynn. 2014. “Sociophonetics and Class Differentiation. A Study of Working- and Middle-Class English in Cape Town’s Coloured Community”. Ph.D. Dissertation, University of Cape Town.

Vallabha, Gautam K., and Betty Tuller. 2002. “Systematic Errors in the Formant Analysis of Steady-State Vowels”. Speech Communication 381: 141–160.

Watson, Catherine I. and Zoe E. Evans. 2016. “Sound change or experimental artifact? A study on the impact of data preparation on measuring sound change”. Proceedings of the 16th Australasian International Conference on Speech Science and Technology. Parramatta: Australasian Speech Science and Technology Association, 261–264.

Wells, John C. 1982. Accents of English. Cambridge: Cambridge University Press.

Youssef, Valerie, and Winford James. 2008. “The Creoles of Trinidad and Tobago”. In Bernd Kortmann and Edgar W. Schneider. eds. The Americas and the Caribbean. Berlin: De Gruyter, 320–338.

Cited by (9)

Cited by nine other publications

Order by:

Coats, Steven

2025. 257An automatic pipeline for processing streamed content: New horizons for corpus linguistics and phonetics. In Exploring digitally-mediated communication with corpora, ► pp. 257 ff.

Fuchs, Robert

2025. Influencing People Around the Globe. In Manipulation, Influence and Deception, ► pp. 135 ff.

Hansen Edwards, Jette G.

2025. Speech Analysis Software for World Englishes. In The Wiley Blackwell Encyclopedia of World Englishes, ► pp. 1 ff.

Jackson, Samantha, Philipp Meer & Mirjam Schmalz

2025. Trinidad and Tobago, English and Creoles in. In The Wiley Blackwell Encyclopedia of World Englishes, ► pp. 1 ff.

Meer, Philipp & Ryan Durgasingh

2025. Caribbean Creoles and Englishes: Sociophonetic and Morphosyntactic Variation. In The Wiley Blackwell Encyclopedia of World Englishes, ► pp. 1 ff.

Schneider, Edgar W.

2025. English World‐Wide (Journal) . In The Wiley Blackwell Encyclopedia of World Englishes, ► pp. 1 ff.

Wilson, Guyanne & Michael Westphal

2023. Conclusion. In New Englishes, New Methods [Varieties of English Around the World, G68], ► pp. 263 ff.

[no author supplied]

2025. Dialect on Air [Varieties of English Around the World, G71],

[no author supplied]

2025. Persuasion and (New) Contexts of Use. In Manipulation, Influence and Deception, ► pp. 43 ff.

This list is based on CrossRef data as of 9 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.