Article published In: Australian Review of Applied Linguistics
Vol. 22:1 (1999) ► pp.1–42
Differences and distinguishability in the acoustic characteristics of hello in voices of similar-sounding speakers
A forensic phonetic investigation
Published online: 1 January 1999
https://doi.org/10.1075/aral.22.1.01ros
https://doi.org/10.1075/aral.22.1.01ros
Abstract
Forensic Phonetics is an important application of Linguistics that has emerged as a discipline over the last decade. This paper describes a Forensic Phonetic experiment which investigates the nature of within- and between-speaker variation in the acoustic characteristics of the word hello in demonstrably similar-sounding voices. The nature of within-segment variation is determined in repeats of the same word said under different prosodic conditions in order to exclude as much of the linguistically determined variation as is consistent with the realities of the forensic situation, thus providing a good estimate of variation associated with speakers. Intonationally varying tokens of the naturally produced single word utterance hello from six adult Australian males are compared with respect to fundamental frequency, and to centre frequencies and bandwidths of the F-pattern below 5 KHz. This comprises the first five formants and extra resonances, including a possible singer’s formant and tracheal resonance. Results show that between-speaker acoustic differences are pervasive, though not ubiquitous. Magnitudes of between-speaker differences are presented for all parameters, and their forensic significance evaluated. ANOVA and chi-square tests show that even similar-sounding voices differ significantly in their acoustics, especially centre frequencies of F2-F4, formant bandwidth, and incidence of extra resonances. Simulated forensic conditions show that some of these differences are not realistically demonstrable. Nevertheless, there remain sufficient significant differences to distinguish 13 out of 15 pairs: a value of 13% for the denominator of the associated Bayesian Likelihood Ratio for the prosecution hypothesis. Directions for future research are indicated.
References (35)
Aitken, C.G.G. (1995) Statistics and the evaluation of evidence for forensic scientists. Chichester, John Wiley & Sons.
Broad, David J. (1972) Formants in automatic speech recognition. International Journal of Man-Machine Studies 141:411–424.
Butcher, Andrew (1996) Getting the voice line-up right: analysis of a multiple auditory confrontation. In Paul McCormak and Alison Russell (eds) Proceedings of the sixth Australian international conference on speech science and technology. Canberra, Australian Speech Science and Technology Association.
Gibbons, J. (1994) Introduction: Forensic Linguistics. In John Gibbons (ed.) Language and the Law. Longman, London.
van der, Giet (1987) Der Einsatz des Computers in der Sprechererkennung. In H. Kunzel (ed.) Sprechererkennung: Grundziige forensischer Sprachverarbeitung. Heidelberg, Kriminalistik Verlag.
Greisbach, Reinhold, Otto Esser and Constanze Weinstock (1995) Speaker identification by formant contours. In Angelika Braun and Peter Köster (eds) Studies in forensic phonetics. Beiträge zur Phonetik und Linguistik 64. Trier, Wissenschaftlicher Verlag.
Hirson, Allen (1995) Human laughter - A forensic phonetic perspective. In Angelika Braun and Peter Köster (eds) Studies in forensic phonetics. Beiträge zur Phonetik und Linguistik 64. Trier, Wissenschaftlicher Verlag.
Ingram, J. (1996) Formant trajectories as indices of phonetic variation for speaker identification. Journal of Forensic Linguistics 3, 1.
Jakobson, Roman, C. Gunnar M. Fant and Morris Halle (1952) Preliminaries to speech analysis [tenth reprint 1972]. Cambridge, Mass., MIT Press.
Jones, Alex (1994) The limitations of voice identification. In John Gibbons (ed.) Language and the Law. Harlow, Longman.
Kunzel, H.J. (1987) Sprechererkennung: Grundziige forensischer Sprachverarbeitung. Heidelberg, Kriminalistik Verlag.
Labov, William and Wendell Harris (1994) Addressing social issues through linguistic evidence. In John Gibbons (ed.) Language and the Law. Harlow, Longman.
LaRiviere, C. (1975) Contributions of fundamental frequency and formant frequencies to speaker identification. Phonetica 311: 185–197.
Naik, Jay (1994) Speaker verification over the telephone network: databases, algorithms and performance assessment. ESCA Workshop on Automatic Speaker Recognition, Identification and Verification 31–38.
Nolan, Francis (1983) The phonetic bases of speaker recognition. Cambridge, Cambridge University Press.
(1990) The limitations of auditory-phonetic speaker identification. In H. Kniffka (ed.) Texte zur Theorie and Praxis forensischer Linguistik. Tubingen, Max Niemayer Verlag.
(1997) Speaker recognition and forensic phonetics. In W. Hardcastle and J.M.D. Laver (eds) The handbook of phonetic sciences. Oxford, Blackwell.
Rose, Phil (1982) Acoustic characteristics of the Shanghai-Zhenhai phonation types. In D. Bradley (ed.) Tonation. Papers in South-East Asian Linguistics 8. Pacific Linguistics series A, 62.
(1989) On the non-equivalence of fundamental frequency and pitch in tonal description. In D. Bradley, E. Henderson and M. Mazaudon (eds) Prosodic Analysis and Asian Linguistics: to Honour R.K.Sprigg. Canberra, Pacific Linguistics.
(1995) On the Acoustics of Similar Voices. Paper given at the International Conference on Forensic Linguistics, University of New England.
(1996a) Speaker verification under realistic forensic conditions. In Paul McCormak and Alison Russell (eds) Proceedings of the sixth Australian international conference on speech science and technology. Canberra, Australian Speech Science and Technology Association.
(1996b) Aerodynamic involvement in intrinsic F0 perturbations - evidence from Thai-phake. In Paul McCormak and Alison Russell (eds) Proceedings of the sixth Australian international conference on speech science and technology. Canberra, Australian Speech Science and Technology Association.
Rose, Phil and Duncan, S. (1995) Naive auditory identification and discrimination of similar voices by familiar listeners. Journal of Forensic Linguistics 2, 1: 1–17.
Rose, Phil and Alison Simmons (1996) F-pattern variability in disguise and over the telephone - comparisons for forensic speaker identification. In Paul McCormak and Alison Russel (eds) Proceedings of the 6th Australian International Conference on Speech Science and Technology. Canberra, Australian Speech Science and Technology Association.
Schegloff, Emanuel A. (1968) Sequencing in conversational openings. American Anthropology 70, 6: 1075–95.
Stevens, Kenneth N. (1971) Sources of inter- and intra-speaker variability in the acoustic properties of speech sounds. Proceedings of the 7th international congress of phonetic sciences.
Cited by (10)
Cited by ten other publications
Hernandez-Laredo, Enrique, Marco Antonio Hernández-Galicia, René Arnulfo García-Hernández & Yulia Ledeneva
Padmini, Palli, C. Paramasivam, G. Jyothish Lal, Sadeen Alharbi & Kaustav Bhowmick
Gresse, Adrien, Mathias Quillot, Richard Dufour, Vincent Labatut & Jean-Francois Bonastre
Zuo, Donghui & Peggy Pik Ki Mok
Morrison, Geoffrey Stewart
Clermont, Frantz
McDougall, Kirsty
Zhang, Cuiling, Joost van de Weijer & Jingxu Cui
Rose, Phil
This list is based on CrossRef data as of 14 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
