Article published In: Register Studies: Online-First Articles
Experimental assessment of phonetic register variation in situated interaction
Available under the Creative Commons Attribution (CC BY) 4.0 license.
For any use beyond this license, please contact the publisher at rights@benjamins.nl.
Open Access publication of this article was funded through a Transformative Agreement with Bielefeld University.
Published online: 25 September 2025
https://doi.org/10.1075/rs.24020.dur
https://doi.org/10.1075/rs.24020.dur
Abstract
This study analyses phonetic variation in cross-situational conversational speech styles in German by varying
situations in terms of the level of perceived formality of the addressee and the function of an interaction as well as the space
in which the interaction took place. We propose an experimental paradigm for controlled elicitation of intra-individual phonetic
variation in situated interaction. Several stimulus videos were created and rated in which the identical person was dressed in
different guises varying between formal and informal. The person acted in a choreographed way across the videos. Two types of
tasks were given to each participant of the production experiment: one posed a potential face-threat, while the other consisted of
casually conversing with the informal persona shown in the video. We analyzed vowel dispersion, mean fundamental frequency and its
range in formal vs. informal situations. Results show complex effects of formality interacting with space and speaker gender.
Article outline
- 1.Introduction
- 1.1Our study
- 1.1.1Register based variation in fundamental frequency
- 1.1.2Register based variation in vowel space size
- 1.2Structure of this paper
- 1.1Our study
- 2.Video stimuli
- 2.1Creation of videos
- 2.2Rating of videos
- 2.3Results
- 2.3.1Formality ratings of picture stimuli
- 2.3.2Job ratings of pictures and videos
- 2.3.3Personality ratings of pictures and videos
- 2.4Summary: Stimulus rating
- 3.Production experiment
- 3.1Experimental Setup
- 3.2Participants
- 3.3Speech material
- 3.4Data processing and analysis
- 3.5Results
- 3.5.1Modelling mean f
- 3.5.2Modelling f range
- 3.5.3Modelling vowel dispersion
- 3.5.4Summary: Production data
- 4.Discussion
- Ethics approval
- Acknowledgments
References
References (117)
Avery, J. D., & Liss, J. M. (1996). Acoustic
characteristics of less-masculine-sounding male speech. The Journal of the Acoustical Society
of
America, 99(6), 3738–3748.
Aung, T., & Puts, D. (2020). Voice
pitch: a window into the communication of social power. Current opinion in
psychology, 331, 154–161.
Aung, T., Hill, A. K., Hlay, J. K., Hess, C., Hess, M., Johnson, J., Doll, L., Carlson, S. M., Magdinec, C., G-Santoyo, I., Walker, R. S., Bailey, D., Arnocky, S., Kamble, S., Vardy, T., Kyritsis, T., Atkinson, Q., Jones, B., Burns, J., … Puts, D. (2024). Effects
of Voice Pitch on Social Perceptions Vary With Relational Mobility and Homicide
Rate. Psychological
Science, 35(3), 250–262.
(2005). Registers
of Language. In A. Duranti (Ed.), A
Companion to Linguistic Anthropology (1st
ed., pp. 23–45). Wiley.
Barlow, M., & Kemmer, S. (Eds.). (2000). Usage-based
models of language. CSLI Publications, Center for the Study of Language and Information.
Bartoń, K. (2024). MuMIn:
Multi-Model Inference. Retrieved from [URL]
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting
Linear Mixed-Effects Models Using lme4. Journal of Statistical
Software, 67(1), 1–48.
Beattie, G. W., Cutler, A., & Pearson, M. (1982). Why
is Mrs Thatcher interrupted so
often? Nature, 300(5894), 744–747.
Ben-Aderet, T., Gallego-Abenza, M., Reby, D., & Mathevon, N. (2017). Dog-directed
speech: Why do we use it and do dogs pay attention to it? Proceedings of the Royal Society B:
Biological
Sciences, 284(1846), 20162429.
Berry, D. S. (1991). Accuracy
in social perception: Contributions of facial and vocal information. Journal of Personality and
Social
Psychology, 61(2), 298–307.
Biber, D. (1995). Dimensions
of Register Variation: A Cross-Linguistic Comparison. Cambridge University Press.
Biber, D., & Egbert, J. (2016). Using
Grammatical Features for Automatic Register Identification in an Unrestricted Corpus of Documents from the Open
Web. Journal of Research Design and Statistics in Linguistics and Communication
Science, 2(1), 3–36.
Boersma, P., & Weenink, D. (2022). Praat:
Doing phonetics by computer. [URL]
Bradlow, A. R., Torretta, G. M., & Pisoni, D. B. (1996). Intelligibility
of normal speech I: Global and fine-grained acoustic-phonetic talker characteristics. Speech
Communication, 20(3–4), 255–272.
Brown, B. L. (1982). Experimentelle
Untersuchungen zur Personenwahrnehmung aufgrund vokaler
Hinweisreize. In: Scherer, K. (Hg.): Vokale
Kommunikation. Weinheim und Basel: Beltz Verlag, 211–227.
Brown, P. (2015). Politeness
and Language. In International Encyclopedia of the Social &
Behavioral
Sciences (pp. 326–330). Elsevier.
Brown, P., & Levinson, S. C. (1987). Politeness:
Some universals in language usage. Cambridge University Press.
Burgoon, J. K., Manusov, V., & Guerrero, L. K. (2021). Nonverbal
Communication (2nd ed.). New York: Routledge.
Cartei, V., Bond, R., & Reby, D. (2014). What
makes a voice masculine: Physiological and acoustical correlates of women’s ratings of men’s vocal
masculinity. Hormones and
Behavior, 66(4), 569–576.
Claßen, K., Dogil, G., Jessen, M., Marasek, K., & Wokurek, W. (1998). Stimmqualität
und Wortbetonung im Deutschen. Linguistische
Berichte, 1741, 202–245.
Costa, P. T. jr. & McCrae, R. (1992). Revised
NEO Personality Inventory (NEO PI — R) and NEO Five Factor Inventory. Professional
Manual. Odessa, Florida: Psychological Assessment Resources.
DiCanio, C., Nam, H., Amith, J. D., García, R. C., & Whalen, D. H. (2015). Vowel
variability in elicited versus spontaneous speech: Evidence from Mixtec. Journal of
Phonetics, 481, 45–59.
Diehl, R. L., Lindblom, B., Hoemeke, K. A., & Fahey, R. P. (1996). On
explaining certain male-female differences in the phonetic realization of vowel
categories. Journal of
Phonetics, 241, 187–208.
Duran, D., & Jannedy, S. (2023). Register
differences in ÄH–ÄHM filler particles? In S. Betz, B. Ludusan, & P. Wagner (Eds.), Disfluency
in Spontaneous Speech (DiSS) Workshop
2023 (pp. 48–52). ISCA.
Duran, D., Weirich, M., & Jannedy, S. (2023). Assessing
register variation in local speech rate. In R. Skarnitzl & J. Volín (Eds.), Proceedings
of the 20th International Congress of Phonetic Sciences, Prague
2023 (pp. 2314–2318). [URL]
Eckert, P. (2012). Three
Waves of Variation Study: The Emergence of Meaning in the Study of Sociolinguistic
Variation. Annual Review of
Anthropology, 41(1), 87–100.
Eckert, P., & Labov, W. (2017). Phonetics,
phonology and social meaning. Journal of
Sociolinguistics, 21(4), 467–496.
Fant, G. (1966). A
note on vocal tract size factors and non-uniform F-pattern
scalings. STL-QPSR, 41, 22–30.
(1970). Acoustic
theory of speech production: With Calculations based on X-Ray Studies of Russian Articulations (2nd
ed.). The Hague, Paris: Mouton.
Feinberg, D. R. (2008). Are
human faces and voices ornaments signaling common underlying cues to mate value? Evolutionary
Anthropology: Issues, News, and
Reviews, 17(2), 112–118.
Ferguson, S. H., & Kewley-Port, D. (2007). Talker
Differences in Clear and Conversational Speech: Acoustic Characteristics of Vowels. Journal of
Speech, Language, and Hearing
Research, 50(5), 1241–1255.
Fiske, S. T. (2018). Stereotype
Content: Warmth and Competence Endure. Current Directions
in Psychological
Science, 27(2), 67–73.
Fiske, S. T., Cuddy, A. J. C., & Glick, P. (2007). Universal
dimensions of social cognition: Warmth and competence. Trends in Cognitive
Sciences, 11(2), 77–83.
Fox, R. A., & Jacewicz, E. (2017). Reconceptualizing
the vowel space in analyzing regional dialect variation and sound change in American
English. The Journal of the Acoustical Society of
America, 142(1), 444–459.
Giles, H. (1973). Accent
mobility: A model and some data. Anthropological
Linguistics, 15(2), 87–105.
(Ed.). (2016). Communication
Accommodation Theory: Negotiating Personal Relationships and Social Identities Across
Contexts. Cambridge University Press.
Goldinger, S. D. (1996). Words
and voices: Episodic traces in spoken word identification and recognition memory. Journal of
Experimental Psychology: Learning, Memory, and
Cognition, 22(5), 1166–1183.
Guillemot, C., & Sano, S. (2020). Gender-
and register-biased patterns in f use: How does prosody contribute to (in)formality in
Japanese? In Proceedings of Speech Prosody
2020, 725–729.
Gussenhoven, C. (2002). Intonation
and Interpretation: Phonetics and Phonology. Proc. Speech Prosody
2002, 47–57. ISCA Archive.
Harnsberger, J. D., Wright, R., & Pisoni, D. B. (2008). A
new method for eliciting three speaking styles in the laboratory. Speech
Communication, 50(4), 323–336.
Hay, J., Jannedy, S., & Mendoza-Denton, N. (2010). Oprah
and /ay/: Lexical frequency, referee design, and style. In E. Schleef & M. Meyerhoff (Eds.), The
Routledge sociolinguistics
reader (pp. 53–59). Routledge.
Hazan, V., & Baker, R. (2010). Does
reading clearly produce the same acoustic-phonetic modifications as spontaneous speech in a clear speaking
style? DiSS-LPSS Joint Workshop (DiSS
2010), 7–10.
Heffernan, K. (2010). Mumbling
is macho: Phonetic distinctiveness in the speech of American radio DJs. American
Speech, 85(1), 67–90.
Hillenbrand, J., Getty, L. A., Clark, M. J., & Wheeler, K. (1995). Acoustic
characteristics of American English vowels. The Journal of the Acoustical Society of
America, 97(5), 3099.
Jacewicz, E., Fox, R. A., & Salmons, J. (2007). Vowel
Space Areas Across Dialects and Gender. In J. Trouvain & W. J. Barry (Eds.), ICPhS
XVI: 16th International Congress of Phonetic
Sciences (pp. 1465–1468).
Jannedy, S., Mendoza-Denton, N., & Weirich, M. (2019). Social
capital and the production and perception of fine phonetic detail in
Berlin. In T. Heyd, F. Mengden, & B. Schneider (Eds.), The
Sociolinguistic Economy of
Berlin (pp. 125–144). De Gruyter.
Johnson, K. (1997). Speech
Perception without Speaker Normalization: An Exemplar Model. In K. Johnson & J. Mullennix (Eds.), Talker
Variability in Speech
Processing (pp. 145–165). Academic Press.
Jongman, A., Wayland, R. & Wong, S. (2000). Acoustic
characteristics of English fricatives. The Journal of the Acoustical Society of
America 1081, 1252–1263.
Kisler, T., Reichel, U., & Schiel, F. (2017). Multilingual
processing of speech via web services. Computer Speech &
Language, 451, 326–347.
Kleiner, S., Knöbl, R., Dudenredaktion
(Bibliographisches Institut), & Institut für Deutsche
Sprache (Eds.). (2015). Duden, das
Aussprachewörterbuch (7th ed.). Dudenverlag; Institut für Deutsche Sprache.
Klofstad, C. A., Anderson, R. C., & Nowicki, S. (2015). Perceptions
of Competence, Strength, and Age Influence Voters to Select Leaders with Lower-Pitched
Voices. PLOS
ONE, 10(8), e0133779.
Knief, U., & Forstmeier, W. (2021). Violating
the normality assumption may be the lesser of two evils. Behavior Research
Methods, 53(6), 2576–2590.
Ko, S. J., Judd, C. M., & Blair, I. V. (2006). What
the Voice Reveals: Within- and Between-Category Stereotyping on the Basis of Voice. Personality
and Social Psychology
Bulletin, 32(6), 806–819.
Koch, P., & Oesterreicher, W. (1985). Sprache
der Nähe — Sprache der Distanz. Mündlichkeit und Schriftlichkeit im Spannungsfeld von Sprachtheorie und
Sprachgeschichte. Romanistisches
Jahrbuch, 36(1), 15–43.
(2012). Language
of immediacy — language of distance. Orality and Literacy from the perspective of language theory and linguistic
history. In C. Lange, B. Weber, & G. Wolf (Eds.), Communicative
Spaces: Variation, Contact, and Change. Papers in Honour of Ursula
Schaefer (pp. 441–473). Peter Lang.
Kohler, K. J. (2000). Investigating
Unscripted Speech: Implications for Phonetics and
Phonology. Phonetica, 57(2–4), 85–94.
Kuhl, P. K., Andruski, J. E., Chistovich, I. A., Chistovich, L. A., Kozhevnikova, E. V., Ryskina, V. L., Stolyarova, E. I., Sundberg, U., & Lacerda, F. (1997). Cross-language
analysis of phonetic units in language addressed to
infants. Science, 277(5326), 684–686.
Kuznetsova, A., Brockhoff, P. B., & Christensen, R. H. B. (2017). lmerTest
Package: Tests in Linear Mixed Effects Models. Journal of Statistical
Software, 82(13), 1–26.
Labov, W. (1966). The
Social Stratification of English in New York City. Washington, DC: Center for Applied Linguistics.
(2006). The
Social Stratification of English in New York City (2nd
ed.). Cambridge University Press. First published
1966.
Lancien, M., & Côté, M.-H. (2021). Hypoarticulation
as a tool for assessing social distance: An acoustic study of speech addressed to different types of
interlocutors. Studii de
Lingvistica, 111, 55–84.
Lehmann, N., Serova, D., Lukassek, J., Döring, S., Goymann, F., Lüdeling, A., & Akbari, R. (2023). Guidelines
for the Annotation of Parameters of Narration. REALIS: Register Aspects of Language in
Situation, 21.
Lenth, R. V. (2022). emmeans:
Estimated Marginal Means, aka Least-Squares Means. Retrieved from [URL]
Leongómez, J. D., Mileva, V. R., Little, A. C., & Roberts, S. C. (2017). Perceived
differences in social status between speaker and listener affect the speaker’s vocal
characteristics. PLoS
One, 12(6), e0179407.
Lüdecke, D., Ben-Shachar, M., Patil, I., Waggoner, P., & Makowski, D. (2021). performance:
An R Package for Assessment, Comparison and Testing of Statistical Models. Journal of Open
Source Software, 6(60), 3139.
Luke, S. G. (2017). Evaluating
significance in linear mixed-effects models in R. Behavior Research
Methods, 49(4), 1494–1502.
Mathôt, S., Schreij, D., & Theeuwes, J. (2012). OpenSesame:
An open-source, graphical experiment builder for the social sciences. Behavior Research
Methods, 44(2), 314–324.
Munson, B. (2007). The
Acoustic Correlates of Perceived Masculinity, Perceived Femininity, and Perceived Sexual
Orientation. Language and
Speech, 50(1), 125–142.
Ohala, J. J. (1984). An
Ethological Perspective on Common Cross-Language Utilization of F₀ of
Voice. Phonetica, 41(1), 1–16.
Osgood, C. E., Suci, G. J., Tannenbaum, P. H. (1957). The
measurement of meaning. University of Illinois.
Pätzold, M., & Simpson, A. P. (1997). Acoustic
analysis of German vowels in the Kiel Corpus of Read
Speech. In Arbeitsberichte Des Instituts Für Phonetik Und Digitale
Sprachverarbeitung (AIPUK): Vol. 32. The Kiel Corpus of Read/Spontaneous Speech Acoustic data base, processing tools and
analysis
results (pp. 215–247). Kiel: Universität Kiel.
Pescuma, V. N., Serova, D., Lukassek, J., Sauermann, A., Schäfer, R., Adli, A., Bildhauer, F., Egg, M., Hülk, K., Ito, A., Jannedy, S., Kordoni, V., Kuehnast, M., Kutscher, S., Lange, R., Lehmann, N., Liu, M., Lütke, B., Maquate, K., … Knoeferle, P. (2023). Situating
language register across the ages, languages, modalities, and cultural aspects: Evidence from complementary
methods. Frontiers in Psychology, 131.
Picheny, M. A., Durlach, N. I., & Braida, L. D. (1986). Speaking
Clearly for the Hard of Hearing II: Acoustic Characteristics of Clear and Conversational
Speech. Journal of Speech, Language, and Hearing
Research, 29(4), 434–446.
Pierrehumbert, J. B. (2001). Exemplar
dynamics: Word frequency, lenition, and contrast. In J. L. Bybee & P. J. Hopper (Eds.), Frequency
and the Emergence of Linguistic
Structure (pp. 137–157). John Benjamins Publishing Company.
Pisanski, K., & Bryant, G. A. (2019). The
Evolution of Voice Perception. In N. S. Eidsheim & K. Meizel (Eds.), The
Oxford Handbook of Voice
Studies (pp. 268–300). Oxford University Press.
Podesva, R. J. (2007). Phonation
type as a stylistic variable: The use of falsetto in constructing a persona. Journal of
Sociolinguistics, 11(4), 478–504.
Puts, D. A., Hodges, C. R., Cárdenas, R. A., & Gaulin, S. J. C. (2007). Men’s
voices as dominance signals: Vocal fundamental and formant frequencies influence dominance attributions among
men. Evolution and Human
Behavior, 28(5), 340–344.
R Core Team. (2022). R: A Language and
Environment for Statistical Computing (version 4.2.2). R Foundation for Statistical Computing. [URL]
Raiche, G., & Magis, D. (2022). nFactors:
Parallel Analysis and Other Non Graphical Solutions to the Cattell Scree Test. [URL]
Schielzeth, H., Dingemanse, N. J., Nakagawa, S., Westneat, D. F., Allegue, H., Teplitsky, C., Réale, D., Dochtermann, N. A., Garamszegi, L. Z., & Araya-Ajoy, Y. G. (2020). Robustness
of linear mixed-effects models to violations of distributional assumptions. Methods in Ecology
and
Evolution, 11(9), 1141–1152.
Schild, C., Aung, T., Kordsmeyer, T. L., Cardenas, R. A., Puts, D. A., & Penke, L. (2020). Linking
human male vocal parameters to perceptions, body morphology, strength and hormonal profiles in contexts of sexual
selection, Scientific
Reports, 10 (1), 1–16.
Sell, A., Bryant, G. A., Cosmides, L., Tooby, J., Sznycer, D., Von Rueden, C., Krauss, A., & Gurven, M. (2010). Adaptations
in humans for assessing physical strength from the voice. Proceedings of the Royal Society B:
Biological
Sciences, 277(1699), 3509–3518.
Sharma, D., & McCarthy, K. (2018). Attentional
Load and Style Control. University of Pennsylvania Working Papers in
Linguistics, 24(2), 127–135.
Silverstein, M. (2003). Indexical
order and the dialectics of sociolinguistic life. Language &
Communication, 23(3–4), 193–229.
Simpson, A. P., & Weirich, M. (2015). Gender-specific
differences in sibilant contrast realizations in English and German. 18th International
Congress of the Phonetic Sciences (ICPhS), Glasgow.
Smiljanić, R., & Bradlow, A. R. (2005). Production
and perception of clear speech in Croatian and English. The Journal of the Acoustical Society
of
America, 118(3), 1677–1688.
(2009). Speaking
and Hearing Clearly: Talker and Listener Factors in Speaking Style Changes. Language and
Linguistics
Compass, 3(1), 236–264.
Szmrecsanyi, B., & Engel, A. (2023). A
variationist perspective on the comparative complexity of four registers at the intersection of mode and
formality. Corpus Linguistics and Linguistic
Theory, 19(1), 79–113.
Sorokowski, P., Puts, D., Johnson, J., Zółkiewicz, O., Oleszkiewicz, A., Sorokowska, A., Kowal, M., Borkowska, B., & Pisanski, K. (2019). Voice
of Authority: Professionals Lower Their Vocal Frequencies When Giving Expert Advice. Journal of
Nonverbal
Behavior, 43(2), 257–269.
Steiner, M. D., & Grieder, S. (2020). EFAtools:
An R package with fast and flexible implementations of exploratory factor analysis
tools. Journal of Open Source
Software, 5(53), 2521.
Stevens, K. N. (1972). The
Quantal Nature of Speech: Evidence from Articulatory-Acoustic
Data. In Edward E. David Jr. & P. B. Denes (Eds.), Human
Communication: A Unified
View (pp. 51–66). New York: McGraw-Hill.
Titze, I. R. (1989). Physiologic
and acoustic differences between male and female voices. The Journal of the Acoustical Society
of
America, 85(4), 1699–1707.
Turnbull, R. (2019). Listener-oriented
phonetic reduction and theory of mind. Language, Cognition and
Neuroscience, 34(6), 747–768.
Weirich, M., & Simpson, A. P. (2014). Differences
in acoustic vowel space and the perception of speech tempo. Journal of
Phonetics, 431, 1–10.
Weirich, M., & Simpson, A. (2019). Effects
of gender, parental role, and time on infant-and adult-directed read and spontaneous
speech. Journal of Speech, Language, and Hearing
Research, 62(11), 4001–4014.
Whiteside, S. P. (2001). Sex-specific
fundamental and formant frequency patterns in a cross-sectional study. The Journal of the
Acoustical Society of
America, 1101, 464–478.
Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L. D., François, R., Grolemund, G., Hayes, A., Henry, L., Hester, J., Kuhn, M., Pedersen, T. L., Miller, E., Bache, S. M., Müller, K., Ooms, J., Robinson, D., Seidel, D. P., Spinu, V., Takahashi, K., Vaughan, D., Wilke, C., Woo, K., Yutani, H. (2019). Welcome
to the tidyverse. Journal of Open Source
Software, 4(43), 1686.
Winkler, R., Fuchs, S., & Perrier, P. (2006). The
relation between differences in vocal tract geometry and articulatory control strategies in the production of French vowels:
Evidence from MRI and modelling. Proceedings of the International Seminar on Speech
Production, 509–516. CEFALA Ubatuba, Brazil. Retrieved from [URL]
Winter, B., & Grawunder, S. (2012). The
phonetic profile of Korean formal and informal speech registers. Journal of
Phonetics, 40(6), 808–815.
Winter, B., Oh, G. E., Hübscher, I., Idemaru, K., Brown, L., Prieto, P., & Grawunder, S. (2021). Rethinking
the frequency code: A meta-analytic review of the role of acoustic body size in communicative
phenomena. Philosophical Transactions of the Royal Society B: Biological
Sciences, 376(1840), 20200400.
Cited by (1)
Cited by one other publication
This list is based on CrossRef data as of 30 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.