Article published In: Asia-Pacific Language Variation
Vol. 2:1 (2016) ► pp.48–81
Sociotonetics using connected speech
A study of Sui tone variation in free-speech style
Published online: 29 September 2016
https://doi.org/10.1075/aplv.2.1.02sta
https://doi.org/10.1075/aplv.2.1.02sta
Abstract
This is the first variationist sociotonetic study to use free-speech data for exploring tone. Due to the challenges of analyzing tone in free-speech data, prior work on sociotonetics has been limited to relatively formal speech styles: word lists, sentence frames, and phrase lists. But connected speech styles, including free speech and reading passages, are important for segmental sociophonetics and most other linguistic variables. Will free-speech data always be out of reach for sociotonetics? Can tone variation in connected speech data be normalized and meaningfully analyzed for sociolinguistic research questions? Using field data from the Sui language of China, this paper develops a practical approach for analyzing tone variation in connected speech data, and then applies it to a specific research question about dialect contact in exogamous Sui villages. Results show that some types of intra- and inter-speaker tone variation in connected speech can be effectively analyzed, although other types of tone variables are neutralized in this speech style.
抽象
本文是首篇利用自然言语来探讨声调的变异派社会语音学论。由于分析自然言语技巧上的限制以往文献限于单字表,单字表等正式言语。然而,连续式的言语,例如自然言语及朗读,是音段社会语音学内非常重要的领域。社会音调学永远无法被用来研究连续言语数据吗?连续言语数据能被标准化及分析吗?这篇论文利用田野调查发展出一个分析声调变异的实用方法,并且将其运用于研究外婚制水语族的方言接触。研究发现连续言语的声调变化能被分析,但某些类型的声调变数仍被中和。
Article outline
- 1.Introduction
- 1.1Normalization for variationist purposes
- 1.2Vowel formant normalization
- 1.3Tone normalization
- 1.4Background on the Sui field data study
- 1.4.1Sui tones
- 1.4.2The sociolinguistic question for the Sui field data study
- 2.Normalization methods
- 2.1Duration
- 2.2Mean pitch and pitch range
- 2.3Challenges of measuring tone in connected speech
- 2.4Testing moving-window normalization methods
- 2.5Examining the normalization method
- 3.Results of the Sui field data study
- 3.1Group analysis
- 3.2Conversation
- 4.Conclusion
- Notes
References
References (127)
Adank, Patti, van Hout, Roeland, & Smits, Roel (2004). An acoustic description of the vowels of Northern and Standard Dutch. Journal of the Acoustical Society of America, 1161, 1729–1738.
Adda-Decker, Martine, & Snoeren, Natalie (2011). Quantifying temporal speech reduction in French using forced speech alignment. Journal of Phonetics, 391, 261–270.
Aston, John, Chiou, Jeng-Min, & Evans, Jonathan (2010). Linguistic pitch analysis using functional principal component mixed effect models. Journal of the Royal Statistical Society, 591, 297–317.
Astruc, Lluisa (2013). Prosody. In Mark Jones & Rachel-Anne Knight (Eds.), The Bloomsbury companion to phonetics (pp. 126–139). London: Bloomsbury.
Bauer, Robert, Cheung, Kwan-Hin, & Cheung, Pak-Man (2003). Variation and merger of the rising tones in Hong Kong Cantonese. Language Variation and Change, 151, 211–225.
Boersma, Paul, & Weenink, David (2014). Praat: Doing phonetics by computer (Version 5.3.82) [Computer program]. Available from: [URL]
Bradley, David (2007). East and Southeast Asia. In Ronald E. Asher & Christopher Moseley (Eds.), Atlas of the world’s languages (2nd ed.) (pp. 157–208). London; New York: Routledge.
Brown, J. Marvin (1975). The great tone split: Did it work in two opposite ways? In Jimmy Harris & James Chamberlain (Eds.), Studies in Tai linguistics in honor of William J. Gedney (pp. 33–48). Bangkok: Central Institute of English Language.
Castro, Andy (2011). Southern Sui: A fourth Sui dialect. Journal of the Southeast Asian Linguistics Society, 41, 1–31.
Castro, Andy, & Pan, Xingwen (2014). Shuiyu diaocha yanjiu [Investigation of the Sui language]. Guiyang, China: Guizhou People’s Publishing.
Chasaide, Ailbhe, & Gobl, Christer (1993). Contextual variation of the vowel voice source as a function of adjacent consonants. Language and Speech, 361, 303–330.
Chen, C.-J., Gopinath, R., Monkowski, M., Picheny, M., & Shen, K. (1997). New methods in continuous Mandarin speech recognition. Proceedings of the fifth European Conference on Speech Communication and Technology, Rhodes, Greece, 1543–1546.
Clark, John, & Yallop, Colin (1995). An introduction to phonetics and phonology (2nd ed.). Oxford: Blackwell.
Clopper, Cynthia (2009). Computational methods for analyzing acoustic vowel data for talker differences. Language and Linguistics Compass, 31, 1430–1442.
Drager, Katie, & Hay, Jennifer (2012). Exploiting random intercepts: Two case studies in sociophonetics. Language Variation and Change, 241, 59–78.
Edmondson, Jerold, Esling, John, Harris, Jimmy, & Wei, James (2004). A phonetic study of Sui consonants and tones. Mon-Khmer Studies, 341, 47–66.
Edmondson, Jerold, & Solnit, David (Eds.). (1988). Comparative Kadai: Linguistic studies beyond Tai. Dallas, Texas: SIL.
Ernestus, Mirjam, & Warner, Natasha (2011). An introduction to reduced pronunciation variants. Journal of Phonetics, 391, 253–260.
Evans, Betsy (2011). Aspects of the acoustic analysis of imitation. In Dennis Preston & Nancy Niedzielski (Eds.), A reader in sociophonetics (pp. 379–391). New York: De Gruyter Mouton.
Evans, Jonathan , Chu, Man-ni, Aston, John, & Su, Chao-yu (2010). Linguistic and human effects on F0 in a tonal dialect of Qiang. Phonetica, 671, 82–99.
Flynn, Nicholas (2011). Comparing vowel formant normalization procedures. York Papers in Linguistics, 21, 1–28.
Fought, Carmen (1999). A majority sound change in a minority community: /u/-fronting in Chicano English. Journal of Sociolinguistics, 31, 5–23.
Foulkes, Paul, & Docherty, Gerard (2006). The social life of phonetics and phonology. Journal of Phonetics, 341, 409–438.
Fridland, Valerie, & Kendall, Tyler (2012). Exploring the relationship between production and perception in the mid front vowels of U.S. English. Lingua, 1221, 779–793.
Gerstman, Louis (1968). Classification of self-normalized vowels. IEEE Transactions of Audio Electroacoustics, AU-161, 78–80.
Hart, Johan’t, Collier, René, & Cohen, Antonie (1990). A perceptual study of intonation: An experimental approach to speech melody. Cambridge: Cambridge University Press.
Hermes, Dik, & van Gestel, Joost (1991). The frequency scale of speech intonation. Journal of the Acoustical Society of America, 901, 97–102.
Hildebrandt, Kristine (2003). Manange tone: Scenarios of retention and loss in two communities. Doctoral dissertation, University of California, Santa Barbara.
(2005). A phonetic analysis of Manange segmental and suprasegmental properties. Linguistics of the Tibeto-Burman Area, 281, 1–36.
Hombert, Jean-Marie (1978). Consonant types, vowel quality, and tone. In Victoria Fromkin (Ed.), Tone: A linguistic survey (pp. 77–111). New York: Academic Press.
Hombert, Jean-Marie, Ohala, John, & Ewan, William (1979). Phonetic explanations for the development of tones. Language, 551, 37–58.
House, Arthur, & Fairbanks, Grant (1953). The influence of consonant environment upon the secondary acoustical characteristics of vowels. Journal of the Acoustical Society of America, 251, 105–113.
Huang, Jingyuan, & Holt, Lori (2009). General perceptual contributions to lexical tone normalization. Journal of the Acoustical Society of America, 1251, 3983–3994.
(2011). Evidence for the central origin of lexical tone normalization. Journal of the Acoustical Society of America, 1291, 1145–1148.
Huang, Hank C.-H., & Seide, Frank (2000). Pitch tracking and tone features for Mandarin speech recognition.
Proceedings of the 2000 International Conference on Acoustics, Speech and Signal Processing
, Istanbul, Turkey, 1523–1526.
Johnson, Daniel E. (2009). Getting off the GoldVarb standard: Rbrul for mixed-effects variable rule analysis. Language and Linguistics Compass, 31, 359–383.
Johnson, Keith (1997). Speech perception without speaker normalization. In K. Johnson & J. W. Mullennix (Eds.), Talker variability in speech processing (pp. 145–166). San Diego: Academic Press.
(2003). Speaker normalization in speech perception. In David Pisoni & Robert Remez (Eds.), The handbook of speech perception (pp. 361–389). Oxford: Blackwell.
Kendall, Tyler & Thomas, Erik (2010). Vowels: Vowel manipulation, normalization, and plotting in R (software package). Available from: [URL]
Labov, William (1966). The social stratification of English in New York City. Washington, DC: Center for Applied Linguistics. (2nd edition, 2006, Cambridge: Cambridge University Press)
Labov, William, Ash, Sharon, & Boberg, Charles (2006). The linguistic atlas of North America. Berlin: Mouton de Gruyter.
Ladd, Robert, & Silverman, Kim (1984). Vowel intrinsic pitch in connected speech. Phonetica, 411, 31–40.
Ladefoged, Peter (2003). Phonetic data analysis: An introduction to fieldwork and instrumental techniques. Oxford: Blackwell.
Leather, Jonathan (1983). Speaker normalization in perception of lexical tone. Journal of Phonetics, 111, 373–382.
Lee, Tan, Lau, Wai, Wong, Y. W., & Ching, P. C. (2002). Using tone information in Cantonese continuous speech recognition. ACM Transactions on Asian Language Processing, 11, 83–102.
Lei, Xin (2006). Modeling lexical tones for Mandarin large vocabulary continuous speech recognition. Doctoral dissertation, University of Washington.
Li, Fang Kuei (1948). The distribution of initials and tones in the Sui language. Language, 241, 160–167.
Liang, Lei, & Meng, Xiaolin (2011). A sociophonetic study on tones of Chongqing Mandarin in gender and age difference. Paper presented at the International Conference of Phonetic Sciences XVII, Hong Kong.
(2013). Chongqing fangyan danzidiaode gongshibianyi [Chongqing dialect synchronic tone variation]. Language and Linguistics, 141, 929–959.
Liberman, Mark, & Pierrehumbert, Janet (1984). Intonational variance under changes in pitch range and length. In Mark Aronoff & Richard T. Oehrle (Eds.), Language sound structure (pp. 157–233). Cambridge, Massachusetts; London, England: The MIT Press.
Lobanov, Boris (1971). Classification of Russian vowels spoken by different listeners. Journal of the Acoustical Society of America, 491, 606–608.
Loveday, Leo (1981). Pitch, politeness and sexual role: An exploratory investigation into the pitch correlates of English and Japanese politeness formulae. Language and Speech, 241, 71–89..
Matisoff, James (1970). Glottal dissimilation and the Lahu high-rising tone: A tonogenetic case study. Journal of the American Oriental Society, 901, 13–44.
(1973). Tonogenesis in Southeast Asia. In Larry Hyman (Ed.), Consonant types and tone (pp. 71–95). Southern California Occasional Papers in Linguistics, No. 1. Los Angeles: UCLA.
Meyerhoff, Miriam (in press). Writing a linguistic symphony: Analysing variation while doing language documentation. Canadian Journal of Linguistics..
Miller, James (1989). Auditory-perceptual interpretation of the vowel. Journal of the Acoustical Society of America, 851, 2114–2134.
Mok, Peggy, Zuo, Donghui, & Wong, Peggy W. Y. (2013). Production and perception of a sound change in progress: Tone merging in Hong Kong Cantonese. Language Variation and Change, 251, 341–370.
Moore, Corinne, & Jongman, Allard (1997). Speaker normalization in the perception of Mandarin Chinese tones. Journal of the Acoustical Society of America, 1021, 1864–1877.
Nearey, Terrance (1977). Phonetic feature systems for vowels. Doctoral dissertation, University of Alberta, Edmonton, Canada.
(1989). Static, dynamic, and relational properties in vowel perception. Journal of the Acoustical Society of America, 851, 2088–2113.
Nguyen, Hong Quang, Nocera, Pascal, Castelli, Eric, & Trinh, Van Loan (2008). Tone recognition of Vietnamese continuous speech using Hidden Markov Model. Second International Conference on Communications and Electronics, 235–239. Available from [URL].
Nolan, Francis (2003). Intonational equivalence: An experimental evaluation of pitch scales. In Maria-Josep Solé, Daniel Recasens, & Joaquin Romero (Eds.), Proceedings of the 15th International Congress of the Phonetic Sciences (pp. 771–774). Barcelona.
Ohala, John (1978). Production of tones. In Victoria Fromkin (Ed.), Tone: A linguistic survey (pp. 5–39). New York: Academic Press.
Ohala, John, & Ewan, W. (1973). Speed of pitch change [Abstract]. Journal of the Acoustical Society of America, 531, 345.
Ohara, Yumiko (1992). Gender-dependent pitch levels: A comparative study in Japanese and English. In Kira Hall, Mary Bucholtz, & Birch Moonwomon (Eds.), Locating power: Proceedings of the Second Berkeley Women and Language Conference (pp. 469–477). Berkeley: Berkeley Women and Language Group.
Peng, Gang, & Wang, William S.-Y. (2005). Tone recognition of continuous Cantonese speech based on support vector machines. Speech Communication, 251, 49–62.
Pisoni, David (1997). Some thoughts on 'normalization' in speech perception. In Keith Johnson & John Mullenix (Eds.), Talker variability in speech processing (pp. 9–32). San Diego, California: Academic Press.
Podesva, Robert (2007). Phonation type as a stylistic variable: The use of falsetto in constructing a persona. Journal of Sociolinguistics, 111, 478–504.
Rose, Philip (1987). Considerations in the normalization of the fundamental frequency of linguistic tone. Speech Communication, 61, 343–351.
(1991). How effective are long term mean and standard deviation as normal parameters for tonal fundamental frequency? Speech Communication, 101, 229–247.
(1993). A linguistic-phonetic acoustic analysis of Shanghai tones. Australian Journal of Linguistics, 131, 185–220.
(1997). A seven-tone dialect in Southern Thai with super high: Pakphanang tonal acoustics and physiological inferences. In Arthur Abramson (Ed.), Southeast Asian linguistics studies in honour of Vichin Panupong (pp. 191–208). Bangkok: Chulalongkorn University Press.
Rosner, B., & Pickering, J. (1994). Vowel perception and production. New York; Oxford: Oxford University Press.
Ross, Elliot, Edmondson, Jerold, & Seibert, Burton G. (1986). The effect of affect on various acoustic measures of prosody in tone and non-tone languages: A comparison based on computer analysis of voice. Journal of Phonetics, 141, 283–302.
Sanchez, Kauyumari, Hay, Jennifer, & Nilson, Elissa (2015). Contextual activation of Australia can affect New Zealanders’ vowel productions. Journal of Phonetics, 481, 76–95.
Sankoff, Gillian, & Blondeau, Hélène (2007). Language change across the lifespan: /r/ in Montreal French. Language, 831, 560–588.
Shuiyu Diaocha Baogao [Sui Language Research Report] (1956). Unpublished manuscript, Gui yang, Guizhou, China. Acquired from Jerold Edmondson, University of Texas, Arlington.
Stanford, James (2007). Dialect contact and identity: A case study of exogamous Sui clans. Doctoral dissertation, Michigan State University.
(2008a). A sociotonetic analysis of Sui dialect contact. Language Variation and Change, 201, 409–450.
(2008b). Child dialect acquisition: New perspectives on parent/peer influence. Journal of Sociolinguistics, 121, 567–596.
(2009a). 'Eating the food of our place': Sociolinguistic loyalties in multidialectal Sui villages. Language in Society, 381, 287–309.
(2009b). Clan as a sociolinguistic variable. In James Stanford & Dennis Preston (Eds.), Variation in indigenous minority languages (pp. 463–484). Amsterdam; Philadelphia: John Benjamins.
(2011). A 50-year comparison of regional dialect variation in the Sui language. Journal of the Southeast Asian Linguistics Society, 41, 120–143.
Sundberg, Johan (1973). Data on maximum speed of pitch changes. Speech Transmission Laboratory Quarterly Progress and Status Reports, 41, 39–47.
(1979). Maximum speed of pitch changes in singers and untrained subjects. Journal of Phonetics, 71, 71–79.
Thomas, Erik (2002). Instrumental phonetics. In J. K. Chambers, Peter Trudgill, & Natalie Schil ling-Estes (Eds.), The handbook of language variation and change (pp. 168–200). Oxford, UK; Malden, MA: Blackwell.
(2010). A longitudinal analysis of the durability of the Northern-Midland dialect boundary in Ohio. American Speech, 851, 375–430.
Thomas, Erik, & Kendall, Tyler (2007). NORM: The vowel normalization and plotting suite. Available from [URL]
Tingsabadh, Kalaya M.R., & Deeprasert, Daranee (1997). Tones in Standard Thai Connected Speech. In Arthur Abramson (Ed.), Southeast Asian linguistics studies: In honor of Vichin Panupong (pp. 297–308). Chulalongkorn University Press. Available from [URL]
Tran, Do-Dat, Castelli, Eric, Serignat, Jean-Francois, Trinh, Van Loan, & Le, Xuan Hung (2005). Influence of F0 on Vietnamese syllable perception. INTERSPEECH, Lisbon, Portugal, September 2005.
Traunmüller, Hartmut, & Eriksson, Anders (1995). The perceptual evaluation of F0 excursions in speech as evidenced by liveliness estimations. Journal of the Acoustical Society of America, 971, 1905–1915.
Van Niekerk, Daniel, & Barnard, Etienne (2014). Predicting utterance pitch targets in Yoruba for tone realization in speech synthesis. Speech Communication, 561, 229–242.
Verbrugge, Robert, Strange, Winifred, Shankweiler, Donald, & Edman, Thomas (1976). What information enables a listener to map a talker’s vowel space? Journal of the Acoustical Society of America, 601, 198–212.
Watt, Dominic, & Fabricius, Anne (2002). Evaluation of a technique for improving the mapping of multiple speakers’ vowel spaces in the F1~F2 plane. Leeds Working Papers in Linguistics and Phonetics, 91, 159–173.
Wolfram, Walt (1986). Language variation in the United States. In O. L. Taylor (Ed.), Nature of communication disorders in culturally and linguistically diverse populations (pp. 73–115). San Diego, CA: College-Hill Press.
Wolfram, Walt, & Schilling-Estes, Natalie (2006). American English: Dialects and variation (2nd ed.). Malden, Massachusetts; Oxford, UK: Blackwell.
Wong, Patrick C. M., & Diehl, Randy L. (2003). Perceptual normalization for inter- and intratalker variation in Cantonese level tones. Journal of Speech, Language, and Hearing Research, 461, 413–421.
Wright, Richard, & Shryock, Aaron (1993). The effects of pitch on implosives in SiSwati. Journal of the International Phonetic Association, 231, 16–23.
Xia, Yongliang (1988/2008). Duyunshi Yanghexiang Pandong Shuiyu diaocha [Duyun city Yanghe village Pandong Sui language investigation]. In Yongqu Liang (Ed.), Guizhou Liushanliushui Minzu Diaocha Ziliao Xuanbian [Selections from Guizhou Nationalities Investigations] (pp. 254–265). Guiyang, China: Guizhou Nationalities Publishing.
Xu, Yi (1994). Production and perception of coarticulated tones. Journal of the Acoustical Society of America, 951, 2240–2253.
(1999). Effects of tone and focus on the formation and alignment of F0 contours. Journal of Phonetics, 271, 55–105.
(2004). Understanding tone from the perspective of production and perception. Language and Linguistics, 51, 757–797.
(2005). Speech melody as articulatorily implemented communicative functions. Speech Communication, 461, 220–251.
(2006). Tone in connected discourse. In Keith Brown (Ed.), Encyclopedia of language and linguistics (2nd ed.) (pp. 742–750). Oxford: Elsevier.
Xu, Yi, & Sun Xuejing (2002). Maximum speed of pitch change and how it may relate to speech. Journal of the Acoustical Society of America, 1111, 1399–1413.
Xu, Yi, & Wang, Q. Emily (2001). Pitch targets and their realization: Evidence from Mandarin Chinese. Speech Communication, 331, 319–337.
Yaeger-Dror, Malcah, & Fagyal, Zsuzsanna (2011). Anayzing prosody: Best practices for the analysis of prosody. In Marianna Di Paolo & Malcah Yaeger-Dror (Eds.), Sociophonetics: A student’s guide (pp. 119–130). London; New York: Routledge.
Yuasa, Patricia (2008). Culture and gender of voice pitch: A sociophonetic comparison of the Japanese and Americans. London: Equinox.
Zeng, Xiaoyu, & Yao, Fuxiang (1996). Han-Shui cidian [Chinese-Sui dictionary]. Chengdu: Sichuan Minorities Publishing.
Cited by (8)
Cited by eight other publications
Nagy, Naomi, Holman Tse & James N. Stanford
Yang, Cathryn
Yang, Cathryn, James N. Stanford, Chunxia Luo & Naluo Zhang
Yu, Alan C. L., Crystal W. T. Lee, Chen Lan & Peggy P. K. Mok
Yang, Cathryn, James N. Stanford, Yang Liu, Jinjing Jiang & Liufang Tang
2019. Variation in the tonal space of Yangliu Lalo, an endangered language of Yunnan, China. Linguistics of the Tibeto-Burman Area 42:1 ► pp. 2 ff.
Zhang, Jingwei
Kuo, Jennifer
2018. A large-scale smartphone-based sociophonetic study of Taiwan Mandarin. Asia-Pacific Language Variation 4:2 ► pp. 197 ff.
This list is based on CrossRef data as of 14 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
