Article published In: Concentric
Vol. 47:2 (2021) ► pp.184–224
From speech to language
An alternative corpus account of prosodic highlight in continuous speech
Published online: 17 November 2021
https://doi.org/10.1075/consl.00027.che
https://doi.org/10.1075/consl.00027.che
Abstract
This study proposes a novel exploration of perceived prosodic highlights in continuous speech, focusing on the alternative function of indexing and projecting information content deployment in the speech context. Given the assumption that prosodic highlight allocation directly reflects the interlocutors’ information content deployment, this study foregrounds perception-based prominences for indexing both the key information (KEY) and the projector (PJR) that projects the deployment of key/focal information. Two information content planning units (PJR plus its respective projection PJN, and KEY) prompted by prosodic highlights were established, based on quantitative analyses and discriminative acoustic features. Additional analyses confirm a general heavy-to-light information distribution across both units, showcasing that the relative projection trajectory size in the PJR-PJN unit is positively correlated to its position within discourse-prosodic units. Current results, therefore, directly substantiate the cognitive explanation of prosodic projection in speech, as evidence beyond syntactic relationships are drawn and prosodic projection is shown to involve perceived prosodic highlight allocation and information deployment in a fixed pattern. Explorations of prosody-prompted projection shed light on a more comprehensive account of the mechanism behind information planning, hence facilitating a deeper understanding of the composition of context prosody and the derivation of linguistic invariants from speech.
Article outline
- 1.Introduction
- 2.On projection in previous studies
- 3.The present study: A preview
- 4.Speech materials and data pre-processing
- 4.1The speech data
- 4.2Data pre-processing and annotations
- 4.2.1Annotations for discourse-prosodic unit
- 4.2.2Annotations for perceived prosodic highlight
- 4.3Categorizing prominence prompted information content
- 4.3.1Key information (KEY)
- 4.3.2Projector (PJR plus its respective projection PJN)
- 4.3.3Referring expression (KEY-REF)
- 4.3.4Inferred key information (KEY-INF)
- 5.Analysis I: The information content category
- 5.1Distribution of information content categories
- 5.1.1Discussion
- 5.2Prosodic profiles of KEY and PJR
- 5.2.1Discussion
- 5.3Interim summary
- 5.4PJR-PJN by the discourse-prosodic unit boundaries
- 5.4.1Discussion
- 5.1Distribution of information content categories
- 6.Analyses by information planning units: Calculation of emphasis density and their position by discourse-prosodic units
- 6.1Analysis II: Distribution of emphasis density
- 6.1.1Emphasis density score calculation
- 6.1.2Results of emphasis density score calculation
- 6.1.3Discussions
- 6.2Analysis III: Locating information planning units by discourse-prosodic units
- 6.2.1Estimating the average position of information planning units
- 6.2.2Analysis result
- 6.2.3Discussions
- 6.1Analysis II: Distribution of emphasis density
- 7.General discussions and summary
- 7.1On prosodic highlight-prompted information content units
- 7.2Toward a cognitive significance for prosodic highlight-prompted projection
- Acknowledgements
- Notes
- Abbreviations
References
References (39)
Auer, Peter. 1996. On the prosody and syntax of turn-continuations. Prosody and Conversation: Interactional Studies, ed. by Elizabeth Couper-Kuhlen and Margret Selting, 57–100. Cambridge, UK: Cambridge University Press.
. 2015. The temporality of language in interaction: Projection and latency. Temporality in Interaction, ed. by Arnulf Deppermann and Susanne Günthner, 27–56. Amsterdam & Philadelphia: John Benjamins.
Baumann, Stefan, Oliver Niebuhr, and Bastian Schroeter. 2016. Acoustic cues to perceived prominence levels: Evidence from German spontaneous speech. Proceedings of 8th Speech Prosody Conference, ed. by Jon Barnes, Alejna Brugos, Stefanie Shattuck-Hufnagel and Nanette Veilleux, 711–715. Baixas, France: ISCA Archive.
Boersma, Paul, and David Weenink. 2015. Praat: Doing phonetics by computer. Retrieved November 20, 2015, from [URL]
Chafe, Wallace. 1994. Discourse, Consciousness, and Time: The Flow and Displacement of Conscious Experience in Speaking and Writing. Chicago, IL: University of Chicago Press.
Chen, Kai-Yun, Laurent Prévot, Roxane Bertrand, Béatrice Priego-Valverde, and Philippe Blache. 2012. Toward a Mandarin-French corpus of interactional data. Proceedings of the 16th Workshop on the Semantics and Pragmatics of Dialogues, ed. by Sarah Brown-Schmidt, Jonathan Ginzburg and Staffan Larsson, 147–148. Paris, France: SEMDIAL.
Chen, Kai-Yun, Wei-Te Fang, and Chiu-Yu Tseng. 2015. Information content, weighting and distribution in continuous speech prosody–A cross-genre comparison. Paper presented at the 2015 International Conference Oriental COCOSDA, Shanghai Jiao Tong University, Shanghai.
Chomsky, Noam. 1986. Knowledge of Language: Its Nature, Origin, and Use. Westport, CT: Greenwood Publishing Group.
Clark, Andy. 2013. Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behavioral and Brain Sciences 36.3:181–204.
De Ruiter, Jan-Peter, Holger Mitterer, and Nick J. Enfield. 2006. Projecting the end of a speaker’s turn: A cognitive cornerstone of conversation. Language 82.3:515–535.
Dilley, Laura. 2016. Rhythm, context effects, and prediction. Paper presented at the 8th Speech Prosody Conference (SP 2016), Boston University, Boston.
Falk, Simone. 2014. On the notion of salience in spoken discourse–Prominence cues shaping discourse structure and comprehension. TIPA: Travaux Interdisciplinaires Sur La Parole et Le Langage 301:1–23.
Ford, Cecilia E., and Sandra A. Thompson. 1996. Interactional units in conversation: Syntactic, intonational, and pragmatic resources for the management of turns. Interaction and Grammar, ed. by Elinor Ochs, Emanuel A. Schegloff and Sandra A. Thompson, 135–184. Cambridge, UK: Cambridge University Press.
Goodwin, Charles. 1996. Transparent vision. Interaction and Grammar, ed. by Elinor Ochs, Emanuel A. Schegloff and Sandra A. Thompson, 370–404. Cambridge, UK: Cambridge University Press.
Haegeman, Lillian. 1994. Introduction to Government and Binding Theory. Oxford, UK: Blackwell Publishing.
Halliday, Michael Alexander Kirkwood. 1967. Notes on transitivity and theme in English: Part 1. Journal of Linguistics 3.1:37–81.
Huang, Shuanfan. 2013. Chinese Grammar at Work. Amsterdam & Philadelphia: John Benjamins.
Jefferson, Gail. 1973. A case of precision timing in ordinary conversation: Overlapped tag-positioned address terms in closing sequences. Semiotica 9.1:47–96.
Kohler, Klaus. J. 1997. Modelling prosody in spontaneous speech. Computing Prosody: Computational Models for Processing Spontaneous Speech, ed. by Sagisaka Yoshinori, Nick Campbell and Norio Higuchi, 187–210. New York: Springer.
Lambrecht, Knud. 1994. Information Structure and Sentence Form: Topic, Focus, and the Mental Representations of Discourse Referents. Cambridge, UK: Cambridge University Press.
Lerner, Gene. H. 1996. On the “semi-permeable” character of grammatical units in conversation: Conditional entry into the turn space of another speaker. Interaction and Grammar, ed. by Elinor Ochs, Emanuel A. Schegloff and Sandra A. Thompson. 238–271. Cambridge, UK: Cambridge University Press.
Pierrehumbert, Janet, and Julia Bell Hirschberg. 1990. The meaning of intonational contours in the interpretation of discourse. Intentions in Communication, ed. by Philip R. Cohen, Jerry L. Morgan, and Martha E. Pollack, 271–311. Cambridge, MA: MIT Press.
Sacks, Harvey, Emanuel. A. Schegloff, and Gail Jefferson. 1974. A simplest systematics for the organization of turn-taking for conversation. Language 50.4:696–735.
Silverman, Kim, Mary Beckman, John Pitrelli, Mari Ostendorf, Colin Wightman, Patti Price, Janet Pierrehumbert and Julia Hirschberg. 1992. ToBI: A standard for labeling English prosody. Proceedings of the 2nd International Conference on Spoken Language Processing (ICSLP 92), ed. by John J. Ohala, Terrance M. Nearey, Bruce L. Derwing, Megan M. Hodge and Grace Weibe, 867–870. Alberta, Canada: University of Alberta.
Su, Chao-yu, and Chiu-yu Tseng. 2015. Melody of Mandarin L2 English⎯When L1 transfer and L2 planning come together. Paper presented at the 2015 International Conference Oriental COCOSDA, Shanghai Jiao Tong University, Shanghai.
. 2017. How prosodic cues could lead to information center in speech⎯An alternative to ASR. Paper presented at the 2017 International Conference on Speech Database and Assessments (Oriental COCOSDA), Seoul National University, Seoul.
Tseng, Chiu-yu. 2010. An F0 analysis of discourse construction and global information in realized narrative prosody. Language and Linguistics 11.2:183–218.
. 2013. Output prosody⎯How information highlights are piggybacked by discourse structure. Zhongguo Yuyin Xuebao 41:109–124.
Tseng, Chiu-yu, and Chao-yu Su. 2012. Information allocation and prosodic expressiveness in continuous speech: A Mandarin cross genre analysis. Paper presented at the 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012), Innocentre, Hong Kong.
. 2014. L2 discourse and information planning and their prosodic implications. Paper presented at the 2014 International Conference Oriental COCOSDA. Cape Panwa Hotel, Phuket.
Tseng, Chiu-yu, Lin-shan Lee, and Zhao-yu Su. 2008. Spontaneous Mandarin speech prosody-the NTU DSP lecture corpus. Paper presented at the 2008 International Conference Oriental COCOSDA. National Institute of Communication Technology, Kyoto.
Tseng, Chiu-yu, Shao-huang Pin, Yehlin Lee, Hsin-min Wang, and Yong-cheng Chen. 2005. Fluent speech prosody: Framework and modelling. Speech Communication 46.3–4:284–309.
Tseng, Chiu-yu, Yun-Ching Cheng, Wei-Shan Lee, and Feng-Lan Huang. 2003. Collecting Mandarin speech databases for prosody investigation. Paper presented at the Oriental-COCOSDA Workshop 2003. Sijori Resort Sentosa, Sentosa.
Tseng, Chiu-yu, and Zhao-yu Su. 2008. Discourse prosody and context–Global F0 and tempo modulations. Paper presented at the 9th Annual Conference of the International Speech Communication Association (INTERSPEECH 2008). Brisbane Convention and Exhibition Centre (BCEC), Brisbane.
Tseng, Chiu-yu, Zhao-yu Su, and Chi-feng Huang. 2011. Prosodic highlights in Mandarin continuous speech⎯Cross-genre attributes and implications. Proceedings of the 12th Annual Conference of the International Speech Communication Association (INTERSPEECH 2011), ed. by Piero Cosi, Renato De Mori, Giuseppe Di Fabbrizio and Roberto Pieraccini, 1381–1384. Baixas, France: ISCA Archive.
