In:Spoken Corpora and Linguistic Studies
Edited by Tommaso Raso and Heliana Mello
[Studies in Corpus Linguistics 61] 2014
► pp. 411–467
Prosodic constraints for discourse markers
Published online: 14 November 2014
https://doi.org/10.1075/scl.61.14ras
https://doi.org/10.1075/scl.61.14ras
This article aims to show how a corpus driven theory that analyses speech through information units can better account for Discourse Markers (DM) identification and analysis. We propose that the speech flow can only be properly analyzed if segmented into utterances and tone units through prosodic parameters. Utterances correspond to speech acts and tone units to information units (IU); therefore, it is possible for DMs to be identified since they correspond to dialogic information units (DU). Each IU is submitted to different prosodic conditions in order to carry their function. This allows for: (i) identifying DUs; (ii) distinguishing different DUs, thus recognizing the specific function of each DM. We present data from comparable corpora from different Romance languages. The DM data are studied focusing on their functions, frequency, distribution and lexical fillers.
References (45)
Acciardi, Daniela. 2009. Congiunzioni frasali e connettori testuali (C-ORAL-ROM e GRITTEXT). In Sintassi storica e sincronica dell’italiano, Angela Ferrari (ed.). Basilea: Fondo Nazionale Svizzero per la Ricerca Scientifica.
. 2010. I focalizzatori in un corpus di italiano parlato (C-ORAL-ROM Italia). In Atti del Convegno Internazionale G.S.C.P. “La comunicazione parlata”, Napoli 23–25 febbraio 2009, Massimo Pettorino, Antonella Giannini & Francesca M. Dovetto (eds).
. 2010. La subordinazione causale esplicita in un corpus di italiano parlato spontaneo (C-ORAL-ROM Italia). In
Atti del XLII Congresso Internazionale SLI, “Scuola Normale Superiore
”,
Pisa
, 25–27 settembre 2008, Pier Marco Bertinetto, Valentina Bambini & Irene Ricci et al. (eds).
Arruda, Adriellen. 2013. A unidade informacional de comentários múltiplos no português do Brasil: Um trabalho baseado em corpus, Monografia Final. Belo Horizonte: Faculdade de Letras, Universidade Federal de Minas Gerais.
Barbosa, Plinio Almeida. 1999. Pela integração de ciência e tecnologia de fala. In Estudos de Prosódia, Ester Scarpa (ed.), 21–52. Campinas: Editora da Unicamp.
. 2013. Semi-automatic and automatic tools for generating prosodic descriptors for prosody research. In
Proceedings from TRASP 2013, Tools and Resources for the Analysis of Speech Prosody
, August 30, 2013, Brigitte Bigi & Daniel Hirst (eds), 86–90. Aix-en-Provence: Laboratoire Parole et Langage.
Bazzanella, Carla, Bosco, Cristina, Gili Fivela, Barbara, Mieznikowski, Johanna, & Brunozzi, Francesca Tini. 2008. Polifunzionalità dei segnali discorsivi, sviluppo conversazionale e ruolo dei tratti fonetici e fonologici. In La comunicazione parlata, Vol. II, Massimo Pettorino, Alessandra Giannini, Marianna Vallone & Renata Savy (eds), 934–963. Napoli: Liguori.
Boersma, Paul & Weenink, Daniel. 2013. Praat. [URL].
. 2002. Illocuzione e modalit�. In: Gian Luigi Beccaria and Carla Marello, (Eds.). La parola al testo. Scritti in onore di Bice Mortara Garavelli. Alessandria: Edizioni dell’Orso.
. 2012. Topic-Focus and Comment-Focus in the Language into Act Theory. In Illocution, Modality, Attitude, Information Patterning and Speech Annotation, Heliana Mello, Alessandro Panunzi & Tommaso Raso (ed.). Florence: Firenze University Press. [URL]
Cresti, Emanuela & Moneglia, Massimo (eds). 2005 C-ORAL-ROM: Integrated Reference Corpora for Spoken Romance Languages [Studies in Corpus Linguistics 15]. Amsterdam: John Benjamins.
Firenzuoli, Valentina. 2003. Le forme intonative di valore illocutivo dell’italiano parlato: Analisi sperimentaledi un corpus di parlato spontaneo (LABLITA). PhD dissertation, Università di Firenze.
Fischer, Kerstin (ed.). 2006. Approaches to Discourse Particles [Studies in Pragmatics 1]. Bingley: Emerald.
Fleiss, Joseph L. 1971. Measuring nominal scale agreement among many raters. Psychological Bulletin 76: 378–382.
Fraser, Bruce. 2006. Towards a theory of discourse markers. In Fischer (ed.), 189–205.
Frosali, Fabrizio. 2008. L’unità di informazione di ausilio dialogico: Valori percentuali, caratteri intonativi, lessicali e morfo-sintattici in un corpus di italiano parlato (C-ORAL-ROM). In Prospettive nello studio del lessico italiano, Emanuela Cresti (ed.), 417–424. Florence: Firenze University Press. [URL].
Gregori, Lorenzo & Panunzi, Alessandro. 2012. DBIPIC: An XML database for information patterning analysis. In.
Proceedings of the GSCP 2012. Speech and Corpora. Heliana Mello, Massimo Pettorino & Tommaso Raso (eds). [URL]
Maia Rocha, Bruna & Raso, Tommaso. 2011. A unidade informacional de Introdutor Locutivo no português do Brasil: uma primeira descrição baseada em corpus. In: Domínios de Lingu@agem, 327–343. [URL]
Martin, Philippe. 2005. WinPitch Corpus. A text-to-speech analysis and alignment tool. In Cresti & Moneglia (eds), 40–51.
. 2012. WinPitch. [URL].
Mello, Heliana & Raso, Tommaso. 2012. Illocution, modality, attitude: Different names for different categories. In Illocution, Modality, Attitude, Information Patterning and Speech Annotation, Heliana Mello, Alessandro Panunzi & Tommaso Raso (eds). Firenze: FUP. [URL]
Mello, Heliana, Raso, Tommaso, Mittmann, Maryualê M., Vale, Heloísa P. & Cõrtes, Priscila O. 2012. Transcrição e segmentação prosódica do corpus C-ORAL-BRASIL: Critérios de implementação e validação. In C-ORAL-BRASIL I: Corpus de referência do português brasileiro falado informal, Tommaso Raso & Heliana Mello (eds). Belo Horizonte: UFMG.
Mettouchi, Amina, Caubet, Dominique, Vanhove, Martine, Tosco, Mauro, Comrie, Bernard & Izre’el, Shlomo. 2010. CORPAFROAS. A corpus for spoken Afroasiatic languages: Morphosyntactic and prosodic analysis. In
CAMSEMUD2007 Proceedings of the 13th Italian Meeting of Afro-Asiatic Linguistics
, Frederick M. Fales & Giulia F. Grassi (eds), 177–180. Padova: Sargon.
Mittmann, Maryualê. 2012. O C-ORAL-BRASIL e o estudo da fala informal: um novo olhar sobre o Tópico no Português Brasileiro. PhD Dissertation. Belo Horizonte: Universidade Federal de Minas Gerais.
Mittmann, Maryualê M., Panunzi, Alessandro, Cresti, Emanuela, Moneglia, Massimo, Mello, Heliana & Raso, Tommaso. 2012. Information patterning strategies in spontaneous speech: A cross-linguistic study. In
Proceedings of the GSCP 2012. Speech and Corpora
, Heliana Mello, Massimo Pettorino & Tommaso Raso (eds). [URL]
Mittmann, Maryualê M. & Raso, Tommaso. 2012. The C-ORAL-BRASIL informationally tagged minicorpus
. In Illocution, Modality, Attitude, Information Patterning and Speech Annotation, Heliana Mello & Alessandro Panunzi & Tommaso Raso (eds). Florence: Firenze University Press. [URL]
Moneglia, Massimo. 2005. The C-ORAL-ROM resource. In Cresti & Moneglia (eds), 1–70.
. 2011. Spoken corpora and pragmatics. Revista Brasileira de Linguística Aplicada, 479–519. [URL]
Moneglia, Massimo, Scarano, Antonietta & Spinu, Marius. 2002. Validation by expert transcribers of the CORAL-ROM prosodic tagging criteria on Italian, Spanish, Portuguese corpora of spontaneous speech. [URL]
Moneglia, Massimo, Fabbri, Marco, Quazza, Silvia, Panizza, Andrea, Danieli, Morena, Garrido, Juan Maríia & Swerts, Marc. 2005. Evaluation of consensus on the annotation of terminal and non-terminal prosodic breaks in the C-ORAL-ROM corpus
. In Cresti & Moneglia (eds), 257–276.
Moneglia, Massimo, Raso, Tommaso, Mittmann, Maryualê M. & Mello, Heliana. 2010. Challenging the perceptual relevance of prosodic breaks in multilingual spontaneous speech corpora: C-ORAL-BRASIL/C-ORAL-ROM. Prosodic prominence perceptual and automatic identification. In
Proceedings of the Speech Prosody International Conference Satellite Workshop
, Chicago, IL
. [URL]
Panunzi, Alessandro & Gregori, Lorenzo. 2012. An XML model for multi-layer representation of spoken Language. In Illocution, Modality, Attitude, Information Patterning and Speech Annotation, Heliana Mello, Alessandro Panunzi & Tommaso Raso (eds). Florence: Firenze University Press. [URL]
Raso, Tommaso. 2012. O C-ORAL-BRASIL e a Teoria da Língua em Ato. In C-ORAL-BRASIL I: Corpus de referência do português brasileiro falado informal, Tommaso Raso & Heliana Mello (eds). Belo Horizonte: UFMG.
Raso, Tommaso & Leite, Flávia. 2010. Estudo contrastivo do uso de alocutivos em italiano, português e espanhol europeus e português brasileiro. Domínios de Lingu@gem. [URL]
Raso, Tommaso & Mello, Heliana. 2009. Parâmetros de compilação de um corpus oral: O caso do C-ORAL—BRASIL. Veredas, 20–35. [URL]
. 2010. The C-ORAL-BRASIL corpus. In Bootstrapping Information from Corpora in a Cross-Linguistic Perspective, Massimo Moneglia & Alesseandro Panunzi (eds). Florence: Firenze University Press. [URL]
(eds). 2012a. C-ORAL-BRASIL I. Corpus de referência do português brasileiro falado informal. Belo Horizonte: UFMG.
. 2012b. The C-ORAL-BRASIL corpus
. In
Proceedings of the GSCP 2012. Speech and Corpora
. Heliana Mello, Massimo Pettorino & Tommaso Raso. Firenze: Firenze University Press. [URL]
Raso, Tommaso & Mittmann, Maryualê M. 2009. Validação estatística dos critérios de segmentação da fala espontânea no corpus C-ORAL-BRASIL. Revista de Estudos da Linguagem, 73–91. [URL]
Rocha, Bruno. 2013. Metodologia empírica para o estudo de ilocuções naturais do PB. Domínios de Lingu@agem. [URL]
Cited by (15)
Cited by 15 other publications
Moneglia, Massimo & Giorgina Cantalini
Raso, Tommaso, Saulo Mendes Santos, Albert Rilliard & João A. Moraes
Raso, Tommaso, Bruno Neves Rati de Melo Rocha, João Vinícius Salgado, Breno Fiuza Cruz, Lucas Machado Mantovani & Heliana Mello
Raso, Tommaso, Albert Rilliard & Saulo Mendes Santos
Bossaglia, Giulia, Heliana Mello & Tommaso Raso
2020. Illocution as a unit of reference for spontaneous speech. In In search of basic units of spoken language [Studies in Corpus Linguistics, 94], ► pp. 221 ff.
Cavalcante, Frederico Amorim, Tommaso Raso, Giulia Bossaglia, Maryualê Mittmann & Bruno Rocha
Cresti, Emanuela
2020. The pragmatic analysis of speech and its illocutionary classification according to the
Language into Act Theory. In In search of basic units of spoken language [Studies in Corpus Linguistics, 94], ► pp. 181 ff.
Haselow, Alexander
2020. Local and global structures in discourse and interaction. In Grammar and Cognition [Human Cognitive Processing, 70], ► pp. 267 ff.
Raso, Tommaso, Plínio A. Barbosa, Frederico A. Cavalcante & Maryualê M. Mittmann
2020. Segmentation and analysis of the two English excerpts. In In search of basic units of spoken language [Studies in Corpus Linguistics, 94], ► pp. 309 ff.
Degand, Liesbeth
2019. Causal relations between discourse and grammar. In Empirical studies of the construction of discourse [Pragmatics & Beyond New Series, 305], ► pp. 131 ff.
Cresti, Emanuela & Massimo Moneglia
2018. The illocutionary basis of information structure. In Information structure in lesser-described languages [Studies in Language Companion Series, 199], ► pp. 359 ff.
Cresti, Emanuela & Massimo Moneglia
2018. The definition of the TOPIC within Language into Act Theory and its identification in spontaneous speech corpora. Revue Romane. Langue et littérature. International Journal of Romance Languages and Literatures 53:1 ► pp. 30 ff.
Cresti, Emanuela & Massimo Moneglia
2020. Some notes on the Hearts and Navy excerpts according
to the Language into Act Theory. In In search of basic units of spoken language [Studies in Corpus Linguistics, 94], ► pp. 383 ff.
Čermáková, Anna, Zuzana Komrsková, Marie Kopřivová & Petra Poukarová
[no author supplied]
2020. Dualistic approaches to the analysis of forms and structures in languages. In Grammar and Cognition [Human Cognitive Processing, 70], ► pp. 157 ff.
This list is based on CrossRef data as of 1 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
