Article published In: Chinese Language and Discourse
Vol. 15:1 (2024) ► pp.73–104
Colloquialism and genre variation in Chinese
A corpus-driven study
Published online: 16 October 2023
https://doi.org/10.1075/cld.21010.li
https://doi.org/10.1075/cld.21010.li
Abstract
Colloquialization has been identified as one of the most crucial change processes of the English language, through
which written genres gradually shift towards spoken styles. Colloquialism refers to the synchronous features resulting from
colloquialization. There is limited research into the lexical and phrasal patterns indicating colloquialism in Chinese or how
Chinese colloquialism manifests in different spoken and written genres. To extract the lexical and phrasal patterns indicating
colloquialism in Chinese, this study used a corpus-driven approach to determine and compare keywords and phrase frames from two
training Chinese corpora and those keywords and phrase frames with higher frequencies were retained for further testing.
Keywords: colloquialism, corpus, features extraction, genre variation
Article outline
- 1.Introduction
- 2.Colloquialism in Chinese
- 3.Corpora and methodology
- 3.1Corpora
- 3.2Delimiting colloquial features in Chinese
- 3.2.1Training to determine the categories of positive keywords and key phrase frames
- 3.2.2Testing to verify the significance of positive keywords and key phrase frames
- 3.3Cluster analysis of colloquial features in each genre
- 4.Colloquialism and genre variation
- 5.Discussion of the colloquial features in each group
- 5.1Colloquialism of genres in the left branch
- 5.2Colloquialism of genres in the right branch
- 6.Conclusion
- Acknowledgements
- Notes
References
References (42)
Anthony, Laurence. 2020a. AntConc (Version
3.5.9) [Computer Software]. Tokyo, Japan: Waseda University (Available
from [URL]).
. 2020b. AntGram (Version
1.2.3) [Computer Software]. Tokyo, Japan: Waseda University. ([URL]).
. 2003. “Compressed
Noun-Phrase Structures in Newspaper Discourse: The Competing Demands of Popularization vs.
Economy.” In New Media Language, eds.
by Aitchison, Jean, and Diana M. Lewis, 169–181. London: Routledge.
Biber, Douglas, Susan Conrad, and Viviana Cortes. 2004. “If
You Look at...: Lexical Bundles in University Teaching and Textbooks.” Applied
Linguistics 25 (3): 371–405.
Biber, Douglas, and Susan Conrad. 2009. Register,
Genre, and Style. Cambridge: Cambridge University Press.
Biber, Douglas, and Jesse Egbert. 2018. Register
Variation Online. Cambridge: Cambridge University Press.
Chafe, Wallace. 1982. “Integration
and Involvement in Speaking, Writing, and Oral Literature.” In Spoken
and Written Language. Exploring Orality and Literacy, ed. by Deborah Tannen, 35–53. Norwood, NJ: Ablex.
Chappell, Hilary. 1991. “Strategies
for the Assertion of Obviousness and Disagreement in Mandarin: A Semantic Study of the Modal Particle
me.” Australian Journal of Linguistics: Journal of the Australian Linguistic
Society 11 (1): 39–65.
Collins, Peter, and Xinyue Yao. 2018. “Colloquialisation
and the Evolution of Australian English: A Cross-Varietal and Cross-Generic Study of Australian, British, and American English
from 1931 to 2006.” English
World-Wide 39 (3): 253–277.
Cook, Angela. 2023. “Reconsidering
the shi…(de) Construction in Spoken Mandarin.” Chinese Language and
Discourse 14 (2): 209–231.
Culpeper, Jonathan, and Merja Kytö. 2010. Early
Modern English Dialogues: Spoken Interaction as
Writing. Cambridge: Cambridge University Press.
Cunningham, Kelly J. 2017. “A Phraseological Exploration of
Recent Mathematics Research Articles Through Key Phrase Frames.” Journal of English for
Academic Purposes 251: 71–83.
Egbert, Jesse, and Michaela Mahlberg. 2020. “Fiction-One
Register or Two? Speech and Narration in Novels.” Register
Studies 2 (1): 72–101.
Fairclough, Norman. 1994. “Conversationalization
of Public Discourse and the Authority of the Consumer.” In The
Authority of the Consumer, eds. by Keat, Russell, Nigel Whiteley, and Nicholas Abercrombie, 253–268. London: Routledge.
Feng, Debing, and Xiaoping Wu. 2018. “Weibo
Interaction in the Discourse of Internet Anti-Corruption: The Case of “Brother Watch”
Event.” Discourse, Context, and
Media 241: 99–108.
Hundt, Marianne, and Christian Mair. 1999. “‘Agile’
and ‘Uptight’ genres: The Corpus-Based Approach to Language Change in Progress.” International
Journal of Corpus
Linguistics 4 (2): 221–242.
Hutchby, Ian. 1999. “Rhetorical
Strategies in Audience Participation Debates on Radio and TV.” Research on Language and Social
Interaction 32 (3): 243–267.
Ji, Chuanbo, and Fangfang Liu. 2015. “留学生汉语书面语中的口语化倾向研究[Colloquial Trends in Chinese
Compositions of Foreign Students].” Yuyan Jiaoxue yu
Yanjiu [Language Teaching and Linguistic
Studies] 11: 31–37.
Jing-Schmidt, Zhuo. 2019. “Grammatical
Constructions and Chinese Discourse.” In Routledge Handbook of
Chinese Discourse Analysis, ed. by C. Shei, 102–115. London: Routledge.
. 2022. “Sentence-Final
Particles: Sociolinguistic and Discourse Perspectives.” In The
Cambridge Handbook of Chinese Linguistics, eds. by Chu-Ren Huang, Yen-Hwei Lin, I-Hsuan Chen, and Yu-Yin Hsu, 597–615. Cambridge: Cambridge University Press.
Jing-Schmidt, Zhuo, and Hongyin Tao. 2009. “The
Mandarin Disposal Constructions: Usage and Development.” Language and
Linguistics 10 (1): 29–58.
Jones, Rodney H., Sylvia Jaworska, and Erhan Aslan. 2021. Language
and Media. A Resource Book for Students. London and New York: Routledge.
Kruger, Haidee, and Adam Smith. 2018. “Colloquialization
Versus Densification in Australian English: A Multidimensional Analysis of the Australian Diachronic Hansard corpus
(ADHC).” Australian Journal of
Linguistics 38 (3): 293–328.
Liang, Maocheng, and Wenxin Xiong. 2008. “文本分析工具PatCount在外语教学与研究中的应用 [Applications of PatCount in
Foreign Language Teaching and Research].” Waiyu Dianhua
Jiaoxue [Technology Enhanced Foreign Language
Education] 1231: 71–76.
Ma, Mingyan. 2017. “汉语学习者书面语作文“口语化”倾向的语体表征 [Features of Oral Style in Chinese
Compositions Written by CSL learners].” Hanyuxuexi [Chinese Language
Learning] 11: 81–90.
Mair, Christian, and Marianne Hundt. 1995. “Why
is the Progressive Becoming More Frequent in English? A Corpus-Based Investigation of Language Change in
Progress.” Zeitschrift für Anglistik und
Amerikanistik 431: 111–122.
McCarthy, Michael. 1993. “Spoken
Discourse Markers in Written Texts.” In Techniques of Description:
Spoken and Written Discourse, eds. by Gwyneth Fox, Michael Hoey, and John M. Sinclair, 170–182. London and New York: Routledge.
O’Keeffe, Anne. 2012. “Media
and Discourse Analysis.” In The Routledge Handbook of Discourse
Analysis, eds. by Michael Handford, and James Paul Gee, 35–53. London and New York: Routledge.
R Development Core Team. 2017. R: A
Language and Environment for Statistical Computing. R Foundation for Statistical
Computing ([URL]).
Samson, Christina. 2004. “Interaction
in Written Economics Lectures: The Meta-Discursive Role of Person
Markers.” In Discourse Patterns in Spoken and Written
Corpora, eds. by Karin Aijmer, and Anna-Brita Stenström, 199–216. Amsterdam/Philadelphia: John Benjamins Publishing Company.
Schegloff, Emanuel A. 1968. “Sequencing in Conversational
Openings.” American
Anthropologist 701: 1075–1095.
Soh, Hooi Ling. 2018. “Mandarin Chinese Sentence
Final de as a Marker of Private Evidence.” In Proceedings of the 92nd
Annual Meeting of the Linguistic Society of America, eds. by Farrell Patrick, 1–14.
Tannen, Deborah. 1980. Spoken
and Written Language and the Oral/Literate Continuum. Paper presented at
the Proceedings of the Sixth Annual Meeting of the Berkeley Linguistics
Society. Berkeley: University of California.
Thornborrow, Joanna. 2001. “Authenticating
Talk: Building Public Identities in Audience Participation Broadcasting.” Discourse
Studies 3 (4): 459–479.
Xiao, Richard, and Tony McEnery Tony. 2004. Aspect
in Mandarin Chinese. A Corpus-Based
Study. Amsterdam/Philadelphia: John Benjamins Publishing Company.
Cited by (1)
Cited by one other publication
This list is based on CrossRef data as of 7 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
