Article published In: International Journal of Corpus Linguistics
Vol. 14:4 (2009) ► pp.433–466
From real-life situated discourse to video-stream data-mining
An argument for agent-oriented modeling for multimodal corpus compilation
Published online: 15 December 2009
https://doi.org/10.1075/ijcl.14.4.01gu
https://doi.org/10.1075/ijcl.14.4.01gu
This paper presents an argument for agent-oriented modeling (AOM) as a research methodology and a metalanguage for corpus linguistics. It is triggered by three closely related issues arising from compiling multimodal corpora such as the Spoken Chinese Corpora of Situated Discourse (SCCSD). Given a real-life situation, there are three types of representation: (i) the Written Word representation, (ii) audio recording, and (iii) video recording. It is shown that the three types are all data-transformative and involve data loss, and that they are intrinsically flawed. The current multiple-layered approach to data integration is also shown to be inadequate. AOM is proposed to be a potential solution to the problems. Modeling decision tree, levels of modeling, and modeling schema written in XML are demonstrated. The philosophical basis of AOM, and its theoretical implications are also discussed.
Cited by (6)
Cited by six other publications
Zhang, Nan, Lihe Huang & Deyu Zhou
Huang, Lihe
Pan, Mingwei
Xu, Jiajin
2015. Corpus-based Chinese studies. Chinese Language and Discourse. An International and Interdisciplinary Journal 6:2 ► pp. 218 ff.
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
