Article published In: Units of Language – Units of Writing
Edited by Terry Joyce and David Roberts
[Written Language & Literacy 15:2] 2012
► pp. 254–278
Orthographic representation and variation within the Japanese writing system
Some corpus-based observations
Published online: 10 August 2012
https://doi.org/10.1075/wll.15.2.07joy
https://doi.org/10.1075/wll.15.2.07joy
Given its multi-scriptal nature, the Japanese writing system can potentially yield some important insights into the complex relationships that can exist between units of language and units of writing. This paper discusses some of the difficult issues surrounding the notions of orthographic representation and variation within the Japanese writing system, as seen from the perspective of creating word lists based on the Kokuritsu Kokugo Kenkyūjo’s ‘Balanced Corpus of Contemporary Written Japanese’ (BCCWJ) Project. More specifically, the paper (i) reflects on the treatment of lemmas within UniDic, the morphological analyzer dictionary developed for the project, (ii) notes some concerns for extracting word lists that stem from the project’s approach towards defining orthographic words which draws on its conceptualization of short and long unit words, and (iii) attempts to quantify the extent of orthographic variation within the Japanese writing system as represented by the BCCWJ. Keywords: Japanese; Balanced Corpus of Contemporary Written Japanese (BCCWJ); kanji; hiragana; katakana; orthographic variation; UniDic
Cited by (10)
Cited by ten other publications
Robertson, Wesley C. & Tamaki Mihic
Joyce, Terry & Dimitrios Meletis
Robertson, Wes
Joyce, Terry & Robert Crellin
Joyce, Terry & Hisashi Masuda
2018. Introduction to the multi-script Japanese writing system and word processing. In Writing Systems, Reading Processes, and Cross-Linguistic Influences [Bilingual Processing and Acquisition, 7], ► pp. 179 ff.
Joyce, Terry & Hisashi Masuda
Masuda, Hisashi & Terry Joyce
2018. Constituent-priming investigations of the morphological activation of Japanese compound words. In Writing Systems, Reading Processes, and Cross-Linguistic Influences [Bilingual Processing and Acquisition, 7], ► pp. 221 ff.
Joyce, Terry, Bor Hodošček & Hisashi Masuda
2017. Constructing an ontology and database of Japanese lexical properties. Written Language & Literacy 20:1 ► pp. 27 ff.
Robertson, Wesley C.
Joyce, Terry, Hisashi Masuda & Taeko Ogawa
2014. Jōyō kanji as core building blocks of the Japanese writing system. Written Language & Literacy 17:2 ► pp. 173 ff.
This list is based on CrossRef data as of 24 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
