Article published In: Semantic Considerations of Lexical Processing
[The Mental Lexicon 9:1] 2014
► pp. 131–140
Word frequency of written Urdu
Published online: 25 April 2014
https://doi.org/10.1075/ml.9.1.06kha
https://doi.org/10.1075/ml.9.1.06kha
Performance on word processing tasks is known to be influenced by the frequency with which words occur in a language. Large and robust effects of word frequency occur across languages and the processes thought to be sensitive to word frequency are considered fundamentally important characteristics of the mental lexicon. To our knowledge, word frequency data is non-existent for Urdu. This important language has characteristics that make it appealing to psycholinguists. Unfortunately, most of the Urdu published electronically is in the form of image files rather than text and therefore, has been largely inaccessible by programs designed to generate word counts. Consequently, unlike other important orthographies (e.g., English) orthographic word frequencies in Urdu are not readily available. We describe here a database that addresses this methodological gap. We have constructed a word frequency database for written Urdu and describe that development. We also describe data from simple tests of the effects of Urdu word frequency to demonstrate that our measure results in effects considered to be the hallmark of frequency effects. The frequency counts from this database will help psycholinguists and cognitive psychologists conduct and control future studies on the mental lexicon using Urdu. This database can be downloaded from http://web2.uwindsor.ca/psychology/urdufrequency/
Keywords: Urdu, word frequency
References (21)
Allen, P. A., McNeal, M., & Kvak, D. (1992). Perhaps the lexicon is coded as a function of word frequency. Journal of Memory and Language, 311, 826– 844.
Baker, P., Hardie, A., McEnery, T., & Jayaram, B. D. (2003). Corpus data for South Asian language processing. Paper presented at the EACL 2003, Workshop on Computational Linguistics for South Asian Languages – Expanding Synergies with Europe, 12–17 April 2003, Budapest.
Comrie, B. (1998). Microsoft Encarta 98 encyclopedia. Redmond: Microsoft Corporation. Retrieved from: [URL]
Dash, N. S., & Chaudhari, B. B. (2001). Paper presented at the SCALLA conference, Bangalore, India. Retrieved from: [URL]
de Groot, A. M. B., Borgwaldt, S., Bos, M., & van den Eijnden, E. (2002). Lexical decision and word naming in bilinguals: Language effects and task effects. Journal of Memory and Language, 471, 91–124.
Durda, K., & Buchanan, L. (2006). WordMine2 [Online]. Retrieved from: [URL]
EMILLE Corpus Documentation. (2003). Retrieved from: [URL]
Ethonologue Survey. (1999). Summer Institute for Linguistics (SIL). Retrieved from: [URL]
Everaert, C. (2009). Tracing the boundaries between Hindi and Urdu: Lost and added in translation between 20th century short stories. Leiden, The Netherlands: Brill.
Gordon, B. (1983). Lexical access and lexical decision: Mechanisms of frequency sensitivity. Journal of Verbal Learning and Verbal Behaviour, 221, 24–44.
1985). Subjective frequency and the lexical decision latency function: Implications for mechanisms of lexical access. Journal of Memory and Language, 241, 631–645.
Khan, Q. H. (2006). A study of word frequency in written Urdu. (Unpublished Master’s Thesis). University of Windsor, Canada.
2012). Lexical representation and processing in cross-script Urdu-English bilinguals: The case of frequency-balanced and frequency-unbalanced cognates and noncognates. (Unpublished Doctoral Dissertation). University of Windsor, Canada.
Khan, Q. H., & Buchanan, L. (2006). A study of word frequency in written Urdu. Poster presented at the Fifth International Conference on the Mental Lexicon, Montreal, QC.
Kucera, H., & Francis, W. N. (1967). Computational analysis of present-day American English. Providence, R. I.: Brown University Press.
Martynyuk, S. (2003). Statistical approach to the debate on Urdu and Hindi. The Annual of Urdu Studies, 181.
Monsell, S. (1991). The nature and locus of word frequency effects in reading. In D. Besner & G. Humphreys (Eds.), Basic processes in reading: Visual word recognition (pp. 148–197). New Jersey: Lawrence Erlbaum Associates, Inc.
Monsell, S., Doyle, M. C., & Haggard, P. N. (1989). Effects of frequency on visual word recognition tasks: Where are they? Journal of Experimental Psychology: General, 118(1), 43–71.
