In:Applied Linguistics in the Middle East and North Africa: Current practices and future directions
Edited by Atta Gebril
[AILA Applied Linguistics Series 15] 2017
► pp. 289–304
Chapter 13Corpora and the study of Arabic vocabulary
Published online: 18 July 2017
https://doi.org/10.1075/aals.15.13abd
https://doi.org/10.1075/aals.15.13abd
Abstract
Corpora have made many significant contributions to our understanding of how language works and how it is used in the society. In particular, they have become an indispensable tool in almost any systematic investigation of the lexicon. This chapter gives an overview of some key aspects of Arabic lexis that can be examined straightforwardly with corpora. Among the topics addressed are word frequency, clusters, synonymy, polysemy, and semantic prosody. Sample corpus analyses are provided to show how corpora can help uncover major patterns in the linguistic behavior of Arabic lexical items. Finally, the chapter discusses some implications for teaching Arabic as a foreign language.
Keywords: Arabic corpora, Arabic vocabulary, clusters, synonymy, and semantic prosody
Article outline
- Introduction
- Data
- Word frequency
- Clusters
- Synonymy
- Polysemy
- Semantic prosody
- Implications for teaching Arabic as a foreign language
Note References
References (30)
Al-Sulaiti, L. (2004). Designing and developing a corpus of contemporary Arabic. (Unpublished MA thesis). University of Leeds, UK.
Buckwalter, T., & Parkinson, D. (2011). A frequency dictionary of Arabic: Core vocabulary for learners. London: Routledge.
(2013). Modern lexicography. In J. Owens (Ed.), The Oxford handbook of Arabic linguistics (pp. 539–559). Oxford: Oxford University Press.
Chambers, A. (2010). What is data-driven learning? In A. O’Keeffe, & M. McCarthy (Eds.), The Routledge handbook of corpus linguistics (pp. 345–358). London: Routledge.
Channell, J. (2003). Corpus-based analysis of evaluative lexis. In S. Hunston & G. Thompson (Eds.), Evaluation in text: Authorial stance and the construction of discourse (pp. 38–55). Oxford: Oxford University Press.
Conrad, S., & Biber, D. (2001). Adverbial marking of stance in speech and writing. In S. Hunston & G. Thompson (Eds.), Evaluation in text: Authorial stance and the construction of discourse (pp. 56–73). Oxford: Oxford University Press.
Elewa, A. (2004). Collocation and synonymy in classical Arabic: A corpus-based study. (Unpublished Doctoral dissertation). The University of Manchester Institute of Science and Technology, UK.
Fellbaum, C. (2007). Introduction. In C. Fellbaum (Ed.), Idioms and collocations: Corpus-based linguistic and lexicographic studies (pp. 1–19). London: Continuum.
Frankenberg-Garcia, A. (2012). Learners’ use of corpus examples. International Journal of Lexicography, 25(3), 273–296.
Granger, S. (2005). Pushing back the limits of phraseology. How far can we go? In C. Cosme, C. Gouverneur, F. Meunier, & M. Paquot (Eds.), Proceedings of phraseology 2005: An interdisciplinary conference (pp. 165–168). Louvain-la-Neuve: Université catholique de Louvain.
Johns, T. (1991). Should you be persuaded: Two examples of data-driven learning. In T. Johns & P. King (Eds.), Classroom Concordancing. ELR Journal, 4, 1–16.
Johnson, A., & Raish, M. (2013). Making vocabulary corporeal: Arabic learners, vocabulary development & arabiCorpus. Journal of the National Council of Less Commonly Taught Languages, 13, 71–85.
Jones, M., & Durrant, P. (2010). What can a corpus tell us about vocabulary teaching materials? In A. O’Keeffe & M. McCarthy (Eds.), The Routledge handbook of corpus linguistics (pp. 387–400). London: Routledge.
Louw, B. (1993). Irony in the text or insincerity in the writer? The diagnostic potential of semantic prosodies. In M. Baker, G. Francis, & E. Tognini-Bonelli (Eds.), Text and technology: In honour of John Sinclair (pp. 157–176). Amsterdam: John Benjamins.
(2000). Contextual prosodic theory: Bringing semantic prosodies to life. In C. Heffer, H. Sauntson, & G. Fox (Eds.), Words in context: A tribute to John Sinclair on his retirement. Birmingham: University of Birmingham.
McCarten, J. (2007). Teaching vocabulary: Lessons from the corpus, lessons for the classroom. Cambridge: Cambridge University Press.
. (2010) Corpus-informed course book design. In A. O’Keeffe & M. McCarthy (Eds.), The Routledge handbook of corpus linguistics (pp. 413–427). London: Routledge.
McCarthy, M. (2006). Fluency and confluence: What fluent speakers do. In M. McCarthy (Ed.), Explorations in corpus linguistics (pp. 1–6). Cambridge: Cambridge University Press.
McCarthy, M., & Carter, R. (2006). This that and the other: Multi-word clusters in spoken English as visible patterns of interaction. In M. McCarthy (Ed.), Explorations in corpus linguistics (pp. 7–26). Cambridge: Cambridge University Press.
O’Keffee, A., McCarthy, M., & Carter, R. (2007). From corpus to classroom: Language use and language teaching. Cambridge: Cambridge University Press.
Schmitt, N., Grandage, S., & Adolphs, S. (2004). Are corpus-derived recurrent clusters psycholinguistically valid? In N. Schmitt (Ed.), Formulaic sequences: Acquisition, processing, and use (pp. 127–151). Amsterdam: John Benjamins.
Scott, M. (2008). WordSmith Tools (Version 5) [Computer software]. Liverpool: Lexical Analysis Software.
Sripicharn, P. (2010). How can we prepare learners for using language corpora? In A. O’Keeffe & M. McCarthy (Eds.), The Routledge handbook of corpus linguistics (pp. 371–384). London: Routledge.
