In:Applied Linguistics in the Middle East and North Africa: Current practices and future directions
Edited by Atta Gebril
[AILA Applied Linguistics Series 15] 2017
► pp. 203–225
Chapter 9The development and validation of an Arabic language test in Saudi Arabia
Published online: 18 July 2017
https://doi.org/10.1075/aals.15.09nor
https://doi.org/10.1075/aals.15.09nor
Abstract
This chapter describes the development and validation of the Standardized Test of Arabic Proficiency in Speakers of Other Languages (STAPSOL) at the National Center for Assessment (NCA) in Saudi Arabia. The chapter describes the theoretical foundations and blueprint of the test, including the test components, their selection criteria and respective weight. The chapter also addresses issues related to the scoring process, with specific focus on rater training, scoring rubrics, and investigation of psychometric qualities using both G-theory and multifaceted item response theory. In closing, the chapter looks at planned and possible future developments of and improvements to the test, particularly by formally linking it to the CEFR using well-established procedures.
Keywords: standardized testing, Arabic, CEFR, Assessment Use Arguments, Saudi Arabia
Article outline
- Introduction
- Arabic L2 tests
- STAPSOL
- Test objective
- Theoretical framework
- Specifications – components and weights
- Item writing and review
- Scoring
- Research
- Validity and reliability
- Differentiating between different levels of proficiency
- The FW component
- A Simplified Assessment Use Argument (AUA) – generalizability of tasks and relevance of research
- Future directions: Formally linking STAPSOL to the CEFR
- Summary and conclusions
References Appendix
References (47)
Al-Arabiyya Institute. Al-Arabiyya Test. Retrieved from <[URL]>
(2009). Test review: Test of English as a Foreign Language: Internet-based Test (TOEFL iBT). Language Testing, 26(4), 621–631.
Alhaqbani, A., & Riazi, M. (2012). Metacognitive awareness of reading strategy use in Arabic as a second language. Reading in a Foreign Language, (24)2, 231–255.
Al-Harbi, K. (2013a) October. STAPSOL: construct validity with reference to structure equation modelling. In Symposium on International Arabic Teaching Programs and Outcomes Assessment. Symposium conducted at the meeting of National Center for Assessment in Higher Education, Riyadh.
(2013b) October. STAPSOL: sensitivity to different levels of language attainment. In Symposium on International Arabic Teaching Programs and Outcomes Assessment. Symposium conducted at the meeting of National Center for Assessment in Higher Education, Riyadh.
Al-Kahtani, S. (2013) October. Dependability of ratings for the NCA writing test (FW). In Symposium on International Arabic Teaching Programs and Outcomes Assessment. Symposium conducted at the meeting of National Center for Assessment in Higher Education, Riyadh.
Al-Owidha, A., & Al-Shamrani, A. (2012) September. Standardized Test of Arabic in Speakers of Other Languages (STAPSOL): Evidence of its reliability and validity. Paper presented at the 38th IAEA Conference, Astana, Kazakhstan.
American Council on the Teaching of Foreign Language. (2012). National Arabic consensus project. Retrieved from <[URL]>
. (2013). Testing for proficiency. Retrieved from <[URL]>
American Educational Research Association (AERA), American Psychological Association (APA), National Council on Measurement in Education (NCME). (2014). Standards for educational and psychological testing. Washington, DC: American Educational Research Association.
Arab Academy. (n.d.). Retrieved from <[URL]>
. Structure of the Arab Language Proficiency Test (ALPT). Retrieved from <[URL]>
Bernstein, J., & Suzuki, M. (2011). Versant Arabic test: Test description and validation summary. Palo Alto, CA: Pearson Education.
Buckwalter, T., & Parkinson, D. (2011). A frequency dictionary of Arabic: Core vocabulary for learners. Abingdon: Routledge.
Byrne, B. (2006). Structural equation modelling with EQS: basic concepts, application, and programming (2nd ed.). Mahwah, NJ: Lawrence Erlbaum Associates.
Chapelle, C. A., Enright, M. K., & Jamieson, J. M. (Eds.). (2008). Building a validity argument for the Test of English as a Foreign Language. New York, NY: Routledge.
Cito. (n.d.). Retrieved from <[URL]>
Council of Europe. (2001). Common European Framework of Reference for languages: Learning, teaching and assessment. Cambridge: Cambridge University Press.
. (2009). Relating language examinations to the Common European Framework of Reference for Languages: Learning, teaching and assessment (CEFR): A manual. Strasbourg: Language Policy Division Council of Europe.
Cronbach, L. J., Gleser, G. C., Nanda, H., & Rajaratnam, N. (1972). The dependability of behavioral measurements. New York, NY: Wiley.
de Graaf, A. (2012). The Netherlands: Arabic in education. In F. Grande, J. J. de Ruiter, & M. Spotti (Eds.), Mother tongue and intercultural valorization: Europe and its migrant youth (pp. 49–60). Milan: Angeli.
Dimitrov, D. (2012). Statistical models for validation of assessment scale data in counseling and related fields. Alexandria, VA: American Counseling Association.
DukeAMES. (n.d.). DukeAMES [YouTube]. Available from <[URL]>
Gebril, A., & Taha-Thomure, H. (2014). Assessing Arabic. In A. Kunnan (Ed.), The companion to language assessment. Chichester, UK: John Wiley & Sons.
Green, K. E., & Frantom, C. G. (2002, November). Survey development and validation with the Rasch Model. Paper presented at the International Conference on Questionnaire Development, Charleston, SC.
Hirsch, B. J. (2009). Integrating dialects into the Modern Standard Arabic high school classroom (Master’s thesis). Retrieved from <[URL]>
IELTS. (2013). IELTS guide for teachers. Retrieved from <[URL]>
Kantarcioğlu, E. (2012). Relating an institutional proficiency exam to the CEFR: A case study (Unpublished Doctoral thesis). Roehampton University, London, UK. Retrieved from <[URL]>
Language Testing International. (n.d.). Oral Proficiency Interview (OPI). Retrieved from <[URL]>
Linacre, J. M. (2009). Winsteps ® (Version 3.69.1.10) [Computer Software]. Beaverton, OR: >Winsteps.com.
Muthén, L. K., & Muthén, B. O. (2008). Mplus user’s guide version 6.1. Los Angeles, CA: Muthén & Muthén.
National Middle East Language Resource Center. (2011). Middle East language learning in higher education. Provo, UT: National Middle East Language Resource Center.
Norrbom, B. (2014, May). Arabic profile: CEFR for Arabic – A learner corpus approach. Paper presented at 11th EALTA Conference, University of Warwick.
Norrbom, B., Yong, L., & Al-Shamrani, A. (2015). ECD for MSA – Developing a comprehensive construct definition. Paper presented at the 38th Language Testing Research Colloquium, Palermo, Italy.
O’Sullivan, B. (2008). City & Guilds Communicator IESOL Examination (B2) CEFR linking project: Case study report.
Pearson. (n.d.). Versant Arabic Test. Harlow: Pearson.
Pearson Versant. (n.d.). Test details. Retrieved from <[URL]>
. (n.d.). Versant Arabic test. Retrieved from <[URL]>
Rasch, G. (1960). Probalistic models for some intelligence and attainment tests (Reprinted by Chicago University Press in 1980).
Schultz, E., & Maisel, S. (2013). Modern Standard Arabic – Integrating main Arabic dialects. Leipzig: University of Leipzig.
Surface, E., & Dierdorff, E. (2003). Reliability and the ACTFL oral proficiency interview: reporting indices of interrater consistency and agreement for 19 languages. Foreign Language Annals, (36)4, 507–519.
Swender, E. (2003). Oral proficiency testing in the real world: Answers to frequently asked questions. Foreign Language Annals, (36)4, 520–526.
telc. (2011). Arabic Language Practice Test 1. Retrieved from <[URL]>
Cited by (1)
Cited by one other publication
This list is based on CrossRef data as of 10 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
