Chapter 9. The development and validation of an Arabic language test in Saudi Arabia

Norrbom, Bjorn; Al-Shamrani, Abdulrahman

doi:10.1075/aals.15.09nor

In:Applied Linguistics in the Middle East and North Africa: Current practices and future directions
Edited by Atta Gebril
[AILA Applied Linguistics Series 15] 2017
► pp. 203–225

Get fulltext from our e-platform

Download Book PDF

Chapter 9
The development and validation of an Arabic language test in Saudi Arabia

Bjorn Norrbom | National Center for Assessment (NCA), KSA

Abdulrahman Al-Shamrani | National Center for Assessment (NCA), KSA

Published online: 18 July 2017

https://doi.org/10.1075/aals.15.09nor

Abstract

This chapter describes the development and validation of the Standardized Test of Arabic Proficiency in Speakers of Other Languages (STAPSOL) at the National Center for Assessment (NCA) in Saudi Arabia. The chapter describes the theoretical foundations and blueprint of the test, including the test components, their selection criteria and respective weight. The chapter also addresses issues related to the scoring process, with specific focus on rater training, scoring rubrics, and investigation of psychometric qualities using both G-theory and multifaceted item response theory. In closing, the chapter looks at planned and possible future developments of and improvements to the test, particularly by formally linking it to the CEFR using well-established procedures.

Keywords: standardized testing, Arabic, CEFR, Assessment Use Arguments, Saudi Arabia

Article outline

Introduction
Arabic L2 tests
STAPSOL
- Test objective
- Theoretical framework
- Specifications – components and weights
- Item writing and review
- Scoring
- Research
- Validity and reliability
- Differentiating between different levels of proficiency
- The FW component
- A Simplified Assessment Use Argument (AUA) – generalizability of tasks and relevance of research
- Future directions: Formally linking STAPSOL to the CEFR
Summary and conclusions
References
Appendix

References (47)

References

Al-Arabiyya Institute. Al-Arabiyya Test. Retrieved from <[URL]>

Alderson, C. J. (2000). Assessing reading. Cambridge: Cambridge University Press.

(2009). Test review: Test of English as a Foreign Language: Internet-based Test (TOEFL iBT). Language Testing, 26(4), 621–631.

Alhaqbani, A., & Riazi, M. (2012). Metacognitive awareness of reading strategy use in Arabic as a second language. Reading in a Foreign Language, (24)2, 231–255.

Al-Harbi, K. (2013a) October. STAPSOL: construct validity with reference to structure equation modelling. In Symposium on International Arabic Teaching Programs and Outcomes Assessment. Symposium conducted at the meeting of National Center for Assessment in Higher Education, Riyadh.

(2013b) October. STAPSOL: sensitivity to different levels of language attainment. In Symposium on International Arabic Teaching Programs and Outcomes Assessment. Symposium conducted at the meeting of National Center for Assessment in Higher Education, Riyadh.

Al-Kahtani, S. (2013) October. Dependability of ratings for the NCA writing test (FW). In Symposium on International Arabic Teaching Programs and Outcomes Assessment. Symposium conducted at the meeting of National Center for Assessment in Higher Education, Riyadh.

Al-Owidha, A., & Al-Shamrani, A. (2012) September. Standardized Test of Arabic in Speakers of Other Languages (STAPSOL): Evidence of its reliability and validity. Paper presented at the 38th IAEA Conference, Astana, Kazakhstan.

American Council on the Teaching of Foreign Language. (2012). National Arabic consensus project. Retrieved from <[URL]>

. (2013). Testing for proficiency. Retrieved from <[URL]>

American Educational Research Association (AERA), American Psychological Association (APA), National Council on Measurement in Education (NCME). (2014). Standards for educational and psychological testing. Washington, DC: American Educational Research Association.

Arab Academy. (n.d.). Retrieved from <[URL]>

. Structure of the Arab Language Proficiency Test (ALPT). Retrieved from <[URL]>

Bachman, L., & Palmer, A. (2010). Language assessment in practice. Oxford: Oxford University Press.

Bernstein, J., & Suzuki, M. (2011). Versant Arabic test: Test description and validation summary. Palo Alto, CA: Pearson Education.

Brennan, R. L. (2001). Generalizability theory. New York, NY: Springer Verlag.

Buckwalter, T., & Parkinson, D. (2011). A frequency dictionary of Arabic: Core vocabulary for learners. Abingdon: Routledge.

Byrne, B. (2006). Structural equation modelling with EQS: basic concepts, application, and programming (2nd ed.). Mahwah, NJ: Lawrence Erlbaum Associates.

Chapelle, C. A., Enright, M. K., & Jamieson, J. M. (Eds.). (2008). Building a validity argument for the Test of English as a Foreign Language. New York, NY: Routledge.

Cito. (n.d.). Retrieved from <[URL]>

Council of Europe. (2001). Common European Framework of Reference for languages: Learning, teaching and assessment. Cambridge: Cambridge University Press.

. (2009). Relating language examinations to the Common European Framework of Reference for Languages: Learning, teaching and assessment (CEFR): A manual. Strasbourg: Language Policy Division Council of Europe.

Cronbach, L. J., Gleser, G. C., Nanda, H., & Rajaratnam, N. (1972). The dependability of behavioral measurements. New York, NY: Wiley.

de Graaf, A. (2012). The Netherlands: Arabic in education. In F. Grande, J. J. de Ruiter, & M. Spotti (Eds.), Mother tongue and intercultural valorization: Europe and its migrant youth (pp. 49–60). Milan: Angeli.

Dimitrov, D. (2012). Statistical models for validation of assessment scale data in counseling and related fields. Alexandria, VA: American Counseling Association.

DukeAMES. (n.d.). DukeAMES [YouTube]. Available from <[URL]>

Gebril, A., & Taha-Thomure, H. (2014). Assessing Arabic. In A. Kunnan (Ed.), The companion to language assessment. Chichester, UK: John Wiley & Sons.

Green, K. E., & Frantom, C. G. (2002, November). Survey development and validation with the Rasch Model. Paper presented at the International Conference on Questionnaire Development, Charleston, SC.

Hirsch, B. J. (2009). Integrating dialects into the Modern Standard Arabic high school classroom (Master’s thesis). Retrieved from <[URL]>

IELTS. (2013). IELTS guide for teachers. Retrieved from <[URL]>

Kantarcioğlu, E. (2012). Relating an institutional proficiency exam to the CEFR: A case study (Unpublished Doctoral thesis). Roehampton University, London, UK. Retrieved from <[URL]>

Language Testing International. (n.d.). Oral Proficiency Interview (OPI). Retrieved from <[URL]>

Linacre, J. M. (2009). Winsteps ® (Version 3.69.1.10) [Computer Software]. Beaverton, OR: >Winsteps.com.

Muthén, L. K., & Muthén, B. O. (2008). Mplus user’s guide version 6.1. Los Angeles, CA: Muthén & Muthén.

National Middle East Language Resource Center. (2011). Middle East language learning in higher education. Provo, UT: National Middle East Language Resource Center.

Norrbom, B. (2014, May). Arabic profile: CEFR for Arabic – A learner corpus approach. Paper presented at 11th EALTA Conference, University of Warwick.

Norrbom, B., Yong, L., & Al-Shamrani, A. (2015). ECD for MSA – Developing a comprehensive construct definition. Paper presented at the 38th Language Testing Research Colloquium, Palermo, Italy.

O’Sullivan, B. (2008). City & Guilds Communicator IESOL Examination (B2) CEFR linking project: Case study report.

Pearson. (n.d.). Versant Arabic Test. Harlow: Pearson.

Pearson Versant. (n.d.). Test details. Retrieved from <[URL]>

. (n.d.). Versant Arabic test. Retrieved from <[URL]>

Purpura, J. (2004). Assessing grammar. Cambridge: Cambridge University Press.

Rasch, G. (1960). Probalistic models for some intelligence and attainment tests (Reprinted by Chicago University Press in 1980).

Schultz, E., & Maisel, S. (2013). Modern Standard Arabic – Integrating main Arabic dialects. Leipzig: University of Leipzig.

Surface, E., & Dierdorff, E. (2003). Reliability and the ACTFL oral proficiency interview: reporting indices of interrater consistency and agreement for 19 languages. Foreign Language Annals, (36)4, 507–519.

Swender, E. (2003). Oral proficiency testing in the real world: Answers to frequently asked questions. Foreign Language Annals, (36)4, 520–526.

telc. (2011). Arabic Language Practice Test 1. Retrieved from <[URL]>

Cited by (1)

Cited by one other publication

Syarofit, Miqdarul Khoir, Hanik Mahliatussikah, Muhammad Alfan & Eiman F. Abushihab

2025. Tāḥlīl Kitāb TOAFL “Muqārrār Tāḥdīd āl-Mustāwā” āl-Mustānīdī īlā āl-Iṭār āl-Ūrūbī āl-Mārjāʿī āl-Mushtārāk līl-Lughāt. Mantiqu Tayr: Journal of Arabic Language 5:2 ► pp. 420 ff.

This list is based on CrossRef data as of 10 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

Chapter 9The development and validation of an Arabic language test in Saudi Arabia

Cited by one other publication

Chapter 9
The development and validation of an Arabic language test in Saudi Arabia