Article published In: Perspectives on Chinese Language Acquisition
Edited by Henghua Su
[International Journal of Chinese Linguistics 12:2] 2025
► pp. 315–345
Cloze test and self-assessment as measures of L2 Chinese proficiency
A contextualized approach for university language programs
Published online: 6 October 2025
https://doi.org/10.1075/ijchl.00041.qia
https://doi.org/10.1075/ijchl.00041.qia
Abstract
This study presents the development and validation of a Chinese cloze test for second language (L2) learners in
U.S. university-level programs, targeting proficiency levels typical of large public universities, especially those using the
Integrated Chinese textbook series. Constructed using a rational deletion method, the test contains 40 blanks within a passage of
approximately 400 characters and takes about 20 minutes to complete. Validation was conducted with students from a large state
university and benchmarked against multiple proficiency measures, including a placement test and a final exam. Results demonstrate
that the cloze test effectively differentiates between adjacent proficiency levels and correlates with both placement and final
exam scores, supporting its use as a practical in-house assessment tool. The study also investigates the reliability of
self-assessment as a supplementary measure of proficiency. Findings indicate that self-assessments are more accurate within
proficiency levels than across them: lower-proficiency learners more accurately evaluate reading and writing skills, while
higher-proficiency learners better assess listening and speaking. Framed as a curriculum-based classroom assessment project, this
study offers a quick, effective, and easy-to-administer tool for both instructional and research use.
Article outline
- 1.Introduction
- 1.1Validity, reliability, and design principles
- 1.2Use of cloze tests for placement and instructional differentiation
- 1.3Self-assessment and its application to Chinese learners
- 1.4Rationale and development of the Chinese cloze test
- 2.Methods
- 2.1Participants
- 2.2Materials and design
- The cloze test
- The placement test
- The language background information form
- 2.3Procedure
- 3.Results
- 4.Discussion and conclusion
References
References (67)
Abraham, R. G., & Chapelle, C. A. (1992). The meaning of cloze test scores: An item difficulty perspective. The Modern Language Journal, 761, 468–479.
ACTFL. (2017). NCSSFL-ACTFL Can-Do statements. Retrieved September 23, 2020, from [URL]
AlFallay, I. (2004). The role of some selected psychological and personality traits of the rater in the accuracy of self- and peer-assessment. System, 321, 407–425.
Allen, J. R. (2008). Why learning to write Chinese is a waste of time: A modest proposal. Foreign Language Annals, 411, 237–251.
Bachman, L. F. (1985). Performance on cloze tests with fixed-ratio and rational deletions. TESOL Quarterly, 161, 61–70.
Birckbichler, D., Corl, K., & Deville, C. (1993). The dynamics of language program testing: Implications for articulation and program revision. The Dynamics of Language Program Direction, Boston, MA: Heinle & Heinle.
Birdsong, D. (2005). Interpreting age effects in 2nd language acquisition. In J. Kroll, & A. de Groot. (Eds.). Handbook of Bilingualism (pp. 109–127). Oxford: Oxford University Press.
Brantmeier, C. (2006). Advanced L2 learners and reading placement: Self-assessment, CBT, and subsequent performance. System, 341, 15–35.
Brière, E. J., Clausing, G., Senko, D., & Purcell, E. (1978). A look at cloze testing across languages and levels. The Modern Language Journal, 621, 23–26.
Brown, J. D. (1980). Relative merits of four methods for scoring cloze tests. The Modern Language Journal, 641, 311–317.
(2013). My twenty-five years of cloze testing research: So what? International Journal of Language Studies, 71, 1–32.
Brown, J. D., Yamashiro, A. D., & Ogane, E. (2001). The emperor’s new cloze: Strategies for revising cloze tests. A focus on language test development: Expanding the language proficiency construct across a variety of tests, 143–161.
Carlisle, J., & Rice, M. (2004). Assessment of reading comprehension. Handbook of Language and Literacy, 521–555.
Caulfield, J., & Smith, W. C. (1981). The reduced redundancy test and the cloze procedure as measures of global language proficiency, The Modern Language Journal, 651, 54–58.
Chapelle, C. A. (1999). Validity in language assessment. Annual Review of Applied Linguistics, 191, 254–272.
Chihara, T., Oller, J., Weaver, K., & Chavez-Oller, M. A. (1997). Are cloze tests sensitive to constraints across sentences? Language Learning, 271, 63–73.
Dastjerdi, H. V., & Talebinezhad, M. R. (2006). Chain-preserving deletion procedure in cloze: A discoursal perspective. Language Testing, 231, 58–72.
Deville, M., & Deville, C. (1999). Computer adaptive testing in second language contexts. Annual Review of Applied Linguistics, 191, 273–299.
Douglas, M. O. (1994). Japanese cloze tests: Toward their construction. Japanese Language Education around the Globe, 41, 117–131.
Drackert, A. (2015). Validating language proficiency assessments in second language acquisition research: Applying an argument-based approach. Frankfurt am Main: Peter Lang.
Drackert, A., & Timukova, A. (2020). What does the analysis of C-test gaps tell us about the construct of a C-test? A comparison of foreign and heritage language learners’ performance. Language Testing, 371, 107–132.
Eckes, T., & Grotjahn, R. (2006). A closer look at the construct validity of C-tests. Language Testing, 231, 290–325.
Ehrlinger, J., Johnson, K., Banner, M., Dunning, D., & Kruger, J. (2008). Why the unskilled are unaware: Further explorations of (absent) self-insight among the incompetent. Organizational Behavior and Human Decision Processes, 1051, 98–121.
Fischer, R. (1981). Measuring linguistic competence in a foreign language, International Review of Applied Linguistics, 191, 207–217.
Flege, J. E., MacKey, I. A., & Piske, T. (2002). Assessing bilingual dominance. Applied Psycholinguistics, 231, 567–598.
Fotos, S. (1991). The cloze test as an integrative measure of EFL proficiency: A substitute for essays on college entrance examinations? Language Learning, 411, 313–336.
Gellert, A. S., & Elbro, C. (2013). Cloze tests may be quick, but are they dirty? Development and preliminary validation of a cloze test of reading comprehension. Journal of Psychoeducational Assessment, 311, 16–28.
Greene, B. B. (2001). Testing reading comprehension of theoretical discourse with cloze. Journal of Research in Reading, 241, 82–98.
Grotjahn, R., Klein-Braley, C., & Raatz, U. (2002). C-tests: An overview. In J. Coleman, R. Grotjahn, & U. Raatz. (Eds.), University language testing and the C-test (pp. 93–114). Bochum, Germany: AKS-Verlag.
Hanania, E., & Shikhani, M. (1986). Interrelationships among three tests of language proficiency: Standardized ESL, cloze and writing. TESOL Quarterly, 201, 97–110.
Harrington, M., & Carey, M. (2009). The online Yes/No test as a placement tool. System, 371, 614–626.
Heilenman, L. K. (1983). The use of a cloze procedure in foreign language placement. The Modern Language Journal, 671, 121–126.
Jia, G., Aaronson, D., & Wu, Y. (2002). Long-term language attainment of bilingual immigrants: Predictive variables and language group differences. Applied Psycholinguistics, 231, 599–621.
Jonz, J. (1994). The effects of textual cohesion and prior knowledge on native and nonnative cloze test scores. In J. W. Jr. Oller, & J. Jonz, (Eds.), Cloze and Coherence (pp. 269–285). London: Associated University Press.
Kleijn, S., Pander Maat, H., & Sanders, T. (2019). Cloze testing for comprehension assessment: The HyTeC-cloze. Language Testing, 361, 553–572.
Kobayashi, M. (2002). Cloze tests revisited: Exploring item characteristics with special attention to scoring methods. The Modern Language Journal, 861, 571–586.
Krausert, S. R. (1991). Determining the usefulness of self-assessment of foreign language skills: Post-secondary ESL students’ placement contribution. [Doctoral Dissertation, University of Southern California].
LeBlanc, R., & Painchaud, G. (1985). Self-assessment as a second language placement instrument. TESOL Quarterly, 191, 673–687.
Li, Y., & Wen, X., & Xie, T. (2014). CLTA 2012 survey of college-level Chinese language programs in North America. Journal of the Chinese Language Teachers Association, 491, 1–49.
Liu, J. L. C., & Li, Y. (2010). Encounters: A Cognitive Approach to Advanced Chinese. Bloomington, IN: Indiana University Press.
Looney, D., & Lusin, N. (2018). Enrollments in Languages Other than English in United States Institutions of Higher Education, Summer 2016 and Fall 2016: Final Report. Modern Language Association.
MacKey, I., & Flege, J. E. (2004). Effects of the age of second language learning on the duration of first and second language sentences. Applied Psycholinguistics, 251, 373–396.
Marian, V., Blumenfeld, H. K., & Kaushanskaya, M. (2007). The language experience and proficiency questionnaire (LEAP-Q): Assessing language profiles in bilinguals and multilinguals. Journal of Speech, Language and Hearing Research, 501, 940–967.
McKenna, M. C., & Layton, K. (1990). Concurrent validity of cloze as a measure of intersentential comprehension, Journal of Educational Psychology, 821, 372–377.
Mozgalina, A., & Ryshina-Pankova, M. (2015). Meeting the challenges of curriculum construction and change: Revision and validity evaluation of a placement test. The Modern Language Journal, 991, 346–370.
Oller, J. W., Jr. (1973). Cloze tests of second language proficiency and what they measure. Language Learning, 231, 105–118.
O’Toole, J., & King, R. (2011). The deceptive mean: Conceptual scoring of cloze entries differentially advantages more able readers. Language Testing, 281, 127–144.
Pearson, P. D., & Hamm, D. (2005). The assessment of reading comprehension: A review of practices — past, present, and future. In S. G. Paris, & S. A. Stahl. (Eds.), Children’s reading comprehension and assessment (pp. 13–69). Mahwah, NJ: Erlbaum.
Ross, S. (1998). Self-assessment in second language testing: A meta-analysis and analysis of experimental factors. Language Testing, 151, 1–20.
Sasaki, M. (2000). Effects of cultural schemata on students’ test-taking processes for cloze tests: A multiple-data source approach, Language Testing, 171, 85–114.
Spear-Swerling, L. (2004). Fourth graders’ performance on a state-mandated assessment involving two different measures of reading comprehension, Reading Psychology, 251, 121–148.
Stankowv, L. (1998). Calibration curves, scatterplots, and the distinction between general knowledge and perceptual tests. Learning and Individual Differences, 81, 28–51.
Taylor, W. L. (1953). Cloze procedure: A new tool for measuring readability. Journalism Quarterly, 301, 414–438.
(1957). “Cloze” readability scores as indices of individual differences in comprehension and aptitude. Journal of Applied Psychology, 411, 19–26.
Trace, J., Brown, J. D., Janssen, G., & Kozhevnikova, L. (2016). Determining cloze item difficulty from item and passage characteristics across different learner backgrounds. Language Testing, 341, 151–174.
Tremblay, A. (2011). Proficiency assessment standards in second language acquisition research: “Clozing” the gap. Studies in Second Language Acquisition, 331, 339–372.
Wang, X. (2008). Chinese Odyssey Volume 5 Textbook (Simplified & Traditional) (English and Chinese Edition), Cheng & Tsui.
