Cover not available

Review article published In: International Journal of Learner Corpus Research
Vol. 11:2 (2025) ► pp.309335

References (38)
References
Alsufieva, A., Kisselev, O., & Freels, S. (2012). Results 2012: Using flagship data to develop a Russian learner corpus of academic writing. Russian Language Journal, 621, 79–105. Google Scholar logo with link to Google Scholar
Arhar Holdt, Š., Gantar, P., Bon, M., Gapsa, M., Lavrič, P., & Klemen, M. (2023). Dataset for evaluation of Slovene spell- and grammar-checking tools Šolar-Eval 1.0. (Slovenian language resource repository CLARIN.SI). [URL]
Arhar Holdt, Š., & Kosem, I. (2024). Šolar, the developmental corpus of Slovene. Language Resources and Evaluation, 1–27. Google Scholar logo with link to Google Scholar
Arnardóttir, Þ., Xu, X., Guðmundsdóttir, D., Stefánsdóttir, L., & Ingason, A. (2021). Creating an Error Corpus: Annotation and Applicability. In Proceedings of CLARIN 2021 Annual Conference (pp. 59–63).Google Scholar logo with link to Google Scholar
Bol, T., de Vaan, M., & van de Rijt, A. (2018). The Matthew effect in science funding. Proceedings of the National Academy of Sciences, 115(19), 4887–4890. Google Scholar logo with link to Google Scholar
Boyd, A. (2018). Using Wikipedia edits in low resource grammatical error correction. In Proceedings of the 2018 EMNLP Workshop W-NUT: The 4th Workshop on Noisy User-generated Text (pp. 79–84). Association for Computational Linguistics. Google Scholar logo with link to Google Scholar
Boyd, A., Hana, J., Nicolas, L., Meurers, D., Wisniewski, K., Abel, A., Schöne, K., Štindlová, B., & Vettori, C. (2014). The MERLIN corpus: Learner language and the CEFR. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14) (pp. 1281–1288). European Language Resources Association (ELRA).Google Scholar logo with link to Google Scholar
Council of Europe. (2020). Common European Framework of Reference for Languages: Learning, teaching, assessment. Companion volume with new descriptors. Council of Europe Publishing.Google Scholar logo with link to Google Scholar
Darg̀is, R., Auziņa, I., Kaija, I., Levāne-Petrova, K., & Pokratniece, K. (2022). LaVA–Latvian Language Learner corpus. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (pp. 727–731).Google Scholar logo with link to Google Scholar
Darg̀is, R., Auziņa, I., Levāne-Petrova, K., & Kaija, I. (2020). Quality focused approach to a learner corpus development. In Proceedings of the Twelfth Language Resources and Evaluation Conference (pp. 392–396).Google Scholar logo with link to Google Scholar
Davis, C., Caines, A., Andersen, Ø., Taslimipoor, S., Yannakoudakis, H., Yuan, Z., Bryant, C., Rei, M. & Buttery, P. (2024). Prompting open-source and commercial language models for grammatical error correction of English learner text. In Findings of the association for computational linguistics: ACL 2024 (pp. 11952–11967). Association for Computational Linguistics. Google Scholar logo with link to Google Scholar
Ducel, F., Fort, K., Lejeune, G., & Lepage, Y. (2022). Do we name the languages we study? the #BenderRule in LREC and ACL articles. In N. Calzolari, F. Béchet, P. Blache, K. Choukri, C. Cieri, T. Declerck, S. Goggi, H. Isahara, B. Maegaard, J. Mariani, H. Mazo, J. Odijk & S. Piperidis (Eds.), Proceedings of the Thirteenth Language Resources and Evaluation Conference (pp. 564–573). European Language Resources Association.Google Scholar logo with link to Google Scholar
Gantar, P., Bon, M., Gapsa, M., & Arhar Holdt, Š. (2023). Šolar-Eval: Evalvacijska množica za strojno popravljanje jezikovnih napak v slovenskih besedilih. Jezik in Slovstvo, 68(4), 89–108. Google Scholar logo with link to Google Scholar
Glišić, I., & Ingason, A. K. (2022). The Nature of Icelandic as a second language: An insight from the Learner Error Corpus for Icelandic. In Proceedings of the CLARIN Annual Conference (p. 23–33).Google Scholar logo with link to Google Scholar
Godfroid, A., & Andringa, S. (2023). Uncovering sampling biases, advancing inclusivity, and rethinking theoretical accounts in Second Language Acquisition: Introduction to the special issue SLA for all? Language Learning, 73(4), 981–1002. Google Scholar logo with link to Google Scholar
Hammarstedt, M., Schumacher, A., Borin, L., & Forsberg, M. (2022). Sparv 5 user manual (Tech. Rep.). Språkbanken Text.Google Scholar logo with link to Google Scholar
Ingason, A. K., Stefánsdóttir, L. B., Arnardóttir, Þ., & Xu, X. (2021). Icelandic Error Corpus (IceEC) Version 1.1. (CLARIN-IS).Google Scholar logo with link to Google Scholar
Ingason, A. K., Stefánsdóttir, L. B., Arnardóttir, Þ., Xu, X., Glišić, I., & Guðmundsdóttir, D. (2022). The Icelandic L2 Error Corpus (IceL2EC) 1.3 (22.10). (CLARIN-IS).Google Scholar logo with link to Google Scholar
Masciolini, A., Caines, A., De Clercq, O., Kruijsbergen, J., Kurfalı, M., Muñoz Sánchez, R., Volodina, E., Östling, R. (2025a). The MultiGEC-2025 shared task on multilingual grammatical error correction at NLP4CALL. In R. Muñoz Sánchez, D. Alfter, J. Kallas, & E. Volodina (Eds.), Proceedings of the 14th workshop on Natural Language Processing for Computer Assisted Language Learning. Tallin, Estonia: University of Tartu. [URL]
Masciolini, A., Caines, A., De Clercq, O., Kruijsbergen, J., Kurfalı, M., Muñoz Sánchez, R., … Zesch, T. (2025b). An overview of grammatical error correction for the twelve MultiGEC-2025 languages. GU-ISS Forskningsrapporter från Institutionen för svenska språket. Institution for Swedish, Multilingualism, Language Technology; University of Gothenburg. [URL]Google Scholar logo with link to Google Scholar
Merton, R. K. (1968). The Matthew effect in science: The reward and communication systems of science are considered. Science, 159(3810), 56–63. Google Scholar logo with link to Google Scholar
Náplava, J., Straka, M., Straková, J., & Rosen, A. (2022). Czech grammar error correction with a large and diverse corpus. Transactions of the Association for Computational Linguistics, 101, 452–467. Google Scholar logo with link to Google Scholar
Nicholls, D., Caines, A., & Buttery, P. (2024). The Write & Improve Corpus 2024: Error-annotated and CEFR-labelled essays by learners of English. Cambridge University Press Assessment.Google Scholar logo with link to Google Scholar
Palma Gomez, F., & Rozovskaya, A. (2024). Multi-reference benchmarks for Russian grammatical error correction. In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (volume 1: Long papers) (pp. 1253–1270). Association for Computational Linguistics.Google Scholar logo with link to Google Scholar
Perc, M. (2014). The Matthew effect in empirical data. Journal of The Royal Society Interface, 11(98), 20140378. Google Scholar logo with link to Google Scholar
Rosen, A., Hana, J., Hladká, B., Jelínek, T., Škodová, S., & Štindlová, B. (2020). Compiling and annotating a learner corpus for a morphologically rich language — CzeSL, a corpus of non-native Czech. Karolinum, Charles University Press.Google Scholar logo with link to Google Scholar
Rozovskaya, A., & Roth, D. (2019). Grammar error correction in morphologically rich languages: The case of Russian. Transactions of the Association for Computational Linguistics, 71, 1–17. Google Scholar logo with link to Google Scholar
Rudebeck, L., & Sundberg, G. (2021). SweLL correction annotation guidelines. (Tech. Rep.). GU-ISS Research report series, Department of Swedish, University of Gothenburg.Google Scholar logo with link to Google Scholar
Sakaguchi, K., Napoles, C., Post, M., & Tetreault, J. (2016). Reassessing the goals of grammatical error correction: Fluency instead of grammaticality. Transactions of the Association for Computational Linguistics, 41, 169–182. Google Scholar logo with link to Google Scholar
Šebesta, K., Bedřichová, Z., Šormová, K., Straňák, P., & Peterek, N. (2014). ROMi 1.0. (LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University).Google Scholar logo with link to Google Scholar
Šebesta, K., Goláňová, H., Letafková, J., & Jelínková, B. (2016). AKCES 1. (LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University).Google Scholar logo with link to Google Scholar
Søgaard, A. (2022). Should we ban English NLP for a year? In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (pp. 5254–5260). Association for Computational Linguistics. Google Scholar logo with link to Google Scholar
Syvokon, O., Nahorna, O., Kuchmiichuk, P., & Osidach, N. (2023). UA-GEC: Grammatical error correction and fluency corpus for the Ukrainian Language. In Proceedings of the second Ukrainian Natural Language Processing workshop (UNLP) (pp. 96–102). Association for Computational Linguistics. Google Scholar logo with link to Google Scholar
Syvokon, O., & Romanyshyn, M. (2023). The UNLP 2023 Shared Task on Grammatical Error Correction for Ukrainian. In Proceedings of the second Ukrainian Natural Language Processing workshop (UNLP) (pp. 132–137). Association for Computational Linguistics. Google Scholar logo with link to Google Scholar
Tantos, A., Amvrazis, N., & Drakonaki, E. (2023). Greek Learner Corpus II (GLCII): Design and development of an online corpus for L2 Greek. Journal of Applied Linguistics, 361, 125–150. Google Scholar logo with link to Google Scholar
Volodina, E., Granstedt, L., Matsson, A., Megyesi, B., Pilán, I., Prentice, J., … & Wirén, M. (2019). The SweLL language learner corpus: From design to annotation. Northern European Journal of Language Technology (NEJLT), 61, 67–104. Google Scholar logo with link to Google Scholar
(2022). SweLL-gold. Språkbanken Text. Distributed via SBX/CLARIN. Google Scholar logo with link to Google Scholar
Wisniewski, K., Schöne, K., Nicolas, L., Vettori, C., Boyd, A., Meurers, D., … Hana, J. (2013). MERLIN: An online trilingual learner corpus empirically grounding the European Reference Levels in authentic learner data. In International Conference, ICT for Language Learning, 6th edition.Google Scholar logo with link to Google Scholar
Mobile Menu Logo with link to supplementary files background Layer 1 prag Twitter_Logo_Blue