Cover not available

Article published In: International Journal of Corpus Linguistics
Vol. 23:1 (2018) ► pp.2854

Get fulltext from our e-platform
References (31)
References
Berzak, Y., Huang, Y., Barbu, A., Korhonen, A., & Katz, B. (2016a). Anchoring and agreement in syntactic annotations. In J. Su (Ed.), Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (pp. 2215–2224). Austin, TX: ACL. Google Scholar logo with link to Google Scholar
Berzak, Y., Kenney, J., Spadine, C., Wang, J. X., Lam, L., Mori, K. S., Garza, S., & Katz, B. (2016b). Universal dependencies for learner English. In K. Erk & N. A. Smith (Eds.), Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 737–746). Berlin: ACL. Google Scholar logo with link to Google Scholar
Buchholz, S., & Marsi, E. (2006). CoNLL-X shared task on multilingual dependency parsing. In L. Marquez & D. Klein (Eds.), Proceedings of the Tenth Conference on Computational Natural Language Learning (pp. 149–164). New York, NY: ACL. Google Scholar logo with link to Google Scholar
Cer, D. M., De Marneffe, M. -C., Jurafsky, D., & Manning, C. D. (2010). Parsing to Stanford dependencies: Trade-offs between speed and accuracy. In N. Calzolari, K. Choukri, B. Maegaard, J. Mariani, J. Odijk, S. Piperidis, M. Rosner & D. Tapias (Eds.), Proceedings of the Seventh International Conference on Language Resources and Evaluation (pp. 1628–1632). Valletta: ELRA.Google Scholar logo with link to Google Scholar
Charniak, E., & Johnson, M. (2005). Coarse-to-fine n-best parsing and MaxEnt discriminative reranking. In K. Knight (Ed.), Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics (pp. 173–180). Stroudsburg: ACL.Google Scholar logo with link to Google Scholar
Council of Europe. (2001). Common European Framework of Reference for Languages: Learning, Teaching, Assessment. Cambridge: Cambridge University Press.Google Scholar logo with link to Google Scholar
De Marneffe, M. -C., MacCartney, B., & Manning, C. D. (2006). Generating typed dependency parses from phrase structure parses. In N. Calzolari, K. Choukri, A. Gangemi, B. Maegaard, J. Mariani, J. Odijk & D. Tapias (Eds.), Proceedings of the Fifth International Conference on Language Resources and Evaluation (pp. 449–454). Genoa: ELRA.Google Scholar logo with link to Google Scholar
De Marneffe, M. -C., & Manning, C. D. (2008). Stanford typed dependencies manual (Technical Report). Retrieved from [URL] (last accessed February 2018).
Dickinson, M., & Lee, C. M. (2013). Modifying corpus annotation to support the analysis of learner language. CALICO Journal, 26(3), 545–561. Google Scholar logo with link to Google Scholar
Dickinson, M., & Ragheb, M. (2009). Dependency annotation for learner corpora. In M. Passarotti, A. Przepiorkowski, S. Raynaud & F. Van Eynde (Eds.), Proceedings of the Eighth Workshop on Treebanks and Linguistic Theories (pp. 59–70). Milan: EDUCatt.Google Scholar logo with link to Google Scholar
Geertzen, J., Alexopoulou, T., & Korhonen, A. (2013). Automatic linguistic annotation of large scale L2 databases: The EF-Cambridge Open Language Database (EFCAMDAT). In R. T. Miller, K. I. Martin, C. M. Eddington, A. Henery, N. M. Miguel, A. Tseng, A. Tuninetti & D. Walter (Eds.), Proceedings of the 31st Second Language Research Forum: Building Bridges Between Disciplines. Somerville: Cascadilla Proceedings Project.Google Scholar logo with link to Google Scholar
Granger, S., Dagneaux, E., Meunier, F., & Paquot, M. (2009). The International Corpus of Learner English. Version 2. Handbook and CD-ROM. Louvain-la-Neuve: Presses Universitaires de Louvain.Google Scholar logo with link to Google Scholar
James, C. (2013). Errors in Language Learning and Use: Exploring Error Analysis. New York, NY: Addison Wesley Longman. Google Scholar logo with link to Google Scholar
Klein, D., & Manning, C. D. (2003a). Accurate unlexicalized parsing. In E. W. Hinrichs & D. Roth (Eds.), Proceedings of the 41st Annual Meeting on Association for Computational Linguistics-Volume 1 (pp. 423–430). Sapporo: ACL.Google Scholar logo with link to Google Scholar
(2003b). Fast exact inference with a factored model for natural language parsing. In S. Becker, S. Thrun, & K. Obermayer (Eds.), Advances in Neural Information Processing Systems 15 (pp. 3–10). Cambridge, MA: MIT Press.Google Scholar logo with link to Google Scholar
Kong, L., & Smith, N. A. (2014). An empirical comparison of parsing methods for stanford dependencies (arXiv preprint). Retrieved from [URL] (last accessed February 2018).
Korhonen, A. (2002). Semantically motivated subcategorization acquisition. In J. Pentheroudakis, N. Calzolari & A. Wu (Eds.), Proceedings of the ACL-02 Workshop on Unsupervised Lexical Acquisition-Volume 9 (pp. 51–58). Philadelphia, PA: ACL. Google Scholar logo with link to Google Scholar
Krivanek, J., & Meurers, D. (2011). Comparing rule-based and data-driven dependency parsing of learner language. In K. Gerdes, E. Hajičová & L. Wanner (Eds.), Proceedings of the First International Conference on Dependency Linguistics (Depling 2011) (pp. 310–317). Barcelona: IOS Press.Google Scholar logo with link to Google Scholar
Marcus, M. P., Marcinkiewicz, M. A., & Santorini, B. (1993). Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2), 313–330.Google Scholar logo with link to Google Scholar
Martins, A. F. T., Almeida, M., & Smith, N. A. (2013). Turning on the Turbo: Fast third-order non-projective Turbo parsers. In H. Schuetze (Ed.), Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 617–622). Sofia: ACL.Google Scholar logo with link to Google Scholar
Nicholls, D. (2003). The Cambridge Learner Corpus: Error coding and analysis for lexicography and ELT. In A. Dawn, P. Rayson, A. Wilson & T. McEnery (Eds.), Proceedings of the Corpus Linguistics 2003 Conference (pp. 572–581). Lancaster: UCREL.Google Scholar logo with link to Google Scholar
Nivre, J., Hall, J., Nilsson, J., Chanev, A., Eryigit, G., Kübler, S., Marinov, S., & Marsi, E. (2007). MaltParser: A language-independent system for data-driven dependency parsing. Natural Language Engineering, 13(2), 95–135. Google Scholar logo with link to Google Scholar
Ott, N., & Ziai, R. (2010). Evaluating dependency parsing performance on German learner language. In M. Dickinson, K. Müürisep & M. Passarotti (Eds.), Proceedings of the Ninth International Workshop on Treebanks and Linguistic Theories (pp. 175–186). Tartu: NEALT.Google Scholar logo with link to Google Scholar
Paquot, M., & Plonsky, L. (2017). Quantitative research methods and study quality in learner corpus research. International Journal of Learner Corpus Research, 3(1), 61–94. Google Scholar logo with link to Google Scholar
Petrov, S., & Klein, D. (2007). Improved inference for unlexicalized parsing. In B. Carpenter, A. Stent & J. D. Williams (Eds.), Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL) (pp. 404–411). Rochester: ACL.Google Scholar logo with link to Google Scholar
Ragheb, M., & Dickinson, M. (2011). Avoiding the comparative fallacy in the annotation of learner corpora. In G. Granena, J. Koeth, S. Lee-Ellis, A. Lukyanchenko, G. P. Botana & E. Rhoades (Eds.), Selected Proceedings of the 2010 Second Language Research Forum: Reconsidering SLA Research, Dimensions, and Directions (pp. 114–124). Somerville, MA: Cascadilla Proceedings Project.Google Scholar logo with link to Google Scholar
(2013). Inter-annotator agreement for dependency annotation of learner language. In J. Tetreault, J. Burstein & C. Leacock (Eds.), Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications (pp. 169–179). Atlanta, GA: ACL.Google Scholar logo with link to Google Scholar
Rosen, A., Hana, J., Štindlová, B., & Feldman, A. (2014). Evaluating and automating the annotation of a learner corpus. Language Resources and Evaluation, 48(1), 65–92. Google Scholar logo with link to Google Scholar
Santorini, B. (1990). Part-of-speech tagging guidelines for the Penn Treebank Project (3rd revision, 2nd printing) (Technical report). Retrieved from [URL] (last accessed February 2018).
Tono, Y., & Díez-Bedmar, M. B. (2014). Focus on learner writing at the beginning and intermediate stages: The ICCI corpus. International Journal of Corpus Linguistics, 19(2), 163–177. Google Scholar logo with link to Google Scholar
Cited by (39)

Cited by 39 other publications

Geremia, Sara, Thomas Gaillat, Nicolas Ballier & Andrew J. Simpkin
2025. Exploring the cross-lingual influence of linguistic complexity in second language writing assessment. Assessing Writing 66  pp. 100951 ff. DOI logo
Murakami, Akira
2025. Towards more appropriate modelling of linguistic complexity measures: Beyond traditional regression models. Research Methods in Applied Linguistics 4:1  pp. 100182 ff. DOI logo
Sato, Masatoshi, Steven L. Thorne, Marije Michel, Theodora Alexopoulou & John Hellermann
2025. Language, people, classrooms, world: Blending disparate theories for united language education practices. The Modern Language Journal 109:S1  pp. 15 ff. DOI logo
Uchida, Satoru & Masashi Negishi
2025. Assigning CEFR-J levels to English learners’ writing: An approach using lexical metrics and generative AI. Research Methods in Applied Linguistics 4:2  pp. 100199 ff. DOI logo
Zhang, Yujie & Lawrence Jun Zhang
2025. Use of Dependency‐Annotated Learner Corpora in Measuring Syntactic Complexity for Granularity, Accuracy, Consistency, and Transparency: Implications for Research and Teaching. TESOL Quarterly 59:2  pp. 1050 ff. DOI logo
Alzahrani, Alaa & Lawrence Jun Zhang
2024. Utility of Kolmogorov complexity measures: Analysis of L2 groups and L1 backgrounds. PLOS ONE 19:4  pp. e0301806 ff. DOI logo
Bannò, Stefano & Marco Matassoni
2024. Back to grammar: Using grammatical error correction to automatically assess L2 speaking proficiency. Speech Communication 157  pp. 103025 ff. DOI logo
Eguchi, Masaki & Kristopher Kyle
2024. Building custom NLP tools to annotate discourse-functional features for second language writing research: A tutorial. Research Methods in Applied Linguistics 3:3  pp. 100153 ff. DOI logo
Granger, Sylviane
2024. From early to future learner corpus research. International Journal of Learner Corpus Research 10:2  pp. 247 ff. DOI logo
Kim, Minjin & Xiaofei Lu
2024. L2 English speaking syntactic complexity: Data preprocessing issues, reliability of automated analysis, and the effects of proficiency, L1 background, and topic. The Modern Language Journal 108:1  pp. 270 ff. DOI logo
Kyle, Kristopher & Masaki Eguchi
2024. Evaluating NLP models with written and spoken L2 samples. Research Methods in Applied Linguistics 3:2  pp. 100120 ff. DOI logo
Lestari, Febriana
2024. Analysis of verb argument constructions (VACs) in L2 learners across proficiency levels: A corpus-based study in L1 Indonesian. Applied Corpus Linguistics 4:3  pp. 100097 ff. DOI logo
Ma, Hong, Jinglei Wang & Lianzhen He
2024. Linguistic Features Distinguishing Students’ Writing Ability Aligned with CEFR Levels. Applied Linguistics 45:4  pp. 637 ff. DOI logo
Shatz, Itamar, Theodora Alexopoulou & Akira Murakami
2024. The potential influence of cross-linguistic lexical similarity on lexical diversity in L2 English writing. Corpora 19:2  pp. 131 ff. DOI logo
Spina, Stefania, Irene Fioravanti, Luciana Forti & Fabio Zanda
2024. The CELI corpus: Design and linguistic annotation of a new online learner corpus. Second Language Research 40:2  pp. 457 ff. DOI logo
Vercellotti, MaryLou & Sean Hall
2024. Coding all clauses in L2 data: A call for consistency. Research Methods in Applied Linguistics 3:3  pp. 100132 ff. DOI logo
Xia, Detong, Mark A. Sulzer & Hye K. Pae
2024. Phrase-frames in business emails: a contrast between learners of business English and working professionals. Text & Talk 44:5  pp. 693 ff. DOI logo
Yan, Hengbin & Yinghui Li
2024. Constraction: a tool for the automatic extraction and interactive exploration of linguistic constructions. Linguistics Vanguard 9:1  pp. 215 ff. DOI logo
Berti, Barbara, Andrea Esuli & Fabrizio Sebastiani
2023. Unravelling interlanguage facts via explainable machine learning. Digital Scholarship in the Humanities 38:3  pp. 953 ff. DOI logo
Crossley, Scott & Langdon Holmes
Shatz, Itamar, Theodora Alexopoulou, Akira Murakami & Ramona Bongelli
2023. Examining the potential influence of crosslinguistic lexical similarity on word-choice transfer in L2 English. PLOS ONE 18:2  pp. e0281137 ff. DOI logo
Du, Xiangtao, Muhammad Afzaal & Hind Al Fadda
2022. Collocation Use in EFL Learners’ Writing Across Multiple Language Proficiencies: A Corpus-Driven Study. Frontiers in Psychology 13 DOI logo
Durrant, Philip
2022. Studying children's writing development with a corpus. Applied Corpus Linguistics 2:3  pp. 100026 ff. DOI logo
Gaillat, Thomas, Andrew Simpkin, Nicolas Ballier, Bernardo Stearns, Annanda Sousa, Manon Bouyé & Manel Zarrouk
2022. Predicting CEFR levels in learners of English: The use of microsystem criterial features in a machine learning approach. ReCALL 34:2  pp. 130 ff. DOI logo
McCallum, Lee & Philip Durrant
2022. Shaping Writing Grades, DOI logo
Murakami, Akira & Nick C. Ellis
2022. Effects of Availability, Contingency, and Formulaicity on the Accuracy of English Grammatical Morphemes in Second Language Writing. Language Learning 72:4  pp. 899 ff. DOI logo
Tan, Yi & Ute Römer
2022. Using phrase-frames to trace the language development of L1 Chinese learners of English. System 108  pp. 102844 ff. DOI logo
Xia, Detong, Haiyang Ai & Hye K. Pae
2022. “Please let me know”. International Journal of Learner Corpus Research 8:1  pp. 1 ff. DOI logo
Chen, Xiaobin, Theodora Alexopoulou & Ianthi Tsimpli
2021. Automatic extraction of subordinate clauses and its application in second language acquisition research. Behavior Research Methods 53:2  pp. 803 ff. DOI logo
Huang, Yan, Akira Murakami, Theodora Alexopoulou & Anna Korhonen
2021. Subcategorization frame identification for learner English. International Journal of Corpus Linguistics 26:2  pp. 187 ff. DOI logo
Kyle, Kristopher
2021. Natural language processing for learner corpus research. International Journal of Learner Corpus Research 7:1  pp. 1 ff. DOI logo
Rubin, Rachel
2021. Assessing the impact of automatic dependency annotation on the measurement of phraseological complexity in L2 Dutch. International Journal of Learner Corpus Research 7:1  pp. 131 ff. DOI logo
Sun, Kun & Xiaofei Lu
2021. Assessing Lexical Psychological Properties in Second Language Production: A Dynamic Semantic Similarity Approach. Frontiers in Psychology 12 DOI logo
Sun, Kun & Rong Wang
2021. Using the Relative Entropy of Linguistic Complexity to Assess L2 Language Proficiency Development. Entropy 23:8  pp. 1080 ff. DOI logo
Gilquin, Gaëtanelle
2020. Learner Corpora. In A Practical Handbook of Corpus Linguistics,  pp. 283 ff. DOI logo
Shatz, Itamar
2020. Refining and modifying the EFCAMDAT. International Journal of Learner Corpus Research 6:2  pp. 220 ff. DOI logo
Ballier, Nicolas, Thomas Gaillat, Andrew Simpkin, Bernardo Stearns, Manon Bouyé & Manel Zarrouk
2019. A Supervised Learning Model for the Automatic Assessment of Language Levels Based on Learner Errors. In Transforming Learning with Meaningful Technologies [Lecture Notes in Computer Science, 11722],  pp. 308 ff. DOI logo
[no author supplied]
2022. Automated Essay Scoring [Synthesis Lectures on Human Language Technologies, ], DOI logo

This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.

Mobile Menu Logo with link to supplementary files background Layer 1 prag Twitter_Logo_Blue