In:Dependency Linguistics: Recent advances in linguistic theory using dependency structures
Edited by Kim Gerdes, Eva Hajičová and Leo Wanner
[Linguistik Aktuell/Linguistics Today 215] 2014
► pp. 161–182
Dependency annotation of coordination for learner language
Published online: 1 October 2014
https://doi.org/10.1075/la.215.08dic
https://doi.org/10.1075/la.215.08dic
We present a strategy for dependency annotation of corpora of second language learners, dividing the annotation into different layers and separating linguistic constraints from realizations. Specifically, subcategorization information is required to compare to the annotation of realized dependencies, in order to fully capture learner innovations. Building from this, we outline dependency annotation for coordinate structures, detailing a number of constructions such as right node raising and the coordination of unlikes. We conclude that branching structures are preferable to treating the conjunction as et al. the head, as this avoids duplicating annotation.
References (44)
Abeillé, A. & Rambow, O. 2000. Tree adjoining grammar: An overview. In Tree Adjoining Grammars: Formalisms, Linguistic Analyses and Processing, A. Abeillé & O. Rambow (eds), 1–68. Stanford CA: CSLI.
Bardovi-Harlig, K. 1999. Examining the role of text type in L2 tense-aspect research: Broadening our horizons. In
Proceedings of the Third Pacific Second Language Research Forum
, Vol. 1, 129–138. Tokyo.
Buch-Kromann, M. 2009. Discontinuous Grammar. A Dependency-based Model of Human Parsing and Language Learning. Saarbrücken: VDM Verlag.
Buchholz, S. & Marsi, E. 2006. CoNLL-X shared task on multilingual dependency parsing. In
Proceedings of CoNLL-X
, 149–164. New York NY.
Debusmann, R., Duchier, D. & Kruijff, G.-M.M. 2004. Extensible dependency grammar: A new methodology. In
Proceedings of the COLING 2004. Workshop on Recent Advances in Dependency Grammar
, Geneva/SUI.
Deulofeu, J., Duffort, L., Gerdes, K., Kahane, S. & Pietrandrea, P. 2010. Depends on what the French say. Spoken corpus annotation with and beyond syntactic functions. In
Proceedings of the Fourth Linguistic Annotation Workshop
, 274–281. Uppsala.
Díaz Negrillo, A. & Fernández Domínguez, J. 2006. Error tagging systems for learner corpora. Revista Española de Lingüística Aplicada (RESLA) 19: 83–102.
Díaz Negrillo, A., Meurers, D., Valera, S. & Wunsch, H. 2010. Towards interlanguage POS annotation for effective learner corpora in SLA and FLT. Language Forum 36: 1–2.
Dickinson, M. & Ragheb, M. 2009. Dependency annotation for learner corpora. In
Proceedings of the TLT-8
, Milan, Italy.
Granger, S. 2003. Error-tagged learner corpora and CALL: A promising synergy. CALICO Journal 20(3): 465–480.
Hirschmann, H., Lüdeling, A., Rehbein, I., Reznicek, M. & Zeldes, A. 2010. Syntactic overuse and underuse: A study of a parsed learner corpus and its target hypothesis. Talk given at the Ninth Workshop on Treebanks and Linguistic Theory, December.
Johansson, R. & Nugues, P. 2007. Extended constituent-to-dependency conversion for English. In
Proceedings of NODALIDA 2007
. Tartu, Estonia.
Juffs, A. 2005. The influence of first language on the processing of wh-movement in English as a second language. Second Language Research 21(2): 121–151.
Kromann, M.T. 2003. The Danish dependency treebank and the underlying linguistic theory. In
Proceedings of TLT-03
, Växjö, Sweden.
Kübler, S., McDonald, R. & Nivre, J. 2009. Dependency parsing. In Synthesis Lectures on Human Language Technologies, G. Hirsts (ed.). San Rafael CA: Morgan & Claypool.
Levin, B. 1993. English Verb Classes and Alternations: A Preliminary Investigation. Chicago IL: University of Chicago Press.
Lüdeling, A., Walter, M., Kroymann, E. & Adolphs, P. 2005. Multi-level error annotation in learner corpora. In
Proceedings of Corpus Linguistics
, Birmingham.
McEnery, T., Xiao, R. & Tono, Y. 2006. Corpus-based Language Studies: An Advanced Resource Book. London: Routledge.
Mel’čuk, I. 1988. Dependency Syntax: Theory and Practice. Albany NY: State University of New York Press.
Nicholls, D. 2003. The Cambridge Learner Corpus. Error coding and analysis for lexicography and ELT. In
Proceedings of the Corpus Linguistics 2003. Conference (CL 2003)
, 572–581. Lancaster University.
Nilsson, J., Nivre, J. & Hall, J. 2007. Generalizing tree transformations for inductive dependency parsing. In
Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics
, 968–975. Prague.
Nivre, J. 2005. Dependency Grammar and Dependency Parsing [MSI report 05133]. Växjö: University of Växjö, School of Mathematics and Systems Engineering.
Osborne, T. 2008. Major constituents and two dependency grammar constraints on sharing in coordination. Linguistics 46(6): 1109–1165.
Ott, N. & Ziai, R. 2010. Evaluating dependency parsing performance on German learner language. In
Proceedings of TLT-9
, Vol. 9, 175–186. Tartu: University of Tartu.
Pendar, N. & Chapelle, C. 2008. Investigating the promise of learner corpora: Methodological issues. CALICO Journal 25(2): 189–206.
Pienemann, M. 1992. Coala. A computational system for interlanguage analysis. Second Language Research 8(1): 58–92.
. 1998. Language Processing and Second Language Development: Processability Theory [Studies in Bilingualism 15]. Amsterdam: John Benjamins.
Pollard, C. & Sag, I.A. 1994. Head-Driven Phrase Structure Grammar. Chicago IL: The University of Chicago Press.
Ragheb, M. & Dickinson, M. 2011. Avoiding the comparative fallacy in the annotation of learner corpora. In
Selected Proceedings of the 2010 Second Language Research Forum: Reconsidering SLA Research, Dimensions, and Directions
, 114–124. Somerville MA: Cascadilla Proceedings Project.
. 2012. Defining Syntax for Learner Language Annotation. In
Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), Poster Session
, 965–974. Mumbai, India.
Ross, J.R. 1967. Constraints on Variables in Syntax. Ph.D. dissertation, MIT.
Rozovskaya, A. & Roth, D. 2010. Annotating ESL errors: Challenges and rewards. In
Proceedings of the NAACL HLT 2010. Fifth Workshop on Innovative Use of NLP for Building Educational Applications
, 28–36. Los Angeles CA.
Sag, I.A., Gazdar, G., Wasow, T. & Weisler, S. 1985. Coordination and how to distinguish categories. Natural Language and Linguistic Theory 3: 117–171.
Sag, I.A. 2003. Coordination and underspecification. In Proceedings of the Ninth International Conference on HPSG, J. Bok Kim & S. Wechsler (eds). Stanford CA: CSLI.
Sagae, K., Davis, E., Lavie, A., MacWhinney, B. & Wintner, S. 2007. High-accuracy annotation and parsing of CHILDES transcripts. In
Proceedings of the Workshop on Cognitive Aspects of Computational Language Acquisition
, 25–32. Prague.
. 2010. Morphosyntactic annotation of CHILDES transcripts. Journal of Child Language 37(3): 705–729.
Sampson, G. 1995. English for the Computer: The SUSANNE Corpus and Analytic Scheme. Oxford: Clarendon Press.
Sgall, P., Panevová, J. & Hajičová, E. 2004. Deep syntactic annotation: Tectogrammatical representation and beyond. In Proceedings of the Workshop on Frontiers in Corpus Annotation, 32–38. Boston MA: ACL.
Steedman, M. & Baldridge, J. 2011. Combinatory categorial grammar. In Non-Transformational Syntax: Formal and Explicit Models of Grammar, R. Borsley & K. Borjars (eds). Chichester: Wiley-Blackwell.
Tetreault, J. & Chodorow, M. 2008. Native judgments of non-native usage: Experiments in preposition error detection. In
Proceedings of COLING-08
, 24–32. Manchester.
