In:The Documentarist Turn: From observable linguistic behaviour to typological generalizations
Edited by Sonja Riesberg, Uta Reinöhl and Birgit Hellwig
[Studies in Language Companion Series 240] 2026
► pp. 488–517
Chapter 19High animacy does not favour zero over pronouns – especially for objects
This content is being prepared for publication; it may be subject to changes.
Abstract
Since Givón (1983) and Ariel (1990), a large body of research continues to assume that zero anaphors reflect
a higher degree of accessibility than overt pronominal forms. However, when the features of animacy or person
are taken as its metric, existing case studies do not consistently confirm a correlation between high
accessibility and zero form, and some report the opposite tendency. In this chapter we bring a broader
cross-linguistic perspective to the question, leveraging data from a multilingual richly annotated spoken
language corpus covering 16 typologically diverse languages. We identify a surprisingly robust trend in
support of a reversal of expected accessibility-accounts: Overall, it is non-human referents rather than
human, and third person rather than first and second person, which favour zero expression. Our findings
provide cross-linguistic support for the constraint ‘avoid non-human pronouns’, found by Genetti and Crain (2003) for Nepali, and a tendency to overtly realise speech-act
participants as pronouns. Furthermore, we identify a significant effect of syntactic role, such that these
regularities are more clearly evident for direct objects than for transitive subjects. We explore possible
explanations for these results, noting similarities between our findings and well-documented patterns of
differential indexing, informativity and surprisal (Haspelmath
2021). More generally, these findings call for a re-assessment of the view that a single principle such
as accessibility can account for referential choice among all form types. For the choice between reduced
forms (pronouns and zero), distinct principles may be operative.
Article outline
- 1.Introduction
- 2.Theoretical background
- 2.1Accessibility and referential choice
- 2.2The paradox of reduced forms
- 2.3Interim summary
- 3.Research questions and data sources
- 4.Results
- 5.Discussion
- 6.Summary and outlook
Notes References Appendix
References (112)
Aissen, Judith. 2003. “Differential
Object Marking. Iconicity vs. Economy.” Natural Language and Linguistic
Theory 21 (3): 435–483.
. 2000. “The
Development of Person Agreement Markers: From Pronouns to Higher Accessibility
Markers.” In Usage-based Models of
Language, ed. by Suzanne Kemmer, and Michael Barlow, 197–261. Stanford: CSLI Publications.
Arnold, Jennifer E. 1998. Reference
Form and Discourse Patterns. Doctoral
dissertation. Stanford, CA: Stanford University.
2003. “Multiple
Constraints on Reference Form.” In Preferred
Argument Structure: Grammar as Architecture for Function, ed.
by John W. DuBois, Lorraine Kumpf, and William Ashby, 225 — 246. Amsterdam: John Benjamins.
2008. “Reference
Production. Production-Internal and Addressee-Oriented
Processes.” Language and Cognitive
Processes 23 (4): 495–527.
2010. “How
Speakers Refer. The Role of Accessibility.” Languages and Linguistics
Compass 4 (4): 187–203.
Arnold, Jennifer E., Anthony Losongco, Thomas Wasow, and Ryan Ginstrom. 2000. “Heaviness
vs. Newness. The Effect of Structural Complexity and Discourse Status on Constituent
Ordering.” Language 76 (1): 28–55.
Adibifar, Shirin. 2016. "Multi-CAST Persian." In Multi-CAST: Multilingual Corpus of Annotated Spoken Texts. Version 2507", ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg. ()
Barth, Danielle, and Nicholas Evans. 2017. The
Social Cognition Parallax Corpus (SCOPIC). Language Documentation &
Conservation Special Publication No. 12. [URL]
Barth, Danielle, Kira Davey, and Maria Matheas. 2023. “Multi-CAST
Matukar-Panau.” In Multi-CAST: Multilingual
Corpus of Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
Bickel, Balthasar. 2003. “Referential
Density in Discourse and Syntactic
Typology.” Language 79 (4): 708–736.
Bickel, Balthasar, Giorgio Iemmolo, Taras Zakharko, and Alena Witzlack-Makarevich. 2013. “Patterns
of Alignment in Verb Agreement.” In Languages
Across Boundaries. Studies in Memory of Anna Siewierska, ed.
by Dik Bakker, and Martin Haspelmath, 15–36. Berlin: Mouton de Gruyter.
Bisang, Walter. 2014. “Overt
and Hidden Complexity. Two Types of Complexity and Their
Implications.” Poznań Studies in Contemporary
Linguistics 50 (2): 127–143.
. 2015. “Hidden
Complexity. The Neglected Side of Complexity and its
Implications.” Linguistics
Vanguard 1 (1): 177–187.
Bittner, Dagmar. 2007. “Influence
of Animacy and Grammatical Role on Production and Comprehension of Intersentential Pronouns in German
L1-Acquisition.” ZAS Papers in
Linguistics 48: 103–138.
Bogomolova, Natasha, Dmitry Ganenkov, and Nils N. Schibor. 2021. “Multi-CAST
Tabasaran.” In Multi-CAST: Multilingual
Corpus of Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
Vogels, Jorrig, Alfons Maes, and Emil Krahmer. 2014. "Choosing Referring Expressions in Belgian and Netherlandic Dutch: Effects of Animacy."Lingua 145: 104–121.
Brickell, Timothy. 2016. Multi-CAST
Tondano. In Multi-CAST: Multilingual Corpus
of Annotated Spoken Texts ed. By Geoffrey Haig and Stefan Schnell. Bamberg: University of Bamberg.
Chafe, Wallace. 1976. “Givenness,
Contrastiveness, Definiteness, Subjects, Topics, and Point of
View.” In Subject and
Topic, ed. by Charles N. Li, 27–55. New York: Academic Press.
. 1980. The
Pear Stories. Cognitive, Cultural, and Linguistic Aspects of Narrative
Production. Norwood, NJ: Ablex.
. 1987. “Cognitive
Constraints on Information
Flow.” In Coherence and Grounding in
Discourse, ed. by Russel S. Tomlin, 21–51. Amsterdam: John Benjamins.
. 1994. Discourse,
Consciousness, and Time. The Flow and Displacement of Conscious Experience in Speaking and
Writing. Chicago, IL: The University of Chicago Press.
Clancy, Patricia M. 1980. “Referential
Choice in English and Japanese Narrative
Discourse.” In The Pear Stories. Cognitive,
Cultural, and Linguistic Aspects of Narrative Production, ed.
by Wallace Chafe, 127–202. Norwood, NJ: Ablex.
Clark, Herbert H., and Gregory L. Murphy. 1982. “Audience
Design in Meaning and Reference.” In Language
and Comprehension, ed. by Jean-François Le Ny, and Walter Kintsch, 287–299. Amsterdam, New York, Oxford: North-Holland.
2003. “Agreement:
Terms and Boundaries.” In The Role of
Agreement in Natural Language: TLS 5 Proceedings, ed.
by W. E. Griffin, 109–122.
2003. “Argument
Structure. Grammar in Use.” In Preferred
Argument Structure. Grammar as Architecture for Function, ed.
by John W. Du Bois, Lorraine Kumpff, and William J. Ashby, 11–60. Amsterdam: John Benjamins.
Duranti, Alessandro. 1994. From
Grammar to Politics: Linguistic Anthropology in a Western Samoan
Village. Oakland: University of California Press.
Ganenkov, Dmitry, and Nils N. Schiborr. 2025. “Multi-CAST
Chirag.” In Multi-CAST: Multilingual Corpus
of Annotated Spoken Texts, ed. by Geoffrey Haig and, Stefan Schnell. Bamberg: University of Bamberg.
Genetti, Carol, and Laura D. Crain. 2003. “Beyond
Preferred Argument Structure: Sentences, Pronouns and Given Referents in
Nepali.” In Preferred Argument Structure:
Grammar as Architecture for Function, ed. by John W. Du Bois, Lorraine E. Kumpf, and William J. Ashby, 197–223. Amsterdam: John Benjamins.
Gibson, Edward, Richard Futrell, Steven T. Piantadosi, Isabelle Dautriche, Kyle Mahowald, Leon Bergen, and Roger Levy. 2019. “How
Efficiency Shapes Human Language.” Trends in Cognitive
Science 23 (12): 389–407.
Givón, Talmy. 1983. “An
Introduction.” In Topic Continuity in
Discourse, ed. by Talmy Givón, 1–42. Amsterdam: John Benjamins.
. 2017. The
Story of
Zero. Amsterdam/Philadelphia: John Benjamins.
Grosz, Barbara J., Aravind K. Joshi, and Scott Weinstein. 1995. “Centering.
A Framework for Modelling the Local Coherence of
Discourse.” Computational
Linguistics 21 (2): 203–225.
Gundel, Jeanette K., Nancy Hedberg, and Ron Zacharski. 1993. “Cognitive
Status and the Form of Referring Expressions in
Discourse.” Language 69 (2): 274–307.
Hadjidas, Harris, and Maria Vollmer. 2015. “Multi-CAST
Cypriot Greek.” In Multi-CAST: Multilingual
Corpus of Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
Haig, Geoffrey. 2018. “The
Grammaticalization of Object Pronouns: Why Differential Object Indexing is an Attractor
State.” Linguistics 56 (4): 781–818.
Haig, Geoffrey, and Stefan Schnell. 2014. Annotations
Using GRAID: Grammatical Relations and Animacy in
Discourse. Bamberg: University of Bamberg.
. 2021. Multi-CAST.
Multilingual Corpus of Annotated Spoken Texts. Version
2507. Bamberg: University of Bamberg.
Haig, Geoffrey, Stefan Schnell, and Nils N. Schiborr. 2021. “Universals
of Reference in Discourse and Grammar: Evidence from the Multi-CAST Collection of Spoken
Corpora.” In Doing Corpus-Based Typology with
Spoken Language Data: State of the Art, ed. by Geoffrey Haig, Stefan Schnell, and Frank Seifart, 141–177. Honolulu, HI: University of Hawai’i Press. [URL]
Haig, Geoffrey, Maria Vollmer, and Hanna Thiele. 2019. “Multi-CAST
Northern Kurdish.” In Multi-CAST:
Multilingual Corpus of Annotated Spoken Texts, ed.
by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
Harel, Zvi. 1992. “Yossi,
Believe Me as You Never Believed Me Before.” The Haaretz
Supplement, 13 Nov
1992, 20–25.
Haspelmath, Martin. 2013. “Argument
Indexing. A Conceptual Framework for the Syntactic Status of Bound Person
Forms.” In Languages across Boundaries.
Studies in Memory of Anna Siewierska, ed. by Dik Bakker, and Martin Haspelmath, 197–226. Berlin: Mouton de Gruyter.
. 2021. “Explaining
Grammatical Coding Asymmetries. Form-Frequency Correspondences and
Predictability.” Journal of
Linguistics 57 (3): 605–633.
Haude, Katharina, Marc Allassonière-Tang. to
Appear. “Lexical, Pronominal and Zero Argument Encoding in
Movima.” In Topicality and the Shaping of
Grammar, ed. by Claudine Chamoreau, and Enrique Palancer. Amsterdam: John Benjamins.
Himmelmann, Nikolaus P. 1997. Deiktikon,
Artikel, Nominalphrase. Zur Emergenz syntaktischer
Struktur. Tübingen: Niemeyer.
1999. “The
Lack of Zero Anaphora and Incipient Person Marking in Tagalog.” Oceanic
Linguistics 38 (2): 231–269.
2012. “Linguistic
Data Types and the Interface between Language Documentation and
Description.” Language Documentation &
Conservation 6: 187–207. [URL]
2014. “Asymmetries
in the Prosodic Parsing of Function Words: Another Look at the Suffixing
Preference.” Language 90 (4): 927–960.
Himmelmann, Nikolaus P., and Beatrice Primus. 2015. “Prominence
Beyond Prosody — A First
Approximation.” In Prominences in
Linguistics. Proceedings of the pS-prominenceS International Conference, ed.
by Amedeo De Dominicis, 38–58. Viterbo: DISUCOM Press.
Holmberg, Anders. 2009. “Null
Subject Parameters.” In Parametric
Variation, ed. by Theresa Bieberauer, Anders Holmberg, Ian Roberts, and Michelle Sheehan, 88–124. Cambridge: Cambridge University Press.
Horn, Lawrence. 1984. “Toward
a New Taxonomy for Pragmatic Inference: Q-Based and R-Based
Implicature.” In Meaning, Form, and Use in
Context: Linguistic Applications, ed. by D. Schiffrin, 11–42. Georgetown University Press.
Karttunen, Lauri. 1976. “Discourse
Referents.” In Syntax and Semantics 7. Notes
from the Linguistic Underground, ed. by James D. McCawley, 363–385. New York: Academic Press.
Kehler, Andrew. 2002. Coherence,
Reference, and the Theory of Grammar. PhD
dissertation. Stanford, CA: CSLI Publications.
. 2006. “Discourse
Coherence.” In Handbook of
Pragmatics, ed. by Laurence Horn and Gregory Ward, 241–265. Malden, MA: Blackwell.
Kehler, Andrew, and Hannah Rhode. 2013. “A
Probabilistic Reconciliation of Coherence-Driven and Centering-Driven Theories of Pronoun
Interpretation.” Theoretical
Linguistics 39 (1–2): 1–37.
Kehler, Andrew, Laura Kertz, Hannah Rohde, and Jeffrey L. Elman. 2008. “Coherence
and Coreference Revisited.” Journal of
Semantics 25 (1): 1–44.
Kimoto, Yukinori. 2019. “Multi-CAST
Arta.” In Multi-CAST: Multilingual Corpus of
Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
Kurabe, Keita. 2021. “Multi-CAST
Jinghpaw.” In Multi-CAST: Multilingual Corpus
of Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
Levy, Gideon. 1995. “That
I Should Have to Defend Arafat?” The Ha’aretz
Supplement. February 17,
1995, 14–16.
Levy, Roger, and T. Florian Jäger. 2007. “Speakers
Optimize Information Density through Syntactic
Reduction.” In Adavances in Neural
Information Processing Systems 19. Proceedings of the 2006 Conference, ed.
by Bernard Schölkopf, John Platt, and Thomas Hoffmann, 849–856. Cambridge, MA: MIT Press.
Linders, Guido M., and Stefan Schnell. Under
review. “Functional factors predict referential choice similarly
across languages: A cross-linguistic computational analysis.” Preprint
at PsyArXiv.
Meakins, Felicity. 2015. “Not
Obligatory. Bound Pronoun Variation in Gurindji and
Bilinarra.” Asia-Pacific Language
Variation 1 (2): 128–162.
Forker, Diana, and Nils N. Schiborr. 2019. "Multi-CAST Sanzhi Dargwa." In Multi-CAST: Multilingual Corpus of Annotated Spoken Texts. Version 2507", ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg. ()
Meng, Chenxi. 2019. “Multi-CAST
Tulil.” In Multi-CAST: Multilingual Corpus of
Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
Mosel, Ulrike, and Stefan Schnell. 2015. “Multi-CAST
Teop.” In Multi-CAST: Multilingual Corpus of
Annotated Spoken Texts, ed. by Geoffrey Haig and Stefan Schnell. Bamberg: University of Bamberg.
Neeleman, Ad, and Kriszta Szendröi. 2007. “Radical
Pro Drop and the Morphology of Pronouns.” Linguistic
Inquiry 38 (4): 671–714.
Patterson, Clare, and Petra B. Schumacher. 2020. “The
Timing of Prominence Information during the Resolution of German Personal and Demonstrative
Pronouns.” Dialogue &
Discourse 11 (1): 1–39.
Prada Pérez, Ana De. 2015. “First
Person Singular Subject Pronoun Expression in Spanish in Contact with
Catalan.” In Subject Pronoun Expression in
Spanish: A Cross-Dialectal Perspective, ed. by Ana M. Carvalho, Rafael Orozco, and Naomi L. Shin, 121–142. Washington, DC: Georgetown University Press.
Rosenkvist, Henrik. 2009. “Null
Referential Subjects in Germanic Languages — An Overview.” Working
Papers in Scandinavian
Syntax 84: 151–180.
. 2018. “Null
Subject and Distinct Agreement in Modern
Germanic.” In Null Subjects in Generative
Grammar. A Synchronic and Diachronic Perspective, ed.
by Federica Cognola and Jan Casalincchio, 285–306. Oxford: Oxford University Press.
Same, Fahime. 2024. Referring
Expression Generation in Context. Combining Linguistic and Computational
Approaches. Berlin: Language Science Press.
Schiborr, Nils. N. 2015. “Multi-CAST
English.” In Multi-CAST: Multilingual Corpus
of Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
Schiborr, Nils N. 2021. “MulticastR.
A Companion to the Multi-CAST Collection. R package version
2.0.0.” In Multi-CAST. Multilingual Corpus of
Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. [URL]
2023. Lexical
Anaphora. A Corpus-Based Typological Study of Referential
Choice. Bamberg: University of Bamberg Press.
2025. “Multi-CAST
Collection Overview.” In Multi-CAST:
Multilingual Corpus of Annotated Spoken Texts, ed.
by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
Schiborr, Nils N., Stefan Schnell, and Hannah Thiele. 2018. RefIND
— Referent Indexing in Natural-language Discourse. Annotation guidelines
(v1.1). Bamberg: University of Bamberg.
Schnell, Stefan. 2015. “Multi-CAST
Vera’a.” In Multi-CAST: Multilingual Corpus
of Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
. 2018. “Whence
Subject-Verb Agreement? Investigating the Role of Topicality, Accessibility, and Frequency in Vera’a
Texts.” Linguistics 56 (4): 735–780.
. under
revision. “The Person Indexing Cycle. A Source-Oriented View from
Corpus-Based Typology.” Language and Linguistics
Compass.
Schnell, Stefan, and Danielle Barth. 2018. “Discourse
Motivations for Pronominal and Zero Objects Across Genres in
Vera’a.” Language Variation and
Change 30 (1): 51–81.
. 2020. “Expression
of Anaphoric Subjects in Vera’a. Functional and Structural Factors in the Choice between Pronoun and
Zero.” Language Variation and
Change 32 (3): 267–291.
Schnell, Stefan, Geoffrey Haig, Nils N. Schiborr, and Maria Vollmer. 2023. “Are
Referent Introductions Sensitive to Forward Planning in
Discourse?” In Discourse Phenomena in
Typological Perspective, ed. by Alessandra Barotto, and Simone Mattiola, 231–268. Amsterdam/Philadephia: John Benjamins.
Seifart, Frank, and Tai Hong. 2022. “Multi-CAST
Bora.” In Multi-CAST: Multilingual Corpus of
Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
Seržant, Ilja, and George Moroz. 2022. “Universal
Attractors in Language Evolution Provide Evidence for the Kinds of Efficiency Pressures
Involved.” Humanities and Social Sciences
Communications 9 (58).
Shiohara, Asako. 2022. “Multi-CAST
Sumbawa.” In Multi-CAST: Multilingual Corpus
of Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
Skopeteas, Stavros, Firmin Ahoua, Marie Laure Koffi Bla Adou, and Beatrice Koffi Mambo. 2024. “Resuming
Topics and Foci: Anyi, Baule and Microvariation in Kwa
Languages.” Journal of African Languages and
Linguistics 45 (1): 45–84.
Soares, Eduardo Correa, Philip Harold Miller, and Barbara Hemforth. 2020. “The
Effect of Semantic and Discourse Features on the Use of Null and Overt Subjects — A Quantitative Study
of Third Person Subjects in Brazilian Portuguese.” DELTA: Documentação
de Estudos em Lingüística Teórica e
Aplicada 36 (1): 1–38.
Stevenson, Rosemary J., Alistair Knott, Jon Oberlander, and Sharon McDonald. 2000. “Interpreting
Pronouns and Connectives. Interactions among Focussing, Thematic Roles, and Coherence
Relations.” Language and Cognitive
Processes 15 (3): 225–262.
Stoll, Sabine, and Balthasar Bickel. 2009. “How
Deep are Differences in Referential
Density?” In Crosslinguistic Approaches to
the Psychology of Language. Research in the Tradition of Dan Isaac Slobin, ed.
by Jiansheng Guo, Elena Lieven, Nancy Budwig, Susan Ervin-Tripp, Keiko Nakamura, and Şeyda Özçalışkan, 543–555. London: Psychology Press.
Thieberger, Nicholas, and Timothy Brickell. 2019. “Multi-CAST
Nafsan.” In Multi-CAST: Multilingual Corpus
of Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
Tily, Harry, and Steven Piantadosi. 2007. “Refer
Efficiently. Use Less Informative Expressions for More Predictable
Meanings.” In Proceedings of the Workshop on
the Production of Referring Expressions: Bridging the Gap between Computational and Empirical
Approaches to Reference (PRE-CogSci 2009), ed. by Kees Van Deemter, Albert Gatt, Roger P. G. van Gompel, and Emiel Krahmer. Tilburg: University of Tilburg.
Torres Cacoullos, Rena, and Catherine E. Travis. 2019. “Variationist
Typology. Shared Probabilistic Constraints across (Non-)Null Subject
Languages.” Linguistics 57 (3): 653–692.
Visser, Eline. 2021. “Multi-CAST
Kalamang.” In Multi-CAST: Multilingual Corpus
of Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
. 2025. “Multi-CAST
Uruangnirin.” In Multi-CAST: Multilingual
Corpus of Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
Vollmer, Maria. 2020. “Multi-CAST
Mandarin.” In Multi-CAST: Multilingual Corpus
of Annotated Spoken Texts, ed. by Geoffrey Haig, and Stefan Schnell. Bamberg: University of Bamberg.
. 2024. “Comparing
Zero and Referential Choice in Eight Languages with a Focus on Mandarin
Chinese.” Studies in
Language 48 (2): 351–389.
Von Heusinger, Klaus, and Petra B. Schumacher. 2019. “Discourse
Prominence: Definition and Application.” Journal of
Pragmatics 154: 117–127.
