In:The Documentarist Turn: From observable linguistic behaviour to typological generalizations
Edited by Sonja Riesberg, Uta Reinöhl and Birgit Hellwig
[Studies in Language Companion Series 240] 2026
► pp. 263–286
Chapter 11Reflections on literacy in language documentation
This content is being prepared for publication; it may be subject to changes.
Abstract
This chapter is concerned with improving our understanding of literacies through language
documentation. It addresses different types of writing practices and the use of documentation corpora in this regard.
While writing is often taken for granted for languages with a tradition in this modality, it can be seen as “intrusive
and alien” (Finnegan 2008: 22) in other contexts. Yet Himmelmann (cf. 2018: 36) points out that hardly any language community nowadays has not
encountered a written standard. Accordingly, this chapter argues to increase the efforts in documenting the rich
perspectives on and practices of literacy in linguistic communities worldwide. The rationale is that this may enhance
knowledge about literate structures in the languages documented as well as in linguistic theory.
Article outline
- 1.Introduction
- 2.Orality, literacy, and different ways of knowing
- 3.The rationale of a comprehensive documentation of writing practices
- 4.Different types of writing practices, examples and implications for their documentation
- 4.1Documenting existing writing practices
- 4.2Documenting writing practices that emerge through documentation
- 4.3Prospects of documenting writing practices that exist prior to or emerge through documentation
- 5.Conclusions
Acknowledgements Notes Abbreviations References
References (84)
Abdelhay, Ashraf, Yonas Mesfun Asfaha, and Kasper Juffermans. 2014. “African
Literacy Ideologies, Scripts and
Education.” In African Literacies. Ideologies,
Scripts, Education, ed. by Kasper Juffermans, Yonas Mesfun Asfaha, and Ashraf Abdelhay, 1–62. Newcastle upon Tyne: Cambridge Scholars.
Abercrombie, Amelia. 2018. “Language
Purism and Social Hierarchies: Making a Romani Standard in Prizren.” Language
in
Society 47 (5): 741–761.
Akinwalere, Ifedayo. 2023. “Indigenous
Language as a Factor for Sustainable
Development.” In Indigenous African Language
Media, ed. by Phillip Mpofu, Israel Ayinla Fadipe, and Thulani Tshabangu, 321–337 Singapore: Palgrave Macmillan.
Ameka, Felix K. 2015. “Unintended
Consequences of Methodological and Practical Responses to Language Endangerment in
Africa.” In Language Documentation and Endangerment
in Africa, ed. by James Essegbey, Brent Henderson, and Fiona McLaughlin, 1–15. (Culture
and Language Use: Studies in Anthropological Linguistics
17.) Amsterdam: Benjamins.
Barton, David, Mary Hamilton, and Roz Ivanič (eds.). 2000. Situated
Literacies. Theorising Reading and Writing in Context. (Literacies
1.) London: Routledge.
Bausi, Alessandro, Dmitry Bondarev, Nadine Bregler, Sara Chiarini, Giovanni Ciotti, Janine Droese, Eliana Dal Sasso, Michael Friedrich, Agnieszka Helman-Ważny, Michael Kohs, Leah Mascia, Ann Lauren Osthof, Malena Ratzke, Ondřej Škrabal, Szilvia Sövegjártó, Thies Staack, and Hanna Wimmer. 2023. Definition
of ‘Written Artefact’. (CSMC Occasional Paper 8.)
Bird, Steven. 2010. “A
Scalable Method for Preserving Oral Literature from Small
Languages.” In The Role of Digital Libraries in a
Time of Global Change, ed. by Gobinda Chowdhury, Chris Koo, and Jane Hunter, 5–14. Berlin: Springer.
Bondarev, Dmitry. 2021. “A
Typology of West African Ajami Manuscripts: Languages, Layout and Research
Perspectives.” In Exploring Written Artefacts:
Objects, Methods, and Concepts, ed. by Jörg B. Quenzer, 707–728. Berlin, Boston: De Gruyter Mouton.
. 2023. “Loose-leaf
Islamic Manuscripts of West Africa: Retention, Adaptation or
Invention?” In Tied and Bound: A Comparative View on
Manuscript Binding ed. by Alessandro Bausi, and Michael Friedrich, 285–299 Berlin, Boston: De Gruyter Mouton.
Bondarev, Dmitry, Nikolay Dobronravin, Darya Ogorodnikova, and Lameen Souag. 2019. “Identifying
Languages of Ajami Commentaries.” Zenodo.
Choudhary, Narayan, D. G. Rao. 2020. “The
LDC-IL Speech Corpora.” In Proceedings of the 23rd
Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech
Databases and Assessment Techniques, Yangon, Myanmar. 28–32.
Coulmas, Florian. 2013. Writing
and Society. An
Introduction. Cambridge: Cambridge University Press.
Courthiade, Marcel. 1985. “Between
Oral and Written Textuality: The Lila of the Young Romani Poets in
Kosovia.” Lacio
Drom 6: 2–20.
Dąbrowska, Ewa. 2020. “How
Writing Changes Language.” In Language Change: The
Impact of English as a Lingua Franca, ed. by Anna Mauranen, and Svetlana Vetchinnikova, 75–94. Cambridge: Cambridge University Press.
Eckert, Penelope. 2008. “Variation
and the Indexical Field.” Journal of
Sociolinguistics 12 (4): 453–476. . (Cited
after Zhao & Liu. 2021.)
Elbow, Peter. 2012. Vernacular
Eloquence: What Speech Can Bring to
Writing. Oxford: Oxford University Press.
Finnegan, Ruth. 2008. “Data
— but Data from What?” In Language Documentation and
Description 5: 13–28.
Foley, William A. 2003. “Genre, Register
and Language Documentation in Literate and Preliterate Communities.” Language
Documentation and
Description 1: 85–98.
Frye, Henrike, and Birgit Hellwig (eds.). 2022. Arluis araataing ngenamasiit [Children’s songs and
stories]. Plauen: Sachsendruck.
Good, Jeff. 2018. “Reflections
on the Scope of Language
Documentation.” In Reflections on Language
Documentation 20 Years after Himmelmann 1998, ed. by Bradley McDonnell, Andrea L. Berez-Kroeker, and Gary Holton, 13–21. (Language
Documentation and Conservation Special Publication
15.) Honolulu: University of Hawai’i Press. [URL]
Grenoble, Lenore A. & Lindsay J. Whaley. 1998. “Toward
a Typology of Language Endangerment.” In Endangered
Languages: Language Loss and Community Response, ed. by Lenore A. Grenoble and Lindsay J. Whaley. Cambridge: Cambridge University Press. 22–54.
Gutiérrez-Fresneda, Raúl, Teresa Pozo Rico, María José García Tárraga, and Elena Jiménez-Pérez. 2024. “Effect
of the Development of Precursor Skills for Literacy in Spanish on Learning to Read in
English.” The Journal of Educational
Research 117 (5): 284–293.
Handoko, Handoko. 2024. “Developing
the Corpus of Minangkabau Language: Insights, Challenges, and Future
Directions.” Jurnal
Arbitrer 11 (3): 413–429.
Hauk, Bryn. 2022. “Review
of Creating Orthographies for Endangered Languages, ed.
by Mari C. Jones, and Damien Mooney.” Language
Documentation and
Conservation 16: 25–31. [URL]
Heller, Monica. 2009. “Media,
the State and Linguistic Authority.” In Language
Ideologies and Media Discourse: Texts, Practices, Politics, ed.
by Sally Johnson, and Tommaso M. Milani, 277–282. London: Bloomsbury Academic. (Cited after Zhao & Liu. 2021.)
Hellwig, Birgit. 2019. A
Grammar of Qaqet. (Mouton Grammar Library, 79). Berlin, Boston: De Gruyter Mouton.
Hellwig, Birgit, and Dagmar Jung. 2020. “Child-directed
Language — and how it Informs the Documentation and Description of the Adult
Language.” Language Documentation and
Conservation 14: 188–214. [URL]
Himmelmann, Nikolaus P. 1998. “Documentary
and Descriptive
Linguistics.” Linguistics 36 (1): 161–195.
2006a. “The
Challenges of Segmenting Spoken
Language.” In Essentials of language
documentation, ed. by Jost Gippert, Nikolaus P. Himmelmann, and Ulrike Mosel, 253–274. Berlin, New York: De Gruyter Mouton.
2006b. “Language
Documentation: What is it and What is it Good
for?” In Essentials of language
documentation, ed. by Jost Gippert, Nikolaus P. Himmelmann, and Ulrike Mosel, 1–30. Berlin, New York: De Gruyter Mouton.
2012. “Linguistic
Data Types and the Interface between Language Documentation and
Description.” Language Documentation and
Conservation 6: 187–207. [URL]
2018. “Meeting the
Transcription Challenge.” In Reflections on Language
Documentation. 20 Years after Himmelmann 1998, ed. by Bradley McDonnell, Andrea L. Berez-Kroeker, and Gary Holton, 33–40. (Language
Documentation and Conservation Special Publication
15.) Honolulu: University of Hawai’i Press. [URL]
Jany, Carmen. 2017. “The
Role of New Technology and Social Media in Reversing Language Loss.” Speech,
Language and
Hearing 21 (2): 73–76.
Jones, Mari C., and Damien Mooney (eds.). 2020. Creating
Orthographies for Endangered
Languages. Cambridge: Cambridge University Press.
Klamer, Marian, and Francesca R. Moro. 2020. “What
is ‘Natural’ Speech? Comparing Free Narratives and Frog Stories in
Indonesia.” Language Documentation and
Conservation 14: 238–313. [URL]
Koch, Peter, and Wulf Oesterreicher. 1985. “Sprache
der Nähe — Sprache der Distanz. Mündlichkeit und Schriftlichkeit im Spannungsfeld von Sprachtheorie und
Sprachgeschichte.” Romanistisches
Jahrbuch 36 (1): 15–43.
Leroy, Lieve Mieke R., Van Phuc Le, Wouter Boesman, and Filip Lenaerts. 2021. “Talking
With Children in Multilingual Preschools in Central Vietnam.” Childhood
Education 97 (4): 24–33.
Levinson, Martin P. 2007. “Literacy in
English Gypsy Communities: Cultural Capital Manifested as Negative
Assets.” American Educational Research
Journal 44 (1): 5–39.
Lewis, M. Paul. 2023. “Promoting
Linguistic Human Rights Through Language
Documentation.” In The Handbook of Linguistic Human
Rights, ed. by Tove Skutnabb-Kangas, and Robert Phillipson, 613–621. Hoboken: Wiley Blackwell.
Lewis, M. Paul, and Gary F. Simons. 2016. Sustaining
Language Use: Perspectives on Community-Based Language
Development. Dallas: SIL International.
Lidubwi, Jackline U., and John O. Ndavula. 2024. “Revitalising
Endangered Languages through Social Media. A Case Study of Olunyore Language Preservation through Facebook in
Kenya.” In Decolonising Digital Media and
Indigenisation of Participatory Epistemologies, ed. by Fulufhelo Oscar Makananise, and Shumani Eric Madima, 163–174. London: Routledge.
Lüpke, Friederike. 2011. “Orthography
Development.” In The Cambridge Handbook of Endangered
Languages, ed. by Peter K. Austin, and Julia Sallabank, 312–336. (Cambridge
Handbooks in Language and
Linguistics.) Cambridge: Cambridge University Press.
. 2017. “Escaping
the Tyranny of Writing: West African Regimes of Writing as a Model for Multilingual
Literacy.” In Tyranny of Writing: Ideologies of the
Written Word, ed. by Constanze Weth, and Kasper Juffermans, 129–148. London: Bloomsbury Academic.
Maas, Utz. 2010. “Literat
und Orat. Grundbegriffe der Analyse geschriebener und Gesprochener
Sprache.” Grazer Linguistische
Studien 73: 21–150.
Macht, Alexandra. 2018. “From
Romanian ‘Soul’ to English ‘Heart’: Dilemmas of Cultural and Gender Representation in Translating Qualitative
Data.” Forum Qualitative
Sozialforschung 19 (2): Article
12.
Maitz, Péter, and Stephan Elspaß. 2011. “Zur
Sozialen und Sprachpolitischen Verantwortung der
Variationslinguistik.” In Dynamik des Dialekts —
Wandel und Variation, ed. by Elvira Glaser, Jürgen E. Schmidt, and Natascha Frey, 221–240. Stuttgart: Steiner.
Mattheier, Klaus J. 2003. “German.” In Germanic
Standardizations: Past to Present, ed. by Ana Deumert, 211–244. Amsterdam: Benjamins.
Mbodj-Pouye, Aïssatou. 2013. “Writing
the Self in Rural Mali: Domestic Archives and Genres of Personal
Writing.” Africa: The Journal of the International African
Institute 83: 205–226.
Milroy, James. 2001. “Language
Ideologies and the Consequences of Standardization.” Journal of
Sociolinguistics 5 (4): 530–555.
Minhas, Shahid, and Abiodun Salawu. 2024. “Preserving
and Promoting Indigenous Languages: Social Media Analysis of Punjabi and Setswana
Languages.” Journal of Asian and African
Studies 60 (7): 4109–4126.
Morey, Stephen. 2015. “Metadata
and Endangered Archives: Lessons from the Ahom Manuscripts
Project.” In From Dust to Digital: Ten Years of the
Endangered Archives Programme, ed. by Maja Kominko, 31–66. Cambridge: Open Book Publishers.
Mosel, Ulrike. 2015. “Putting
Oral Narratives into Writing — Experiences from a Language Documentation Project in Bouganville, Papua New
Guinea.” In Language Contact and Documentation —
Contacto lingüístico y documentación, ed. by Bernard Comrie, and Lucía Golluscio, 321–342. Berlin, Boston: De Gruyter Mouton.
Muradoglu, Saliha, Hanna Suominen, and Nicholas Evans. 2023. “A
Quest for Paradigm Coverage: The Story of
Nen.” In Proceedings of the Second Workshop on NLP
Applications to Field Linguistics, Dubrovnik, Croatia. 74–85.
Ong, Walter J. 2002 [1982]. Orality and
Literacy. The Technologizing of the
Word. London: Routledge.
Parker, James, and Diana Parker. 1974. “A
Tentative Phonology of Baining (Kakat
Dialect).” In Phonologies of four Papua New Guinea
Languages, ed. by Richard Loving, 5–43. (Workpapers
in Papua New Guinea Languages
4.) Ukarumpa: Summer Institute of Linguistics. (Cited after Hellwig. 2019.)
Piper, Benjamin, Stephanie Simmons Zuilkowski, Dunston Kwayumba, and Arbogast Oyanga. 2018. “Examining
the Secondary Effects of Mother-tongue Literacy Instruction in Kenya: Impacts on Student Learning in English,
Kiswahili, and Mathematics.” International Journal of Educational
Development 59: 110–127.
Reetz, Steffen. 2022. Code-Switching
among Qaqet-Speaking Adults in Kamanakam, East New Britain Province, Papua New
Guinea. PhD thesis, University of Cologne. [URL]
Riyanto, Edi Dwi, Syahrur Marta Dwisusilo, Shaffarullah Bin Abdul Rahman, and Nadya Afdholy. 2023. “Tengger
Communities Creativity and Language Preservation in Text-Speak on the WhatsApp Social Media
Platform.” International Journal of
Anthropology 38 (3–4): 263–279.
Schippling, Melanie. 2026. Speaking
of Writing Romani: Language Attitudes, Text Editing, and Variability. (IMPACT: Studies
in Language, Culture and Society
56.) Amsterdam: Benjamins.
Schlüter, Julia, and Katharina Deckert. 2025. “Will
AI-Assisted Writing Streamline Non-Dominant National Standard Englishes? A Systematic Case Study of
Prepositional Variation.” Presentation, Corpus
linguistics 2040: Which data, which methods, which models?, Leibniz-Institut für Deutsche Sprache
(IDS), Mannheim, 10–11 July
2025.
Schroeder, Christoph. 2014. “Türkische
Texte deutsch-türkisch bilingualer Schülerinnen und Schüler in
Deutschland.” Zeitschrift für Literaturwissenschaft und
Linguistik 44: 24–43.
Schultze-Berndt, Eva. 2006. “Linguistic
Annotation.” In Essentials of Language
Documentation, ed. by Jost Gippert, Nikolaus P. Himmelmann, and Ulrike Mosel, 213–252. Berlin, New York: De Gruyter Mouton.
Sebba, Mark. 2009. “Sociolinguistic
Approaches to Writing Systems Research.” Writing Systems
Research 1 (1): 35–49.
Seifart, Frank. 2006. “Orthography
development.” In Essentials of Language
Documentation, ed. by Jost Gippert, Nikolaus P. Himmelmann, and Ulrike Mosel, 275–300. Berlin, New York: De Gruyter Mouton.
. 2008. “On
the Representativeness of Language
Documentations.” In Language Documentation and
Description 5: 60–76. London: SOAS.
Seifart, Frank, Nicholas Evans, Harald Hammarström, and Stephen C. Levinson. 2018. “Language
Documentation Twenty-five Years
on.” Language 94: e324–e345.
Shan, Xinyang, Yuanyuan Xu, Yining Wang, Yin-Shan Lin, Yunshi Bao. 2025. “Cross-Cultural
Implications of Large Language Models: An Extended Comparative
Analysis.” In HCI International 2024 — Late Breaking
Papers. HCII 2024. Lecture Notes in Computer
Science, Vol. 15375, ed.
by Adela Coman, Simona Vasilache, Fiona Fui-Hoon Nah, Keng Leng Siau, June Wei, and George Margetis, 106–118. Cham: Springer.
Spencer, Piyapath T. 2024. “Documenting
Endangered Languages with LangDoc: A Wordlist-Based System and A Case Study on
Moklen.” In Proceedings of the Third Workshop on NLP
Applications to Field Linguistics, Bangkok, Thailand. 28–36.
Taylor, Stephen, and Marisa von Fintel. 2016. “Estimating
the Impact of Language of Instruction in South African Primary Schools: A Fixed Effects
Approach.” Economics of Education
Review 50: 75–89.
Tobar, Edoardo. 2019. “Documenting
Online Writing Practices: The Case of Nominal Plural Marking in Zamboanga
Chabacano.” In Documentation and Maintenance of
Contact Languages from South Asia to East Asia, ed. by Mário Pinharanda-Nunes, and Hugo Cardoso, 141–173. (Language
Documentation and Conservation Special Publication
19.) Honolulu: University of Hawai’i Press. [URL]
Toninato, Paola. 2014. Romani
Writing. Literacy, Literature and Identity Politics. (Routledge Research in Literacy
4.) New York: Routledge.
Van Wyk, Jacolynn, and Maria Louise Mostert. 2016. “The
Influence of Mother Tongue and Gender on the Acquisition of English (L2). The Case of Afrikaans in Windhoek
Schools, Namibia.” Cogent
Education 3 (1), 1–12.
Voßkamp, Patrick. 2010. Sprechen,
um zu schreiben. Mündliche Kommunikation im Lokaljournalismus. (ESS-KuLtur
2.) Duisburg: Universitätsverlag Rhein-Ruhr.
Walker, Katherine, Birgit Hellwig, and Dagmar Jung. This volume. “Transcription practices and (the documentation of metalinguistic knowledge).” In The Documentarist Turn: From Observable Linguistic Behaviour to Typological Generalizations, ed. by Sonja Riesberg, Uta Reinöhl, and Birgit Hellwig. Amsterdam: John Benjamins.
Walsh, Olivia. 2021. “Introduction:
In the Shadow of the Standard. Standard Language Ideology and Attitudes towards ‘Non-Standard’ Varieties and
Usages.” Journal of Multilingual and Multicultural
Development 42 (9): 773–782.
Witt, Roxanna-Lorraine. 2021. “Gadjé-Rassismus.
Ein analytischer Perspektivwechsel auf Kontinuitäten menschenfeindlicher Ideologien in weißer Kultur und
Identität.” In Rassismus. Macht. Vergessen. Von
München über den NSU bis Hanau: Symbolische und materielle Kämpfe entlang rechten
Terrors, ed. by Onur Suzan Nobrega, Matthias Quent, and Jonas Zipf, 125–144. Bielefeld: transcript.
Yang, Jianliang, Xiya Zhang, Kai Liang, Yuenan Liu. 2023. “Exploring
the Application of Large Language Models in Detecting and Protecting Personally Identifiable Information in
Archival Data: A Comprehensive Study.” In Proceedings
of the 2023 IEEE International Conference on Big Data (BigData), Sorrento, Italy. 2116–2123.
