In:Advances in Sign Language Corpus Linguistics
Edited by Ella Wehrmeyer
[Studies in Corpus Linguistics 108] 2023
► pp. 287–308
Chapter 11W(h)ither the ASL corpus?
Considering trends in signed corpus development
Published online: 3 April 2023
https://doi.org/10.1075/scl.108.11hoc
https://doi.org/10.1075/scl.108.11hoc
Abstract
In this chapter, we discuss logistic and ideological practices used by signed language corpus projects around the world to develop their corpora, starting with Johnston’s early call to develop signed language corpora in 2004. We then outline a brief history of existing American Sign Language (ASL) corpora or corpus-like collections, most of which are specialized and/or inaccessible. We unpack the lessons we have learned as we, North American linguists interested in signed language research and corpora, have sought to create a national-level corpus following the standards set by earlier projects. We conclude that our contexts are somewhat different, and describe projects that we think would be better suited for our ASL communities. In other words, though our dreams of having a single national-level ASL corpus have ‘withered’, we remain optimistic that corpus methods can be used to catalog and analyze the wealth of ASL signing data currently available.
Article outline
- 1.Introduction
- 2.Why ‘w(h)ither’?
- 3.The US context
- 3.1Cultural barriers to corpus work in the US
- 3.2Systemic barriers to corpus work in the US
- 3.3Material barriers to corpus work in the US
- 4.Attempts to establish North American ASL corpora
- 4.1Specialized corpora for ASL
- 4.2Existing corpus-like ASL collections
- 4.3Development of tools that work around the lack of corpus data
- 5.Considering and problematizing some signed language corpus concepts and practices
- 5.1‘Native’ signer bias
- 5.2Defining the limits of ASL
- 5.3‘Naturalistic’ data is authentic
- 6.Whither the development of an ASL corpus?
- 7.Going forward
Notes References
References (73)
Bellugi, Ursula & Fischer, Susan. 1972. A comparison of sign language and spoken language. Cognition 1(2): 173–200.
Berez-Kroeker, Andrea, Andreassen, Helene, Gawne, Lauren, Holton, Gary, Smythe Kung, Susan, Pulsifer, Peter, Collister, Lauren, The Data Citation and Attribution in Linguistics Group & the Linguistics Data Interest Group. 2018. The Austin principles of data citation in linguistics. Version 1.0. <[URL]> (30 June 2022).
Bragg, Danielle, Caselli, Naomi, Hochgesang, Julie, Huenerfauth, Matt, Katz-Hernandez, Leah, Koller, Oscar, Kushalnagar, Raja, Vogler, Christian & Ladner, Richard. 2021. The FATE landscape of sign language AI datasets: An interdisciplinary perspective. ACM Transactions on Accessible Computing 14(2): 1–45. (Special Issue: AI Fairness and People with Disabilities).
Bragg, Danielle, Koller, Oscar, Bellard, Mary, Berke, Larwan, Boudreault, Patrick, Braffort, Annelies & Caselli, Naomi. 2019. Sign language recognition, generation, and translation: An interdisciplinary perspective. In The 21st International ACM SIGACCESS Conference on Computers and Accessibility, 16–31. New York NY: Association for Computing Machinery.
Branson, Jan & Miller, Don. 2006. Beyond ‘language’: Linguistic imperialism, sign languages and linguistic anthropology. In Disinventing and Reconstituting Languages, Sinfree Makoni & Alastair Pennycook (eds), 116–134. Bristol: Multilingual Matters.
. 2008. National sign languages and language policies. In Encyclopedia of Language and Education, Nancy H. Hornberger (ed.), 151–165. Dordrecht: Springer.
Bucholtz, Mary. 2003. Sociolinguistic nostalgia and the authentication of identity. Journal of Sociolinguistics 7(3): 398–416.
Caselli, Naomi K., Sevcikova Sehyr, Zed, Cohen-Goldberg, Ariel M. & Emmorey, Karen. 2016. ASL-LEX: A lexical database of American Sign Language. Behavior Research Methods 49(2): 784–801.
Charity Hudley, Anne H. & Mallinson, Christine. 2018. Dismantling ‘the master’s tools’. American Speech 93(3–4): 513–537.
Cheng, Lauretta, Burgess, Danielle, Vernooij, Natasha, Solís-Barroso, Cecilia, McDermott, Ashley & Namboodiripad, Savithry. 2021. The problematic concept of native speaker in psycholinguistics: replacing vague and harmful terminology with inclusive and accurate measures. Frontiers in Psychology 12(September): 715843.
Chen Pichler, Deborah, Hochgesang, Julie A., Simons, Doreen & Lillo-Martin, Diane. 2016. Community input on re-consenting for data sharing. In Proceedings of the 7th Workshop on the Representation and Processing of Sign Languages: Corpus Mining (LREC 2016), Eleni Efthimiou, Stavroula-Evita Fotinea, Thomas Hanke, Julie A. Hochgesang, Jette Kristoffersen, & Johanna Mesch (eds), 29–34. Paris: ELRA. <[URL]> (13 October 2022).
Costello, Brendan, Fernández, Javier & Landa, Alazne. 2008. The non-(existent) native signer: Sign language research in a small deaf population. In Sign Languages: Spinning and Unraveling the Past, Present and Future. TISLR9, Forty Five Papers and Three Posters from the 9th. Theoretical Issues in Sign Language Research Conference, Florianopolis, Brazil, December 2006, Ronice M. de Quadros (ed.), 77–94. <[URL]> (30 June 2022).
Cox, Christopher. 2011. Corpus linguistics and language documentation: Challenges for collaboration. In Corpus-Based Studies in Language Use, Language Learning, and Language Documentation [Language & Computers 73], John Newman, Harald Baayen & Sally Rice (eds), 239–264. Leiden: Brill.
Crasborn, Onno, Mesch, Johanna, Waters, Dafydd, Nonhebel, Annika, Van der Kooij, Els, Woll, Bencie & Bergman, Brita. 2007. Sharing sign language data online: Experiences from the ECHO project. International Journal of Corpus Linguistics 12(4): 535–562.
Crasborn, Onno, & Sloetjes, Han. 2008. Enhanced ELAN functionality for sign language corpora. In Proceedings of the 3rd Workshop on the Representation and Processing of Sign Languages: Construction and Exploitation of Sign Language Corpora (LREC 2008), Onno Crasborn, Thomas Hanke, Eleni Efthimiou, Inge Zwitserlood & Ernst Thoutenhoofd (eds), 39–43. Paris: ELRA. <[URL]> (27 March 2022).
Crasborn, Onno, Zwisterlood, Inge & Ros, Johan & Van Kampen, Annemieke. 2008. Corpus NGT. An open access digital corpus of movies with annotations of Sign Language of the Netherlands (video corpus). <[URL]> (30 June 2022).
De Meulder, Maartje, Krausneker, Verena, Turner, Graham & Conama, John. 2019. Sign language communities. In The Palgrave Handbook of Minority Languages and Communities, Gabrielle Hogan-Brun & Bernadette O’Rourke (eds), 207–232. London: Palgrave Macmillan.
De Meulder, Maartje, Kusters, Annelies, Moriarty, Erin & Murray, Joseph. 2019. Describe, don’t prescribe. The practice and politics of translanguaging in the context of deaf signers. Journal of Multilingual and Multicultural Development 40(10): 892–906.
Dobrin, Lise M., Austin, Peter K. & David, Nathan. 2007. Dying to be counted: The commodification of endangered languages in documentary linguistics. In Language Documentation and Description, Vol. 6, Peter K. Austin (ed.), 37–52. London. SOAS.
Dudis, Paul G., Hochgesang, Julie A., Shaw, Emily & Villanueva, Miako. 2020. Introduction to ‘Motivated Look at Indicating Verbs in ASL (MoLo)’ Project. Paper at the HDLS14, Virtual Conference, November.
Dudis, Paul G., Mathur, Gaurav & Mirus, Gene. 2009. Bringing a corpus in line with Deaf communities’ perspectives. Paper presented at the Sign Language Corpora: Linguistic Issues, DCAL, UCL London, July 24. <[URL]> (30 June 2022).
Emmorey, Karen. 2001. Language, Cognition, and the Brain: Insights from Sign Language Research. New York NY: Psychology Press.
Fenlon, Jordan & Hochgesang, Julie A. 2022. Introduction to signed language corpora. In Signed Language Corpora, Jordan Fenlon & Julie A. Hochgesang (eds), 1–17. Washington DC: Gallaudet University Press.
Fenlon, Jordan, & Erin Wilkinson. 2015. Sign languages in the world. In Sociolinguistics and Deaf Communities, Adam Schembri & Ceil Lucas (eds), 5–28. Cambridge: CUP.
Fisher, Jami, Hochgesang, Julie & Tamminga, Meredith. 2016. Examining variation in the absence of a ‘main’ ASL corpus: The case of the Philadelphia Signs Project. In Proceedings of the 7th Workshop on the Representation and Processing of Sign Languages: Corpus Mining (LREC 2016), Eleni Efthimiou, Stavroula-Evita Fotinea, Thomas Hanke, Julie Hochgesang, Jette Kristoffersen & Johanna Mesch (eds), 75–80. Paris: ELRA. <[URL]> (30 June 2022)
Fisher, Jami, Tamminga, Meredith & Hochgesang, Julie. 2020. Philly signs. Philadelphia Sign Project. March 2020. <[URL]> (30 June 2022).
Harris, Raychelle, Holmes, Heidi M. & Mertens, Donna M. 2009. Research ethics in sign language communities. Sign Language Studies 9(2): 104–131.
Henner, Jon & Robinson, Octavian. 2021. Unsettling languages, unruly bodyminds: Imaging a crip linguistics.
Hill, Joseph C. 2017. The importance of the sociohistorical context in sociolinguistics: The case of Black ASL. Sign Language Studies 18(1): 41–57.
Hill, Joseph C. & McCaskill, Carolyn. 2016. Reflections on the Black ASL Project. Sign Language Studies 17(1): 59–63.
Hochgesang, Julie A. 2020. SLAASh ID glossing principles, ASL Signbank and annotation conventions, Version 3.0. figshare.
2022. Managing sign language acquisition video data: A personal journey in the organization and representation of signed data. In The Open Handbook of Linguistic Data Management, Andrea L. Berez-Kroeker, Bradley McDonnell, Eve Koller & Lauren B. Collister (eds), 367–383. Cambridge MA: MIT Press Direct.
Hochgesang, Julie A., Willow, Nic, Treviño, Rafael & Shaw, Emily. 2019. Gallaudet University documentation of ASL (GUDA) – Whither a corpus for ASL? Poster presented at Theoretical Issues of Sign Language Research (TISLR) 13, University of Hamburg, Germany, September 28.
Hornberger, Nancy H., De Korne, Haley & Weinberg, Miranda. 2016. Ways of talking (and acting) about language reclamation: An ethnographic perspective on learning Lenape in Pennsylvania. Journal of Language, Identity & Education 15(1): 44–58.
Hou, Lynn, Lepic, Ryan & Wilkinson, Erin. 2022. Managing sign language video data collected from the internet. In The Open Handbook of Linguistic Data Management, Andrea L. Berez-Kroeker, Bradley McDonnell, Eve Koller & Lauren B. Collister, 471–480. Cambridge MA: MIT Press Open. .
Janzen, Terry. 2017. Composite utterances in a signed language: Topic constructions and perspective-taking in ASL. Cognitive Linguistics 28(3): 511–538.
. 2018. Know and understand in ASL: A usage-based study of grammaticalized topic constructions. In Functionalist and Usage-Based Approaches to the Study of Language: In Honor of Joan L. Bybee [Studies in Language Companion Series 192], K. Aaron Smith & Dawn Nordquist (eds), 59–97. Amsterdam: John Benjamins.
. 2019. Shared spaces, shared mind: Connecting past and present viewpoints in American Sign Language narratives. Cognitive Linguistics 30(2): 253–279.
Johnston, Trevor. 2004. W(h)ither the Deaf community? Population, genetics, and the future of Australian Sign Language. American Annals of the Deaf 148(5): 358–375. .
. 2008. Auslan corpus. <[URL]> (30 June 2022).
. 2010. From archive to corpus: Transcription and annotation in the creation of signed language corpora. International Journal of Corpus Linguistics 15(1): 106–131. .
Johnston, Trevor & Schembri, Adam. 2012. Corpus analysis of sign languages. In The Encyclopedia of Applied Linguistics, Carol Chapelle (ed.), online. Hoboken NJ: John Wiley & Sons.
Joze, Hamid, Vaezi, Reza & Koller, Oscar. 2018. MS-ASL: A large-scale data set and benchmark for understanding American Sign Language. <[URL]> (30 June 2022).
Konrad, Reiner, Hanke, Thomas, Langer, Gabriele, Blanck, Dolly, Bleicken, Julian, Hofmann, Ilona & Jeziorski, Olga. 2020. MEINE DGS – Annotiert. Öffentliches Korpus Der Deutschen Gebärdensprache, 3. Release / MY DGS – Annotated. Public Corpus of German Sign Language, 3rd Release. Universität Hamburg.
Kusters, Annelies, & Sahasrabudhe, Sujit. 2018. Language ideologies on the difference between gesture and sign. Language & Communication 60(May): 44–63.
Leahy, Anne & Brown, Peter R. 2020. Naming British Sign Language 1960–1975. Sign Language Studies 20(4): 691–698.
Leonard, Wesley Y. 2018. Reflections on (de)colonialism in language documentation. In Reflections on Language Documentation 20 Years after Himmelmann 1998 [Language Documentation & Conservation Special Publication No. 15], Bradley McDonnell, Andrea L. Berez-Kroeker & Gary Holton (eds), 55–65. Honolulu HI: University of Hawai’i.
Lewis, M. Paul & Simons, Gary F. 2010. Assessing endangerment: Expanding Fishman’s GIDS. Revue Roumaine de Linguistique LV(2): 103–20.
Lillo-Martin, Diane, & Chen Pichler, Deborah. 2008. Development of sign language acquisition corpora. In Proceedings of the 3rd Workshop on the Representation and Processing of Sign Languages: Construction and Exploitation of Sign Language (LREC 2008), Onno Crasborn, Thomas Hanke, Eleni Efthimiou, Inge Zwitserlood & Ernst Thoutenhoofd (eds), 129–133. Paris: ELRA. <[URL]> (30 June 2022).
Lucas, Ceil, Bayley, Robert & Valli, Clayton. 2001. Sociolinguistic Variation in American Sign Language [Sociolinguistics in Deaf Communities 7]. In collaboration with Mary Rose, Alyssa Wulf, Paul Dudis, Susan Schatz & Laura Sanheim. Washington DC: Gallaudet University Press.
Lu, Pengfei, & Huenerfauth, Matt. 2014. Collecting and evaluating the CUNY ASL Corpus for research on American Sign Language animation. Computer Speech & Language 28(3): 812–831.
Mayberry, Rachel I., Hall, Matthew L. & Zvaigzne, Meghan. 2014. Subjective frequency ratings for 432 ASL signs. Behavior Research Methods 46(2): 526–539.
McCaskill, Carolyn, Lucas, Ceil, Bayley, Robert & Hill, Joseph. 2011. The Hidden Treasure of Black ASL: Its History and Structure. In collaboration with Roxanne Dummett, Pamela Baldwin & Randall Hogue. Washington DC: Gallaudet University Press.
McEnery, Tony & Hardie, Andrew. 2011. Corpus Linguistics: Method, Theory and Practice [Cambridge Textbooks in Linguistics]. Cambridge: CUP.
Meurant, Laurence. 2015. Corpus LSFB. First digital open access corpus of movies and annotations of French Belgian Sign Language (LSFB). [URL].
Michaels, John. 1923. A Handbook of the Sign Language of the Deaf. Atlanta GA: Home Mission Board Southern Baptist Convention.
Morford, Jill & MacFarlane, James. 2003. Frequency characteristics of American Sign Language. Sign Language Studies 3(2): 213–225.
Murray, Joseph J. 2017. Academic and community interactions in the formation of Deaf Studies in the United States. In Innovations in Deaf Studies: The Role of Deaf Scholars, Annelies Kusters, Maartje De Meulder & Dai O’Brien (eds), 77–100. Oxford: OUP.
Neidle, Carol, Thangali, Ashwin & Sclaroff, Stan. 2012. Challenges in development of the American Sign Language Lexicon Video Dataset (ASLLVD) corpus. In Proceedings of the 5th Workshop on the Representation and Processing of Sign Languages: Interactions between Corpus and Lexicon (LREC 2012), Onno Crasborn, Eleni Efthimiou, Stavroula-Evita Fotinea, Thomas Hanke, Jette Kristoffersen & Johanna Mesch (eds), 143–150. Paris: ELRA. <[URL]> (30 June 2022).
Occhino, Corrine, Fisher, Jami, Hill, Joseph, Hochgesang, Julie A., Shaw, Emily & Tamminga, Meredith. 2021. Report on on-going research: New trends in ASL variation documentation. Sign Language Studies 21(3): 350–377.
Pennycook, Alistair & Otsuji, Emi. 2016. Lingoing, language labels and metrolingual practices. Applied Linguistics Review 7(3): 259–277.
Rosa, Jonathan & Flores, Nelson. 2017. Unsettling race and language: Toward a raciolinguistic perspective. Language in Society 46(5): 621–647.
Roush, Daniel & Schilling, Amy. 2021. The Dennis Cokely American Freedom Speeches Parallel Corpus (1.0). Eastern Kentucky University Libraries. <[URL]> (30 June 2022).
Russell, Kevin, Wilkinson, Erin & Janzen, Terry. 2011. ASL sign lowering as undershoot: A corpus study. Laboratory Phonology 2(2).
Rutkowski, Paweł, Kuder, Anna, Filipczak, Joanna, Mostowski, Piotr, Łacheta, Joanna & Łozińska, Sylwia. 2017. The design and compilation of the Polish Sign Language (PJM) Corpus. In Different Faces of Sign Language Research, Paweł Rutkowski (ed.), 125–151. Warsaw: University of Warsaw, Faculty of Polish Studies.
Schembri, Adam, Fenlon, Jordan, Rentelis, Ramas, Sally, Reynolds & Cormier, Kearsy. 2013. Building the British Sign Language corpus. Language Documentation & Conservation 7: 136–154.
Schütze, Carson T., & Sprouse, Jon. 2013. Judgment data. In Research Methods in Linguistics, Robert Podesva & Devyani Sharma (eds), 27–50. Cambridge: CUP.
Sehyr, Zed S., Caselli, Naomi, Cohen-Goldberg, Ariel M. & Emmorey, Karen. 2021. The ASL-LEX 2.0 project: A database of lexical and phonological properties for 2,723 signs in American Sign Language. Journal of Deaf Studies and Deaf Education 26(2): 263–277.
Simms, Laurene, Moore, Elizabeth, Lawyer, Gloshanda, Harrison, Dominic, Rouse, Jenelle, McCaskill, Carolyn, Moges-Riedel, Rezenet, Hill, Joseph & McKay-Cody, Melanie. 2021. Is ASL too White? Zoom lecture presented at the Chief Bilingual Office and Chief Diversity Office Lecture Series, Gallaudet, February 17. <[URL]> (30 June 2022).
Supalla, Ted & Clark, Patricia. 2015. Sign Language Archaeology: Understanding the Historical Roots of American Sign Language. Washington DC: Gallaudet University Press.
Wilson, Robin. 2012. Scholarly publishing’s gender gap. Chronicle of Higher Education (October). <[URL]> (30 June 2022).
Cited by (2)
Cited by two other publications
Hou, Lynn & Savithry Namboodiripad
This list is based on CrossRef data as of 1 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
