Article published In: Interaction and Iconicity in the Evolution of Language:
Edited by Stefan Hartmann, Michael Pleyer, James Winters and Jordan Zlatev
[Interaction Studies 18:3] 2017
► pp. 376–401
Debunking two myths against vocal origins of language
Language is iconic and multimodal to the core
Published online: 8 December 2017
https://doi.org/10.1075/is.18.3.05per
https://doi.org/10.1075/is.18.3.05per
Gesture-first theories of language origins often raise two unsubstantiated arguments against vocal origins. First, they argue that great ape vocal behavior is highly constrained, limited to a fixed, species-typical repertoire of reflexive calls. Second, they argue that vocalizations lack any significant potential to ground meaning through iconicity, or resemblance between form and meaning. This paper reviews the considerable evidence that debunks these two “myths”. Accumulating evidence shows that the great apes exercise voluntary control over their vocal behavior, including their breathing apparatus, larynx, and supralaryngeal articulators. They are also able to learn new vocal behaviors, and even show some rudimentary ability for vocal imitation. In addition, an abundance of research demonstrates that the vocal modality affords rich potential for iconicity. People can understand iconicity in sound symbolism, and they can produce iconic vocalizations to communicate a diverse range of meanings. Thus, two of the primary arguments against vocal origins theories are not tenable. As an alternative, the paper concludes that the origins of language – going as far back as our last common ancestor with great apes – are rooted in iconicity in both gesture and vocalization.
Keywords: ape vocal behavior, iconicity, language evolution, vocalization, multimodality
Article outline
- 1.Introduction: Gestures versus vocalizations
- 2.Myth 1: Great ape vocal behavior is fixed and reflexive
- 2.1Voluntary control of vocal behavior
- 2.2Learning in vocal behavior
- 3.Myth 2: Vocalizations cannot be iconic
- 3.1Sound symbolism
- 3.2Iconic vocalizations
- 4.Conclusion: Language is iconic and multimodal to the core
- Notes
References
References (117)
Ackermann, H., Hage, S. R., & Ziegler, W. (2014). Brain mechanisms of acoustic communication in humans and nonhuman primates: An evolutionary perspective. Behavioral and Brain Sciences, 371, 529–604.
Ahlner, F., & Zlatev, J. (2011). Cross-modal iconicity: A cognitive semiotic approach to sound symbolism. Sign Systems Studies, 381, 298–348.
Arbib, M. A., Liebal, K., & Pika, S. (2008). Primate vocalization, gesture, and the evolution of human language. Current Anthropology, 491, 1053–1076.
Armstrong, D. F., & Wilcox, S. E. (2007). The gestural origin of language. Oxford ; New York: Oxford University Press.
Bender, R., & Bender, N. (2013). Brief communication: Swimming and diving behavior in apes (Pan troglodytes and Pongo pygmaeus): First documented report: Swimming and Diving behavior in Apes. American Journal of Physical Anthropology, 152(1), 156–162.
Bentley, M., & Varon, E. J. (1933). An accessory study of phonetic symbolism. American Journal of Psychology, 451, 76–86.
Blackwell, N. L., Perlman, M., & Fox Tree, J. E. (2015). Quotation as a multimodal construction. Journal of Pragmatics, 811, 1–7.
Blasi, D. E., Wichmann, S., Hammarstrom, H., Stadler, P. F., & Christiansen, M. H. (2016). Sound-meaning association biases evidenced across thousands of languages. Proceedings of the
National Academy of Sciences
.1131, 10818–10823.
Bonvillian, J. D., Orlansky, M. D., & Novack, L. L. (1983). Developmental milestones: Sign language acquisition and motor development. Child Development, 541, 1435–1445.
Bremner, A. J., Caparos, S., Davidoff, J., de Fockert, J., Linnell, K. J. & Spence, C. (2013) ‘‘Bouba’’ and ‘‘Kiki’’ in Namibia? A remote culture make similar shape–sound matches, but different shape–taste matches to Westerners. Cognition, 1261, 165–172.
Call, J., & Tomasello, M. (Eds.). (2007). The gestural communication of apes and monkeys. London: Lawrence Erlbaum Associates, Publishers.
Cartmill, E. A., Beilock, S., & Goldin-Meadow, S. (2012). A word in the hand: Action, gesture and mental representation in humans and non-human primates. Philosophical Transactions of the Royal Society B, 3671, 29–143.
Clark, N., Perlman, M., & Johansson Falck, M. (2014). Iconic pitch expresses vertical space. In Language and the creative mind (pp. 393–410). Stanford: SCLI Publications.
Corballis, M. C. (2003). From hand to mouth: The origins of language. Princeton, NJ; Woodstock: Princeton University Press.
Crockford, C., Herbinger, I., Vigilant, L., & Boesch, C. (2004). Wild chimpanzees produce group-specific calls: A case for vocal learning? Ethology, 1101, 221–243.
Cuskley, C. (2013). Mappings between linguistic sound and motion. Public Journal of Semiotics, 51, 39–62.
Dingemanse, M. (2012). Advances in the cross-linguistic study of ideophones: Advances in the cross-linguistic study of ideophones. Language and Linguistics Compass, 6(10), 654–672.
(2013). Ideophones and gesture in everyday speech. Gesture, 131, 146–165.
Dingemanse, M., Blasi, D. E., Lupyan, G., Christansen, M. H., & Monaghan, P. (2015). Arbitrariness, iconicity, and systematicity in language. Trends in Cognitive Sciences, 191, 603–615.
Douglas, P. H., & Moscovice, L. R. (2015). Pointing and pantomime in wild apes? Female bonobos use referential and iconic gestures to request genito-genital rubbing. Scientific Reports, 5, 13999.
Evans, N. & Levinson, S. C. (2009). The myth of language universals: Language diversity and its importance for cognitive science. Behavioral and Brain Sciences, 321, 429–448.
Fay, N., Arbib, M., & Garrod, S. (2013). How to Bootstrap a Human Communication System. Cognitive Science, 37(7), 1356–1367.
Fay, N., Lister, C. J., Ellison, T. M., & Goldin-Meadow, S. (2014). Creating a communication system from scratch: Gesture beats vocalization hands down. Frontiers in Psychology, 51, 1–12.
Feyereisen, P., & de Lannoy, J. D. (1991). Gestures and speech: Psychological investigations. Cambridge: Cambridge University Press.
Fouts, R. S., & Tukel, S. (1997). Next of kin: My conversations with chimpanzees. New York: William Morrow.
Furness, W. H. (1916). Observations on the mentality of chimpanzees and orang-utans. Proceedings of the
American Philosophical Society
, 551, 281–290.
Genty, E., & Zuberbühler, K. (2015). Iconic gesturing in bonobos. Communicative & Integrative Biology, 81, e992742.
Goldin-Meadow, S. (2003). The resilience of language: what gesture creation in deaf children can tell us about how all children learn language. New York, NY: Psychology Press.
Goodall, J. (1986) The chimpanzees of Gombe: Patterns of behavior. Cambridge: Harvard University Press.
Graham, K. E., Furuichi, T., & Byrne, R. W. (2016). The gestural repertoire of the wild bonobo (Pan paniscus): A mutually understood communication system. Animal Cognition.
Greenberg, J. H., & Jenkins, J. J. (1966). Studies in the psychological correlates of the sound system of American English. Word, 221, 207–242.
Hardus, M. E., Lameira, A. R., Van Schaik, C. P., & Wich, S. A. (2009). Tool use in wild orang-utans modifies sound production: A functionally deceptive innovation? Proceedings of the
Royal Society B
, 2761, 3689–3694.
Hewes, G. W. (1973). Primate communication and the gestural origin of language. Current Anthropology, 14(1/2), 5–24.
Hobaiter, C. & Byrne, R. W. (2011). The gestural repertoire of the wild chimpanzee. Animal Cognition, 141, 745–767.
Hirata, S., Ukita, J., & Kita, S. (2011). Implicit phonetic symbolism in voicing of consonants and visual lightness using Garner’s speeded classification task. Perceptual & Motor Skills, 1131, 929–940.
(1987). Refurbishing our foundations. Amsterdam: John Benjamins.
Hopkins, W. D., & Savage-Rumbaugh, S. (1991). Vocal communication as a function of differential rearing experiences in Pan paniscus: A preliminary report. International Journal of Primatology, 121, 559–583.
Imai, M., & Kita, S. (2014). The sound symbolism bootstrapping hypothesis for language acquisition and language evolution. Philosophical Transactions of the Royal Society B: Biological Sciences, 369(1651), 20130298.
Iverson, J. M., & Goldin-Meadow, S. (1998). Why people gesture when they speak. Nature, 3961, 228–229.
Jespersen, O. (1933). Symbolic value of the vowel i. In O. Jespersen (Ed.), Linguistica (pp. 283–303). Copenhagen: Levin & Munksgaard.
Joseph, J. E. (2015). Iconicity in Saussure’s linguistic work, and why it does not contradict the arbitrariness of the sign. Historiographia Linguistica, 421, 85–105.
Kearton, C. (1925). My friend Toto: The adventures of a chimpanzee, and the story of his journey from the Congo to London. London: Arrowsmith.
Kellogg, W. N., & Kellogg, L. A. (1933). The ape and the child: A study of environmental influence upon early behavior. Oxford: Whittelsey House.
Kelly, B. F., Leben, W., & Cohen, R. (2003). The meanings of consonants. Proceedings of the
29
th
Berkeley Linguistics Society
(pp. 245–253).
Klink, R. R. (2000). Creating brand names with meaning: The use of sound symbolism. Marketing Letters, 111, 5–20.
Lameira, A. R., Hardus, M. E., Kowalsky, B., de Vries, H., Spruijt, B. M., Sterck, E. H. M., Shumaker, R. W., & Wich, S. A. (2013). Orangutan (Pongo spp.) whistling and implications for the emergence of an open-ended call repertoire: A replication and extension. Journal of the Acoustical Society of America, 1341, 2326–2335.
Lameira, A. R., Hardus, M. E., Bartlett, A. M., Shumaker, R. W., Wich, S. A., & Menken, S. B. J. (2015). Speech-like rhythm in a voiced and voiceless orangutan call. PLoS ONE, 101, e116136.
Lameira, A. R., Hardus, M. E., Mielke, A., Wich, S. A., & Shumaker, R. W. (2016). Vocal fold control beyond the species-specific repertoire in an orang-utan. Scientific Reports, 6, 30315.
Lameira, A. R., Maddieson, I., & Zuberbühler, K. (2014). Primate feedstock for the evolution of consonants. Trends in Cognitive Sciences, 18(2), 60–62.
Laporte, M. N. C., & Zuberbühler, K. (2010). Vocal greeting behavior in wild chimpanzee females. Animal Behaviour, 801, 467–473.
Leavens, D. A. (2003). Integration of visual and vocal communication: Evidence for Miocene origins. Behavioral and Brain Sciences, 261, 232–233.
Leavens, D. A., Russell, J. L., & Hopkins, W. D. (2010). Multimodal communication by captive chimpanzees (Pan troglodytes). Animal Cognition, 131, 33–40.
Levinson, S. C., & Holler, J. (2014). The origin of human multi-modal communication. Philosophical Transactions Royal Society B, 3691, 20130302.
Lewis, M., & Frank, M. (2016). The length of words reflects their conceptual complexity. Cognition, 1531, 182–195.
Liebal, K., Waller, B., Burrows, A., & Slocombe, K. E. (2013). Primate communication: A multimodal approach. Cambridge: Cambridge University Press.
Lockwood, G., & Dingemanse, M. (2015). Iconicity in the lab: A review of behavioral, developmental, and neuroimaging research into sound-symbolism. Frontiers in Psychology, 6, 1246.
Lupyan, G., & Casasanto, D. (2014). Meaningless words promote meaningful categorization. Language and Cognition, 1–27.
MacLarnon, A. M., & Hewitt, G. P. (1999). The evolution of human speech: The role of enhanced breathing control. American Journal of Physical Anthropology, 1091, 341–363.
Maddieson, I. (2011). Consonant-vowel ratio. In The World Atlas of Language Stuctures Online. Max Planck Digital Library.
Marshall, A. J., Wrangham, R. W., & Arcadi, A. C. (1999). Does learning affect the structure of vocalizations in chimpanzees? Animal Behaviour, 581, 825–830.
McNeill, D. (2012). How language began: Gesture and speech in human evolution. Cambridge: Cambridge University Press.
Miles, L. (1990). The cognitive foundations for reference in a signing orangutan. In S. T. Parker & K. R. Gibson (Eds.), “Language” and intelligence in monkeys and apes (pp. 511–539). Cambridge: Cambridge University Press.
Mitani, J. C., & Gros-Louis, J. (1998). Chorusing and call convergence in chimpanzees: Tests of three hypotheses. Behaviour, 1351, 1041–1064.
Moos, A., Smith, R., Miller, S. R., & Simmons, D. R. (2014). Cross-modal associations in synaesthesia: vowel colours in the ear of the beholder. i-Perception, 51, 132–142.
Newman, S. S. (1933). Further experiments in phonetic symbolism. American Journal of Psychology, 451, 53–75.
Ohala, J. J. (1994). The frequency code underlies the sound symbolic use of voice pitch. In L. Hinton, J. Nichols, & J. J. Ohala (Eds.), Sound Symbolism (pp. 325–347). Cambridge: Cambridge University Press.
Owren, M. J., Amoss, R. T., & Rendall, D. (2011). Two organizing principles of vocal production: Implications for nonhuman and human primates. American Journal of Primatology, 731, 530–544.
Perlman, M., & Cain, A. A. (2014). Iconicity in vocalization, comparisons with gesture, and implications for theories on the evolution of language. Gesture, 14(3), 320–350.
Perlman, M., & Clark, N. (2015). Learned vocal and breathing behavior in an enculturated gorilla. Animal Cognition, 181, 1165–1179.
Perlman, M., Clark, N., & Johansson Falck, M. (2015). Iconic prosody in story reading. Cognitive Science, 39(6), 1348–1368.
Perlman, M., Dale, R., & Lupyan, G. (2015). Iconicity can ground the creation of vocal symbols. Royal Society Open Science, 2(8), 150152.
Perlman, M., & Gibbs, R. W. Jr. (2013). Pantomimic gestures reveal the sensorimotor imagery of a human-fostered gorilla. Journal of Mental Imagery, 371, 73–96.
Perlman, M., & Lupyan, G. (2015). The vocal iconicity challenge! Paper presented at
Protolang 4 conference
, Rome, Italy.
Perlman, M., Patterson, F. G., & Cohn, R. H. (2012). The human-fostered gorilla Koko shows breath control in play with wind instruments. Biolinguistics, 61, 433–444.
Perlman, M., Paul, J. Z., & Lupyan, G. (2015). Congenitally deaf children produce iconic vocalizations to communicate magnitude. In D. C. Noelle, R. Dale, A. S. Warlaumont, J. Yoshimi, T. Matlock, C. D. Jennings, & P. P. Maglio (Eds.), Proceedings of the 37th Annual Conference of the Cognitive Science Society (pp. 1853–1856). Austin, TX: Cognitive Science Society.
Perlman, M., Paul, J. Z. & Lupyan, G. (under review). Deaf and hearing Chinese children improvise iconic vocalizations that communicate magnitude to American listeners.
Perlman, M., Tanner, J. E., & King, B. J. (2012). A mother gorilla’s variable use of touch to guide her infant: Insights into iconicity and the relationship between gesture and action. In S. Pika & K. Liebal (Eds.), Developments in primate gesture research (pp. 55–72). Amsterdam: John Benjamins.
Perniss, P., & Vigliocco, G. (2014). The bridge of iconicity: From a world of experience to the experience of language. Philosophical Transactions of the Royal Society B: Biological Sciences, 369, 20130300.
Perry, L. K., Perlman, M., & Lupyan, G. (2015). Iconicity in English and Spanish and its relation to lexical category and age of acquisition. PLoS ONE, 10(9), e0137147.
Pika, S. (2014). Chimpanzee grooming gestures and sounds: What might they tell us about how language evolved. In D. Dor, C. Knight, & J. Lewis (Eds.), The Social Origins of Language (pp. 128–140). Oxford: Oxford University Press.
Pisanki, K., Cartei, V., McGettigan, C., Raine, J., & Reby, D. (2016). Voice modulation: A window into the origins of human vocal control? Trends in Cognitive Sciences, 201, 304–318.
Ramachandran, V. S., & Hubbard, E. M. (2001). Synaesthesia: A window into perception, thought and language. Journal of Consciousness Studies, 81, 3–34.
Russon, A., & Andrews, K. (2010). Orangutan pantomime: Elaborating the message. Biology Letters, 7(4), 627–630.
Sandler, W., Meir, I., Padden, C., & Aronoff, M. (2005). The emergence of grammar: systematic structure in a new language. Proceediings of the National Aacademy of Sciences, 1021, 2661–2665.
Schel, A. M., Townsend, S. W., Machanda, Z., Zuberbühler, K, & Slocombe, K. E. (2013). Chimpanzee Alarm Call Production Meets Key Criteria for Intentionality. PLoS ONE, 8(10), e76674.
Senghas, A., Kita, S., & zuryek, A. (2004). Children creating core properties of language: evidence from an emergining sign language in Nicaragua. Science, 3051, 1779–1782.
Shintel, H., Nusbaum, H. C., & Okrent, A. (2006). Analog acoustic expression in speech communication. Journal of Memory and Language, 55(2), 167–177.
Simner, J., Cuskley, C., & Kirby, S. (2010). What sound does that taste? Cross-modal mappings across gustation and audition. Perception, 391, 553–569.
Taglialatela, J. P., Savage-Rumbaugh, S., & Baker, L. A. (2003). Vocal production by a language-competent Pan paniscus
. International Journal of Primatology, 241, 1–17.
Taglialatela, J. P., Russell J. L., Schaeffer J. A., Hopkins, W. D. (2011). Chimpanzee vocal signaling points to a multimodal origin of human language. PLoS One, 61, e18852.
Tanner, J. E., & Byrne, R. W. (1996). Representation of action through iconic gesture in a captive lowland gorilla. Current Anthropology, 371, 162–173.
Thompson, P. D., & Estes, Z. (2011). Sound symbolic naming of novel objects is a graded function. The Quarterly Journal of Experimental Psychology, 641, 2392–2404.
Van Schaik, C. P., Van Noordwijk, M. & Wich, S. A. (2006). Innovation in wild Bornean orangutans (Pongo pygmaeus wurmbii). Behaviour, 1431, 839–876.
Watson, S. K., Townsend, S. W., Schel, A. M., Wilke, C., Wallace, E. K., Cheng, L., West, V., & Slocombe, K. E. (2015) Vocal learning in the functionally referential food grunts of chimpanzees. Current Biology, 251, 495–499.
Wich, S. A., Swartz, K. B., Hardus, M. E., Lameira, A. R., Stromberg, E., & Shumaker, R. W. (2009). A case of spontaneous acquisition of a human sound by an orangutan. Primates, 501, 56–64.
Cited by (28)
Cited by 28 other publications
Snider, Robert, Petros Kaklamanis & Gareth Roberts
Nölle, Jonas M., Andres Karjus & Stefan Hartmann
Henlein, Alexander, Anastasia Bauer, Reetu Bhattacharjee, Aleksandra Ćwiek, Alina Gregori, Frank Kügler, Jens Lemanski, Andy Lücking, Alexander Mehler, Pilar Prieto, Paula G. Sánchez-Ramón, Job Schepens, Martin Schulte-Rüther, Stefan R. Schweinberger & Celina I. von Eiff
Pleyer, Michael, Ryan Lepic & Stefan Hartmann
Źywiczyński, Przemysław & Jordan Zlatev
Kilpatrick, Alexander, Aleksandra Ćwiek, Eleanor Lewis & Shigeto Kawahara
Markov, Ilia, Kseniia Kharitonova & Elena L. Grigorenko
Hodge, Gabrielle & Lindsay Ferrara
Holler, Judith
Leongómez, Juan David, Jan Havlíček & S. Craig Roberts
Mudd, Katie, Connie de Vos & Bart de Boer
Rasenberg, Marlou, Asli Özyürek, Sara Bögels & Mark Dingemanse
Dellert, Johannes, Niklas Erben Johansson, Johan Frid & Gerd Carling
Erben Johansson, Niklas, Jon W Carr & Simon Kirby
Hartmann, Stefan & Michael Pleyer
Ćwiek, Aleksandra, Susanne Fuchs, Christoph Draxler, Eva Liina Asu, Dan Dediu, Katri Hiovain, Shigeto Kawahara, Sofia Koutalidis, Manfred Krifka, Pärtel Lippus, Gary Lupyan, Grace E. Oh, Jing Paul, Caterina Petrone, Rachid Ridouane, Sabine Reiter, Nathalie Schümchen, Ádám Szalontai, Özlem Ünal-Logacev, Jochen Zeller, Bodo Winter & Marcus Perlman
Żywiczyński, Przemysław, Sławomir Wacewicz & Casey Lister
DINGEMANSE, MARK, MARCUS PERLMAN & PAMELA PERNISS
Macuch Silva, Vinicius, Judith Holler, Asli Ozyurek & Seán G. Roberts
Bergman, Thore J., Jacinta C. Beehner, Melissa C. Painter & Morgan L. Gustison
Fröhlich, Marlen, Christine Sievers, Simon W. Townsend, Thibaud Gruber & Carel P. van Schaik
Mertz, Justine, Annaëlle Surreault, Erica van de Waal & Jennifer Botting
Żywiczyński, Przemysław
Graham, Kirsty E., Catherine Hobaiter, James Ounsley, Takeshi Furuichi, Richard W. Byrne & Lars Chittka
This list is based on CrossRef data as of 17 march 2026. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
