Iconicity in vocalization, comparisons with gesture, and implications for theories on the evolution of language
Published online: 21 January 2016
https://doi.org/10.1075/gest.14.3.03per
https://doi.org/10.1075/gest.14.3.03per
Scholars have often reasoned that vocalizations are extremely limited in their potential for iconic expression, especially in comparison to manual gestures (e.g., Armstrong & Wilcox, 2007; Tomasello, 2008). As evidence for an alternative view, we first review the growing body of research related to iconicity in vocalizations, including experimental work on sound symbolism, cross-linguistic studies documenting iconicity in the grammars and lexicons of languages, and experimental studies that examine iconicity in the production of speech and vocalizations. We then report an experiment in which participants created vocalizations to communicate 60 different meanings, including 30 antonymic pairs. The vocalizations were measured along several acoustic properties, and these properties were compared between antonyms. Participants were highly consistent in the kinds of sounds they produced for the majority of meanings, supporting the hypothesis that vocalization has considerable potential for iconicity. In light of these findings, we present a comparison between vocalization and manual gesture, and examine the detailed ways in which each modality can function in the iconic expression of particular kinds of meanings. We further discuss the role of iconic vocalizations and gesture in the evolution of language since our divergence from the great apes. In conclusion, we suggest that human communication is best understood as an ensemble of kinesis and vocalization, not just speech, in which expression in both modalities spans the range from arbitrary to iconic.
Keywords: iconicity, language evolution, modality, sound symbolism, vocalization
References (110)
Ahlner, Felix & Jordan Zlatev (2011). Cross-modal iconicity: A cognitive semiotic approach to sound symbolism. Sign Systems Studies, 381, 298–348.
Alpher, Barry (2001). Ideophones in interaction with intonation and the expression of new information in some indigenous languages of Australia. In Erhard F.K. Voeltz & Christa Kilian-Hatz (Eds.), Ideophones (pp. 9–24). Amsterdam: John Benjamins.
Armstrong, David F., William C. Stokoe, & Sherman E. Wilcox (1995). Gesture and the nature of language. Cambridge: Cambridge University Press.
Armstrong, David F. & Sherman E. Wilcox (2007). The gestural origin of language. New York: Oxford University Press.
Banse, Rainer & Klaus R. Scherer (1996). Acoustic profiles in vocal emotion expression. Journal of Personality and Social Psychology, 701, 614–636.
Bentley, Madison & Edith J. Varon (1933). An accessory study of phonetic symbolism. American Journal of Psychology, 451, 76–86.
Boersma, Paul (2001). Praat, a system for doing phonetics by computer. Glot International, 51, 341–345.
Bolinger, Dwight (1986). Intonation and its parts: Melody in spoken English. Palo Alto, CA: Stanford University Press.
Bremner, Andrew J., Serge Caparos, Jules Davidoff, Jan de Fockert, Karina J. Linnell, & Charles Spence (2013). “Bouba” and “Kiki” in Namibia? A remote culture make similar shape-sound matches, but different shape-taste matches to Westerners. Cognition, 1261, 165–172.
Brown, Roger W., Abraham H. Black, & Arnold E. Horowitz (1955). Phonetic symbolism in natural languages. Journal of Abnormal Social Psychology, 501, 388–393.
Call, Josep & Michael Tomasello (Eds.) (2007). The gestural communication of apes and monkeys. London: Lawrence Erlbaum.
Cartmill, Erica A., Sian Beilock, & Susan Goldin-Meadow (2012). A word in the hand: Action, gesture and mental representation in humans and non-human primates. Philosophical Transactions of the Royal Society B, 3671, 129–143.
Childs, G. Tucker (1994). African ideophones. In Leanne Hinton, Johanna Nichols, & John J. Ohala (Eds.), Sound symbolism (pp. 178–206). Cambridge: Cambridge University Press.
Clark, Herbert H. (2003). Pointing and placing. In Sotaro Kita (Ed.), Pointing. Where language, culture, and cognition meet (pp. 243–268). Hillsdale, NJ: Erlbaum.
Clark, Nathaniel, Marcus Perlman, & Marlene Johansson Falck (2014). The iconic use of pitch to express vertical space. In Barbara Dancygier, Mike Borkent, & Jennifer Hinnell (Eds.), Language and the creative mind (pp. 393–410). Stanford: SCLI Publications.
Corballis, Michael (2002). From hand to mouth: The origins of language. Princeton: Princeton University Press.
Corbett, Greville (1994). Gender and gender systems. In R. Asher (Ed.), The Encyclopedia of language and linguistics, Vol. 31 (pp. 1347–1353). Oxford: Pergamon Press.
Cosmides, Leda (1983). Invariances in the acoustic expression of emotion during speech. Journal of Experimental Psychology, 91, 864–881.
Cuskley, Christine (2013). Mappings between linguistic sound and motion. Public Journal of Semiotics, 51, 39–62.
Davis, R. (1961). The fitness of names to drawings: A cross-cultural study in Tanganyika. British Journal of Psychology, 521, 259–268.
de Boer, Bart & Marcus Perlman (2014). Physical mechanisms may be as important as brain mechanisms in evolution of speech. Behavioral and Brain Sciences, 37 (6), 552–553.
Diffloth, Gerard (1972). The notes on expressive meaning. In Paul M. Peranteau, Judith N. Levi, & Gloria C. Phares (Eds.), Papers from the Eighth Regional Meeting of Chicago Linguistic Society (pp. 440–447). Chicago: Chicago Linguistic Society.
Dingemanse, Mark (2012). Advances in the cross-linguistic study of ideophones. Language and Linguistics Compass, 61, 654–672.
Dingemanse, Mark, Francisco Torreira, & N.J. Enfield (2013). Is “Huh?” a universal word? Conversational infrastructure and the convergent evolution of linguistic items. PLoS ONE, 81, e78273.
Emmorey, Karen (2002). Language, cognition, and brain: Insights from sign language research. Hillsdale, NJ: Lawrence Erlbaum Associates.
Fay, Nicolas, Michael Arbib, & Simon Garrod (2013). How to bootstrap a human communication system. Cognitive Science, 371, 1356–1367.
Fay, Nicolas, Casey J. Lister, T. Mark Ellison, & Susan Goldin-Meadow (2014). Creating a communication system from scratch: Gesture beats vocalization hands down. Frontiers in Psychology, 51, 1–12.
Feld, Steven (1996). Waterfalls of song: an acoustemology of place resounding in Bosavi, Papua New Guinea. In Keith H. Basso & Steven Feld (Eds.), Senses of Place (pp. 91–135). Santa Fe, NM: School of American Research Advanced Seminar Series.
Gebels, Gustav (1969). An investigation of phonetic symbolism in different cultures. Journal of Verbal Learning & Verbal Behavior, 81, 310–312.
Goldin-Meadow, Susan (2003). The resilience of language: What gesture creation in deaf children can tell us about how children learn language. New York, NY: Psychology Press.
Greenberg, Joseph H. (1966). Some universals of language with special reference to the order of meaningful constituents. In Joseph Greenberg (Ed.), Universals of language (pp. 73–113). Cambridge, MA: MIT Press.
Greenberg, Joseph H. & James J. Jenkins (1966). Studies in the psychological correlates of the sound system of American English. Word, 221, 207–242.
Hardus, Madeleine E., Adriano R. Lameira, Carel P. Van Schaik, & Serge A. Wich (2009). Tool use in wild orang-utans modifies sound production: A functionally deceptive innovation? Proceedings of the Royal Society B, 2761, 3689–3694.
Hewes, Gordon W. (1973). Primate communication and the gestural origins of language. Current Anthropology, 141, 5–24.
Hopkins, William D., Jared P. Taglialatela, & David A. Leavens (2007). Chimpanzees differentially produce novel vocalizations to capture the attention of a human. Animal Behavior, 731, 281–286.
Imai, Mutsumi & Sotaro Kita (2014). The sound symbolism bootstrapping hypothesis for language acquisition and language evolution. Philosophical Transactions of the Royal Society B: Biological Sciences, 369 (1651), 20130298.
Imai, Mutsumi, Sotaro Kita, Miho Nagumo, & Hiroyuki Okada (2008). Sound symbolism facilitates early verb learning. Cognition, 1091, 54–65.
Jakobson, Roman & Linda R. Waugh (1979). The sound shape of language. Bloomington, IN: Indiana University Press.
Jespersen, Otto (1933). Symbolic value of the vowel i. In Otto Jespersen (Ed.), Linguistica (pp. 283–303). Copenhagen: Levin & Munksgaard.
Kantartzis, Katerina, Mutsumi Imai, & Sotaro Kita (2011). Japanese sound-symbolism facilitates word learning in English-speaking children. Cognitive Science, 351, 575–586.
Kelly, Barbara F., William Leben, & Robert Cohen (2003). The meanings of consonants.
Proceedings of the 29th Berkeley Linguistics Society
(pp. 245–253).
Kendon, Adam (1991). Some considerations for a theory of language origins. Man, (N.S.) 261, 602–619.
(2009). Language’s matrix. Gesture, 91, 352–372.
(2014). Semiotic diversity in utterance production and the concept of ‘language’. Philosophical Transactions of the Royal Society B: Biological Sciences, 369, 20130293
Klink, Richard R. (2000). Creating brand names with meaning: The use of sound symbolism. Marketing Letters, 111, 5–20.
Kovic, Vanja, Kim Plunkett, & Gert Westermann (2010). The shape of words in the brain. Cognition, 1141, 19–28.
LaPolla, Randy J. (1994). An investigation of phonetic symbolism as it relates to Mandarin Chinese. In Leanne Hinton, Johanna Nichols, & John J. Ohala (Eds.), Sound symbolism. Cambridge: Cambridge University Press.
Lewis, Jerome (2009). As well as words: Congo Pygmy hunting, mimicry, and play. In Rudolf Botha & Chris Knight (Eds.), The cradle of language (pp. 236–256). Oxford, UK: Oxford University Press.
Liddell, Scott K. (2003). Grammar, gesture, and meaning in American Sign Language. Cambridge: Cambridge University Press.
Lupyan, Gary & Daniel Casasanto (2014). Meaningless words promote meaningful categorization. Language and Cognition, FirstView, 1–27.
Lupyan, Gary & Rick Dale (2015). The role of adaptation in understanding linguistic diversity. In Randy LaPolla & Rik De Busser (Eds.), The shaping of language: The relationship between the structures of languages and their social, cultural, historical, and natural environments (pp. 289–316).
Massaro, Dominic W. (1998). Perceiving talking faces: From speech perception to a behavioral principle. Cambridge, MA: MIT Press.
Maurer, Daphne, Thanujeni Pathman, & Catherine J. Mondloch (2006). The shape of boubas: sound-shape correspondences in toddlers and adults. Developmental Science, 91, 316–322.
McGregor, William (2001). Ideophones as the source of verb in northern Australian languages. In Erhard F.K. Voeltz & Christa Kilian-Hatz (Eds.), Ideophones (pp. 205–221). Amsterdam: John Benjamins.
(2012). How language began: Gesture and speech in human evolution. Cambridge: Cambridge University Press.
Mikone, Eve (2001). Ideophones in the Balto-Finnic languages. In Erhard F. K.Voeltz & Christa Kilian-Hatz (Eds.), Ideophones (pp. 223–233). Amsterdam: John Benjamins.
Monaghan, Padraic, Richard C. Shillcock, Morten H. Christiansen, & Simon Kirby (2014). How arbitrary is language? Philosophical Transactions of the Royal Society B: Biological Sciences, 369, 20130299.
Newman, Stanley S. (1933). Further experiments in phonetic symbolism. American Journal of Psychology, 451, 53–75.
Nuckolls, Janis B. (1996). Sounds like life: Sound-symbolic grammar, performance, and cognition in Pastaza Quechua. New York: Oxford University Press.
. (2004). To be or not to be ideophonically impoverished. In Wai Fong Chiang, Elaine Chun, Laura Mahalingappa, & Siri Mehus (Eds.),
SALSA XI: Proceedings of the Eleventh Annual Symposium about Language and Society
(pp. 131–142). Austin: University of Texas.
Nygaard, Lynne, Debora Herold, & Laura Namy (2009). The semantics of prosody: Acoustic and perceptual correlates to word meaning. Cognitive Science, 331, 127–146.
Ohala, John J. (1994). The frequency code underlies the sound symbolic use of voice pitch. In Leanne Hinton, Johanna Nichols, & John J. Ohala (Eds.), Sound symbolism (pp. 325–347). Cambridge: Cambridge University Press.
Oswalt, Robert (1994). Inanimate imitatives in English. In Leanne Hinton, Johanna Nichols, & John J. Ohala (Eds.), Sound symbolism (pp. 293–306). Cambridge: Cambridge University Press.
Parise, Cesare V. & Francesco Pavani (2011). Evidence of sound symbolism in simple vocalizations. Experimental Brain Research, 2141, 373–380.
Peirce, Charles S. (1955) Logic as semiotic: The theory of signs. In Justus Buchler (Ed.), Philosophical writings of Peirce (pp. 99–119). New York: Dover.
Perlman, Marcus (2010). Talking fast: The use of speech rate as iconic gesture. In Fey Perrill, Mark Turner, & Vera Tobin (Eds.), Meaning, form, and body (pp. 245–262). Stanford: CSLI Publications.
Perlman, Marcus & Nathaniel Clark (2015). Learned vocal and breathing behavior in an enculturated gorilla. Animal Cognition, 181, 1165–1179.
Perlman, Marcus, Francine G. Patterson, & Ronald H. Cohn (2012a). The human-fostered gorilla Koko shows breath control in play with wind instruments. Biolinguistics, 61, 433–444.
Perlman, Marcus, Joanne E. Tanner, & Barbara J. King (2012b). A mother gorilla’s variable use of touch to guide her infant: Insights into iconicity and the relationship between gesture and action. In Simona Pika & Katja Liebal (Eds.), Developments in primate gesture research (pp. 55–72). Amsterdam: John Benjamins.
Perlman, Marcus & Raymond W. Gibbs Jr. (2013). Pantomimic gestures reveal the sensorimotor imagery of a human-fostered gorilla. Journal of Mental Imagery, 371, 73–96.
Perlman, Marcus, Rick Dale, & Gary Lupyan (2014). Iterative vocal charades: The emergence of conventions in vocal communication. In Erica A. Cartmill, Sean Roberts, Heidi Lyn, & Hannah Cornish (Eds.),
The evolution of language: Proceedings of the 10th International Conference (EVOLANG10
). New Jersey: World Scientific.
Perlman, Marcus, Nathaniel Clark, & Marlene Johansson Falck (2015). Iconic prosody in story reading. Cognitive Science, 39 (6), 1348–1368.
Perniss, Pamela & Gabriella Vigliocco (2014). The bridge of iconicity: From a world of experience to the experience of language. Philosophical Transactions of the Royal Society B: Biological Sciences, 369, 20130300.
Pinker, Steven & Ray Jackendoff (2005). What’s special about the human language faculty? Cognition, 951, 201–236.
Pitcher, Benjamin J., Alex Mesoudi, & Alan G. McElligott (2013). Sex-biased sound symbolism in English-language first names. PLoS ONE, 81, e64825.
Ramachandran, Vilayanur S. & Edward M. Hubbard (2001). Synaesthesia: A window into perception, thought and language. Journal of Consciousness Studies, 81, 3–34.
Rhodes, Richard. Aural images. In Leanne Hinton, Johanna Nichols, & John J. Ohala (Eds.), Sound symbolism (pp. 276–292). Cambridge: Cambridge University Press. .
Rummer, Ralf, Judith Schweppe, René Schlegelmilch, & Martine Grice (2014). Mood is linked to vowel type: The role of articulatory movements. Emotion, 141, 246–250.
Sandler, Wendy (2013). Vive la différence: Sign language and spoken language in language evolution. Language and Cognition, 51, 189–203.
Sapir, Edward (1929). A study in phonetic symbolism. Journal of Experimental Psychology, 121, 225–239.
Sauter, Disa A., Frank Eisner, Paul Ekman, & Sophie K. Scott (2010). Cross-cultural recognition of basic emotions through nonverbal emotional vocalizations.
Proceedings of the National Academy of Sciences
, 1071, 2408–2412.
Schultze-Berndt, Eva (2001). Ideophone-like characteristics of uninflected predicates in Jaminjung (Australia). In Erhard F. K.Voeltz & Christa Kilian-Hatz (Eds.), Ideophones (pp. 355–373). Amsterdam: John Benjamins.
Shintel, Hadas & Howard C. Nusbaum (2007). The sound of motion in spoken language: Visual information conveyed by acoustic properties of speech. Cognition, 1051, 681–690.
(2008). Moving to the speed of sound: Context modulation of the effect of acoustic properties of speech. Cognitive Science, 321, 1063–1074.
Shintel, Hadas, Howard C. Nusbaum, & Arika Okrent (2006). Analog acoustic expression in speech communication. Journal of Memory and Language, 551, 167–177.
Steklis, Horst D. & Stevan Harnad (1976). From hand to mouth: Some critical stages in the evolution of language. In Stevan Harnad, Horst D. Steklis, & Jane B. Lancaster (Eds.), Origins and evolution of language and speech. Annals of the New York academy of sciences, 2801, 445–455.
Tanz, Christine (1971). Sound symbolism in words relating to proximity and distance. Language and Speech, 141, 266–276.
Taub, Sarah (2001). Language from the body: Iconicity and metaphor in American Sign Language. Cambridge: Cambridge University Press.
Taylor, Kevin J. (2007). KA-BOOM! A dictionary of comic book words, symbols & onomatopoeia. USA: Lulu.com.
Thompson, Patrick D. & Zachary Estes (2011). Sound symbolic naming of novel objects is a graded function. The Quarterly Journal of Experimental Psychology, 641, 2392–2404.
Ultan, R. (1978). Size-sound symbolism. In Joseph H. Greenberg (Ed.), Universals of human language, Vol. 2: Phonology (pp. 527–568). Stanford, CA: Stanford University Press.
Urban, Matthias (2011). Conventional sound symbolism in terms for organs of speech: A cross-linguistic study. Folia Linguistica, 451, 199–214.
Van Schaik, Carel P., Marc Ancrenaz, Gwendolyn Borgen, Birute Galdikas, Cheryl D. Knott, Ian Singleton, Akira Suzuki, Sri Suci Utami, & Michelle Merrill (2003). Orangutan cultures and the evolution of material culture. Science, 31, 102–105.
Waugh, Linda (2000). Against arbitrariness: Imitation and motivation revived. In Patrizia Violi (Ed.), Phonosymbolism and poetic language (pp.25–56). Turnhout, Belgium: Brepols.
Watson, Richard L. (2001). A comparison of some Southeast Asian ideophones with some African ideophones. In Erhard F. K.Voeltz & Christa Kilian-Hatz (Eds.), Ideophones (pp. 385–405). Amsterdam: John Benjamins.
Cited by (41)
Cited by 41 other publications
Akita, Kimi & Shigeto Kawahara
Hensel, Laura B., Stephanie Cheng & Stacy Marsella
Jang, Hayeun
Kelly, Spencer D. & Quang‐Anh Ngo Tran
Peltola, Rea, Yaru Wu & Marine Grandgeorge
Sidhu, David M.
Steil, Jessica Naomi & Claudia Katrin Friedrich
Vainio, L., A. Wikström & M. Vainio
Winter, Bodo
Belova, Alla
Calhoun, Sasha, Paul Warren, Joy Mills & Jemima Agnew
de Sousa, Hilário
Odiegwu, Nancy Chiagolum & Jesús Romero-Trillo
Vainio, Lari, Ida‐Lotta Myllylä, Alexandra Wikström & Martti Vainio
Źywiczyński, Przemysław & Jordan Zlatev
Cohen, Dror, Ido Rosenberger, Moshe Butman & Kfir Bar
Sidhu, David M. & Gabriella Vigliocco
Ekström, Axel G., Jens Nirme & Peter Gärdenfors
Fenk-Oczlon, Gertraud
Holler, Judith
Żywiczyński, Przemysław, Sławomir Wacewicz & Casey Lister
Erben Johansson, Niklas, Andrey Anikin, Gerd Carling & Arthur Holmer
THOMPSON, BILL, MARCUS PERLMAN, GARY LUPYAN, ZED SEVCIKOVA SEHYR & KAREN EMMOREY
Fröhlich, Marlen, Christine Sievers, Simon W. Townsend, Thibaud Gruber & Carel P. van Schaik
Perlman, Marcus, Hannah Little, Bill Thompson & Robin L. Thompson
Perlman, Marcus & Gary Lupyan
Perry, Lynn K., Marcus Perlman, Bodo Winter, Dominic W. Massaro & Gary Lupyan
Schlenker, Philippe
Sulik, Justin & Mitchell Rabinowitz
Goldin-Meadow, Susan
Lister, Casey J. & Nicolas Fay
2017. How to create a human communication system. Interaction Studies. Social Behaviour and Communication in Biological and Artificial Systems 18:3 ► pp. 314 ff.
Marino, David, Paul Bucci, Oliver S. Schneider & Karon E. MacLean
Massaro, Dominic W. & Marcus Perlman
Perlman, Marcus
2017. Debunking two myths against vocal origins of language. Interaction Studies. Social Behaviour and Communication in Biological and Artificial Systems 18:3 ► pp. 376 ff.
Winter, Bodo, Marcus Perlman, Lynn K. Perry & Gary Lupyan
2017. Which words are most iconic?. Interaction Studies. Social Behaviour and Communication in Biological and Artificial Systems 18:3 ► pp. 443 ff.
[no author supplied]
[no author supplied]
2019. Conclusion. In Sensory Linguistics [Converging Evidence in Language and Communication Research, 20], ► pp. 235 ff.
This list is based on CrossRef data as of 9 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
