Article published In: The Evolution of Grounded Communication
Edited by Luc Steels
[Evolution of Communication 4:1] 2001
► pp. 33–56
Learning visually grounded words and syntax of natural spoken language
Published online: 29 April 2002
https://doi.org/10.1075/eoc.4.1.04roy
https://doi.org/10.1075/eoc.4.1.04roy
Properties of the physical world have shaped human evolutionary design and given rise to physically grounded mental representations. These grounded representations provide the foundation for higher level cognitive processes including language. Most natural language processing machines to date lack grounding. This paper advocates the creation of physically grounded language learning machines as a path toward scalable systems which can conceptualize and communicate about the world in human-like ways. As steps in this direction, two experimental language acquisition systems are presented.
The first system, CELL, is able to learn acoustic word forms and associated shape and color categories from fluent untranscribed speech paired with video camera images. In evaluations, CELL has successfully learned from spontaneous infant-directed speech. A version of CELL has been implemented in a robotic embodiment which can verbally interact with human partners.
The second system, DESCRIBER, acquires a visually-grounded model of natural language which it uses to generate spoken descriptions of objects in visual scenes. Input to DESCRIBER’s learning algorithm consists of computer generated scenes paired with natural language descriptions produced by a human teacher. DESCRIBER learns a three-level language model which encodes syntactic and semantic properties of phrases, word classes, and words. The system learns from a simple ‘show-and-tell’ procedure, and once trained, is able to generate semantically appropriate, contextualized, and syntactically well-formed descriptions of objects in novel scenes.
Cited by (17)
Cited by 17 other publications
Liu, Rui, Yibei Guo, Runxiang Jin & Xiaoli Zhang
Heath, Scott, David Ball & Janet Wiles
Mingo, Jack Mario & Ricardo Aler
Rasheed, Nadia & Shamsudin H. M. Amin
Mukerjee, Amitabha & Madan Mohan Dabbeeru
Tikhanoff, Vadim, Angelo Cangelosi & Giorgio Metta
Bauckhage, C., S. Wachsmuth, M. Hanheide, S. Wrede, G. Sagerer, G. Heidemann & H. Ritter
Knowles, Michael John & Stefan Wermter
MCCLAIN, MATTHEW & STEPHEN LEVINSON
Wachsmuth, Sven, Sebastian Wrede & Marc Hanheide
Jamieson, M., S. Dickinson, S. Stevenson & S. Wachsmuth
Jung-Hoon Hwang, KangWoo Lee & Dong-Soo Kwon
Bauckhage, C., M. Hanheide, S. Wrede & G. Sagerer
Heidemann, Gunther, Ingo Bax & Holger Bekel
Steels, Luc
Steels, Luc
This list is based on CrossRef data as of 9 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
