TY - JOUR
T1 - Goal-oriented behavior sequence generation based on semantic commands using multiple timescales recurrent neural network with initial state correction
AU - Jeong, Sungmoon
AU - Park, Yunjung
AU - Mallipeddi, Rammohan
AU - Tani, Jun
AU - Lee, Minho
PY - 2014/4/10
Y1 - 2014/4/10
N2 - In this paper, to build an autonomous robot, we propose a novel scheme for a goal-oriented behavior sequence generation in tasks involving multiple objects. The scheme includes three major functions: (1) visual attention for target object localization; (2) automatic initial state correction based on experience using simple reinforcement learning, and (3) a suitable behavior sequence generation method based on multiple timescales recurrent neural networks (MTRNN). The proposed scheme systematically combines the three different major functions so that the autonomous bi-pad robot can automatically execute tasks involving multiple objects based on high level semantic commands given by human supervisor. The selective attention model continuously catches the visual environment to understand the current states of robot and perceive the relationship between current states of robot and the environment (depth perception and localization of a target object). If the current state is different from the initial state (depth perception and localization of a target object), the robot automatically adjust its current state to the initial state by integrating visual attention and simple reinforcement learning. After correcting the initial state of the robot, the behavior sequence generation functions can successfully generate suitable behavior timing signals, by integrating visual attention and MTRNN, based on the high level semantic commands given by human supervisor. Experimental results show that the proposed scheme can successfully generate suitable behavior timing, for a robot to autonomously achieve the tasks involving multiple objects, such as searching, approaching and hitting the target object using its arm.
AB - In this paper, to build an autonomous robot, we propose a novel scheme for a goal-oriented behavior sequence generation in tasks involving multiple objects. The scheme includes three major functions: (1) visual attention for target object localization; (2) automatic initial state correction based on experience using simple reinforcement learning, and (3) a suitable behavior sequence generation method based on multiple timescales recurrent neural networks (MTRNN). The proposed scheme systematically combines the three different major functions so that the autonomous bi-pad robot can automatically execute tasks involving multiple objects based on high level semantic commands given by human supervisor. The selective attention model continuously catches the visual environment to understand the current states of robot and perceive the relationship between current states of robot and the environment (depth perception and localization of a target object). If the current state is different from the initial state (depth perception and localization of a target object), the robot automatically adjust its current state to the initial state by integrating visual attention and simple reinforcement learning. After correcting the initial state of the robot, the behavior sequence generation functions can successfully generate suitable behavior timing signals, by integrating visual attention and MTRNN, based on the high level semantic commands given by human supervisor. Experimental results show that the proposed scheme can successfully generate suitable behavior timing, for a robot to autonomously achieve the tasks involving multiple objects, such as searching, approaching and hitting the target object using its arm.
KW - Autonomous robot
KW - Behavior sequence generation
KW - Robot tasks involving multiple objects
KW - Semantic commands for robot
KW - Visual attention shifts
UR - http://www.scopus.com/inward/record.url?scp=84893717942&partnerID=8YFLogxK
U2 - 10.1016/j.neucom.2013.03.050
DO - 10.1016/j.neucom.2013.03.050
M3 - Article
AN - SCOPUS:84893717942
SN - 0925-2312
VL - 129
SP - 67
EP - 77
JO - Neurocomputing
JF - Neurocomputing
ER -