Generation of efficient and user-friendly queries for helper robots to detect target objects

被引:12
作者
Kurnia, Rahmadi [1 ]
Hossain, Altab [1 ]
Nakamura, Akio [1 ]
Kuno, Yoshinori [1 ]
机构
[1] Saitama Univ, Dept Informat & Comp Sci, Sakura Ku, Saitama 3388570, Japan
关键词
robot vision; dialog generation; object features; image segmentation; human-robot interface;
D O I
10.1163/156855306776985559
中图分类号
TP24 [机器人技术];
学科分类号
080202 [机械电子工程]; 1405 [智能科学与技术];
摘要
We are developing a helper robot that carries out tasks ordered by users through speech. The robot needs a vision system to recognize the objects appearing in the orders. However, conventional vision systems cannot recognize objects in complex scenes. They may find many objects and cannot determine which is the target. This paper proposes a method of using a conversation with the user to solve this problem. The robot asks a question to which the user can easily answer and whose answer can efficiently reduce the number of candidate objects. It considers the characteristics of features used for object identification such as the ease for humans to specify them by word, generating a user-friendly and efficient sequence of questions. Experimental results show that the robot can detect target objects by asking the questions generated by the method.
引用
收藏
页码:499 / 517
页数:19
相关论文
共 22 条
[1]
BERRY GA, 1998, P WORKSH PERC US INT, P67
[2]
MEAN SHIFT, MODE SEEKING, AND CLUSTERING [J].
CHENG, YZ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (08) :790-799
[3]
Mean shift: A robust approach toward feature space analysis [J].
Comaniciu, D ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) :603-619
[4]
Cremers A., 1998, Multimodal Human-Computer Communication. Systems, Techniques and Experiments, P279
[5]
Ehrenmann M, 2002, IEEE ROMAN 2002, PROCEEDINGS, P460, DOI 10.1109/ROMAN.2002.1045665
[6]
HANAFIAH ZM, 2004, P C HUM FACT COMP SY, P1321
[7]
Hans M, 2002, IEEE ROMAN 2002, PROCEEDINGS, P380, DOI 10.1109/ROMAN.2002.1045652
[8]
INAMURA T, 2004, P INT C INT ROB SYST, P2861
[9]
Human robot interaction through integrating visual auditory information with relaxation method [J].
Kawaji, T ;
Okada, K ;
Inaba, M ;
Inoue, H .
PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS, 2003, :323-328
[10]
KOMATANI K, 2002, P COLING, P481