RECOVERING THE BASIC STRUCTURE OF HUMAN ACTIVITIES FROM NOISY VIDEO-BASED SYMBOL STRINGS

被引:15
作者
Kitani, Kris M. [1 ]
Sato, Yoichi [1 ]
Sugimoto, Akihiro [2 ]
机构
[1] Univ Tokyo, Inst Ind Sci, Tokyo 1538505, Japan
[2] Natl Inst Informat, Tokyo 1018430, Japan
关键词
Activity learning; stochastic context-free grammars; grammatical inference; minimum description length principle; model selection; noise;
D O I
10.1142/S0218001408006776
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years stochastic context-free grammars have been shown to be effective in modeling human activities because of the hierarchical structures they represent. However, most of the research in this area has yet to address the issue of learning the activity grammars from a noisy input source, namely, video. In this paper, we present a framework for identifying noise and recovering the basic activity grammar from a noisy symbol string produced by video. We identify the noise symbols by finding the set of non-noise symbols that optimally compresses the training data, where the optimality of compression is measured using an MDL criterion. We show the robustness of our system to noise and its effectiveness in learning the basic structure of human activity, through experiments with artificial data and a real video sequence from a local convenience store.
引用
收藏
页码:1621 / 1646
页数:26
相关论文
共 17 条
[1]   TOWARDS A GENERAL-THEORY OF ACTION AND TIME [J].
ALLEN, JF .
ARTIFICIAL INTELLIGENCE, 1984, 23 (02) :123-154
[2]   Introduction to the special section on video surveillance [J].
Collins, RT ;
Lipton, AJ ;
Kanade, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) :745-746
[3]  
GHANEM N, 2004, 2 IEEE WORKSH EV MIN, P112
[4]   Recognition of visual activities and interactions by stochastic parsing [J].
Ivanov, YA ;
Bobick, AF .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) :852-872
[5]  
Kitani K. M., 2005, Proceedings. 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS) (IEEE Cat. No. 05EX1178), P239
[6]  
Minnen D, 2003, PROC CVPR IEEE, P626
[7]  
Moore D, 2002, EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, P770
[8]   On-line and off-line heuristics for inferring hierarchies of repetitions in sequences [J].
Nevill-Manning, CG ;
Witten, IH .
PROCEEDINGS OF THE IEEE, 2000, 88 (11) :1745-1755
[9]   Identifying hierarchical structure in sequences: A linear-time algorithm [J].
NevillManning, CG ;
Witten, IH .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1997, 7 :67-82
[10]  
OGALE AS, 2005, P WORKSH DYN VIS