Compressed domain video indexing techniques using DCT and motion vector information in MPEG video

被引：52

作者：

Kobla, V

Doermann, D

Lin, KID

Faloutsos, C

机构：

来源：

STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES V | 1997年 / 3022卷

关键词：

compressed domain analysis; video indexing; video retrieval; FastMap; MPEG; DCT;

D O I：

10.1117/12.263408

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Development of various multimedia applications hinges on the availability of fast and efficient storage, brews browsing, indexing, and retrieval techniques. Given that video is typically stored efficiently in a compressed format, if we can analyze the compressed representation directly, we can avoid the costly overhead of decompressing and operating at the pixel level. Compressed domain parsing of video has been presented in earlier work where a video clip is divided into shots, subshots, and scenes.(9,11) In this paper, we describe key frame selection, feature extraction, and indexing and retrieval techniques that are directly applicable to MPEG compressed video. We develop a frame-type independent representation of the various types of frames present in an MPEG video in which all frames can be considered equivalent. Features are derived from the available DCT, macroblock, and motion vector information and mapped to a low-dimensional space where they can be accessed with standard database techniques. The spatial information is used as primacy index while the temporal information is used to enhance the robustness of the system during the retrieval process. The techniques presented enable fast archiving, indexing, and retrieval of video. Our operational prototype typically takes a fraction of a second to retrieve similar video scenes from our database, with over 95% success.

引用

页码：200 / 211

页数：2