DCT-based motion estimation

被引:29
作者
Koc, UV
Liu, KJR
机构
[1] AT&T Bell Labs, Lucent Technol, Murray Hill, NJ 07974 USA
[2] Univ Maryland, Dept Elect Engn, College Pk, MD 20742 USA
[3] Univ Maryland, Syst Res Inst, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
discrete cosine transform; motion estimation; shift measurement; time delay estimation; video coding;
D O I
10.1109/83.701146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose novel discrete cosine transform (DCT) pseudophase techniques to estimate shift/delay between two one-dimensional (1-D) signals directly from their DCT coefficients by computing the pseudophase shift hidden in DCT and then employing the sinusoidal orthogonal principles, applicable to signal delay estimation remote sensing. Under the two-dimensional (2-D) translational motion model, we further extend the pseudophase techniques to the DCT-based motion estimation (DXT-ME) algorithm for 2-D signals/images. The DXT-ME algorithm has certain advantages over the commonly used full search block-matching approach (BKM-ME) for application to video coding despite certain limitations. In addition to its robustness in a noisy environment and low computational complexity, O(M-2) for an M x M search range in comparison to the O(N-2 . M-2) complexity of BKM-ME for an N x N block, its ability to estimate motion completely in DCT domain makes possible the fully DCT-based motion-compensated video coder structure, which has only one major component in the feedback loop instead of three as in the conventional hybrid video coder design, and thus results in a higher system throughput. Furthermore, combination of the DCT and motion estimation units can provide space for further optimization of the overall coder. In addition, the DXT-ME algorithm has solely highly parallel local operations and this property makes feasible parallel implementation suitable for very large scale integration (VLSI) design. Simulation on a number of video sequences is presented with comparison to BKM-ME and other fast block search algorithms for video coding applications even though DXT-ME is completely different from any block search algorithms.
引用
收藏
页码:948 / 965
页数:18
相关论文
共 41 条
[1]   ON THE COMPUTATION OF MOTION FROM SEQUENCES OF IMAGES - A REVIEW [J].
AGGARWAL, JK ;
NANDHAKUMAR, N .
PROCEEDINGS OF THE IEEE, 1988, 76 (08) :917-935
[2]  
[Anonymous], IEEE T CIRCUITS SYST
[3]  
*CCITT, 1990, H261 CCITT
[4]  
*CCITT, 1993, MPEG1 CCITT
[5]   MANIPULATION AND COMPOSITING OF MC-DCT COMPRESSED VIDEO [J].
CHANG, SF ;
MESSERSCHMITT, DG .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1995, 13 (01) :1-11
[6]   Real-Time Parallel and Fully Pipelined Two-Dimensional DCT Lattice Structures with Application to HDTV Systems [J].
Chiu, Ching-Te ;
Liu, K. J. Ray .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1992, 2 (01) :25-37
[7]   FOURIER PHASE-SHIFT LOCATION ESTIMATION OF UNFOCUSED OPTICAL POINT-SPREAD FUNCTIONS [J].
FILLARD, JP ;
LUSSERT, JM ;
CASTAGNE, M ;
MTIMET, H .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 1994, 6 (04) :281-287
[8]   THE CROSS-SEARCH ALGORITHM FOR MOTION ESTIMATION [J].
GHANBARI, M .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1990, 38 (07) :950-953
[9]   MOTION-COMPENSATING PREDICTION WITH FRACTIONAL-PEL ACCURACY [J].
GIROD, B .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1993, 41 (04) :604-612
[10]  
Heeger D. J., 1987, Proceedings of the First International Conference on Computer Vision (Cat. No.87CH2465-3), P181