Supporting content-based searches on time series via approximation

被引:32
作者
Wang, CZ [1 ]
Wang, XS [1 ]
机构
[1] George Mason Univ, Dept Informat & Software Engn, Fairfax, VA 22030 USA
来源
12TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS | 2000年
关键词
D O I
10.1109/SSDM.2000.869779
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fast retrieval of time series in terms of their contents is important in many application domains. This paper studies database techniques supporting fast searches for time series whose contents are similar to what users specify. The content types studied include shapes, trends, cyclic components, autocorrelation functions and partial autocorrelation functions. Due to the complex nature of the similarity searches involving such contents, traditional database techniques usually cannot provide a fast response when the involved data volume is high. This paper hence proposes to answer such content-based queries using appropriate approximation techniques. The paper then introduces two specific approximation methods, one is wavelet based and the other line-fitting based. Finally, the paper reports some experiments conducted on a stock price data set as well as a synthesised random walk data set, and shows that both approximation methods significantly reduce the query processing time without introducing intolerable errors.
引用
收藏
页码:69 / 81
页数:13
相关论文
共 33 条
[1]  
AGRAWAL R, 1995, P 21 VLDB C ZUR SWIT
[2]  
[Anonymous], 4 INT C FDN DAT ORG
[3]  
[Anonymous], P ACM SIGMOD INT C M
[4]  
[Anonymous], P 21 INT C VER LARG
[5]  
[Anonymous], P ACM SIG MOD INT C
[6]  
ASRAR G, 1995, 1995 MTPE EOS REFERE
[7]  
Berchtold S, 1996, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P28
[8]  
BOZKAYA T, 1997, P ACM SIGMOD INT C M, P357
[9]  
Ciaccia P, 1997, PROCEEDINGS OF THE TWENTY-THIRD INTERNATIONAL CONFERENCE ON VERY LARGE DATABASES, P426
[10]  
Diggle P. J., 1990, TIME SERIES