Finding leading indicators for disease outbreaks: Filtering, cross-correlation, and caveats

被引:20
作者
Bloom, Ronald M.
Buckeridge, David L.
Cheng, Karen E.
机构
[1] McGill Univ, Dept Epidemiol & Biostat, Montreal, PQ H3A 1A3, Canada
[2] Appl Res Associates Inc, Albuquerque, NM USA
关键词
D O I
10.1197/jamia.M2178
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bioterrorism and emerging infectious diseases such as influenza have spurred research into rapid outbreak detection. One primary thrust of this research has been to identify data sources that provide early indication of a disease outbreak by being leading indicators relative to other established data sources. Researchers tend to rely on the sample cross-correlation function (CCF) to quantify the association between two data sources. There has been, however, little consideration by medical informatics researchers of the influence of methodological choices on the ability of the CCF to identify a lead-lag relationship between time series. We draw on experience from the econometric and environmental health communities, and we use simulation to demonstrate that the sample CCF is highly prone to bias. Specifically, long-scale phenomena tend to overwhelm the CCF, obscuring phenomena at shorter wave lengths. Researchers seeking lead-lag relationships in surveillance data must therefore stipulate the scale length of the features of interest (e.g., short-scale spikes versus long-scale seasonal fluctuations) and then filter the data appropriately-to diminish the influence of other features, which may mask the features of interest. Otherwise, conclusions drawn from the sample CCF of bi-variate time-series data will inevitably be ambiguous and often altogether misleading.
引用
收藏
页码:76 / 85
页数:10
相关论文
共 36 条
[1]   Time-series studies of particulate matter [J].
Bell, ML ;
Samet, JM ;
Dominici, F .
ANNUAL REVIEW OF PUBLIC HEALTH, 2004, 25 :247-280
[2]  
Bendat J.S., 2000, RANDOM DATA ANAL MEA
[3]  
Box G. E., 1976, TIME SERIES ANAL FOR
[4]   Identifying pediatric age groups for influenza vaccination using a real-time regional surveillance system [J].
Brownstein, JS ;
Kleinman, KP ;
Mandl, KD .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2005, 162 (07) :686-693
[5]  
Buckeridge David L, 2005, MMWR Suppl, V54, P109
[6]   Algorithms for rapid outbreak detection: a research synthesis [J].
Buckeridge, DL ;
Burkom, H ;
Campbell, M ;
Hogan, WR ;
Moore, AW .
JOURNAL OF BIOMEDICAL INFORMATICS, 2005, 38 (02) :99-113
[7]  
BURNSIDE C, 1999, J MONETARY ECON, V41, P513
[8]   Sales of over-the-counter remedies as an early warning system for winter bed crises [J].
Davies, GR ;
Finch, RG .
CLINICAL MICROBIOLOGY AND INFECTION, 2003, 9 (08) :858-863
[9]  
Dominici Francesca, 2004, Res Rep Health Eff Inst, P3
[10]  
Goldberg Mark S, 2003, Rev Environ Health, V18, P269