Detecting and quantifying causal associations in large nonlinear time series datasets

被引:547
作者
Runge, Jakob [1 ,2 ]
Nowack, Peer [2 ,3 ,4 ]
Kretschmer, Marlene [5 ,9 ]
Flaxman, Seth [4 ,6 ]
Sejdinovic, Dino [7 ,8 ]
机构
[1] German Aerosp Ctr, Inst Data Sci, D-07745 Jena, Germany
[2] Imperial Coll, Grantham Inst, London SW7 2AZ, England
[3] Imperial Coll, Dept Phys, Blackett Lab, London SW7 2AZ, England
[4] Imperial Coll, Data Sci Inst, London SW7 2AZ, England
[5] Potsdam Inst Climate Impact Res, D-14473 Potsdam, Germany
[6] Imperial Coll, Dept Math, London SW7 2AZ, England
[7] Alan Turing Inst Data Sci, London NW1 3DB, England
[8] Univ Oxford, Dept Stat, Oxford OX1 3LB, England
[9] Univ Reading, Dept Meteorol, Whiteknights Rd, Reading RG6 6BG, Berks, England
基金
英国工程与自然科学研究理事会;
关键词
GRANGER-CAUSALITY; SOUTHERN OSCILLATION; CONSISTENCY; REGRESSION; DISCOVERY; COUPLINGS; INFERENCE; SELECTION; FEEDBACK; LASSO;
D O I
10.1126/sciadv.aau4996
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying causal relationships and quantifying their strength from observational time series data are key problems in disciplines dealing with complex dynamical systems such as the Earth system or the human body. Data-driven causal inference in such systems is challenging since datasets are often high dimensional and nonlinear with limited sample sizes. Here, we introduce a novel method that flexibly combines linear or nonlinear conditional independence tests with a causal discovery algorithm to estimate causal networks from large-scale time series datasets. We validate the method on time series of well-understood physical mechanisms in the climate system and the human heart and using large-scale synthetic datasets mimicking the typical properties of real-world data. The experiments demonstrate that our method outperforms state-of-the-art techniques in detection power, which opens up entirely new possibilities to discover and quantify causal networks from time series across a range of research fields.
引用
收藏
页数:15
相关论文
共 84 条
[1]  
Alexander MA, 2002, J CLIMATE, V15, P2205, DOI 10.1175/1520-0442(2002)015<2205:TABTIO>2.0.CO
[2]  
2
[3]  
Aliferis CF, 2010, J MACH LEARN RES, V11, P171
[4]   A robust method for detecting interdependences: application to intracranially recorded EEG [J].
Arnhold, J ;
Grassberger, P ;
Lehnertz, K ;
Elger, CE .
PHYSICA D-NONLINEAR PHENOMENA, 1999, 134 (04) :419-430
[5]   A contribution to attribution of recent global warming by out-of-sample Granger causality analysis [J].
Attanasio, Alessandro ;
Pasini, Antonello ;
Triacca, Umberto .
ATMOSPHERIC SCIENCE LETTERS, 2012, 13 (01) :67-72
[6]  
Bach FR, 2008, J MACH LEARN RES, V9, P1179
[7]   Granger causality for state-space models [J].
Barnett, Lionel ;
Seth, Anil K. .
PHYSICAL REVIEW E, 2015, 91 (04)
[8]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[9]   Complex brain networks: graph theoretical analysis of structural and functional systems [J].
Bullmore, Edward T. ;
Sporns, Olaf .
NATURE REVIEWS NEUROSCIENCE, 2009, 10 (03) :186-198
[10]   On posterior consistency in nonparametric regression problems [J].
Choi, Taeryon ;
Schervish, Mark J. .
JOURNAL OF MULTIVARIATE ANALYSIS, 2007, 98 (10) :1969-1987