Randomness and degrees of irregularity

被引:370
作者
Pincus, S [1 ]
Singer, BH [1 ]
机构
[1] PRINCETON UNIV,OFF POPULAT RES,PRINCETON,NJ 08544
关键词
approximate entropy; computable; maximally random sequences; normal numbers; chaos;
D O I
10.1073/pnas.93.5.2083
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The fundamental question ''Are sequential data random?'' arises in myriad contexts, often with severe data length constraints. Furthermore, there is frequently a critical need to delineate nonrandom sequences in terms of closeness to randomness-e.g., to evaluate the efficacy of therapy in medicine, We address both these issues from a computable framework via a quantification of regularity, ApEn (approximate entropy), defining maximal randomness for sequences of arbitrary length, indicating the applicability to sequences as short as N = 5 points. An infinite sequence formulation of randomness is introduced that retains the operational (and computable) features of the finite case. In the infinite sequence setting, we indicate how the ''foundational'' definition of independence in probability theory, and the definition of normality in number theory, reduce to limit theorems without rates of convergence, from which we utilize ApEn to address rates of convergence (of a deficit from maximal randomness), refining the aforementioned concepts in a computationally essential manner. Representative applications among many are indicated to assess (i) random number generation output; (ii) well-shuffled arrangements; and (iii) (the quality of) bootstrap replicates.
引用
收藏
页码:2083 / 2088
页数:6
相关论文
共 42 条
[1]   SHUFFLING CARDS AND STOPPING-TIMES [J].
ALDOUS, D ;
DIACONIS, P .
AMERICAN MATHEMATICAL MONTHLY, 1986, 93 (05) :333-348
[2]  
[Anonymous], 1933, Grundbegriffe der Wahrscheinlichkeitstheorie, Ergebnisse der Mathematik
[3]  
BAILEY RA, 1983, BIOMETRIKA, V70, P183
[4]   NOTE ON THE USE OF SHERMAN STATISTIC AS A TEST FOR RANDOMNESS [J].
BARTHOLOMEW, DJ .
BIOMETRIKA, 1954, 41 (3-4) :556-558
[5]   EVOLUTIONARY SELECTION AGAINST CHANGE IN MANY ALU REPEAT SEQUENCES INTERSPERSED THROUGH PRIMATE GENOMES [J].
BRITTEN, RJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (13) :5992-5996
[6]   THEORY OF PROGRAM SIZE FORMALLY IDENTICAL TO INFORMATION-THEORY [J].
CHAITIN, GJ .
JOURNAL OF THE ACM, 1975, 22 (03) :329-340
[7]   ON LENGTH OF PROGRAMS FOR COMPUTING FINITE BINARY SEQUENCES [J].
CHAITIN, GJ .
JOURNAL OF THE ACM, 1966, 13 (04) :547-+
[8]  
COVER TM, 1989, ANN PROBAB, V17, P863
[9]  
Diaconis P., 1984, STAT APPL NEW DIRECT, P205
[10]  
Diaconis P., 1983, Adv. Appl. Math., V4, P175, DOI 10.1016/0196-8858(83)90009-X