THE COMPLEXITY OF INFORMATION EXTRACTION

被引:10
作者
ABUMOSTAFA, YS [1 ]
机构
[1] CALTECH, DEPT COMP SCI, PASADENA, CA 91125 USA
关键词
COMPUTER METATHEORY - Boolean Functions - DECISION THEORY AND ANALYSIS - PROBABILITY - Random Processes;
D O I
10.1109/TIT.1986.1057209
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to determine the difficulty of decision problems based on natural data, decision problems are characterized by four measures defined on a Boolean function f of N variables: the implementation cost C(f), the randomness R(f), the deterministic entropy H(f), and the complexity K(f). It is shown that: 1) C(f) approximately equals R(f) approximately equals H(f) approximately equals K(f), all measured in bits; 2) decision problems based on natural data are partially random (in the Kolmogorov sense) and have low entropy with respect to their dimensionality, and the relations between the four measures translate to lower and upper bounds on the cost of solving these problems; and 3) allowing small errors in the implementation of f saves a lot in the low-entropy case but saves nothing in the high-entropy case. If f is partially structured, the implementation cost is reduced substantially.
引用
收藏
页码:513 / 525
页数:13
相关论文
共 28 条
[11]  
KNUTH DE, 1966, FIBONACCI QUART, V4
[12]  
Kohavi Z., 1978, SWITCHING FINITE AUT
[13]   3 APPROACHES TO QUANTITATIVE DEFINITION OF INFORMATION [J].
KOLMOGOROV, AN .
INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 1968, 2 (02) :157-+
[14]  
Liu C. L., 1968, INTRO COMBINATORIAL
[15]  
LUPANOV O, 1960, PROBL CYBERN, V3, P782
[16]   DEFINITION OF RANDOM SEQUENCES [J].
MARTINLOF, P .
INFORMATION AND CONTROL, 1966, 9 (06) :602-+
[17]  
MCELIECE RJ, 1977, THEORY INFORMATION C
[18]  
Mead C., 1980, INTRO VLSI SYSTEMS
[19]  
Peatman J. B., 1980, DIGITAL HARDWARE DES
[20]  
Peterson WW, 1972, ERROR CORRECTING COD