COMMON PROPERTIES OF SOME MULTIATTRIBUTE FILE SYSTEMS

被引:29
作者
LIN, WC
LEE, RCT
DU, HC
机构
[1] Rex Company, Olivette Agent, Taipei, Taiwan Republic of China
[2] Institute of Computer and Decision Sciences, National Tsing Hua University, Hsinchu Taiwan, Republic of China
[3] Department of Computer Science, University of Washington
关键词
clustering; Index Terms-Best match; multidimensional directory; multikey hashing; multikey sorting; partial match; partial match pattern; shortest spanning path; string homomorphism hashing; symbolic error correcting codes;
D O I
10.1109/TSE.1979.234172
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper results from an attempt to unify several different file system design theories. We define a term “partial match pattern” and show that in order to produce file systems optimal with respect to partial match patterns, both the multikey hashing (MKH) method [16] and the multidimensional directory (MDD) method [11] must be in such a form that the number of subdivisions is the same for all domains of keys. We show the conditions for the string homomorphism hashing (SHH) method [15], the MKH method, and the MDD method to be equivalent to one another. We define the so-called Cartesian product files and show that if all records are present, the records in a Cartesian product file form a shortest spanning path in which the Hamming distance between every pair of consecutive records is 1. Thus the SHH method, the MKH method, the MDD method, and the multi-key sorting (MKS) method [10] are linked together. Finally, we show that for both partial and best match queries, the file systems exhibit a common characteristic: Similar records are grouped together. Copyright © 1979 by The Institute of Electrical and Electronics Engineers, Inc.
引用
收藏
页码:160 / 174
页数:15
相关论文
共 19 条
[1]  
Bentley J.L., Multidimensional binary search threes used for associative searching, Commun. Ass. Comput. Mach., 18, pp. 509-516, (1975)
[2]  
Burkhard W.A., Keller R.M., Some approaches to best-match match file searching, Commun. Ass. Comput. Mach., 16, pp. 230-236, (1973)
[3]  
Duda R., Hart P., Pattern Recognition and Scene Analysis, (1973)
[4]  
Friedman J.H., Baskett F., Shustek L.J., An algorithm for finding nearest neighbors, IEEE Trans. Comput., C-24, pp. 1000-1006, (1975)
[5]  
Friedman J.H., Bentley J.L., Finkel R.A., An algorithm for finding best matches in logarithmic expected time, ACM Trans. Mathematical Software, 3, pp. 209-226, (1977)
[6]  
Fukunaga K., Introduction to Statistical Pattern Recognition, (1972)
[7]  
Fukunaga K., Narendra P.M., A branch and bound algorithm for computing k-nearest neighbors, IEEE Trans. Comput., C-24, pp. 750-753, (1975)
[8]  
Hsieh C.T., Lee R.C.T., Application of symbolic error correcting code for nearest neighbor searching, 1976 Proc. Nat. Comput. Symp. Republic of China, pp. 6.7-6.14, (1976)
[9]  
Lee R.C.T., Chin Y.H., Chang S.C., Application of principal component analysis to multi-key searching, IEEE Trans. Software Eng., SE-2, pp. 185-193, (1976)
[10]  
Lee R.C.T., Tseng S.H., Multi-key sorting, 1977 Proc. Int. Comput. Symp., 2, pp. 737-753, (1977)