COMPRESSION OF WISWESSER LINE NOTATIONS USING VARIETY GENERATION

被引:9
作者
COOPER, D [1 ]
LYNCH, MF [1 ]
机构
[1] UNIV SHEFFIELD, POSTGRAD SCH LIBRARIANSHIP & INFORMAT SCI, SHEFFIELD S10 2TN, S YORKSHIRE, ENGLAND
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 1979年 / 19卷 / 03期
关键词
D O I
10.1021/ci60019a011
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The use of variety generation for reversible text compression is described briefly, and it is shown how the technique may be applied to compress Wiswesser Line Notations. The notations may be compressed, using 8-bit codes to represent variable-length character strings, to occupy an average of just under 3.6 bits per original character, an improvement of just over 55% on a fixed-length representation using 8 bits per character. This is similar to the amount of compression given by the same technique on natural language texts. © 1979, American Chemical Society. All rights reserved.
引用
收藏
页码:165 / 169
页数:5
相关论文
共 14 条
[1]  
ASH JE, 1975, CHEM INFORMATION SYS
[2]  
BARTON IJ, 1974, INFORMATICS I, P154
[3]  
EMLY MA, 1978, THESIS U SHEFFIELD
[4]   A METHOD FOR THE CONSTRUCTION OF MINIMUM-REDUNDANCY CODES [J].
HUFFMAN, DA .
PROCEEDINGS OF THE INSTITUTE OF RADIO ENGINEERS, 1952, 40 (09) :1098-1101
[5]   VARIETY GENERATION - REINTERPRETATION OF SHANNONS MATHEMATICAL-THEORY OF COMMUNICATION, AND ITS IMPLICATIONS FOR INFORMATION-SCIENCE [J].
LYNCH, MF .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1977, 28 (01) :19-25
[6]  
METCALFE GN, 1977, FILE COMPRESSION REP
[7]   COMPARISON OF ALGORITHMS FOR DATA BASE COMPRESSION BY USE OF FRAGMENTS AS LANGUAGE ELEMENTS [J].
SCHUEGRAF, EJ ;
HEAPS, HS .
INFORMATION STORAGE AND RETRIEVAL, 1974, 10 (9-10) :309-319
[8]  
SCHUEGRAF EJ, 1977, CAN J INFORM SCI, V2, P93
[9]   A MATHEMATICAL THEORY OF COMMUNICATION [J].
SHANNON, CE .
BELL SYSTEM TECHNICAL JOURNAL, 1948, 27 (03) :379-423
[10]  
Smith E.G., 1975, WISWESSER LINE FORMU