Characteristic sequences for DNA primary sequence

被引:71
作者
He, PA [1 ]
Wang, J
机构
[1] Dalian Univ Technol, Dept Appl Math, Dalian 116024, Peoples R China
[2] Dalian Univ Technol, Coll Adv Sci & Technol, Dalian 116024, Peoples R China
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2002年 / 42卷 / 05期
关键词
D O I
10.1021/ci010131z
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A DNA sequence can be identified with a word over an alphabet. N = {A, C, G, T}. Characteristic sequences of a DNA sequence are given in term of classifications of bases of nucleic acids. Using the characteristic sequences, we construct a set of 2 x 2 matrices to represent DNA primary sequences, which are based on counting of the frequency of occurrence of all (0,1) triplets of characteristic sequences. Furthermore, the leading eigenvalues of these matrices are computed and considered as invariants for the DNA primary sequences. Similarity and dissimilarity analysis based on the characteristic sequences are given for eight exon-1 genes of beta-globin about eight species.
引用
收藏
页码:1080 / 1085
页数:6
相关论文
共 10 条
[1]  
HAMORI E, 1983, J BIOL CHEM, V258, P1318
[2]  
LEONG PM, 1995, COMPUT APPL BIOSCI, V12, P503
[3]   On the similarity of DNA primary sequences [J].
Randic, M ;
Vracko, M .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (03) :599-606
[4]   On the characterization of DNA primary sequences by triplet of nucleic acid bases [J].
Randic, M ;
Guo, XF ;
Basak, SC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (03) :619-626
[5]   On 3-D graphical representation of DNA primary sequences and their numerical characterization [J].
Randic, M ;
Vracko, M ;
Nandy, A ;
Basak, SC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (05) :1235-1244
[6]   Condensed representation of DNA primary sequences [J].
Randic, M .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (01) :50-56
[7]   On characterization of DNA primary sequences by a condensed matrix [J].
Randic, M .
CHEMICAL PHYSICS LETTERS, 2000, 317 (1-2) :29-34
[8]   Indexing scheme and similarity measures for macromolecular sequences [J].
Raychaudhury, C ;
Nandy, A .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (02) :243-247
[9]   A symmetrical theory of DNA sequences and its applications [J].
Zhang, CT .
JOURNAL OF THEORETICAL BIOLOGY, 1997, 187 (03) :297-306
[10]   Z-CURVES, AN INTUTIVE TOOL FOR VISUALIZING AND ANALYZING THE DNA-SEQUENCES [J].
ZHANG, R ;
ZHANG, CT .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1994, 11 (04) :767-782