Statistical analysis of large DNA sequences using distribution of DNA words

被引:4
作者
Chaudhuri, P [1 ]
Das, S [1 ]
机构
[1] Indian Stat Inst, Theoret Stat & Mat Unit, Kolkata 700035, W Bengal, India
来源
CURRENT SCIENCE | 2001年 / 80卷 / 09期
关键词
D O I
暂无
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Conventional sequence alignment techniques for comparing and analysing relatively smaller DNA sequences of nearly equal sizes are not applicable to data consisting of large sequences with widely varying sizes. In this article DNA sequences have been analysed based on distributions of DNA words. DNA word frequencies are simple yet effective statistical tools to capture information about structural patterns, and they can reveal biologically significant features in DNA sequence. Our analysis demonstrates how such simple statistical summaries of large DNA data can enable us to detect the structural signature of a genome as well as to identify phylogenetic relationships among different species reflected in the variation of word distributions in their DNA sequences.
引用
收藏
页码:1161 / 1166
页数:6
相关论文
共 21 条
[11]   COMPARISONS OF EUKARYOTIC GENOMIC SEQUENCES [J].
KARLIN, S ;
LADUNGA, I .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (26) :12832-12836
[12]   WHICH BACTERIUM IS THE ANCESTOR OF THE ANIMAL MITOCHONDRIAL GENOME [J].
KARLIN, S ;
CAMPBELL, AM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (26) :12842-12846
[13]   Phylogenetic analysis in molecular evolutionary genetics [J].
Nei, M .
ANNUAL REVIEW OF GENETICS, 1996, 30 :371-403
[14]   SOME RULES IN THE ORDERING OF NUCLEOTIDES IN THE DNA [J].
NUSSINOV, R .
NUCLEIC ACIDS RESEARCH, 1980, 8 (19) :4545-4562
[15]   SOME INDICATIONS FOR INVERSE DNA DUPLICATION [J].
NUSSINOV, R .
JOURNAL OF THEORETICAL BIOLOGY, 1982, 95 (04) :783-791
[16]   STRONG DOUBLET PREFERENCES IN NUCLEOTIDE-SEQUENCES AND DNA GEOMETRY [J].
NUSSINOV, R .
JOURNAL OF MOLECULAR EVOLUTION, 1984, 20 (02) :111-119
[17]   DOUBLET FREQUENCIES IN EVOLUTIONARY DISTINCT GROUPS [J].
NUSSINOV, R .
NUCLEIC ACIDS RESEARCH, 1984, 12 (03) :1749-1763
[18]  
Pan A, 1996, CURR SCI INDIA, V71, P50
[19]  
Zardoya R, 1996, GENETICS, V142, P1249
[20]  
Zardoya R, 1997, GENETICS, V146, P995