Novel techniques of graphical representation and analysis of DNA sequences - A review

被引:87
作者
Roy, A
Raychaudhury, C
Nandy, A
机构
[1] Indian Inst Chem Biol, Comp Div, Kolkata 700032, W Bengal, India
[2] Indian Inst Chem Biol, Appl Biochem Div, Kolkata 700032, W Bengal, India
关键词
sequence visualization; graphical representation; graphical analysis;
D O I
10.1007/BF02728525
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The advent of automated DNA sequencing techniques has led to an explosive growth in the number and length of DNAs sequenced from different organisms. While this has resulted in a large accumulation of data in the DNA databases, it has also called for the development of suitable techniques for rapid viewing and analysis of the data. Over the last few years several methods have been proposed that address these issues and represent a DNA sequence in a compact graphical form in one-, two-or three-dimensions that can be expanded as necessary to help visualize the patterns in gene sequences and aid in in-depth analysis. Graphical techniques have been found to be useful in highlighting focal and global base dominances, to identify regions of extensive repetitive sequences, differentiate between coding and non-coding regions, and to be indicative of evolutionary divergences. Analysis with graphical methods have also provided insights into new structures in DNA sequences such as fractals and long range correlations, and some measures have been developed that help quantify the visual patterns. This review presents a comprehensive study of the graphical representation methods and their applications in viewing and analysing long DNA sequences and evaluates the merits of each of these from a practical viewpoint with prescriptions on domains of applicability of each method. A discussion on the comparative merits and demerits of the various methods and possible future developments have also been included.
引用
收藏
页码:55 / 71
页数:17
相关论文
共 64 条
[1]  
BARANIDHARAN S, 1994, INT J GENOME RES, V1, P309
[2]   SELECTION OF DNA-BINDING SITES BY REGULATORY PROTEINS - STATISTICAL-MECHANICAL THEORY AND APPLICATION TO OPERATORS AND PROMOTERS [J].
BERG, OG ;
VONHIPPEL, PH .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 193 (04) :723-743
[3]   The complete genome sequence of Escherichia coli K-12 [J].
Blattner, FR ;
Plunkett, G ;
Bloch, CA ;
Perna, NT ;
Burland, V ;
Riley, M ;
ColladoVides, J ;
Glasner, JD ;
Rode, CK ;
Mayhew, GF ;
Gregor, J ;
Davis, NW ;
Kirkpatrick, HA ;
Goeden, MA ;
Rose, DJ ;
Mau, B ;
Shao, Y .
SCIENCE, 1997, 277 (5331) :1453-+
[4]   DNA-SEQUENCE SELECTION BY EYE [J].
BURDON, MG .
NATURE, 1984, 312 (5992) :313-313
[5]   GENOME ANALYSIS - A NEW APPROACH FOR VISUALIZATION OF SEQUENCE ORGANIZATION IN GENOMES [J].
BURMA, PK ;
RAJ, A ;
DEB, JK ;
BRAHMACHARI, SK .
JOURNAL OF BIOSCIENCES, 1992, 17 (04) :395-411
[6]   LONG-RANGE CORRELATIONS IN DNA [J].
CHATZIDIMITRIOUDREISMANN, CA ;
LARHAMMAR, D .
NATURE, 1993, 361 (6409) :212-213
[7]   SEQUENCE LANDSCAPES [J].
CLIFT, B ;
HAUSSLER, D ;
MCCONNELL, R ;
SCHNEIDER, TD ;
STORMO, GD .
NUCLEIC ACIDS RESEARCH, 1986, 14 (01) :141-158
[8]   MATHEMATICAL CHARACTERIZATION OF CHAOS GAME REPRESENTATION - NEW ALGORITHMS FOR NUCLEOTIDE-SEQUENCE ANALYSIS [J].
DUTTA, C ;
DAS, J .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 228 (03) :715-719
[9]   AN IMPROVED METHOD OF TESTING FOR EVOLUTIONARY HOMOLOGY [J].
FITCH, WM .
JOURNAL OF MOLECULAR BIOLOGY, 1966, 16 (01) :9-&
[10]   WHOLE-GENOME RANDOM SEQUENCING AND ASSEMBLY OF HAEMOPHILUS-INFLUENZAE RD [J].
FLEISCHMANN, RD ;
ADAMS, MD ;
WHITE, O ;
CLAYTON, RA ;
KIRKNESS, EF ;
KERLAVAGE, AR ;
BULT, CJ ;
TOMB, JF ;
DOUGHERTY, BA ;
MERRICK, JM ;
MCKENNEY, K ;
SUTTON, G ;
FITZHUGH, W ;
FIELDS, C ;
GOCAYNE, JD ;
SCOTT, J ;
SHIRLEY, R ;
LIU, LI ;
GLODEK, A ;
KELLEY, JM ;
WEIDMAN, JF ;
PHILLIPS, CA ;
SPRIGGS, T ;
HEDBLOM, E ;
COTTON, MD ;
UTTERBACK, TR ;
HANNA, MC ;
NGUYEN, DT ;
SAUDEK, DM ;
BRANDON, RC ;
FINE, LD ;
FRITCHMAN, JL ;
FUHRMANN, JL ;
GEOGHAGEN, NSM ;
GNEHM, CL ;
MCDONALD, LA ;
SMALL, KV ;
FRASER, CM ;
SMITH, HO ;
VENTER, JC .
SCIENCE, 1995, 269 (5223) :496-512