Calign: aligning sequences with restricted affine gap penalties

被引:3
作者
Chao, KM [1 ]
机构
[1] Providence Univ, Dept Comp Sci & Informat Management, Taichung 43309, Taiwan
关键词
D O I
10.1093/bioinformatics/15.4.298
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Given a genomic DNA sequence, it is still an open problem to determine its coding regions, i.e. the region consisting of exons and introns. The comparison of cDNA and genomic DNA helps the understanding of coding regions. For such an application, it might be adequate to use the restricted affine gap penalties which penalize long gaps with a constant penalty. Results: Several techniques developed for solving the approximate string-matching problem are employed to yield efficient algorithms for computing the optimal alignment with restricted affine gap penalties. In particular, efficient algorithms can be derived based on the suffix automaton with failure transitions an on the diagonalwise monotonicity of the cost tables. We have implemented the above methods in C on Sun workstations running SunOS Unix. Preliminary experiments show that these approaches are very promising for aligning a cDNA sequence with a genomic DNA sequence.
引用
收藏
页码:298 / 304
页数:7
相关论文
共 36 条
[11]   Gene recognition via spliced sequence alignment [J].
Gelfand, MS ;
Mironov, AA ;
Pevzner, PA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (17) :9061-9066
[12]  
GOTOH O, 1990, B MATH BIOL, V52, P359, DOI 10.1016/S0092-8240(05)80216-2
[13]   AN IMPROVED ALGORITHM FOR MATCHING BIOLOGICAL SEQUENCES [J].
GOTOH, O .
JOURNAL OF MOLECULAR BIOLOGY, 1982, 162 (03) :705-708
[14]   GLOBIN GENE SERVER - A PROTOTYPE E-MAIL DATABASE SERVER FEATURING EXTENSIVE MULTIPLE ALIGNMENTS AND DATA COMPILATION FOR ELECTRONIC GENETIC-ANALYSIS [J].
HARDISON, R ;
CHAO, KM ;
SCHWARTZ, S ;
STOJANOVIC, N ;
GANETSKY, M ;
MILLER, W .
GENOMICS, 1994, 21 (02) :344-353
[15]  
HUANG XQ, 1994, COMPUT APPL BIOSCI, V10, P227
[16]   AN APPROXIMATE STRING-MATCHING ALGORITHM [J].
KIM, JY ;
SHAWETAYLOR, J .
THEORETICAL COMPUTER SCIENCE, 1992, 92 (01) :107-117
[17]  
LANDAU GM, 1988, COMPUT APPL BIOSCI, V4, P19
[18]  
Lewin B., 1994, GENES
[19]   An O(ND) Difference Algorithm and Its Variations [J].
Myers, Eugene W. .
ALGORITHMICA, 1986, 1 (1-4) :251-266
[20]  
MYERS EW, 1989, ACM T PROGR LANG SYS, V11, P33, DOI 10.1145/59287.59290