Calign: aligning sequences with restricted affine gap penalties

被引:3
作者
Chao, KM [1 ]
机构
[1] Providence Univ, Dept Comp Sci & Informat Management, Taichung 43309, Taiwan
关键词
D O I
10.1093/bioinformatics/15.4.298
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Given a genomic DNA sequence, it is still an open problem to determine its coding regions, i.e. the region consisting of exons and introns. The comparison of cDNA and genomic DNA helps the understanding of coding regions. For such an application, it might be adequate to use the restricted affine gap penalties which penalize long gaps with a constant penalty. Results: Several techniques developed for solving the approximate string-matching problem are employed to yield efficient algorithms for computing the optimal alignment with restricted affine gap penalties. In particular, efficient algorithms can be derived based on the suffix automaton with failure transitions an on the diagonalwise monotonicity of the cost tables. We have implemented the above methods in C on Sun workstations running SunOS Unix. Preliminary experiments show that these approaches are very promising for aligning a cDNA sequence with a genomic DNA sequence.
引用
收藏
页码:298 / 304
页数:7
相关论文
共 36 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   FAST STRING-MATCHING WITH MISMATCHES [J].
BAEZAYATES, RA ;
GONNET, GH .
INFORMATION AND COMPUTATION, 1994, 108 (02) :187-199
[4]  
Chao KM, 1997, COMPUT APPL BIOSCI, V13, P75
[5]  
CHAO KM, 1995, COMPUT APPL BIOSCI, V11, P147
[6]   LINEAR-SPACE ALGORITHMS THAT BUILD LOCAL ALIGNMENTS FROM FRAGMENTS [J].
CHAO, KM ;
MILLER, W .
ALGORITHMICA, 1995, 13 (1-2) :106-134
[7]   SPEEDING-UP 2 STRING-MATCHING ALGORITHMS [J].
CROCHEMORE, M ;
CZUMAJ, A ;
GASIENIEC, L ;
JAROMINEK, S ;
LECROQ, T ;
PLANDOWSKI, W ;
RYTTER, W .
ALGORITHMICA, 1994, 12 (4-5) :247-267
[8]   ANALYSIS OF THE ESCHERICHIA-COLI GENOME - DNA-SEQUENCE OF THE REGION FROM 84.5 TO 86.5 MINUTES [J].
DANIELS, DL ;
PLUNKETT, G ;
BURLAND, V ;
BLATTNER, FR .
SCIENCE, 1992, 257 (5071) :771-778
[9]   A FAST ALGORITHM FOR STRING-MATCHING WITH MISMATCHES [J].
DERMOUCHE, A .
INFORMATION PROCESSING LETTERS, 1995, 55 (02) :105-110
[10]  
FORREST GL, 1991, MOL PHARMACOL, V40, P502