Aligning a DNA sequence with a protein sequence

被引:25
作者
Zhang, Z [1 ]
Pearson, WR [1 ]
Miller, W [1 ]
机构
[1] UNIV VIRGINIA,DEPT BIOCHEM,CHARLOTTESVILLE,VA 22908
关键词
D O I
10.1089/cmb.1997.4.339
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We develop several algorithms for the problem of aligning DNA sequence with a protein sequence, Our methods account for frameshift errors, but not for introns in the DNA sequence, Thus, they are particularly appropriate for comparing a cDNA sequence that suffers from sequencing errors with an amino acid sequence or a protein sequence database, We describe algorithms for computing optimal alignments for several definitions of DNA-protein alignment, verify sufficient conditions for equivalence of certain definitions, describe techniques for efficient implementation, and discuss experience with these ideas in a new release of the FASTA suite of database-searching programs.
引用
收藏
页码:339 / 349
页数:11
相关论文
共 11 条
[1]  
[Anonymous], P RECOMB 97 1 INT C
[2]  
CHAO KM, 1992, COMPUT APPL BIOSCI, V8, P481
[3]   AN IMPROVED ALGORITHM FOR MATCHING BIOLOGICAL SEQUENCES [J].
GOTOH, O .
JOURNAL OF MOLECULAR BIOLOGY, 1982, 162 (03) :705-708
[4]  
Guan XJ, 1996, COMPUT APPL BIOSCI, V12, P31
[5]   AN ALGORITHM COMBINING DNA AND PROTEIN ALIGNMENT [J].
HEIN, J .
JOURNAL OF THEORETICAL BIOLOGY, 1994, 167 (02) :169-174
[6]  
HEIN J, 1994, J MOL EVOL, V38, P310, DOI 10.1007/BF00176094
[7]   LINEAR SPACE ALGORITHM FOR COMPUTING MAXIMAL COMMON SUBSEQUENCES [J].
HIRSCHBERG, DS .
COMMUNICATIONS OF THE ACM, 1975, 18 (06) :341-343
[8]  
Huang XQ, 1996, COMPUT APPL BIOSCI, V12, P497
[9]  
Knecht L, 1995, LECT NOTES COMPUT SC, V937, P215
[10]   IMPROVED TOOLS FOR BIOLOGICAL SEQUENCE COMPARISON [J].
PEARSON, WR ;
LIPMAN, DJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1988, 85 (08) :2444-2448