Sequence alignment with tandem duplication

被引:45
作者
Benson, G
机构
关键词
D O I
10.1089/cmb.1997.4.351
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Algorithm development for comparing and aligning biological sequences has, until recently, been based on the SI model of mutational events which assumes that modification of sequences proceeds through any of the operations of Substitution, insertion or deletion (the latter two collectively termed indels). While this model has worked fairly well, it has long been apparent that other mutational events occur. In this paper, we introduce a new model, the DSI model which includes another common mutational event, tandem duplication. Tandem duplication produces tandem repeats which are common in DNA, making up perhaps 10% of the human genome. They are responsible for some human diseases and may serve a multitude of functions in DNA regulation and evolution, Using the DSI model, we develop new exact and heuristic algorithms for comparing and aligning DNA sequences when they contain tandem repeats.
引用
收藏
页码:351 / 367
页数:17
相关论文
共 32 条
[1]   Minisatellite diversity supports a recent African origin for modern humans [J].
Armour, JAL ;
Anttinen, T ;
May, CA ;
Vega, EE ;
Sajantila, A ;
Kidd, JR ;
Kidd, KK ;
Bertranpetit, J ;
Paabo, S ;
Jeffreys, AJ .
NATURE GENETICS, 1996, 13 (02) :154-160
[2]   A METHOD FOR FAST DATABASE SEARCH FOR ALL K-NUCLEOTIDE REPEATS [J].
BENSON, G ;
WATERMAN, MS .
NUCLEIC ACIDS RESEARCH, 1994, 22 (22) :4828-4836
[3]  
BENSON G, 1995, THEORETICAL COMPUTER, V15, P357
[4]  
BENSON G, 1996, UNPUB ALGORITHM FIND
[5]   Friedreich's ataxia: Autosomal recessive disease caused by an intronic GAA triplet repeat expansion [J].
Campuzano, V ;
Montermini, L ;
Molto, MD ;
Pianese, L ;
Cossee, M ;
Cavalcanti, F ;
Monros, E ;
Rodius, F ;
Duclos, F ;
Monticelli, A ;
Zara, F ;
Canizares, J ;
Koutnikova, H ;
Bidichandani, SI ;
Gellera, C ;
Brice, A ;
Trouillas, P ;
DeMichele, G ;
Filla, A ;
DeFrutos, R ;
Palau, F ;
Patel, PI ;
DiDonato, S ;
Mandel, JL ;
Cocozza, S ;
Koenig, M ;
Pandolfo, M .
SCIENCE, 1996, 271 (5254) :1423-1427
[6]   GENETIC-VARIATION AT 5 TRIMERIC AND TETRAMERIC TANDEM REPEAT LOCI IN 4 HUMAN-POPULATION GROUPS [J].
EDWARDS, A ;
HAMMOND, HA ;
JIN, L ;
CASKEY, CT ;
CHAKRABORTY, R .
GENOMICS, 1992, 12 (02) :241-253
[7]  
FISCHETTI VA, 1992, LECT NOTES COMPUT SC, V644, P111
[8]   AN UNSTABLE TRIPLET REPEAT IN A GENE RELATED TO MYOTONIC MUSCULAR-DYSTROPHY [J].
FU, YH ;
PIZZUTI, A ;
FENWICK, RG ;
KING, J ;
RAJNARAYAN, S ;
DUNNE, PW ;
DUBEL, J ;
NASSER, GA ;
ASHIZAWA, T ;
DEJONG, P ;
WIERINGA, B ;
KORNELUK, R ;
PERRYMAN, MB ;
EPSTEIN, HF ;
CASKEY, CT .
SCIENCE, 1992, 255 (5049) :1256-1258
[9]   AN IMPROVED ALGORITHM FOR MATCHING BIOLOGICAL SEQUENCES [J].
GOTOH, O .
JOURNAL OF MOLECULAR BIOLOGY, 1982, 162 (03) :705-708
[10]   ENHANCED GENE-EXPRESSION BY THE POLY(DT-DG) . POLY(DC-DA) SEQUENCE [J].
HAMADA, H ;
SEIDMAN, M ;
HOWARD, BH ;
GORMAN, CM .
MOLECULAR AND CELLULAR BIOLOGY, 1984, 4 (12) :2622-2630