A novel approach to the detection of genomic approximate tandem repeats in the levenshtein metric

被引:15
作者
Domanic, Nevzat Onur [1 ]
Preparata, Franco P. [1 ]
机构
[1] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA
关键词
algorithms; genomic repeat; tandem repeat; repeat finding; Hamming metric; Levenshtein metric; TRIPLET REPEAT; GENE; ALGORITHM; REPETITIONS; SEQUENCES; DISEASE; ASSOCIATION; SEARCH;
D O I
10.1089/cmb.2007.0018
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
An efficient algorithm for detecting approximate tandem repeats in genomic sequences is presented. The algorithm is based on innovative statistical criteria to detect candidate regions which may include tandem repeats; these regions are subsequently verified by alignments based on dynamic programming. No prior information about the period size or pattern is needed. Also, the algorithm is virtually capable of detecting repeats with any period. An implementation of the algorithm is compared with the two state-of-the-art tandem repeats detection tools to demonstrate its effectiveness both on natural and synthetic data. The algorithm is available at www.cs.brown.edu/people/domanic/tandem/.
引用
收藏
页码:873 / 891
页数:19
相关论文
共 45 条
[1]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[2]  
[Anonymous], GENES
[3]   OPTIMAL PARALLEL DETECTION OF SQUARES IN STRINGS [J].
APOSTOLICO, A .
ALGORITHMICA, 1992, 8 (04) :285-319
[4]   OPTIMAL OFF-LINE DETECTION OF REPETITIONS IN A STRING [J].
APOSTOLICO, A ;
PREPARATA, FP .
THEORETICAL COMPUTER SCIENCE, 1983, 22 (03) :297-315
[5]   TOWARD A UNIFIED APPROACH TO GENETIC-MAPPING OF EUKARYOTES BASED ON SEQUENCE TAGGED MICROSATELLITE SITES [J].
BECKMANN, JS ;
SOLLER, M .
BIO-TECHNOLOGY, 1990, 8 (10) :930-932
[6]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[7]   A SPACE EFFICIENT ALGORITHM FOR FINDING THE BEST NONOVERLAPPING ALIGNMENT SCORE [J].
BENSON, G .
THEORETICAL COMPUTER SCIENCE, 1995, 145 (1-2) :357-369
[8]   HIGH-RESOLUTION OF HUMAN EVOLUTIONARY TREES WITH POLYMORPHIC MICROSATELLITES [J].
BOWCOCK, AM ;
RUIZLINARES, A ;
TOMFOHRDE, J ;
MINCH, E ;
KIDD, JR ;
CAVALLISFORZA, LL .
NATURE, 1994, 368 (6470) :455-457
[9]  
Butler J.M., 2001, FORENSIC DNA TYPING
[10]   Friedreich's ataxia: Autosomal recessive disease caused by an intronic GAA triplet repeat expansion [J].
Campuzano, V ;
Montermini, L ;
Molto, MD ;
Pianese, L ;
Cossee, M ;
Cavalcanti, F ;
Monros, E ;
Rodius, F ;
Duclos, F ;
Monticelli, A ;
Zara, F ;
Canizares, J ;
Koutnikova, H ;
Bidichandani, SI ;
Gellera, C ;
Brice, A ;
Trouillas, P ;
DeMichele, G ;
Filla, A ;
DeFrutos, R ;
Palau, F ;
Patel, PI ;
DiDonato, S ;
Mandel, JL ;
Cocozza, S ;
Koenig, M ;
Pandolfo, M .
SCIENCE, 1996, 271 (5254) :1423-1427