The degenerate primer design problem: Theory and applications

被引:58
作者
Linhart, C [1 ]
Shamir, R [1 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
关键词
degenerate primers for PCR; complexity; NP-hardness; approximation algorithms; olfactory receptor genes;
D O I
10.1089/cmb.2005.12.431
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A PCR primer sequence is called degenerate if some of its positions have several possible bases. The degeneracy of the primer is the number of unique sequence combinations it contains. We study the problem of designing a pair of primers with prescribed degeneracy that match a maximum number of given input sequences. Such problems occur when studying a family of genes that is known only in part, or is known in a related species. We prove that various simplified versions of the problem are hard, show the polynomiality of some restricted cases, and develop approximation algorithms for one variant. Based on these algorithms, we implemented a program called HYDEN for designing highly degenerate primers for a set of genomic sequences. We report on the success of the program in several applications, one of which is an experimental scheme for identifying all human olfactory receptor ( OR) genes. In that project, HYDEN was used to design primers with degeneracies up to 10(10) that amplified with high specificity many novel genes of that family, tripling the number of OR genes known at the time.
引用
收藏
页码:431 / 456
页数:26
相关论文
共 42 条
[1]  
[Anonymous], 1985, BIOCHEM J, V229, P281
[2]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[3]  
BAILEY TL, 1995, MACH LEARN, V21, P51, DOI 10.1007/BF00993379
[4]  
Blum M., 1973, Journal of Computer and System Sciences, V7, P448, DOI 10.1016/S0022-0000(73)80033-9
[5]   A NOVEL MULTIGENE FAMILY MAY ENCODE ODORANT RECEPTORS - A MOLECULAR-BASIS FOR ODOR RECOGNITION [J].
BUCK, L ;
AXEL, R .
CELL, 1991, 65 (01) :175-187
[6]   Finding motifs using random projections [J].
Buhler, J ;
Tompa, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) :225-242
[7]  
Clark MD, 1999, METHOD ENZYMOL, V303, P205
[8]  
DOI K, 1997, P 8 WORKSH GEN INF G, P43
[9]   Selecting the median [J].
Dor, D ;
Zwick, U .
SIAM JOURNAL ON COMPUTING, 1999, 28 (05) :1722-1758
[10]   DEFOG: A practical scheme for deciphering families of genes [J].
Fuchs, T ;
Malecova, B ;
Linhart, C ;
Sharan, R ;
Khen, M ;
Herwig, R ;
Shmulevich, D ;
Elkon, R ;
Steinfath, M ;
O'Brien, JK ;
Radelof, U ;
Lehrach, H ;
Lancet, D ;
Shamir, R .
GENOMICS, 2002, 80 (03) :295-302