Performance comparison of algorithms for finding transcription factor binding sites

被引:12
作者
Sinha, S [1 ]
Tompa, M [1 ]
机构
[1] Rockefeller Univ, New York, NY 10021 USA
来源
THIRD IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING - BIBE 2003, PROCEEDINGS | 2003年
关键词
D O I
10.1109/BIBE.2003.1188949
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We compare the accuracy of three motif-finding algorithms for the discovery of novel transcription factor binding sites among co-regulated genes. One of the algorithms (YMF) uses a motif model tailored for binding sites and an enumerative search of the motif space, while the other two (MEME and AlignACE) use a more general motif model and local search techniques. The comparison is done on synthetic data with planted motifs, as well as on real data sets of co-regulated genes from the yeast S. cerevisiae. More often than not, the enumerative algorithm is found to be more accurate than the other two on the yeast data sets, though there is a noticeable exclusivity in the accuracy of the different algorithms. The experiments on synthetic data reveal, not surprisingly, that each algorithm outperforms the others when motifs are planted according to its motif model.
引用
收藏
页码:214 / 220
页数:7
相关论文
共 13 条
[1]  
Blanchette M, 2001, Bioinformatics, V17 Suppl 1, pS30
[2]  
BLANCHETTE M, 2003, 3 IEEE S BIOINF BIOE
[3]   Finding motifs using random projections [J].
Buhler, J ;
Tompa, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) :225-242
[4]   Identifying DNA and protein patterns with statistically significant alignments of multiple sequences [J].
Hertz, GZ ;
Stormo, GD .
BIOINFORMATICS, 1999, 15 (7-8) :563-577
[5]   DETECTING SUBTLE SEQUENCE SIGNALS - A GIBBS SAMPLING STRATEGY FOR MULTIPLE ALIGNMENT [J].
LAWRENCE, CE ;
ALTSCHUL, SF ;
BOGUSKI, MS ;
LIU, JS ;
NEUWALD, AF ;
WOOTTON, JC .
SCIENCE, 1993, 262 (5131) :208-214
[6]  
Pevzner P A, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P269
[7]   Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation [J].
Roth, FP ;
Hughes, JD ;
Estep, PW ;
Church, GM .
NATURE BIOTECHNOLOGY, 1998, 16 (10) :939-945
[8]   Discovery of novel transcription factor binding sites by statistical overrepresentation [J].
Sinha, S ;
Tompa, M .
NUCLEIC ACIDS RESEARCH, 2002, 30 (24) :5549-5560
[9]  
Sinha S, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P344
[10]  
Timothy L. B., 1994, P 2 INT C INT SYST M, P28