Quantifying similarity between motifs

被引:1364
作者
Gupta, Shobhit
Stamatoyannopoulos, John A.
Bailey, Timothy L.
Noble, William Stafford
机构
[1] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[2] Univ Queensland, Inst Mol Biosci, Brisbane, Qld 4072, Australia
[3] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98105 USA
关键词
D O I
10.1186/gb-2007-8-2-r24
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
A common question within the context of de novo motif discovery is whether a newly discovered, putative motif resembles any previously discovered motif in an existing database. To answer this question, we define a statistical measure of motif-motif similarity, and we describe an algorithm, called Tomtom, for searching a database of motifs with a given query motif. Experimental simulations demonstrate the accuracy of Tomtom's E values and its effectiveness in finding similar motifs.
引用
收藏
页数:9
相关论文
共 24 条
[1]   Computational detection of cis-regulatory modules [J].
Aerts, Stein ;
Van Loo, Peter ;
Thijs, Gert ;
Moreau, Yves ;
De Moor, Bart .
BIOINFORMATICS, 2003, 19 :II5-II14
[2]  
[Anonymous], 1992, Statistical Science, DOI DOI 10.1214/SS/1177011454
[3]   Methods and statistics for combining motif match scores [J].
Bailey, TL ;
Gribskov, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1998, 5 (02) :211-221
[4]  
Bailey TL., 1994, P 2 INT C INT SYST M, V2, P28
[5]   Local feature frequency profile: A method to measure structural similarity in proteins [J].
Choi, IG ;
Kwon, J ;
Kim, SH .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (11) :3797-3802
[6]   WebLogo: A sequence logo generator [J].
Crooks, GE ;
Hon, G ;
Chandonia, JM ;
Brenner, SE .
GENOME RESEARCH, 2004, 14 (06) :1188-1190
[7]  
Grundy WN, 1997, COMPUT APPL BIOSCI, V13, P397
[8]   THE MEANING AND USE OF THE AREA UNDER A RECEIVER OPERATING CHARACTERISTIC (ROC) CURVE [J].
HANLEY, JA ;
MCNEIL, BJ .
RADIOLOGY, 1982, 143 (01) :29-36
[9]   PROTEIN FAMILY CLASSIFICATION BASED ON SEARCHING A DATABASE OF BLOCKS [J].
HENIKOFF, S ;
HENIKOFF, JG .
GENOMICS, 1994, 19 (01) :97-107
[10]   Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae [J].
Hughes, JD ;
Estep, PW ;
Tavazoie, S ;
Church, GM .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 296 (05) :1205-1214