CIS:: compound importance sampling method for protein-DNA binding site p-value estimation

被引:17
作者
Barash, Y
Elidan, G
Kaplan, T
Friedman, N [1 ]
机构
[1] Hebrew Univ Jerusalem, Sch Engn & Comp Sci, IL-91904 Jerusalem, Israel
[2] Hebrew Univ Jerusalem, Hadassah Med Sch, IL-91120 Jerusalem, Israel
基金
以色列科学基金会;
关键词
D O I
10.1093/bioinformatics/bti041
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A key aspect of transcriptional regulation is the binding of transcription factors to sequence-specific binding sites that allow them to modulate the expression of nearby genes. Given models of such binding sites, one can scan regulatory regions for putative binding sites and construct a genome-wide regulatory network. In such genome-wide scans, it is crucial to control the amount of false positive predictions. Recently, several works demonstrated the benefits of modeling dependencies between positions within the binding site. Yet, computing the statistical significance of putative binding sites in this scenario remains a challenge. Results: We present a general, accurate and efficient method for computing p-values of putative binding sites that is applicable to a large class of probabilistic binding site and background models. We demonstrate the accuracy of the method on synthetic and real-life data.
引用
收藏
页码:596 / 600
页数:5
相关论文
共 15 条
[1]  
[Anonymous], 1979, Monte Carlo Methods, DOI DOI 10.1007/978-94-009-5819-7
[2]   Combining evidence using p-values: application to sequence homology searches [J].
Bailey, TL ;
Gribskov, M .
BIOINFORMATICS, 1998, 14 (01) :48-54
[3]  
Barash Y., 2003, P 7 ANN INT C COMP M, P28
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]  
Chen MH, 1997, ANN STAT, V25, P1563
[6]  
Gelman A, 1998, STAT SCI, V13, P163
[7]   Determination of local statistical significance of patterns in Markov sequences with application to promoter element identification [J].
Huang, HY ;
Kao, MCH ;
Zhou, XH ;
Liu, JS ;
Wong, WH .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2004, 11 (01) :1-14
[8]  
Jensen FV., 1996, INTRO BAYESIAN NETWO INTRO BAYESIAN NETWO
[9]   A non-parametric model for transcription factor binding sites [J].
King, OD ;
Roth, FP .
NUCLEIC ACIDS RESEARCH, 2003, 31 (19) :E116
[10]   Transcriptional regulatory networks in Saccharomyces cerevisiae [J].
Lee, TI ;
Rinaldi, NJ ;
Robert, F ;
Odom, DT ;
Bar-Joseph, Z ;
Gerber, GK ;
Hannett, NM ;
Harbison, CT ;
Thompson, CM ;
Simon, I ;
Zeitlinger, J ;
Jennings, EG ;
Murray, HL ;
Gordon, DB ;
Ren, B ;
Wyrick, JJ ;
Tagne, JB ;
Volkert, TL ;
Fraenkel, E ;
Gifford, DK ;
Young, RA .
SCIENCE, 2002, 298 (5594) :799-804