RankMotif++: a motif-search algorithm that accounts for relative ranks of K-mers in binding transcription factors

被引:44
作者
Chen, Xiaoyu
Hughes, Timothy R.
Morris, Quaid [1 ]
机构
[1] Univ Toronto, Banting & Best Dept Med Res, Toronto, ON, Canada
[2] Univ Toronto, Dept Med Genet & Microbiol, Toronto, ON, Canada
[3] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
[4] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
关键词
D O I
10.1093/bioinformatics/btm224
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The sequence specificity of DNA-binding proteins is typically represented as a position weight matrix in which each base position contributes independently to relative affinity. Assessment of the accuracy and broad applicability of this representation has been limited by the lack of extensive DNA- binding data. However, new microarray techniques, in which preferences for all possible K-mers are measured, enable a broad comparison of both motif representation and methods for motif discovery. Here, we consider the problem of accounting for all of the binding data in such experiments, rather than the highest affinity binding data. We introduce the RankMotif++, an algorithm designed for finding motifs whenever sequences are associated with a semi- quantitative measure of protein-DNA-binding affinity. RankMotif++ learns motif models by maximizing the likelihood of a set of binding preferences under a probabilistic model of how sequence binding affinity translates into binding preference observations. Because RankMotif++ makes few assumptions about the relationship between binding affinity and the semi-quantitative readout, it is applicable to a wide variety of experimental assays of DNA-binding preference. Results: By several criteria, RankMotif++ predicts binding affinity better than two widely used motif finding algorithms (MDScan, MatrixREDUCE) or more recently developed algorithms (PREGO, Seed and Wobble), and its performance is comparable to a motif model that separately assigns affinities to 8-mers. Our results validate the PWM model and provide an approximation of the precision and recall that can be expected in a genomic scan.
引用
收藏
页码:I72 / I79
页数:8
相关论文
共 23 条
  • [1] Bailey TL., 1994, P 2 INT C INT SYST M, V2, P28
  • [2] SELECTION OF DNA-BINDING SITES BY REGULATORY PROTEINS - STATISTICAL-MECHANICAL THEORY AND APPLICATION TO OPERATORS AND PROMOTERS
    BERG, OG
    VONHIPPEL, PH
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1987, 193 (04) : 723 - 743
  • [3] Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities
    Berger, Michael F.
    Philippakis, Anthony A.
    Qureshi, Aaron M.
    He, Fangxue S.
    Estep, Preston W., III
    Bulyk, Martha L.
    [J]. NATURE BIOTECHNOLOGY, 2006, 24 (11) : 1429 - 1435
  • [4] Identifying transcription factor functions and targets by phenotypic activation
    Chua, Gordon
    Morris, Quaid D.
    Sopko, Richelle
    Robinson, Mark D.
    Ryan, Owen
    Chan, Esther T.
    Frey, Brendan J.
    Andrews, Brenda J.
    Boone, Charles
    Hughes, Timothy R.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (32) : 12045 - 12050
  • [5] Statistical mechanical modeling of genome-wide transcription factor occupancy data by MatrixREDUCE
    Foat, Barrett C.
    Morozov, Alexandre V.
    Bussemaker, Harmen J.
    [J]. BIOINFORMATICS, 2006, 22 (14) : E141 - E149
  • [6] Profiling condition-specific, genome-wide regulation of mRNA stability in yeast
    Foat, BC
    Houshmandi, SS
    Olivas, WM
    Bussemaker, HJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (49) : 17675 - 17680
  • [7] Explicit equilibrium modeling of transcription-factor binding and gene regulation
    Granek, JA
    Clarke, ND
    [J]. GENOME BIOLOGY, 2005, 6 (10)
  • [8] DIP-chip: Rapid and accurate determination of DNA-binding specificity
    Liu, X
    Noll, DM
    Lieb, JD
    Clarke, ND
    [J]. GENOME RESEARCH, 2005, 15 (03) : 421 - 427
  • [9] Liu X, 2001, Pac Symp Biocomput, P127
  • [10] Whole-genome comparison of Leu3 binding in vitro and in vivo reveals the importance of nucleosome occupancy in target site selection
    Liu, Xiao
    Lee, Cheol-Koo
    Granek, Joshua A.
    Clarke, Neil D.
    Lieb, Jason D.
    [J]. GENOME RESEARCH, 2006, 16 (12) : 1517 - 1528