Information about the binding preferences of many transcription factors is known and characterized by a sequence binding motif. However, determining regions of the genome in which a transcription factor binds based on its motif is a challenging problem, particularly in species with large genomes, since there are often many sequences containing matches to the motif but are not bound. Several rules based on sequence conservation or location, relative to a transcription start site, have been proposed to help differentiate true binding sites from random ones. Other evidence sources may also be informative for this task. We developed a method for integrating multiple evidence sources using logistic regression classifiers. Our method works in two steps. First, we infer a score quantifying the general binding preferences of transcription factor binding at all locations based on a large set of evidence features, without using any motif specific information. Then, we combined this general binding preference score with motif information for specific transcription factors to improve prediction of regions bound by the factor. Using cross-validation and new experimental data we show that, surprisingly, the general binding preference can be highly predictive of true locations of transcription factor binding even when no binding motif is used. When combined with motif information our method outperforms previous methods for predicting locations of true binding.
机构:
Dana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Brigham & Womens Hosp, Dept Med, Boston, MA 02115 USA
Harvard Univ, Sch Med, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Lupien, Mathieu
Eeckhoute, Jerome
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Brigham & Womens Hosp, Dept Med, Boston, MA 02115 USA
Harvard Univ, Sch Med, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Eeckhoute, Jerome
Meyer, Clifford A.
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
Harvard Univ, Sch Publ Hlth, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Meyer, Clifford A.
Wang, Qianben
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Brigham & Womens Hosp, Dept Med, Boston, MA 02115 USA
Harvard Univ, Sch Med, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Wang, Qianben
Zhang, Yong
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
Harvard Univ, Sch Publ Hlth, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Zhang, Yong
Li, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
Harvard Univ, Sch Publ Hlth, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Li, Wei
Carroll, Jason S.
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Brigham & Womens Hosp, Dept Med, Boston, MA 02115 USA
Harvard Univ, Sch Med, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Carroll, Jason S.
Liu, X. Shirley
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
Harvard Univ, Sch Publ Hlth, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
机构:
Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USAPenn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Miller, Webb
Rosenbloom, Kate
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Rosenbloom, Kate
Hardison, Ross C.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Hardison, Ross C.
Hou, Minmei
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Hou, Minmei
Taylor, James
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Taylor, James
Raney, Brian
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Raney, Brian
Burhans, Richard
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Burhans, Richard
King, David C.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
King, David C.
Baertsch, Robert
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Baertsch, Robert
Blankenberg, Daniel
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Blankenberg, Daniel
Pond, Sergei L. Kosakovsky
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Pond, Sergei L. Kosakovsky
Nekrutenko, Anton
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Nekrutenko, Anton
Giardine, Belinda
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Giardine, Belinda
Harris, Robert S.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Harris, Robert S.
Diekhans, Svitlana Tyekucheva Mark
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Diekhans, Svitlana Tyekucheva Mark
Diekhans, Mark
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Diekhans, Mark
Pringle, Thomas H.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Pringle, Thomas H.
Murphy, William J.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Murphy, William J.
Lesk, Arthur
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Lesk, Arthur
Weinstock, George M.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Weinstock, George M.
Lindblad-Toh, Kerstin
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Lindblad-Toh, Kerstin
Gibbs, Richard A.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Gibbs, Richard A.
Lander, Eric S.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Lander, Eric S.
Siepel, Adam
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Siepel, Adam
Haussler, David
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Haussler, David
Kent, W. James
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
机构:
Dana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Brigham & Womens Hosp, Dept Med, Boston, MA 02115 USA
Harvard Univ, Sch Med, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Lupien, Mathieu
Eeckhoute, Jerome
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Brigham & Womens Hosp, Dept Med, Boston, MA 02115 USA
Harvard Univ, Sch Med, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Eeckhoute, Jerome
Meyer, Clifford A.
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
Harvard Univ, Sch Publ Hlth, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Meyer, Clifford A.
Wang, Qianben
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Brigham & Womens Hosp, Dept Med, Boston, MA 02115 USA
Harvard Univ, Sch Med, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Wang, Qianben
Zhang, Yong
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
Harvard Univ, Sch Publ Hlth, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Zhang, Yong
Li, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
Harvard Univ, Sch Publ Hlth, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Li, Wei
Carroll, Jason S.
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Brigham & Womens Hosp, Dept Med, Boston, MA 02115 USA
Harvard Univ, Sch Med, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
Carroll, Jason S.
Liu, X. Shirley
论文数: 0引用数: 0
h-index: 0
机构:
Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
Harvard Univ, Sch Publ Hlth, Boston, MA 02115 USADana Farber Canc Inst, Dept Med Oncol, Div Mol & Cellular Oncol, Boston, MA 02115 USA
机构:
Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USAPenn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Miller, Webb
Rosenbloom, Kate
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Rosenbloom, Kate
Hardison, Ross C.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Hardison, Ross C.
Hou, Minmei
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Hou, Minmei
Taylor, James
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Taylor, James
Raney, Brian
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Raney, Brian
Burhans, Richard
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Burhans, Richard
King, David C.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
King, David C.
Baertsch, Robert
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Baertsch, Robert
Blankenberg, Daniel
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Blankenberg, Daniel
Pond, Sergei L. Kosakovsky
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Pond, Sergei L. Kosakovsky
Nekrutenko, Anton
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Nekrutenko, Anton
Giardine, Belinda
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Giardine, Belinda
Harris, Robert S.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Harris, Robert S.
Diekhans, Svitlana Tyekucheva Mark
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Diekhans, Svitlana Tyekucheva Mark
Diekhans, Mark
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Diekhans, Mark
Pringle, Thomas H.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Pringle, Thomas H.
Murphy, William J.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Murphy, William J.
Lesk, Arthur
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Lesk, Arthur
Weinstock, George M.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Weinstock, George M.
Lindblad-Toh, Kerstin
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Lindblad-Toh, Kerstin
Gibbs, Richard A.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Gibbs, Richard A.
Lander, Eric S.
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Lander, Eric S.
Siepel, Adam
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Siepel, Adam
Haussler, David
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Haussler, David
Kent, W. James
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA