Mining ChIP-chip data for transcription factor and cofactor binding sites

被引:54
作者
Smith, AD
Sumazin, P
Das, D
Zhang, MQ
机构
[1] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
[2] Portland State Univ, Dept Comp Sci, Portland, OR 97207 USA
关键词
D O I
10.1093/bioinformatics/bti1043
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Identification of single motifs and motif pairs that can be used to predict transcription factor localization in ChIP-chip data, and gene expression in tissue-specific microarray data. Results: We describe methodology to identify de novo individual and interacting pairs of binding site motifs from ChIP-chip data, using an algorithm that integrates localization data directly into the motif discovery process. We combine matrix-enumeration based motif discovery with multivariate regression to evaluate candidate motifs and identify motif interactions. When applied to the HNF localization data in liver and pancreatic islets, our methods produce motifs that are either novel or improved known motifs. All motif pairs identified to predict localization are further evaluated according to how well they predict expression in liver and islets and according to how conserved are the relative positions of their occurrences. We find that interaction models of HNF1 and CDP motifs provide excellent prediction of both HNF1 localization and gene expression in liver. Our results demonstrate that ChIP-chip data can be used to identify interacting binding site motifs. Availability: Motif discovery programs and analysis tools are available on request from the authors. Contact: asmith@cshl.edu.
引用
收藏
页码:I403 / I412
页数:10
相关论文
共 40 条
[1]   A NEW BIPARTITE DNA-BINDING DOMAIN - COOPERATIVE INTERACTION BETWEEN THE CUT REPEAT AND HOMEO DOMAIN OF THE CUT HOMEO PROTEINS [J].
ANDRES, V ;
CHIARA, MD ;
MAHDAVI, V .
GENES & DEVELOPMENT, 1994, 8 (02) :245-257
[2]   The nuclear matrix protein CDP represses hepatic transcription of the human cholesterol-7α hydroxylase gene [J].
Antes, TJ ;
Chen, J ;
Cooper, AD ;
Levy-Wilson, B .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2000, 275 (34) :26649-26660
[3]   Finding motifs using random projections [J].
Buhler, J ;
Tompa, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) :225-242
[4]   Regulatory element detection using correlation with expression [J].
Bussemaker, HJ ;
Li, H ;
Siggia, ED .
NATURE GENETICS, 2001, 27 (02) :167-171
[5]   The enhanceosome and transcriptional synergy [J].
Carey, M .
CELL, 1998, 92 (01) :5-8
[6]   Integrating regulatory motif discovery and genome-wide expression analysis [J].
Conlon, EM ;
Liu, XS ;
Lieb, JD ;
Liu, JS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (06) :3339-3344
[7]   Interacting models of cooperative gene regulation [J].
Das, D ;
Banerjee, N ;
Zhang, MQ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (46) :16234-16239
[8]  
DAS D, 2005, UNPUB ADAPTIVELY INF
[9]   Transcriptional up-regulation of the delayed early gene HRS/SRp40 during liver regeneration -: Interactions among YY1, GA-binding proteins, and mitogenic signals [J].
Du, KY ;
Leu, JI ;
Peng, Y ;
Taub, R .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (52) :35208-35215
[10]  
ESKIN E, 2004, P 8 ANN INT C COMP M, P115