Finding regulatory DNA motifs using alignment-free evolutionary conservation information

被引:30
作者
Gordan, Raluca [1 ]
Narlikar, Leelavati [1 ]
Hartemink, Alexander J. [1 ]
机构
[1] Duke Univ, Dept Comp Sci, Durham, NC 27708 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
SACCHAROMYCES-CEREVISIAE; TRANSCRIPTION FACTORS; BINDING SPECIFICITY; GENE-EXPRESSION; GIBBS SAMPLER; YEAST; SEQUENCE; ELEMENTS; GENOMES; PROTEIN;
D O I
10.1093/nar/gkp1166
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As an increasing number of eukaryotic genomes are being sequenced, comparative studies aimed at detecting regulatory elements in intergenic sequences are becoming more prevalent. Most comparative methods for transcription factor (TF) binding site discovery make use of global or local alignments of orthologous regulatory regions to assess whether a particular DNA site is conserved across related organisms, and thus more likely to be functional. Since binding sites are usually short, sometimes degenerate, and often independent of orientation, alignment algorithms may not align them correctly. Here, we present a novel, alignment-free approach for using conservation information for TF binding site discovery. We relax the definition of conserved sites: we consider a DNA site within a regulatory region to be conserved in an orthologous sequence if it occurs anywhere in that sequence, irrespective of orientation. We use this definition to derive informative priors over DNA sequence positions, and incorporate these priors into a Gibbs sampling algorithm for motif discovery. Our approach is simple and fast. It requires neither sequence alignments nor the phylogenetic relationships between the orthologous sequences, yet it is more effective on real biological data than methods that do.
引用
收藏
页码:e90.1 / e90.12
页数:12
相关论文
共 54 条
[1]   A Library of Yeast Transcription Factor Motifs Reveals a Widespread Function for Rsc3 in Targeting Nucleosome Exclusion at Promoters [J].
Badis, Gwenael ;
Chan, Esther T. ;
van Bakel, Harm ;
Pena-Castillo, Lourdes ;
Tillo, Desiree ;
Tsui, Kyle ;
Carlson, Clayton D. ;
Gossett, Andrea J. ;
Hasinoff, Michael J. ;
Warren, Christopher L. ;
Gebbia, Marinella ;
Talukder, Shaheynoor ;
Yang, Ally ;
Mnaimneh, Sanie ;
Terterov, Dimitri ;
Coburn, David ;
Yeo, Ai Li ;
Yeo, Zhen Xuan ;
Clarke, Neil D. ;
Lieb, Jason D. ;
Ansari, Aseem Z. ;
Nislow, Corey ;
Hughes, Timothy R. .
MOLECULAR CELL, 2008, 32 (06) :878-887
[2]  
Bailey T L, 1995, Proc Int Conf Intell Syst Mol Biol, V3, P21
[3]   FootPrinter: a program designed for phylogenetic footprinting [J].
Blanchette, M ;
Tompa, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3840-3842
[4]   Integration of external signaling pathways with the core transcriptional network in embryonic stem cells [J].
Chen, Xi ;
Xu, Han ;
Yuan, Ping ;
Fang, Fang ;
Huss, Mikael ;
Vega, Vinsensius B. ;
Wong, Eleanor ;
Orlov, Yuriy L. ;
Zhang, Weiwei ;
Jiang, Jianming ;
Loh, Yuin-Han ;
Yeo, Hock Chuan ;
Yeo, Zhen Xuan ;
Narang, Vipin ;
Govindarajan, Kunde Ramamoorthy ;
Leong, Bernard ;
Shahab, Atif ;
Ruan, Yijun ;
Bourque, Guillaume ;
Sung, Wing-Kin ;
Clarke, Neil D. ;
Wei, Chia-Lin ;
Ng, Huck-Hui .
CELL, 2008, 133 (06) :1106-1117
[5]   Genome-wide regulatory complexity in yeast promoters: Separation of functionally conserved and neutral sequence [J].
Chin, CS ;
Chuang, JH ;
Li, H .
GENOME RESEARCH, 2005, 15 (02) :205-213
[6]   Regulation of mating and filamentation genes by two distinct Ste12 complexes in Saccharomyces cerevisiae [J].
Chou, Song ;
Lane, Shelley ;
Liu, Haoping .
MOLECULAR AND CELLULAR BIOLOGY, 2006, 26 (13) :4794-4805
[7]  
Clark A., 2003, PROPOSAL DROSOPHILA
[8]   Evolution of genes and genomes on the Drosophila phylogeny [J].
Clark, Andrew G. ;
Eisen, Michael B. ;
Smith, Douglas R. ;
Bergman, Casey M. ;
Oliver, Brian ;
Markow, Therese A. ;
Kaufman, Thomas C. ;
Kellis, Manolis ;
Gelbart, William ;
Iyer, Venky N. ;
Pollard, Daniel A. ;
Sackton, Timothy B. ;
Larracuente, Amanda M. ;
Singh, Nadia D. ;
Abad, Jose P. ;
Abt, Dawn N. ;
Adryan, Boris ;
Aguade, Montserrat ;
Akashi, Hiroshi ;
Anderson, Wyatt W. ;
Aquadro, Charles F. ;
Ardell, David H. ;
Arguello, Roman ;
Artieri, Carlo G. ;
Barbash, Daniel A. ;
Barker, Daniel ;
Barsanti, Paolo ;
Batterham, Phil ;
Batzoglou, Serafim ;
Begun, Dave ;
Bhutkar, Arjun ;
Blanco, Enrico ;
Bosak, Stephanie A. ;
Bradley, Robert K. ;
Brand, Adrianne D. ;
Brent, Michael R. ;
Brooks, Angela N. ;
Brown, Randall H. ;
Butlin, Roger K. ;
Caggese, Corrado ;
Calvi, Brian R. ;
de Carvalho, A. Bernardo ;
Caspi, Anat ;
Castrezana, Sergio ;
Celniker, Susan E. ;
Chang, Jean L. ;
Chapple, Charles ;
Chatterji, Sourav ;
Chinwalla, Asif ;
Civetta, Alberto .
NATURE, 2007, 450 (7167) :203-218
[9]   Finding functional features in Saccharomyces genomes by phylogenetic footprinting [J].
Cliften, P ;
Sudarsanam, P ;
Desikan, A ;
Fulton, L ;
Fulton, B ;
Majors, J ;
Waterston, R ;
Cohen, BA ;
Johnston, M .
SCIENCE, 2003, 301 (5629) :71-76
[10]   Surveying Saccharomyces genomes to identify functional elements by comparative DNA sequence analysis [J].
Cliften, PF ;
Hillier, LW ;
Fulton, L ;
Graves, T ;
Miner, T ;
Gish, WR ;
Waterston, RH ;
Johnston, M .
GENOME RESEARCH, 2001, 11 (07) :1175-1186