MSCAN:: identification of functional clusters of transcription factor binding sites

被引:50
作者
Alkema, WBL
Johansson, Ö
Lagergren, J
Wasserman, WW [1 ]
机构
[1] Univ British Columbia, British Columbia Childrens Hosp, Dept Med Genet, Ctr Mol Med & Therapeut, Vancouver, BC V5Z 4H4, Canada
[2] Karolinska Inst, Ctr Genom & Bioinformat, SE-17177 Stockholm, Sweden
[3] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[4] KTH, Stockholm Bioinformat Ctr, SE-10044 Stockholm, Sweden
[5] KTH, Dept Numer Anal & Comp Sci, SE-10044 Stockholm, Sweden
基金
加拿大健康研究院;
关键词
D O I
10.1093/nar/gkh387
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Identification of functional transcription factor binding sites in genomic sequences is notoriously difficult. The critical problem is the low specificity of predictions, which directly reflects the low target specificity of DNA binding proteins. To overcome the noise produced in predictions of individual binding sites, a new generation of algorithms achieves better predictive specificity by focusing on locally dense clusters of binding sites. MSCAN is a leading method for binding site cluster detection that determines the significance of observed sites while correcting for local compositional bias of sequences. The algorithm is highly flexible, applying any set of input binding models to the analysis of a user-specified sequence. From the user's perspective, a key feature of the system is that no reference data sets of regulatory sequences from co-regulated genes are required to train the algorithm. The output from MSCAN consists of an ordered list of sequence segments that contain potential regulatory modules. We have chosen the features in MSCAN such that sequence and matrix retrieval is highly facilitated, resulting in a web server that is intuitive to use. MSCAN is available at http://mscan.cgb.ki.se/cgi-bin/MSCAN.
引用
收藏
页码:W195 / W198
页数:4
相关论文
共 13 条
[1]   Computational detection of cis-regulatory modules [J].
Aerts, Stein ;
Van Loo, Peter ;
Thijs, Gert ;
Moreau, Yves ;
De Moor, Bart .
BIOINFORMATICS, 2003, 19 :II5-II14
[2]   Searching for statistically significant regulatory modules [J].
Bailey, Timothy L. ;
Noble, William Stafford .
BIOINFORMATICS, 2003, 19 :II16-II25
[3]   Ensembl 2004 [J].
Birney, E ;
Andrews, D ;
Bevan, P ;
Caccamo, M ;
Cameron, G ;
Chen, Y ;
Clarke, L ;
Coates, G ;
Cox, T ;
Cuff, J ;
Curwen, V ;
Cutts, T ;
Down, T ;
Durbin, R ;
Eyras, E ;
Fernandez-Suarez, XM ;
Gane, P ;
Gibbins, B ;
Gilbert, J ;
Hammond, M ;
Hotz, H ;
Iyer, V ;
Kahari, A ;
Jekosch, K ;
Kasprzyk, A ;
Keefe, D ;
Keenan, S ;
Lehvaslaiho, H ;
McVicker, G ;
Melsopp, C ;
Meidl, P ;
Mongin, E ;
Pettett, R ;
Potter, S ;
Proctor, G ;
Rae, M ;
Searle, S ;
Slater, G ;
Smedley, D ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Storey, R ;
Ureta-Vidal, A ;
Woodwark, C ;
Clamp, M ;
Hubbard, T .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D468-D470
[4]  
Davidson E. H., 2001, Genomic regulatory systems: development and evolution
[5]   Cluster-Buster: finding dense clusters of motifs in DNA sequences [J].
Frith, MC ;
Li, MC ;
Weng, ZP .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3666-3668
[6]   Identification of functional clusters of transcription factor binding motifs in genome sequences: the MSCAN algorithm [J].
Johansson, Oe. ;
Alkema, W. ;
Wasserman, W. W. ;
Lagergren, J. .
BIOINFORMATICS, 2003, 19 :i169-i176
[7]   TFBS: Computational framework for transcription factor binding site analysis [J].
Lenhard, B ;
Wasserman, WW .
BIOINFORMATICS, 2002, 18 (08) :1135-1136
[8]   TRANSFAC®:: transcriptional regulation, from patterns to profiles [J].
Matys, V ;
Fricke, E ;
Geffers, R ;
Gössling, E ;
Haubrock, M ;
Hehl, R ;
Hornischer, K ;
Karas, D ;
Kel, AE ;
Kel-Margoulis, OV ;
Kloos, DU ;
Land, S ;
Lewicki-Potapov, B ;
Michael, H ;
Münch, R ;
Reuter, I ;
Rotert, S ;
Saxel, H ;
Scheer, M ;
Thiele, S ;
Wingender, E .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :374-378
[9]   SCORE: A computational approach to the identification of cis-regulatory modules and target genes in whole-genome sequence data [J].
Rebeiz, M ;
Reeves, NL ;
Posakony, JW .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (15) :9888-9893
[10]   JASPAR:: an open-access database for eukaryotic transcription factor binding profiles [J].
Sandelin, A ;
Alkema, W ;
Engström, P ;
Wasserman, WW ;
Lenhard, B .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D91-D94