DBDtaxonomically broad transcription factor predictions: new content and functionality

被引:214
作者
Wilson, Derek [1 ]
Charoensawan, Varodom [1 ]
Kummerfeld, Sarah K. [2 ]
Teichmann, Sarah A. [1 ]
机构
[1] MRC, Mol Biol Lab, Cambridge CB2 0QH, England
[2] Stanford Univ, Med Ctr, Dept Dev Biol, Stanford, CA 94305 USA
基金
英国医学研究理事会;
关键词
D O I
10.1093/nar/gkm964
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
DNA-binding domain (DBD) is a database of predicted sequence-specific DNA-binding transcription factors (TFs) for all publicly available proteomes. The proteomes have increased from 150 in the initial version of DBD to over 700 in the current version. All predicted TFs must contain a significant match to a hidden Markov model representing a sequence-specific DNA-binding domain family. Access to TF predictions is provided through http://transcriptionfactor.org, where new search options are now provided such as searching by gene names in model organisms, searching for all proteins in a particular DBD family and specific organism. We illustrate the application of this type of search facility by contrasting trends of DBD family occurrence throughout the tree of life, highlighting the clear partition between eukaryotic and prokaryotic DBD expansions. The website content has been expanded to include dedicated pages for each TF containing domain assignment details, gene names, links to external databases and links to TFs with similar domain arrangements. We compare the increase in number of predicted TFs with proteome size in eukaryotes and prokaryotes. Eukaryotes follow a slower rate of increase in TFs than prokaryotes, which could be due to the presence of splice variants or an increase in combinatorial control.
引用
收藏
页码:D88 / D92
页数:5
相关论文
共 24 条
[1]   FlyTF:: a systematic review of site-specific transcription factors in the fruit fly Drosophila melanogaster [J].
Adryan, Boris ;
Teichmann, Sarah A. .
BIOINFORMATICS, 2006, 22 (12) :1532-1533
[2]   Convergent evolution of gene networks by single-gene duplications in higher eukaryotes [J].
Amoutzias, GD ;
Robertson, DL ;
Oliver, SG ;
Bornberg-Bauer, E .
EMBO REPORTS, 2004, 5 (03) :274-279
[3]   Discovery of the principal specific transcription factors of Apicomplexa and their implication for the evolution of the AP2-integrase DNA binding domains [J].
Balaji, S ;
Babu, MM ;
Iyer, LM ;
Aravind, L .
NUCLEIC ACIDS RESEARCH, 2005, 33 (13) :3994-4006
[4]   A kingdom-level phylogeny of eukaryotes based on combined protein data [J].
Baldauf, SL ;
Roger, AJ ;
Wenk-Siefert, I ;
Doolittle, WF .
SCIENCE, 2000, 290 (5493) :972-977
[5]   EDGEdb:: a transcription factor-DNA Interaction database for the analysis of C-elegans differential gene expression [J].
Barrasa, M. Inmaculada ;
Vaglio, Philippe ;
Cavasino, Fabien ;
Jacotot, Laurent ;
Walhout, Albertha J. M. .
BMC GENOMICS, 2007, 8 (1)
[6]  
Bulyk ML, 2004, GENOME BIOL, V5
[7]   Activation of the ethylene gas response pathway in Arabidopsis by the nuclear protein ETHYLENE-INSENSITIVE3 and related proteins [J].
Chao, QM ;
Rothenberg, M ;
Solano, R ;
Roman, G ;
Terzaghi, W ;
Ecker, JR .
CELL, 1997, 89 (07) :1133-1144
[8]   Hox9 genes and vertebrate limb specification [J].
Cohn, MJ ;
Patel, K ;
Krumlauf, R ;
Wilkinson, DG ;
Clarke, JDW ;
Tickle, C .
NATURE, 1997, 387 (6628) :97-101
[9]   Enhanced protein domain discovery by using language modeling techniques from speech recognition [J].
Coin, L ;
Bateman, A ;
Durbin, R .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (08) :4516-4520
[10]  
*DROS COMP GEN SEQ, 2007, IN PRESS NATURE