Coding limits on the number of transcription factors

被引:60
作者
Itzkovitz, Shalev
Tlusty, Tsvi
Alon, Uri [1 ]
机构
[1] Weizmann Inst Sci, Dept Mol Cell Biol, IL-76100 Rehovot, Israel
[2] Weizmann Inst Sci, Dept Phys Complex Syst, IL-76100 Rehovot, Israel
关键词
D O I
10.1186/1471-2164-7-239
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Transcription factor proteins bind specific DNA sequences to control the expression of genes. They contain DNA binding domains which belong to several super-families, each with a specific mechanism of DNA binding. The total number of transcription factors encoded in a genome increases with the number of genes in the genome. Here, we examined the number of transcription factors from each super-family in diverse organisms. Results: We find that the number of transcription factors from most super-families appears to be bounded. For example, the number of winged helix factors does not generally exceed 300, even in very large genomes. The magnitude of the maximal number of transcription factors from each super-family seems to correlate with the number of DNA bases effectively recognized by the binding mechanism of that super-family. Coding theory predicts that such upper bounds on the number of transcription factors should exist, in order to minimize cross-binding errors between transcription factors. This theory further predicts that factors with similar binding sequences should tend to have similar biological effect, so that errors based on mis-recognition are minimal. We present evidence that transcription factors with similar binding sequences tend to regulate genes with similar biological functions, supporting this prediction. Conclusion: The present study suggests limits on the transcription factor repertoire of cells, and suggests coding constraints that might apply more generally to the mapping between binding sites and biological function.
引用
收藏
页数:15
相关论文
共 73 条
[1]   Zebrafish hox clusters and vertebrate genome evolution [J].
Amores, A ;
Force, A ;
Yan, YL ;
Joly, L ;
Amemiya, C ;
Fritz, A ;
Ho, RK ;
Langeland, J ;
Prince, V ;
Wang, YL ;
Westerfield, M ;
Ekker, M ;
Postlethwait, JH .
SCIENCE, 1998, 282 (5394) :1711-1714
[2]   The many faces of the helix-turn-helix domain: Transcription regulation and beyond [J].
Aravind, L ;
Anantharaman, V ;
Balaji, S ;
Babu, MM ;
Iyer, LM .
FEMS MICROBIOLOGY REVIEWS, 2005, 29 (02) :231-262
[3]   Structure and evolution of transcriptional regulatory networks [J].
Babu, MM ;
Luscombe, NM ;
Aravind, L ;
Gerstein, M ;
Teichmann, SA .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2004, 14 (03) :283-291
[4]   Evolution of transcription factors and the gene regulatory network in Escherichia coli [J].
Babu, MM ;
Teichmann, SA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (04) :1234-1244
[5]   Comprehensive analysis of combinatorial regulation using the transcriptional regulatory network of yeast [J].
Balaji, S. ;
Babu, M. Madan ;
Iyer, Lakshminarayan M. ;
Luscombe, Nicholas M. ;
Aravind, L. .
JOURNAL OF MOLECULAR BIOLOGY, 2006, 360 (01) :213-227
[6]   Predicting gene expression from sequence [J].
Beer, MA ;
Tavazoie, S .
CELL, 2004, 117 (02) :185-198
[7]   Role of DNA sequence in the binding specificity of synthetic basic-helix-loop-helix domains [J].
Beltran, AC ;
Dawson, PE ;
Gottesfeld, JM .
CHEMBIOCHEM, 2005, 6 (01) :104-113
[8]   Probabilistic code for DNA recognition by proteins of the EGR family [J].
Benos, PV ;
Lapedes, AS ;
Stormo, GD .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 323 (04) :701-727
[9]   Adaptive evolution of transcription factor binding sites -: art. no. 42 [J].
Berg, J ;
Willmann, S ;
Lässig, M .
BMC EVOLUTIONARY BIOLOGY, 2004, 4 (1)
[10]   On schemes of combinatorial transcription logic [J].
Buchler, NE ;
Gerland, U ;
Hwa, T .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (09) :5136-5141