Discovery of the principal specific transcription factors of Apicomplexa and their implication for the evolution of the AP2-integrase DNA binding domains

被引:367
作者
Balaji, S [1 ]
Babu, MM [1 ]
Iyer, LM [1 ]
Aravind, L [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
关键词
D O I
10.1093/nar/gki709
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The comparative genomics of apicomplexans, such as the malarial parasite Plasmodium, the cattle parasite Theileria and the emerging human parasite Cryptosporidium, have suggested an unexpected paucity of specific transcription factors (TFs) with DNA binding domains that are closely related to those found in the major families of TFs from other eukaryotes. This apparent lack of specific TFs is paradoxical, given that the apicomplexans show a complex developmental cycle in one or more hosts and a reproducible pattern of differential gene expression in course of this cycle. Using sensitive sequence profile searches, we show that the apicomplexans possess a lineage-specific expansion of a novel family of proteins with a version of the AP2 (Apetala2)integrase DNA binding domain, which is present in numerous plant TFs. About 20-27 members of this apicomplexan AP2 (ApiAP2) family are encoded in different apicomplexan genomes, with each protein containing one to four copies of the AP2 DNA binding domain. Using gene expression data from Plasmodium falciparum, we show that guilds of ApiAP2 genes are expressed in different stages of intraerythrocytic development. By analogy to the plant AP2 proteins and based on the expression patterns, we predict that the ApiAP2 proteins are likely to function as previously unknown specific TFs in the apicomplexans and regulate the progression of their developmental cycle. In addition to the ApiAP2 family, we also identified two other novel families of AP2 DNA binding domains in bacteria and transposons. Using structure similarity searches, we also identified divergent versions of the AP2-integrase DNA binding domain fold in the DNA binding region Of the PI-Scel homing endonuclease and the C-terminal domain of the pleckstrin homology (PH) domain-like modules of eukaryotes. Integrating these findings, we present a reconstruction of the evolutionary scenario of the AP2-integrase DNA binding domain fold, which suggests that it underwent multiple independent combinations with different types of mobile endonucleases or recombinases. It appears that the eukaryotic versions have emerged from versions of the domain associated with mobile elements, followed by independent lineage-specific expansions, which accompanied their recruitment to transcription regulation functions.
引用
收藏
页码:3994 / 4006
页数:13
相关论文
共 71 条
[1]   A novel mode of DNA recognition by a β-sheet revealed by the solution structure of the GCC-box binding domain in complex with DNA [J].
Allen, MD ;
Yamasaki, K ;
Ohme-Takagi, M ;
Tateno, M ;
Suzuki, M .
EMBO JOURNAL, 1998, 17 (18) :5484-5496
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   The BED finger, a novel DNA-binding domain in chromatin-boundary-element-binding proteins and transposases [J].
Aravind, L .
TRENDS IN BIOCHEMICAL SCIENCES, 2000, 25 (09) :421-423
[4]   Plasmodium biology:: Genomic gleanings [J].
Aravind, L ;
Iyer, LM ;
Wellems, TE ;
Miller, LH .
CELL, 2003, 115 (07) :771-785
[5]   The many faces of the helix-turn-helix domain: Transcription regulation and beyond [J].
Aravind, L ;
Anantharaman, V ;
Balaji, S ;
Babu, MM ;
Iyer, LM .
FEMS MICROBIOLOGY REVIEWS, 2005, 29 (02) :231-262
[6]   DNA-binding proteins and evolution of transcription regulation in the archaea [J].
Aravind, L ;
Koonin, EV .
NUCLEIC ACIDS RESEARCH, 1999, 27 (23) :4658-4670
[7]   AT-hook motifs identified in a wide variety of DNA binding proteins [J].
Aravind, L ;
Landsman, D .
NUCLEIC ACIDS RESEARCH, 1998, 26 (19) :4413-4421
[8]   Origin of multicellular eukaryotes - insights from proteome comparisons [J].
Aravind, L ;
Subramanian, G .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 1999, 9 (06) :688-694
[9]   Temperature, template topology, and factor requirements of archaeal transcription [J].
Bell, SD ;
Jaxel, C ;
Nadal, M ;
Kosa, PF ;
Jackson, SP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (26) :15218-15222
[10]   Transcription and translation in Archaea: A mosaic of eukaryal and bacterial features [J].
Bell, SD ;
Jackson, SP .
TRENDS IN MICROBIOLOGY, 1998, 6 (06) :222-228