Structural biology sheds light on the puzzle genomic ORFans

被引:40
作者
Siew, N
Fischer, D [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, Bioinformat Grp, IL-84105 Beer Sheva, Israel
[2] Ben Gurion Univ Negev, Dept Chem, IL-84105 Beer Sheva, Israel
[3] SUNY Buffalo, Buffalo Ctr Excellence Bioinformat Comp Sci & Eng, Buffalo, NY 14203 USA
关键词
genomic ORFans; evolution; structural biology;
D O I
10.1016/j.jmb.2004.06.073
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genomic ORFans are orphan open reading frames (ORFs) with no significant sequence similarity to other ORFs. ORFans comprise 20-30% of the ORFs of most completely sequenced genomes. Because nothing can be learnt about ORFans via sequence homology, the functions and evolutionary origins of ORFans remain a mystery. Furthermore, because relatively few ORFans have been experimentally characterized, it has been suggested that most ORFans are not likely to correspond to functional, expressed proteins, but rather to spurious ORFs, pseudo-genes or to rapidly evolving proteins with non-essential roles. As a snapshot view of current ORFan structural studies, we searched for ORFans among proteins whose three-dimensional structures have been recently determined. We find that functional and structural studies of ORFans are not as underemphasized as previously suggested. These recently determined structures correspond to ORFans from all Kingdoms of life, and include proteins that have previously been functionally characterized, as well as structural genomics targets of unknown function labeled as "hypothetical proteins". This suggests that many of the ORFans in the databases are likely to correspond to expressed, functional (and even essential) proteins. Furthermore, the recently determined structures include examples of the various types of ORFans, suggesting that the functions and evolutionary origins of ORFans are diverse. Although this survey sheds some light on the ORFan mystery, further experimental studies are required to gain a better understanding of the role and origins of the tens of thousands of ORFans awaiting characterization. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:369 / 373
页数:5
相关论文
共 46 条
[1]   Reverse transcriptase-polymerase chain reaction validation of 25 "orphan" genes from Escherichia coli K-12 MG1655 [J].
Alimi, JP ;
Poirot, O ;
Lopez, F ;
Claverie, JM .
GENOME RESEARCH, 2000, 10 (07) :959-966
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Birth and death of orphan genes in Rickettsia [J].
Amiri, H ;
Davids, W ;
Andersson, SGE .
MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (10) :1575-1587
[4]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[5]   A tour of structural genomics [J].
Brenner, SE .
NATURE REVIEWS GENETICS, 2001, 2 (10) :801-809
[6]   Structure prediction meta server [J].
Bujnicki, JM ;
Elofsson, A ;
Fischer, D ;
Rychlewski, L .
BIOINFORMATICS, 2001, 17 (08) :750-751
[7]   Bacterial Genomes as new gene homes:: The genealogy of ORFans in E-coli [J].
Daubin, V ;
Ochman, H .
GENOME RESEARCH, 2004, 14 (06) :1036-1042
[8]   An evolutionary analysis of orphan genes in Drosophila [J].
Domazet-Loso, T ;
Tautz, D .
GENOME RESEARCH, 2003, 13 (10) :2213-2219
[9]   A novel class of cysteine protease inhibitors:: Solution structure of staphostatin A from Staphylococcus aureus [J].
Dubin, G ;
Krajewski, M ;
Popowicz, G ;
Stec-Niemczyk, J ;
Bochtler, M ;
Potempa, J ;
Dubin, A ;
Holak, TA .
BIOCHEMISTRY, 2003, 42 (46) :13449-13456
[10]   The yeast genome project: What did we learn? [J].
Dujon, B .
TRENDS IN GENETICS, 1996, 12 (07) :263-270