An evolutionary analysis of orphan genes in Drosophila

被引:186
作者
Domazet-Loso, T [1 ]
Tautz, D [1 ]
机构
[1] Univ Cologne, Inst Genet, D-50931 Cologne, Germany
关键词
D O I
10.1101/gr.1311003
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Orphan genes are protein-coding regions that have no recognizable homolog in distantly related species. A substantial fraction of coding regions in any genome sequenced consists of orphan genes, but the evolutionary and functional significance of orphan genes is not understood. We present a reanalysis of the Drosophila melanogaster proteome that shows that there are still between 26% and 29% of all proteins without a significant match with noninsect sequences, and that these orphans are underrepresented in genetic screens. To analyze the characteristics of orphan genes in Drosophila, we used sequence comparisons between cDNAs retrieved from two Drosophila yakuba libraries and their corresponding A melanogaster orthologs. We find that a cDNA library from adults yields twice as many orphan genes as Such a library from embryos. The orphan genes evolve oil average more than three times faster than nonorphan genes, although the width of the evolutionary rate distribution is similar for the two classes. In particular, some orphan genes show very low substitution rates that are comparable to otherwise highly conserved genes. We propose a model suggesting that orphans may be involved in the evolution of adaptive traits, and that slow-evolving orphan genes may be particularly interesting candidate genes for identifying lineage-specific adaptations.
引用
收藏
页码:2213 / 2219
页数:7
相关论文
共 37 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[3]  
Ashburner M, 1999, GENETICS, V153, P179
[4]  
Carroll S.B., 2001, DNA DIVERSITY
[5]   Bioinformatics and the discovery of gene function [J].
Casari, G ;
deDaruvar, A ;
Sander, C ;
Schneider, R .
TRENDS IN GENETICS, 1996, 12 (07) :244-245
[6]  
Comeron JM, 1998, GENETICS, V150, P767
[7]   The yeast genome project: What did we learn? [J].
Dujon, B .
TRENDS IN GENETICS, 1996, 12 (07) :263-270
[8]  
Dunn KA, 2001, GENETICS, V157, P295
[9]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[10]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185