Orphans and new gene origination, a structural and evolutionary perspective

被引:18
作者
Light, Sara [1 ,2 ]
Basile, Walter [1 ,2 ]
Elofsson, Arne [1 ,2 ,3 ]
机构
[1] Stockholm Univ, Sci Life Lab, SE-17121 Solna, Sweden
[2] Stockholm Univ, Dept Biochem & Biophys, SE-10691 Stockholm, Sweden
[3] Swedish eSci Res Ctr SeRC, Stockholm, Sweden
基金
瑞典研究理事会;
关键词
DE-NOVO; SEGMENTAL DUPLICATIONS; EXPANSION; FAMILIES; PROTEINS; TRANSCRIPTION; SELECTION; SEQUENCE; DATABASE; DOMAINS;
D O I
10.1016/j.sbi.2014.05.006
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The frequency of de novo creation of proteins has been debated. Early it was assumed that de novo creation should be extremely rare and that the vast majority of all protein coding genes were created in early history of life. However, the early genomics era lead to the insight that protein coding genes do appear to be lineage-specific. Today, with thousands of completely sequenced genomes, this impression remains. It has even been proposed that the creation of novel genes, a continuous process where most de novo genes are short-lived, is as frequent as gene duplications. There exist reports with strongly indicative evidence for de novo gene emergence in many organisms ranging from Bacteria, sometimes generated through bacteriophages, to humans, where orphans appear to be overexpressed in brain and testis. In contrast, research on protein evolution indicates that many very distantly related proteins appear to share partial homology. Here, we discuss recent results on de novo gene emergence, as well as important technical challenges limiting our ability to get a definite answer to the extent of de novo protein creation.
引用
收藏
页码:73 / 83
页数:11
相关论文
共 67 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] A galaxy of folds
    Alva, Vikram
    Remmert, Michael
    Biegert, Andreas
    Lupas, Andrei N.
    Soeding, Johannes
    [J]. PROTEIN SCIENCE, 2010, 19 (01) : 124 - 130
  • [3] [Anonymous], 2004, NUCLEIC ACIDS RES, DOI DOI 10.1093/nar/gkh131
  • [4] InterPro - an integrated documentation resource for protein families, domains and functional sites
    Apweiler, R
    Attwood, TK
    Bairoch, A
    Bateman, A
    Birney, E
    Biswas, M
    Bucher, P
    Cerutti, L
    Corpet, F
    Croning, MDR
    Durbin, R
    Falquet, L
    Fleischmann, W
    Gouzy, J
    Hermjakob, H
    Hulo, N
    Jonassen, I
    Kahn, D
    Kanapin, A
    Karavidopoulou, Y
    Lopez, R
    Marx, B
    Mulder, NJ
    Oinn, TM
    Pagni, M
    Servant, F
    Sigrist, CJA
    Zdobnov, EM
    [J]. BIOINFORMATICS, 2000, 16 (12) : 1145 - 1150
  • [5] Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
  • [6] Evidence for de novo evolution of testis-expressed genes in the Drosophila yakuba Drosophila erecta clade
    Begun, David J.
    Lindfors, Heather A.
    Kern, Andrew D.
    Jones, Corbin D.
    [J]. GENETICS, 2007, 176 (02) : 1131 - 1137
  • [7] DOMAIN SWAPPING - ENTANGLING ALLIANCES BETWEEN PROTEINS
    BENNETT, MJ
    CHOE, S
    EISENBERG, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (08) : 3127 - 3131
  • [8] Domain rearrangements in protein evolution
    Björklund, ÅK
    Ekman, D
    Light, S
    Frey-Skött, J
    Elofsson, A
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2005, 353 (04) : 911 - 923
  • [9] Expansion of protein domain repeats
    Bjorklund, Asa K.
    Ekman, Diana
    Elofsson, Arne
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (08) : 959 - 970
  • [10] Brown C, 2002, EMBED SYST PROGRAM, V15, P55