High GC content causes orphan proteins to be intrinsically disordered

被引:38
作者
Basile, Walter [1 ,2 ]
Sachenkova, Oxana [1 ,2 ]
Light, Sara [1 ,2 ,3 ]
Elofsson, Arne [1 ,2 ,4 ]
机构
[1] Stockholm Univ, Sci Life Lab, Solna, Sweden
[2] Stockholm Univ, Dept Biochem & Biophys, Stockholm, Sweden
[3] Linkoping Univ, BILS, Linkoping, Sweden
[4] Kungliga Tekniska Hogskolan, SeRC, Stockholm, Sweden
基金
瑞典研究理事会;
关键词
DOMAIN REARRANGEMENTS; GENES; PREDICTION; EMERGENCE; RECOGNITION; DIVERSITY; DYNAMICS; REGIONS; MODEL;
D O I
10.1371/journal.pcbi.1005375
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
De novo creation of protein coding genes involves the formation of short ORFs from noncoding regions; some of these ORFs might then become fixed in the population These orphan proteins need to, at the bare minimum, not cause serious harm to the organism, meaning that they should for instance not aggregate. Therefore, although the creation of short ORFs could be truly random, the fixation should be subjected to some selective pressure. The selective forces acting on orphan proteins have been elusive, and contradictory results have been reported. In Drosophila young proteins are more disordered than ancient ones, while the opposite trend is present in yeast. To the best of our knowledge no valid explanation for this difference has been proposed. To solve this riddle we studied structural properties and age of proteins in 187 eukaryotic organisms. We find that, with the exception of length, there are only small differences in the properties between proteins of different ages. However, when we take the GC content into account we noted that it could explain the opposite trends observed for orphans in yeast (low GC) and Drosophila (high GC). GC content is correlated with codons coding for disorder promoting amino acids. This leads us to propose that intrinsic disorder is not a strong determining factor for fixation of orphan proteins. Instead these proteins largely resemble random proteins given a particular GC level. During evolution the properties of a protein change faster than the GC level causing the relationship between disorder and GC to gradually weaken.
引用
收藏
页数:19
相关论文
共 40 条
  • [21] ORIGINS OF GENES - BIG-BANG OR CONTINUOUS CREATION
    KEESE, PK
    GIBBS, A
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (20) : 9489 - 9493
  • [22] Knight Robin D., 2001, GENOME BIOLOGY, V2, p10.1, DOI DOI 10.1186/GB-2001-2-4-RESEARCH0010
  • [23] Structure-based conformational preferences of amino acids
    Koehl, P
    Levitt, M
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (22) : 12524 - 12529
  • [24] OrthoDB v8: update of the hierarchical catalog of orthologs and the underlying free software
    Kriventseva, Evgenia V.
    Tegenfeldt, Fredrik
    Petty, Tom J.
    Waterhouse, Robert M.
    Simao, Felipe A.
    Pozdnyakov, Igor A.
    Ioannidis, Panagiotis
    Zdobnov, Evgeny M.
    [J]. NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) : D250 - D256
  • [25] Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation
    Letunic, Ivica
    Bork, Peer
    [J]. BIOINFORMATICS, 2007, 23 (01) : 127 - 128
  • [26] Orphans and new gene origination, a structural and evolutionary perspective
    Light, Sara
    Basile, Walter
    Elofsson, Arne
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2014, 26 : 73 - 83
  • [27] Protein Expansion Is Primarily due to Indels in Intrinsically Disordered Regions
    Light, Sara
    Sagit, Rauan
    Sachenkova, Oxana
    Ekman, Diana
    Elofsson, Arne
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2013, 30 (12) : 2645 - 2653
  • [28] GlobPlot: exploring protein sequences for globularity and disorder
    Linding, R
    Russell, RB
    Neduva, V
    Gibson, TJ
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (13) : 3701 - 3708
  • [29] Protein disorder prediction: Implications for structural proteomics
    Linding, R
    Jensen, LJ
    Diella, F
    Bork, P
    Gibson, TJ
    Russell, RB
    [J]. STRUCTURE, 2003, 11 (11) : 1453 - 1459
  • [30] Evolution: Dynamics of De Novo Gene Emergence
    Neme, Rafik
    Tautz, Diethard
    [J]. CURRENT BIOLOGY, 2014, 24 (06) : R238 - R240