Detection of orphan domains in Drosophila using "hydrophobic cluster analysis"

被引:22
作者
Bitard-Feildel, Tristan [1 ]
Heberlein, Magdalena [1 ]
Bornberg-Bauer, Erich [1 ]
Callebaut, Isabelle [2 ]
机构
[1] Univ Munster, Inst Evolut & Biodivers, D-48149 Munster, Germany
[2] Univ Paris 06, Sorbonne Univ, Museum Natl Hist Nat, IMPMC,UMR CNRS 7590,IRD UMR 206, F-75005 Paris, France
关键词
Domain detection; Protein domain; Intrinsically disordered domain; Domain evolution; EVOLUTIONARY ANALYSIS; SECONDARY STRUCTURES; MOLECULAR-BASIS; GENES; DYNAMICS; PROTEINS; EMERGENCE; ORIGINS; BINDING;
D O I
10.1016/j.biochi.2015.02.019
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Introduction: Comparative genomics has become an important strategy in life science research. While many genes, and the proteins they code for, can be well characterized by assigning orthologs, a significant amount of proteins or domains remain obscure "orphans". Some orphans are overlooked by current computational methods because they rapidly diverged, others emerged relatively recently (de novo). Recent research has demonstrated the importance of orphans, and of de novo proteins and domains for development of new phenotypic traits and adaptation. New approaches for detecting novel domains are thus of paramount importance. Results: The hydrophobic cluster analysis (HCA) method delineates globular-like domains from the information of a protein sequence and thereby allows bypassing some of the established methods limitations based on conserved sequence similarity. In this study, HCA is tested for orphan domain detection on 12 Drosophila genomes. After their detection, the oprhan domains are classified into two categories, depending on their presence/absence in distantly related species. The two categories show significantly different physico-chemical properties when compared to previously characterized domains from the Pfam database. The newly detected domains have a higher degree of intrinsic disorder and a particular hydrophobic cluster composition. The older the domains are, the more similar their hydrophobic cluster content is to the cluster content of Pfam domains. The results suggest that, over time, newly created domains acquire a canonical set of hydrophobic clusters but conserve some features of intrinsically disordered regions. Conclusion: Our results agree with previous findings on orphan domains and suggest that the physicochemical properties of domains change over evolutionary long time scale. The presented HCA-based method is able to detect domains with unusual properties without relying on prior knowledge, such as the availability of homologs. Therefore, the method has large potential for complementing existing strategies to annotate genomes, and for better understanding how molecular features emerge. (C) 2015 Elsevier B.V. and Societe Francaise de Biochimie et Biologie Moleculaire (SFBBM). All rights reserved.
引用
收藏
页码:244 / 253
页数:10
相关论文
共 66 条
  • [41] Quantification and functional analysis of modular protein evolution in a dense phylogenetic tree
    Moore, Andrew D.
    Grath, Sonja
    Schueler, Andreas
    Huylmans, Ann K.
    Bornberg-Bauer, Erich
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2013, 1834 (05): : 898 - 907
  • [42] The Dynamics and Evolutionary Potential of Domain Loss and Emergence
    Moore, Andrew D.
    Bornberg-Bauer, Erich
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2012, 29 (02) : 787 - 796
  • [43] Molecular basis for G-actin binding to RPEL motifs from the serum response factor coactivator MAL
    Mouilleron, Stephane
    Guettler, Sebastian
    Langer, Carola A.
    Treisman, Richard
    McDonald, Neil Q.
    [J]. EMBO JOURNAL, 2008, 27 (23) : 3198 - 3208
  • [44] Evolution: Dynamics of De Novo Gene Emergence
    Neme, Rafik
    Tautz, Diethard
    [J]. CURRENT BIOLOGY, 2014, 24 (06) : R238 - R240
  • [45] Park SY, 2012, CELL CYCLE, V11, P761, DOI [10.4161/cc.11.4.19209, 10.4161/cc.21168]
  • [46] MACSE: Multiple Alignment of Coding SEquences Accounting for Frameshifts and Stop Codons
    Ranwez, Vincent
    Harispe, Sebastien
    Delsuc, Frederic
    Douzery, Emmanuel J. P.
    [J]. PLOS ONE, 2011, 6 (09):
  • [47] Remmert M, 2012, NAT METHODS, V9, P173, DOI [10.1038/NMETH.1818, 10.1038/nmeth.1818]
  • [48] Twilight zone of protein sequence alignments
    Rost, B
    [J]. PROTEIN ENGINEERING, 1999, 12 (02): : 85 - 94
  • [49] Protein structures sustain evolutionary drift
    Rost, B
    [J]. FOLDING & DESIGN, 1997, 2 (03): : S19 - S24
  • [50] The Evolution of Human Cells in Terms of Protein Innovation
    Sardar, Adam J.
    Oates, Matt E.
    Fang, Hai
    Forrest, Alistair R. R.
    Kawaji, Hideya
    Gough, Julian
    Rackham, Owen J. L.
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2014, 31 (06) : 1364 - 1374