Detection of orphan domains in Drosophila using "hydrophobic cluster analysis"

被引:22
作者
Bitard-Feildel, Tristan [1 ]
Heberlein, Magdalena [1 ]
Bornberg-Bauer, Erich [1 ]
Callebaut, Isabelle [2 ]
机构
[1] Univ Munster, Inst Evolut & Biodivers, D-48149 Munster, Germany
[2] Univ Paris 06, Sorbonne Univ, Museum Natl Hist Nat, IMPMC,UMR CNRS 7590,IRD UMR 206, F-75005 Paris, France
关键词
Domain detection; Protein domain; Intrinsically disordered domain; Domain evolution; EVOLUTIONARY ANALYSIS; SECONDARY STRUCTURES; MOLECULAR-BASIS; GENES; DYNAMICS; PROTEINS; EMERGENCE; ORIGINS; BINDING;
D O I
10.1016/j.biochi.2015.02.019
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Introduction: Comparative genomics has become an important strategy in life science research. While many genes, and the proteins they code for, can be well characterized by assigning orthologs, a significant amount of proteins or domains remain obscure "orphans". Some orphans are overlooked by current computational methods because they rapidly diverged, others emerged relatively recently (de novo). Recent research has demonstrated the importance of orphans, and of de novo proteins and domains for development of new phenotypic traits and adaptation. New approaches for detecting novel domains are thus of paramount importance. Results: The hydrophobic cluster analysis (HCA) method delineates globular-like domains from the information of a protein sequence and thereby allows bypassing some of the established methods limitations based on conserved sequence similarity. In this study, HCA is tested for orphan domain detection on 12 Drosophila genomes. After their detection, the oprhan domains are classified into two categories, depending on their presence/absence in distantly related species. The two categories show significantly different physico-chemical properties when compared to previously characterized domains from the Pfam database. The newly detected domains have a higher degree of intrinsic disorder and a particular hydrophobic cluster composition. The older the domains are, the more similar their hydrophobic cluster content is to the cluster content of Pfam domains. The results suggest that, over time, newly created domains acquire a canonical set of hydrophobic clusters but conserve some features of intrinsically disordered regions. Conclusion: Our results agree with previous findings on orphan domains and suggest that the physicochemical properties of domains change over evolutionary long time scale. The presented HCA-based method is able to detect domains with unusual properties without relying on prior knowledge, such as the availability of homologs. Therefore, the method has large potential for complementing existing strategies to annotate genomes, and for better understanding how molecular features emerge. (C) 2015 Elsevier B.V. and Societe Francaise de Biochimie et Biologie Moleculaire (SFBBM). All rights reserved.
引用
收藏
页码:244 / 253
页数:10
相关论文
共 66 条
  • [31] Domain architecture conservation in orthologs
    Forslund, Kristoffer
    Pekkari, Isabella
    Sonnhammer, Erik L. L.
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [32] HYDROPHOBIC CLUSTER-ANALYSIS - AN EFFICIENT NEW WAY TO COMPARE AND ANALYZE AMINO-ACID-SEQUENCES
    GABORIAUD, C
    BISSERY, V
    BENCHETRIT, T
    MORNON, JP
    [J]. FEBS LETTERS, 1987, 224 (01): : 149 - 155
  • [33] The folding and evolution of multidomain proteins
    Han, Jung-Hoon
    Batey, Sarah
    Nickson, Adrian A.
    Teichmann, Sarah A.
    Clarke, Jane
    [J]. NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2007, 8 (04) : 319 - 330
  • [34] Non-intertwined binary patterns of hydrophobic/nonhydrophobic amino acids are considerably better markers of regular secondary structures than nonconstrained patterns
    Hennetin, J
    Le Tuan, K
    Canard, L
    Colloc'h, N
    Mornon, JP
    Callebaut, I
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 51 (02) : 236 - 244
  • [35] Origins, evolution, and phenotypic impact of new genes
    Kaessmann, Henrik
    [J]. GENOME RESEARCH, 2010, 20 (10) : 1313 - 1326
  • [36] Nature of the protein universe
    Levitt, Michael
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (27) : 11079 - 11084
  • [37] New Gene Evolution: Little Did We Know
    Long, Manyuan
    VanKuren, Nicholas W.
    Chen, Sidi
    Vibranovski, Maria D.
    [J]. ANNUAL REVIEW OF GENETICS, VOL 47, 2013, 47 : 307 - 333
  • [38] CDD: conserved domains and protein three-dimensional structure
    Marchler-Bauer, Aron
    Zheng, Chanjuan
    Chitsaz, Farideh
    Derbyshire, Myra K.
    Geer, Lewis Y.
    Geer, Renata C.
    Gonzales, Noreen R.
    Gwadz, Marc
    Hurwitz, David I.
    Lanczycki, Christopher J.
    Lu, Fu
    Lu, Shennan
    Marchler, Gabriele H.
    Song, James S.
    Thanki, Narmada
    Yamashita, Roxanne A.
    Zhang, Dachuan
    Bryant, Stephen H.
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) : D348 - D352
  • [39] Prediction of Protein Binding Regions in Disordered Proteins
    Meszaros, Balint
    Simon, Istvan
    Dosztanyi, Zsuzsanna
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (05)
  • [40] Arrangements in the modular evolution of proteins
    Moore, Andrew D.
    Bjorklund, Asa K.
    Ekrnan, Diana
    Bornberg-Bauer, Erich
    Elofsson, Arne
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 2008, 33 (09) : 444 - 451