Domain architecture conservation in orthologs

被引:42
作者
Forslund, Kristoffer [1 ,2 ]
Pekkari, Isabella [1 ,2 ]
Sonnhammer, Erik L. L. [1 ,2 ,3 ]
机构
[1] Stockholm Bioinformat Ctr, Sci Life Lab, S-17121 Solna, Sweden
[2] Stockholm Univ, Dept Biochem & Biophys, Stockholm, Sweden
[3] Swedish eSci Res Ctr, Stockholm, Sweden
来源
BMC BIOINFORMATICS | 2011年 / 12卷
关键词
SEQUENCE; DUPLICATION; PARALOGS; PROTEINS; SEARCH; PFAM;
D O I
10.1186/1471-2105-12-326
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: As orthologous proteins are expected to retain function more often than other homologs, they are often used for functional annotation transfer between species. However, ortholog identification methods do not take into account changes in domain architecture, which are likely to modify a protein's function. By domain architecture we refer to the sequential arrangement of domains along a protein sequence. To assess the level of domain architecture conservation among orthologs, we carried out a large-scale study of such events between human and 40 other species spanning the entire evolutionary range. We designed a score to measure domain architecture similarity and used it to analyze differences in domain architecture conservation between orthologs and paralogs relative to the conservation of primary sequence. We also statistically characterized the extents of different types of domain swapping events across pairs of orthologs and paralogs. Results: The analysis shows that orthologs exhibit greater domain architecture conservation than paralogous homologs, even when differences in average sequence divergence are compensated for, for homologs that have diverged beyond a certain threshold. We interpret this as an indication of a stronger selective pressure on orthologs than paralogs to retain the domain architecture required for the proteins to perform a specific function. In general, orthologs as well as the closest paralogous homologs have very similar domain architectures, even at large evolutionary separation. The most common domain architecture changes observed in both ortholog and paralog pairs involved insertion/deletion of new domains, while domain shuffling and segment duplication/deletion were very infrequent. Conclusions: On the whole, our results support the hypothesis that function conservation between orthologs demands higher domain architecture conservation than other types of homologs, relative to primary sequence conservation. This supports the notion that orthologs are functionally more similar than other types of homologs at the same evolutionary distance.
引用
收藏
页数:14
相关论文
共 39 条
  • [1] Alexeyenko Andrey, 2006, Drug Discov Today Technol, V3, P137, DOI 10.1016/j.ddtec.2006.06.002
  • [2] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [3] [Anonymous], CHI SQUARED TEST
  • [4] InParanoid 6:: eukaryotic ortholog clusters with inparalogs
    Berglund, Ann-Charlotte
    Sjolund, Erik
    Ostlund, Gabriel
    Sonnhammer, Erik L. L.
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D263 - D266
  • [5] Domain rearrangements in protein evolution
    Björklund, ÅK
    Ekman, D
    Light, S
    Frey-Skött, J
    Elofsson, A
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2005, 353 (04) : 911 - 923
  • [6] Expansion of protein domain repeats
    Bjorklund, Asa K.
    Ekman, Diana
    Elofsson, Arne
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (08) : 959 - 970
  • [7] The evolution of protein domain families
    Buljan, Marija
    Bateman, Alex
    [J]. BIOCHEMICAL SOCIETY TRANSACTIONS, 2009, 37 : 751 - 755
  • [8] Two rounds of whole genome duplication in the ancestral vertebrate
    Dehal, P
    Boore, JL
    [J]. PLOS BIOLOGY, 2005, 3 (10) : 1700 - 1708
  • [9] Orthology and functional conservation in eukaryotes
    Dolinski, Kara
    Botstein, David
    [J]. ANNUAL REVIEW OF GENETICS, 2007, 41 : 465 - 507
  • [10] A probabilistic model of local sequence alignment that simplifies statistical significance estimation
    Eddy, Sean R.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (05)