Comprehensive analysis of co-occurring domain sets in yeast proteins

被引:14
作者
Cohen-Gihon, Inbar
Nussinov, Ruth
Sharan, Roded [1 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
[2] Tel Aviv Univ, Sackler Fac Med, Dept Human Genet, Sackler Inst Mol Med, IL-69978 Tel Aviv, Israel
[3] NCI, SAIC Frederick Inc, Nanobiol Program, Ctr Canc Res, Frederick, MD 21702 USA
关键词
D O I
10.1186/1471-2164-8-161
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Protein domains are fundamental evolutionary units of protein architecture, composing proteins in a modular manner. Combinations of two or more, possibly non- adjacent, domains are thought to play specific functional roles within proteins. Indeed, while the number of potential co- occurring domain sets ( CDSs) is very large, only a few of these occur in nature. Here we study the principles governing domain content of proteins, using yeast as a model species. Results: We design a novel representation of proteins and their constituent domains as a protein-domain network. An analysis of this network reveals 99 CDSs that occur in proteins more than expected by chance. The identified CDSs are shown to preferentially include ancient domains that are conserved from bacteria or archaea. Moreover, the protein sets spanned by these combinations were found to be highly functionally coherent, significantly match known protein complexes, and enriched with protein- protein interactions. These observations serve to validate the biological significance of the identified CDSs. Conclusion: Our work provides a comprehensive list of co- occurring domain sets in yeast, and sheds light on their function and evolution.
引用
收藏
页数:10
相关论文
共 46 条
[1]  
Apic G, 2001, Bioinformatics, V17 Suppl 1, pS83
[2]   Domain combinations in archaeal, eubacterial and eukaryotic proteomes [J].
Apic, G ;
Gough, J ;
Teichmann, SA .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 310 (02) :311-325
[3]   PRINTS and its automatic supplement, prePRINTS [J].
Attwood, TK ;
Bradley, P ;
Flower, DR ;
Gaulton, A ;
Maudling, N ;
Mitchell, AL ;
Moulton, G ;
Nordle, A ;
Paine, K ;
Taylor, P ;
Uddin, A ;
Zygouri, C .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :400-402
[4]   Evolution of substrate specificities in the P-type ATPase superfamily [J].
Axelsen, KB ;
Palmgren, MG .
JOURNAL OF MOLECULAR EVOLUTION, 1998, 46 (01) :84-101
[5]   Calmodulin signaling via the IQ motif [J].
Bähler, M ;
Rhoads, A .
FEBS LETTERS, 2002, 513 (01) :107-113
[6]   Analysis of domain correlations in yeast protein complexes [J].
Betel, Doron ;
Isserlin, Ruth ;
Hogue, Christopher W. V. .
BIOINFORMATICS, 2004, 20 :55-62
[7]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[8]   Evolution of the protein repertoire [J].
Chothia, C ;
Gough, J ;
Vogel, C ;
Teichmann, SA .
SCIENCE, 2003, 300 (5626) :1701-1703
[9]   Modular genes with metazoan-specific domains have increased tissue specificity [J].
Cohen-Gihon, I ;
Lancet, D ;
Yanai, I .
TRENDS IN GENETICS, 2005, 21 (04) :210-213
[10]   What are DNA sequence motifs? [J].
D'haeseleer, P .
NATURE BIOTECHNOLOGY, 2006, 24 (04) :423-425