Protein Expansion Is Primarily due to Indels in Intrinsically Disordered Regions

被引:63
作者
Light, Sara [1 ]
Sagit, Rauan [1 ]
Sachenkova, Oxana [1 ]
Ekman, Diana [1 ]
Elofsson, Arne [2 ]
机构
[1] Stockholm Univ, Dept Biochem & Biophys, Sci Life Lab, Solna, Sweden
[2] Stockholm Univ, Swedish E Sci Res Inst, Dept Biochem & Biophys, Sci Life Lab, Solna, Sweden
基金
瑞典研究理事会;
关键词
disordered proteins; insertions and deletions; indels; protein evolution; protein structure; UNSTRUCTURED PROTEINS; TANDEM REPEATS; CODING REGIONS; PREDICTION; EVOLUTION; DNA; CHARACTERIZE; INSERTIONS; SEQUENCES; DELETIONS;
D O I
10.1093/molbev/mst157
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Proteins evolve not only through point mutations but also by insertion and deletion events, which affect the length of the protein. It is well known that such indel events most frequently occur in surface-exposed loops. However, detailed analysis of indel events in distantly related and fast-evolving proteins is hampered by the difficulty involved in correctly aligning such sequences. Here, we circumvent this problem by first only analyzing homologous proteins based on length variation rather than pairwise alignments. Using this approach, we find a surprisingly strong relationship between difference in length and difference in the number of intrinsically disordered residues, where up to three quarters of the length variation can be explained by changes in the number of intrinsically disordered residues. Further, we find that disorder is common in both insertions and deletions. A more detailed analysis reveals that indel events do not induce disorder but rather that already disordered regions accrue indels, suggesting that there is a lowered selective pressure for indels to occur within intrinsically disordered regions.
引用
收藏
页码:2645 / 2653
页数:9
相关论文
共 46 条
[1]   Reconstructing a Missing Link in the Evolution of a Recently Diverged Phosphotriesterase by Active-Site Loop Remodeling [J].
Afriat-Jurnou, Livnat ;
Jackson, Colin J. ;
Tawfik, Dan S. .
BIOCHEMISTRY, 2012, 51 (31) :6047-6055
[2]  
[Anonymous], 2011, Saccharomyces Genome Database
[3]   The Candida Genome Database (CGD), a community resource for Candida albicans gene and protein information [J].
Arnaud, MB ;
Costanzo, MC ;
Skrzypek, MS ;
Binkley, G ;
Lane, C ;
Miyasato, SR ;
Sherlock, G .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D358-D363
[4]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[5]   Domain rearrangements in protein evolution [J].
Björklund, ÅK ;
Ekman, D ;
Light, S ;
Frey-Skött, J ;
Elofsson, A .
JOURNAL OF MOLECULAR BIOLOGY, 2005, 353 (04) :911-923
[6]   Expansion of protein domain repeats [J].
Bjorklund, Asa K. ;
Ekman, Diana ;
Elofsson, Arne .
PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (08) :959-970
[7]  
Brown C, 2002, EMBED SYST PROGRAM, V15, P55
[8]   DNA indels in coding regions reveal selective constraints on protein evolution in the human lineage [J].
de la Chaux, Nicole ;
Messer, Philipp W. ;
Arndt, Peter F. .
BMC EVOLUTIONARY BIOLOGY, 2007, 7 (1)
[9]   The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins [J].
Dosztányi, Z ;
Csizmók, V ;
Tompa, P ;
Simon, I .
JOURNAL OF MOLECULAR BIOLOGY, 2005, 347 (04) :827-839
[10]   Bioinformatical approaches to characterize intrinsically disordered/unstructured proteins [J].
Dosztanyi, Zsuzsanna ;
Meszaros, Balint ;
Simon, Istvan .
BRIEFINGS IN BIOINFORMATICS, 2010, 11 (02) :225-243