SLiMDisc: short, linear motif discovery, correcting for common evolutionary descent

被引:79
作者
Davey, Norman E. [1 ]
Shields, Denis C. [1 ]
Edwards, Richard J. [1 ]
机构
[1] Univ Coll Dublin, Conway Inst Biomol & Biomed Sci, Dublin 4, Ireland
基金
爱尔兰科学基金会;
关键词
D O I
10.1093/nar/gkl486
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Many important interactions of proteins are facilitated by short, linear motifs (SLiMs) within a protein's primary sequence. Our aim was to establish robust methods for discovering putative functional motifs. The strongest evidence for such motifs is obtained when the same motifs occur in unrelated proteins, evolving by convergence. In practise, searches for such motifs are often swamped by motifs shared in related proteins that are identical by descent. Prediction of motifs among sets of biologically related proteins, including those both with and without detectable similarity, were made using the TEIRESIAS algorithm. The number of motif occurrences arising through common evolutionary descent were normalized based on treatment of BLAST local alignments. Motifs were ranked according to a score derived from the product of the normalized number of occurrences and the information content. The method was shown to significantly outperform methods that do not discount evolutionary relatedness, when applied to known SLiMs from a subset of the eukaryotic linear motif (ELM) database. An implementation of Multiple Spanning Tree weighting outperformed two other weighting schemes, in a variety of settings.
引用
收藏
页码:3546 / 3554
页数:9
相关论文
共 35 条
  • [1] Bailey T L, 1995, Proc Int Conf Intell Syst Mol Biol, V3, P21
  • [2] The universal protein resource (UniProt)
    Bairoch, A
    Apweiler, R
    Wu, CH
    Barker, WC
    Boeckmann, B
    Ferro, S
    Gasteiger, E
    Huang, HZ
    Lopez, R
    Magrane, M
    Martin, MJ
    Natale, DA
    O'Donovan, C
    Redaschi, N
    Yeh, LSL
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : D154 - D159
  • [3] BAUMGARTNER JW, 1994, J BIOL CHEM, V269, P29094
  • [4] Approaches to the automatic discovery of patterns in biosequences
    Brazma, A
    Jonassen, I
    Eidhammer, I
    Gilbert, D
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 1998, 5 (02) : 279 - 305
  • [5] Multiple sequence alignment with the Clustal series of programs
    Chenna, R
    Sugawara, H
    Koike, T
    Lopez, R
    Gibson, TJ
    Higgins, DG
    Thompson, JD
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (13) : 3497 - 3500
  • [6] Role of the LXCXE binding site in Rb function
    Dahiya, A
    Gavin, MR
    Luo, RX
    Dean, DC
    [J]. MOLECULAR AND CELLULAR BIOLOGY, 2000, 20 (18) : 6799 - 6805
  • [7] INDUCTION OF HEPATITIS-A VIRUS-NEUTRALIZING ANTIBODY BY A VIRUS-SPECIFIC SYNTHETIC PEPTIDE
    EMINI, EA
    HUGHES, JV
    PERLOW, DS
    BOGER, J
    [J]. JOURNAL OF VIROLOGY, 1985, 55 (03) : 836 - 839
  • [8] Fisher AL, 1996, MOL CELL BIOL, V16, P2670
  • [9] YEAST PROHORMONE PROCESSING ENZYME (KEX2 GENE-PRODUCT) IS A CA-2+-DEPENDENT SERINE PROTEASE
    FULLER, RS
    BRAKE, A
    THORNER, J
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1989, 86 (05) : 1434 - 1438
  • [10] Protein C-mannosylation: Facts and questions
    Furmanek, A
    Hofsteenge, J
    [J]. ACTA BIOCHIMICA POLONICA, 2000, 47 (03) : 781 - 789