PALI - a database of Phylogeny and ALIgnment of homologous protein structures

被引:73
作者
Balaji, S
Sujatha, S
Kumar, SSC
Srinivasan, N [1 ]
机构
[1] Indian Inst Sci, Mol Biophys Unit, Bangalore 560012, Karnataka, India
[2] Indian Inst Technol, Dept Biotechnol, Kharagpur 721302, W Bengal, India
关键词
D O I
10.1093/nar/29.1.61
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
PALI (release 1.2) contains three-dimensional (3-D) structure-dependent sequence alignments as well as structure-based phylogenetic trees of homologous protein domains in various families. The data set of homologous protein structures has been derived by consulting the SCOP database (release 1.50) and the data set comprises 604 families of homologous proteins involving 2739 protein domain structures with each family made up of at least two members. Each member in a family has been structurally aligned with every other member in the same family (pairwise alignment) and all the members in the family are also aligned using simultaneous superposition (multiple alignment). The structural alignments are performed largely automatically, with manual interventions especially in the cases of distantly related proteins, using the program STAMP (version 4.2). Every family is also associated with two dendrograms, calculated using PHYLIP (version 3.5), one based on a structural dissimilarity metric defined for every pairwise alignment and the other based on similarity of topologically equivalent residues. These dendrograms enable easy comparison of sequence and structure-based relationships among the members in a family. Structure-based alignments with the details of structural and sequence similarities, superposed coordinate sets and dendrograms can be accessed conveniently using a web interface. The database can be queried for protein pairs with sequence or structural similarities falling within a specified range. Thus PALI forms a useful resource to help in analysing the relationship between sequence and structure variation at a given level of sequence similarity. PALI also contains over 653 'orphans' (single member families). Using the web interface involving PSI BLAST and PHYLIP it is possible to associate the sequence of a new protein with one of the families in PALI and generate a phylogenetic tree combining the query sequence and proteins of known 3-D structure. The database with the web interfaced search and dendrogram generation tools can be accessed at http://pauling.mbu.iisc.ernet.in/similar to pali.
引用
收藏
页码:61 / 65
页数:5
相关论文
共 33 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [3] The PDB data uniformity project
    Bhat, TN
    Bourne, P
    Feng, ZK
    Gilliland, G
    Jain, S
    Ravichandran, V
    Schneider, B
    Schneider, K
    Thanki, N
    Weissig, H
    Westbrook, J
    Berman, HM
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 214 - 218
  • [4] THE RELATION BETWEEN THE DIVERGENCE OF SEQUENCE AND STRUCTURE IN PROTEINS
    CHOTHIA, C
    LESK, AM
    [J]. EMBO JOURNAL, 1986, 5 (04) : 823 - 826
  • [5] SIMILAR AMINO-ACID-SEQUENCES - CHANCE OR COMMON ANCESTRY
    DOOLITTLE, RF
    [J]. SCIENCE, 1981, 214 (4517) : 149 - 159
  • [6] Felsenstein J, 1995, PHYLIP PHYLOGENY INF
  • [7] HOLM L, 1994, NUCLEIC ACIDS RES, V22, P3600
  • [8] KNOWLEDGE-BASED PROTEIN MODELING
    JOHNSON, MS
    SRINIVASAN, N
    SOWDHAMINI, R
    BLUNDELL, TL
    [J]. CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1994, 29 (01) : 1 - 68
  • [9] MOLECULAR ANATOMY - PHYLETIC RELATIONSHIPS DERIVED FROM 3-DIMENSIONAL STRUCTURES OF PROTEINS
    JOHNSON, MS
    SUTCLIFFE, MJ
    BLUNDELL, TL
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1990, 30 (01) : 43 - 59
  • [10] JOHNSON MS, 1990, METHOD ENZYMOL, V183, P670