Searching protein structure databases with DaliLite v.3

被引:881
作者
Holm, L. [1 ]
Kaariainen, S. [2 ]
Rosenstrom, P. [2 ]
Schenkel, A. [2 ]
机构
[1] Univ Helsinki, Dept Biol & Environm Sci, FIN-00014 Helsinki, Finland
[2] Univ Helsinki, Inst Biotechnol, FIN-00014 Helsinki, Finland
基金
芬兰科学院;
关键词
D O I
10.1093/bioinformatics/btn507
中图分类号
Q5 [生物化学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
The Red Queen said, 'It takes all the running you can do, to keep in the same place.' Lewis Carrol Motivation: Newly solved protein structures are routinely scanned against structures already in the Protein Data Bank (PDB) using Internet servers. In favourable cases, comparing 3D structures may reveal biologically interesting similarities that are not detectable by comparing sequences. The number of known structures continues to grow exponentially. Sensitive-thorough but slow search algorithms are challenged to deliver results in a reasonable time, as there are now more structures in the PDB than seconds in a day. The brute-force solution would be to distribute the individual comparisons on a massively parallel computer. A frugal solution, as implemented in the Dali server, is to reduce the total computational cost by pruning search space using prior knowledge about the distribution of structures in fold space. This note reports paradigm revisions that enable maintaining such a knowledge base up-to-date on a PC.
引用
收藏
页码:2780 / 2781
页数:2
相关论文
共 7 条
[1]
The global trace graph, a novel paradigm for searching protein sequence databases [J].
Heger, Andreas ;
Mallick, Swapan ;
Wilton, Christopher ;
Holm, Liisa .
BIOINFORMATICS, 2007, 23 (18) :2361-2367
[2]
SEARCHING PROTEIN-STRUCTURE DATABASES HAS COME OF AGE [J].
HOLM, L ;
SANDER, C .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1994, 19 (03) :165-173
[3]
DaliLite workbench for protein structure comparison [J].
Holm, L ;
Park, J .
BIOINFORMATICS, 2000, 16 (06) :566-567
[4]
Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions [J].
Krissinel, E ;
Henrick, K .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2004, 60 :2256-2268
[5]
Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences [J].
Li, Weizhong ;
Godzik, Adam .
BIOINFORMATICS, 2006, 22 (13) :1658-1659
[6]
SCOP - A STRUCTURAL CLASSIFICATION OF PROTEINS DATABASE FOR THE INVESTIGATION OF SEQUENCES AND STRUCTURES [J].
MURZIN, AG ;
BRENNER, SE ;
HUBBARD, T ;
CHOTHIA, C .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 247 (04) :536-540
[7]
Sensitivity and selectivity in protein structure comparison [J].
Sierk, ML ;
Pearson, WR .
PROTEIN SCIENCE, 2004, 13 (03) :773-785