REPRESENTATIVE SELECTION OF PROTEINS BASED ON NUCLEAR FAMILIES

被引:11
作者
BOBERG, J
SALAKOSKI, T
VIHINEN, M
机构
[1] UNIV TURKU,DEPT BIOCHEM,SF-20500 TURKU,FINLAND
[2] UNIV TURKU,DEPT COMP SCI,SF-20500 TURKU,FINLAND
[3] KAROLINSKA INST,NOVUM,CTR STRUCT BIOCHEM,S-14157 HUDDINGE,SWEDEN
来源
PROTEIN ENGINEERING | 1995年 / 8卷 / 05期
关键词
COMPLETE LINKAGE CLUSTERING; NOISE ELIMINATION; PDB FAMILIES; REPRESENTATIVE SELECTION;
D O I
10.1093/protein/8.5.501
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The selection of unbiased representatives from a large database is complicated by the requirement for the chosen entries to be not only genuinely different from each other but also typical for the family of related entries. A method satisfying this 2-fold objective was developed by equipping complete linkage clustering with a novel noise elimination procedure to deal with overlapping cluster structure, A total of 200 nuclear families of truly related Brookhaven Protein Data Bank structures were generated, from which any entry can be chosen to represent its family.
引用
收藏
页码:501 / 503
页数:3
相关论文
共 13 条
  • [1] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [2] SELECTION OF A REPRESENTATIVE SET OF STRUCTURES FROM BROOKHAVEN PROTEIN DATA-BANK
    BOBERG, J
    SALAKOSKI, T
    VIHINEN, M
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1992, 14 (02): : 265 - 276
  • [3] GENERAL FORMULATION AND EVALUATION OF AGGLOMERATIVE CLUSTERING METHODS WITH METRIC AND NONMETRIC DISTANCES
    BOBERG, J
    SALAKOSKI, T
    [J]. PATTERN RECOGNITION, 1993, 26 (09) : 1395 - 1406
  • [4] BOBERG J, 1995, IN PRESS PROTEIN ENG, V8
  • [5] A COMPREHENSIVE SET OF SEQUENCE-ANALYSIS PROGRAMS FOR THE VAX
    DEVEREUX, J
    HAEBERLI, P
    SMITHIES, O
    [J]. NUCLEIC ACIDS RESEARCH, 1984, 12 (01) : 387 - 395
  • [6] HERINGA J, 1992, COMPUT APPL BIOSCI, V8, P599
  • [7] HOBOHM U, 1994, PROTEIN SCI, V3, P522
  • [8] HOBOHM U, 1992, PROTEIN SCI, V1, P409
  • [9] DICTIONARY OF PROTEIN SECONDARY STRUCTURE - PATTERN-RECOGNITION OF HYDROGEN-BONDED AND GEOMETRICAL FEATURES
    KABSCH, W
    SANDER, C
    [J]. BIOPOLYMERS, 1983, 22 (12) : 2577 - 2637
  • [10] SIMILARITIES BETWEEN PROTEIN 3-D STRUCTURES
    LESSEL, U
    SCHOMBURG, D
    [J]. PROTEIN ENGINEERING, 1994, 7 (10): : 1175 - 1187