A global taxonomy of loops in globular proteins

被引:80
作者
Kwasigroch, JM
Chomilier, J
Mornon, JP
机构
[1] UNIV PARIS 06, LAB MINERAL CRISTALLOG, F-75252 PARIS 05, FRANCE
[2] UNIV PARIS 07, CNRS URA 009, F-75252 PARIS 05, FRANCE
关键词
loops; proteins; structural biology; molecular modeling; taxonomy;
D O I
10.1006/jmbi.1996.0363
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A bank of loops from three to eight amino acid residues long has been constituted. On the basis of statistical analysis of occurrences of conformations and residue, loops could be divided into two parts: the side residues directly bonded to the secondary structure flanking element, and the inner part. The conformations of the side residues are correlated to the nature of their neighboring flanks, while the inner residues adopt conformations uncorrelated from one residue to the next; thus they are unrelated to the flanks. Two zones in the Ramachandran plot are important: alpha(L) and beta(p). In particular, the high occurrence of alpha(L), mainly occupied by glycine residues, is necessary to induce flexibility and thus allow loops to comply with the geometrical constraints of the flanks. An algorithm of clustering has been used to aggregate loops of the same length within families of similar 3D structures. At each position in each cluster, sequence and conformational signatures have been deduced if the occurrence of a residue (or a conformation) is higher than an equiprobable distribution over all clusters. The result is that some positions favor particular amino acids and conformations, which are typical of a cluster although not unique. This is an indication of a relation between structure and sequence in loops. A taxonomy is proposed that classifies the various clusters. It relies on two terms: the mean distance between the first and last C-alpha in one cluster and, perpendicular to this line, the distance to the center of gravity of the cluster. It is noteworthy that the differently populated clusters represented in such 2D plots can be separated. Thus, although the conformations of loops in globular proteins could cover a continuum, it has been possible to cluster them into a limited number of well populated families and superfamilies. This basic feature of protein architecture could be further exploited to better predict their geometry. (C) 1996 Academic Press Limited
引用
收藏
页码:855 / 872
页数:18
相关论文
共 60 条
  • [1] APPROACHING A COMPLETE CLASSIFICATION OF PROTEIN SECONDARY STRUCTURE
    ADZHUBEI, AA
    EISENMENGER, F
    TUMANYAN, VG
    ZINKE, M
    BRODZINSKI, S
    ESIPOVA, NG
    [J]. JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1987, 5 (03) : 689 - 704
  • [2] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [3] STRUCTURE OF ANTIBODY HYPERVARIABLE LOOPS REPRODUCED BY A CONFORMATIONAL SEARCH ALGORITHM
    BRUCCOLERI, RE
    HABER, E
    NOVOTNY, J
    [J]. NATURE, 1988, 335 (6190) : 564 - 568
  • [4] Casari G, 1995, STRUCT BIOL, V2, P171
  • [5] IDENTIFICATION, CLASSIFICATION, AND ANALYSIS OF BETA-BULGES IN PROTEINS
    CHAN, AWE
    HUTCHINSON, EG
    HARRIS, D
    THORNTON, JM
    [J]. PROTEIN SCIENCE, 1993, 2 (10) : 1574 - 1590
  • [6] EASY ADAPTATION OF PROTEIN-STRUCTURE TO SEQUENCE
    CHELVANAYAGAM, G
    ROY, G
    ARGOS, P
    [J]. PROTEIN ENGINEERING, 1994, 7 (02): : 173 - 184
  • [7] MODELING THE POLYPEPTIDE BACKBONE WITH SPARE PARTS FROM KNOWN PROTEIN STRUCTURES
    CLAESSENS, M
    VANCUTSEM, E
    LASTERS, I
    WODAK, S
    [J]. PROTEIN ENGINEERING, 1989, 2 (05): : 335 - 345
  • [8] ORIGINS OF STRUCTURAL DIVERSITY WITHIN SEQUENTIALLY IDENTICAL HEXAPEPTIDES
    COHEN, BI
    PRESNELL, SR
    COHEN, FE
    [J]. PROTEIN SCIENCE, 1993, 2 (12) : 2134 - 2145
  • [9] COMPARISON OF 3 ALGORITHMS FOR THE ASSIGNMENT OF SECONDARY STRUCTURE IN PROTEINS - THE ADVANTAGES OF A CONSENSUS ASSIGNMENT
    COLLOCH, N
    ETCHEBEST, C
    THOREAU, E
    HENRISSAT, B
    MORNON, JP
    [J]. PROTEIN ENGINEERING, 1993, 6 (04): : 377 - 382
  • [10] BETA-BREAKERS - AN APERIODIC SECONDARY STRUCTURE
    COLLOCH, N
    COHEN, FE
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1991, 221 (02) : 603 - 613