A New Clustering of Antibody CDR Loop Conformations

被引:278
作者
North, Benjamin [1 ]
Lehmann, Andreas [1 ]
Dunbrack, Roland L., Jr. [1 ]
机构
[1] Fox Chase Canc Ctr, Inst Canc Res, Philadelphia, PA 19111 USA
基金
美国国家卫生研究院;
关键词
antibody structure; canonical loop conformations; affinity propagation; SEQUENCE CULLING SERVER; PROTEIN-STRUCTURE; STRUCTURAL CLASSIFICATION; HYPERVARIABLE REGIONS; CANONICAL STRUCTURES; MULTIPLE TEMPLATES; CRYSTAL-STRUCTURE; IMMUNOGLOBULINS; PREDICTION; INFORMATION;
D O I
10.1016/j.jmb.2010.10.030
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Previous analyses of the complementarity-determining regions (CDRs) of antibodies have focused on a small number of "canonical" conformations for each loop. This is primarily the result of the work of Chothia and coworkers, most recently in 1997. Because of the widespread utility of antibodies, we have revisited the clustering of conformations of the six CDR loops with the much larger amount of structural information currently available. In this work, we were careful to use a high-quality data set by eliminating low-resolution structures and CDRs with high B-factors or high conformational energies. We used a distance function based on directional statistics and an effective clustering algorithm with affinity propagation. With this data set of over 300 nonredundant antibody structures, we were able to cover 28 CDR length combinations (e.g., L1 length 11, or "L1-11" in our CDR-length nomenclature) for L1, L2, L3, H1, and H2. The Chothia analysis covered only 20 CDR-lengths. Only four of these had more than one conformational cluster, of which two could easily be distinguished by gene source (mouse/human; kappa/lambda) and one could easily be distinguished purely by the presence and the positions of Pro residues (L3-9). Thus, using the Chothia analysis does not require the complicated set of "structure-determining residues" that is often assumed. Of our 28 CDR-lengths, 15 have multiple conformational clusters, including 10 for which the Chothia analysis had only one canonical class. We have a total of 72 clusters for non-H3 CDRs; approximately 85% of the non-H3 sequences can be assigned to a conformational cluster based on gene source and/or sequence. We found that earlier predictions of "bulged" versus "nonbulged" conformations based on the presence or the absence of anchor residues Arg/Lys94 and Asp101 of H3 have not held up, since all four combinations lead to a majority of conformations that are bulged. Thus, the earlier analyses have been significantly enhanced by the increased data. We believe that the new classification will lead to improved methods for antibody structure prediction and design. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:228 / 256
页数:29
相关论文
共 39 条
[1]   Standard conformations for the canonical structures of immunoglobulins [J].
AlLazikani, B ;
Lesk, AM ;
Chothia, C .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 273 (04) :927-948
[2]   SACS - Self-maintaining database of antibody crystal structure information [J].
Allcorn, LC ;
Martin, ACR .
BIOINFORMATICS, 2002, 18 (01) :175-181
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]   Systematic analysis of the effect of multiple templates on the accuracy of comparative models of protein structure [J].
Chakravarty, Suvobrata ;
Godbole, Sucheta ;
Zhang, Bing ;
Berger, Seth ;
Sanchez, Roberto .
BMC STRUCTURAL BIOLOGY, 2008, 8
[5]   CANONICAL STRUCTURES FOR THE HYPERVARIABLE REGIONS OF IMMUNOGLOBULINS [J].
CHOTHIA, C ;
LESK, AM .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 196 (04) :901-917
[6]   CONFORMATIONS OF IMMUNOGLOBULIN HYPERVARIABLE REGIONS [J].
CHOTHIA, C ;
LESK, AM ;
TRAMONTANO, A ;
LEVITT, M ;
SMITHGILL, SJ ;
AIR, G ;
SHERIFF, S ;
PADLAN, EA ;
DAVIES, D ;
TULIP, WR ;
COLMAN, PM ;
SPINELLI, S ;
ALZARI, PM ;
POLJAK, RJ .
NATURE, 1989, 342 (6252) :877-883
[7]   WebLogo: A sequence logo generator [J].
Crooks, GE ;
Hon, G ;
Chandonia, JM ;
Brenner, SE .
GENOME RESEARCH, 2004, 14 (06) :1188-1190
[8]  
Durbin R., 1998, Biological sequence analysis: probabilistic models of proteins and nucleic acids
[9]  
Eddy Sean R, 2009, Genome Inform, V23, P205
[10]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763