Statistical analysis of the protein environment of N-glycosylation sites:: implications for occupancy, structure, and folding

被引:372
作者
Petrescu, AJ
Milac, AL
Petrescu, SM
Dwek, RA
Wormald, MR
机构
[1] Univ Oxford, Dept Biochem, Oxford Glycobiol Inst, Oxford OX1 3QU, England
[2] Romanian Acad, Inst Biochem, Bucharest 77700 17, Romania
基金
英国惠康基金;
关键词
glycan-protein linkage; N-glycosylation sites; occupancy; protein folding; X-ray diffraction;
D O I
10.1093/glycob/cwh008
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We recently reported statistical analysis of structural data on glycosidic linkages. Here we extend this analysis to the glycan-protein linkage, and the peptide primary, secondary, and tertiary structures around N-glycosylation sites. We surveyed 506 glycoproteins in the Protein Data Bank crystallographic database, giving 2592 glycosylation sequons (1683 occupied) and generated a database of 626 nonredundant sequons with 386 occupied. Deviations in the expected amino acid composition were seen around occupied asparagines, particularly an increased occurrence of aromatic residues before the asparagine and threonine at position +2. Glycosylation alters the asparagine side chain torsion angle distribution and reduces its flexibility. There is an elevated probability of finding glycosylation sites in which secondary structure changes. An 11-class taxonomy was developed to describe protein surface geometry around glycosylation sites. Thirty-three percent of the occupied sites are on exposed convex surfaces, 10% in deep recesses and 20% on the edge of grooves with the glycan filling the cleft. A surprisingly large number of glycosylated asparagine residues have a low accessibility. The incidence of aromatic amino acids brought into close contact with the glycan by the folding process is higher than their normal levels on the surface or in the protein core. These data have significant implications for control of sequon occupancy and evolutionary selection of glycosylation sites and are discussed in relation to mechanisms of protein fold stabilization and regional quality control of protein folding. Hydrophobic protein-glycan interactions and the low accessibility of glycosylation sites in folded proteins are common features and may be critical in mediating these functions.
引用
收藏
页码:103 / 114
页数:12
相关论文
共 51 条
[1]   On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database [J].
Apweiler, R ;
Hermjakob, H ;
Sharon, N .
BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 1999, 1473 (01) :4-8
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]   Mutations at critical N-glycosylation sites reduce tyrosinase activity by altering folding and quality control [J].
Branza-Nichita, N ;
Negroiu, G ;
Petrescu, AJ ;
Garman, EF ;
Platt, FM ;
Wormald, MR ;
Dwek, RA ;
Petrescu, SM .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2000, 275 (11) :8169-8175
[5]   Structure of the hemagglutinin precursor cleavage site, a determinant of influenza pathogenicity and the origin of the labile conformation [J].
Chen, J ;
Lee, KH ;
Steinhauer, DA ;
Stevens, DJ ;
Skehel, JJ ;
Wiley, DC .
CELL, 1998, 95 (03) :409-417
[6]   A database analysis of potential glycosylating Asn-X-Ser/Thr consensus sequences [J].
Christlet, THT ;
Biswas, M ;
Veluraja, K .
ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 1999, 55 :1414-1420
[7]  
Christlet THT, 2001, BIOPHYS J, V80, P952, DOI 10.1016/S0006-3495(01)76074-2
[8]   GlycoSuiteDB: a new curated relational database of glycoprotein glycan structures and their biological sources [J].
Cooper, CA ;
Harrison, MJ ;
Wilkins, MR ;
Packer, NH .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :332-335
[9]   Siglecs: sialic-acid-binding immunoglobulin-like lectins in cell-cell interactions and signalling [J].
Crocker, PR .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2002, 12 (05) :609-615
[10]   N-linked glycans direct the cotranslational folding pathway of influenza hemagglutinin [J].
Daniels, R ;
Kurowski, B ;
Johnson, AE ;
Hebert, DN .
MOLECULAR CELL, 2003, 11 (01) :79-90