NetOglyc: Prediction of mucin type O-glycosylation sites based on sequence context and surface accessibility

被引:443
作者
Hansen, JE
Lund, O
Tolstrup, N
Gooley, AA
Williams, KL
Brunak, S
机构
[1] Tech Univ Denmark, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark
[2] Macquarie Univ, Sch Biol Sci, Sydney, NSW 2109, Australia
关键词
mucin type O-glycosylation; specificity; GalNAc transferase; prediction; neural networks; gp120; SIV; HIV;
D O I
10.1023/A:1006960004440
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The specificities of the UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferases which link the carbohydrate GalNAc to the side-chain of certain serine and threonine residues in mucin type glycoproteins, are presently unknown. The specificity seems to be modulated by sequence context, secondary structure and surface accessibility. The sequence context of glycosylated threonines was found to differ from that of serine, and the sites were found to cluster. Non-clustered sites had a sequence context different from that of clustered sites. Charged residues were disfavoured at position -1 and +3. A jury of artificial neural networks was trained to recognize the sequence context and surface accessibility of 299 known and verified mucin type O-glycosylation sites extracted from O-GLYCBASE. The cross-validated NetOglyc network system correctly found 83% of the glycosylated and 90% of the non-glycosylated serine and threonine residues in independent test sets, thus proving more accurate than matrix statistics and vector projection methods. Predictions of O-glycosylation sites in the envelope glycoprotein gp120 from the primate lentiviruses HIV-1, HIV-2 and SIV are presented. The most conserved O-glycosylation signals in these evolutionary-related glycoproteins were found in their first hypervariable loop, V1. However, the strain variation for HIV-1 gp120 was significant. A computer server, available through WWW or E-mail, has been developed for prediction of mucin type O-glycosylation sites in proteins based on the amino acid sequence. The server addresses are http://www.cbs.dtu.dk/services/NetOGlyc/ and netOglyc@cbs.dtu.dk.
引用
收藏
页码:115 / 130
页数:16
相关论文
共 136 条
[1]   NATURAL HUMAN INTERFERON-ALPHA-2 IS O-GLYCOSYLATED [J].
ADOLF, GR ;
KALSNER, I ;
AHORN, H ;
MAURERFOGY, I ;
CANTELL, K .
BIOCHEMICAL JOURNAL, 1991, 276 :511-518
[2]   THE HUMAN MUC2 MUCIN APOPROTEIN APPEARS TO DIMERIZE BEFORE O-GLYCOSYLATION AND SHARES EPITOPES WITH THE INSOLUBLE MUCIN OF RAT SMALL-INTESTINE [J].
ASKER, N ;
BAECKSTROM, D ;
AXELSSON, MAB ;
CARLSTEDT, I ;
HANSSON, GC .
BIOCHEMICAL JOURNAL, 1995, 308 :873-880
[3]   The SWISS-PROT protein sequence data bank and its new supplement TREMBL [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 1996, 24 (01) :21-25
[4]   THE PIR-INTERNATIONAL DATABASES [J].
BARKER, W ;
GEORGE, DG ;
MEWES, HW ;
PFEIFFER, F ;
TSUGITA, A .
NUCLEIC ACIDS RESEARCH, 1993, 21 (13) :3089-3092
[5]   cDNA cloning and expression of a novel human UDP-N-acetyl-alpha-D-galactosamine - Polypeptide N-acetylgalactosaminyltransferase, GalNAc-T3 [J].
Bennett, EP ;
Hassan, H ;
Clausen, H .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1996, 271 (29) :17006-17012
[6]  
BENNETT HPJ, 1986, INT J PEPT PROT RES, V27, P306
[7]   HUMAN-IMMUNODEFICIENCY-VIRUS TYPE-1 ENVELOPE GLYCOPROTEIN IS MODIFIED BY O-LINKED OLIGOSACCHARIDES [J].
BERNSTEIN, HB ;
TUCKER, SP ;
HUNTER, E ;
SCHUTZBACH, JS ;
COMPANS, RW .
JOURNAL OF VIROLOGY, 1994, 68 (01) :463-468
[8]   CHARACTERIZATION OF ANTISERA DISTINGUISHING CARBOHYDRATE STRUCTURES IN THE BETA-CARBOXYL-TERMINAL REGION OF HUMAN CHORIONIC-GONADOTROPIN [J].
BIRKEN, S ;
AGOSTO, G ;
AMR, S ;
NISULA, B ;
COLE, L ;
LEWIS, J ;
CANFIELD, R .
ENDOCRINOLOGY, 1988, 122 (05) :2054-2063
[9]   Cleavage site analysis in picornaviral polyproteins: Discovering cellular targets by neural networks [J].
Blom, N ;
Hansen, J ;
Blaas, D ;
Brunak, S .
PROTEIN SCIENCE, 1996, 5 (11) :2203-2216
[10]   HUMAN C1BAR INHIBITOR - PRIMARY STRUCTURE, CDNA CLONING, AND CHROMOSOMAL LOCALIZATION [J].
BOCK, SC ;
SKRIVER, K ;
NIELSEN, E ;
THOGERSEN, HC ;
WIMAN, B ;
DONALDSON, VH ;
EDDY, RL ;
MARRINAN, J ;
RADZIEJEWSKA, E ;
HUBER, R ;
SHOWS, TB ;
MAGNUSSON, S .
BIOCHEMISTRY, 1986, 25 (15) :4292-4301