LINUCS:: LInear Notation for Unique Description of Carbohydrate Sequences

被引:83
作者
Bohne-Lang, A
Lang, E
Förster, T
von der Lieth, CW
机构
[1] Deutsch Krebsforschungszentrum, Cent Spect Dept R0400, D-69120 Heidelberg, Germany
[2] Univ Appl Sci Darmstadt, Dept Informat & Knowledge Management, D-64295 Darmstadt, Germany
关键词
carbohydrate sequence; canonical description; glycodatabase; glycobioinformatics;
D O I
10.1016/S0008-6215(01)00230-0
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The use of proteomics databases has become indispensable for daily work of molecular biologists, but this situation has not yet been achieved for carbohydrate applications. One obvious reason is that existing data collections are only rarely annotated and no cross-linking to other resources exists. The existence of a generally accepted linear, canonical description for carbohydrates which can be readily processed by computers will enable efficient automatic cross-linking of distributed carbohydrate data collections by serving as a unique and unambiguous database access key. Various possibilities to derive a canonical notation are discussed. They can be divided into attempts that require structure description alone and alternatives that profit from the fact that a preferred graph direction (non-reducing to reducing end) exists within the structure. To open a fruitful discussion among glycoscientists a possible solution is presented where the reducing monosaccharide unit is selected as graph root and linkage information is used to define the priority of the various branches. A Web interface (http://www.dkfz.de/spec/linucs/) has been created that directly converts the commonly used extended representation of complex carbohydrates into the preferred canonical description or into its inverted form. (C) 2001 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 16 条
  • [1] Albersheim P, 1991, Glycobiology, V1, P113, DOI 10.1093/glycob/1.2.113
  • [2] W3-SWEET: Carbohydrate modeling by Internet
    Bohne, A
    Lang, E
    von der Lieth, CW
    [J]. JOURNAL OF MOLECULAR MODELING, 1998, 4 (01) : 33 - 43
  • [3] Cooper CA, 1999, ELECTROPHORESIS, V20, P3589, DOI 10.1002/(SICI)1522-2683(19991201)20:18<3589::AID-ELPS3589>3.0.CO
  • [4] 2-M
  • [5] GlycoSuiteDB: a new curated relational database of glycoprotein glycan structures and their biological sources
    Cooper, CA
    Harrison, MJ
    Wilkins, MR
    Packer, NH
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 332 - 335
  • [6] COUTINHO PM, 1999, CARBOHYDRATE ACTIVE
  • [7] THE COMPLEX CARBOHYDRATE STRUCTURE DATABASE
    DOUBET, S
    BOCK, K
    SMITH, D
    DARVILL, A
    ALBERSHEIM, P
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 1989, 14 (12) : 475 - 477
  • [8] CARBBANK
    DOUBET, S
    ALBERSHEIM, P
    [J]. GLYCOBIOLOGY, 1992, 2 (06) : 505 - 505
  • [9] 3D connectivity indices in QSPR/QSAR studies
    Estrada, E
    Molina, E
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (03): : 791 - 797
  • [10] O-GLYCBASE version 2.0: A revised database of O-glycosylated proteins
    Hansen, JE
    Lund, O
    Rapacki, K
    Brunak, S
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (01) : 278 - 282