Application of a new probabilistic model for recognizing complex patterns in glycans

被引:9
作者
Aoki, Kiyoko F. [1 ]
Ueda, Nobuhisa [1 ]
Yamaguchi, Atsuko [1 ]
Kanehisa, Minoru [1 ]
Akutsu, Tatsuya [1 ]
Mamitsuka, Hiroshi [1 ]
机构
[1] Kyoto Univ, Inst Chem Res, Bioinformat Ctr, Uji, Kyoto 6110011, Japan
关键词
D O I
10.1093/bioinformatics/bth916
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The study of carbohydrate sugar chains, or glycans, has been one of slow progress mainly due to the difficulty in establishing standard methods for analyzing their structures and biosynthesis. Glycans are generally tree structures that are more complex than linear DNA or protein sequences, and evidence shows that patterns in glycans may be present that spread across siblings and into further regions that are not limited by the edges in the actual tree structure itself. Current models were not able to capture such patterns. Results: We have applied a new probabilistic model, called probabilistic sibling-dependent tree Markov model (PSTMM), which is able to inherently capture such complex patterns of glycans. Not only is the ability to capture such patterns important in itself, but this also implies that PSTMM is capable of performing multiple tree structure alignments efficiently. We prove through experimentation on actual glycan data that this new model is extremely useful for gaining insight into the hidden, complex patterns of glycans, which are so crucial for the development and functioning of higher level organisms. Furthermore, we also show that this model can be additionally utilized as an innovative approach to multiple tree alignment, which has not been applied to glycan chains before. This extension on the usage of PSTMM may be a major step forward for not only the structural analysis of glycans, but it may consequently prove useful for discovering clues into their function.
引用
收藏
页码:6 / 14
页数:9
相关论文
共 22 条
[1]  
Aoki Kiyoko F, 2003, Genome Inform, V14, P134
[2]   Bayesian gene/species tree reconciliation and orthology analysis using MCMC [J].
Arvestad, Lars ;
Berglund, Ann-Charlotte ;
Lagergren, Jens ;
Sennblad, Bengt .
BIOINFORMATICS, 2003, 19 :i7-i15
[3]   Chemical glycobiology [J].
Bertozzi, CR ;
Kiessling, LL .
SCIENCE, 2001, 291 (5512) :2357-2364
[4]   Fast recovery of evolutionary trees with thousands of nodes [J].
Csúrös, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) :277-297
[5]  
Diligenti M, 2003, IEEE T PATTERN ANAL, V25, P519, DOI 10.1109/TPAMI.2003.1190578
[6]  
DRICKAMER K, 1988, J BIOL CHEM, V263, P9557
[7]  
Durbin R., 1998, BIOL SEQUENCE ANAL
[8]   The hierarchical hidden Markov model: Analysis and applications [J].
Fine, S ;
Singer, Y ;
Tishby, N .
MACHINE LEARNING, 1998, 32 (01) :41-62
[9]  
Friedman N., 1998, Uncertainty in Artificial Intelligence. Proceedings of the Fourteenth Conference (1998), P129
[10]   Local similarity in RNA secondary structures [J].
Höchsmann, M ;
Töller, T ;
Giegerich, R ;
Kurtz, S .
PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, :159-168