A graph based algorithm for generating EST consensus sequences

被引：14

作者：

Malde, K ^{[1
]}

Coward, E

Jonassen, I

机构：

[1] Univ Bergen, Dept Informat, N-5020 Bergen, Norway

[2] Univ Bergen, Bergen Ctr Computat Sci, Computat Biol Unit, N-5020 Bergen, Norway

来源：

BIOINFORMATICS | 2005年 / 21卷 / 08期

关键词：

D O I：

10.1093/bioinformatics/bti184

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Motivation: EST sequences constitute an abundant, yet error prone resource for computational biology. Expressed sequences are important in gene discovery and identification, and they are also crucial for the discovery and classification of alternative splicing. An important challenge when processing EST sequences is the reconstruction of mRNA by assembling EST clusters into consensus sequences. Results: In contrast to the more established assembly tools, we propose an algorithm that constructs a graph over sequence fragments of fixed size, and produces consensus sequences as traversals of this graph. We provide a tool implementing this algorithm, and perform an experiment where the consensus sequences produced by our implementation, as well as by currently available tools, are compared to mRNA. The results show that our proposed algorithm in a majority of the cases produces consensus of higher quality than the established sequence assemblers and at a competitive speed.

引用

页码：1371 / 1375

页数：5

共 19 条

[1]

Batzoglou S, 2002, GENOME RES, V12, P177, DOI 10.1101/gr.208902

[2] Common intervals and sorting by reversals: a marriage of necessity [J].