An improved sequence assembly program

被引:90
作者
Huang, XQ
机构
[1] Department of Computer Science, Michigan Technological University, Houghton
关键词
D O I
10.1006/geno.1996.0155
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We describe a number of improvements to the CAP sequence assembly program. These improvements include the development of methods for solving the problem caused by simple repetitive sequences, for automatically editing fragment alignments and consensus sequences, and for identifying chimeric fragments. The improved program (CAP2) assembled each of seven data sets, six of which contain repetitive sequences of very strong similarity, into a single sequence. As an example, CAP2 assembled a set of 1467 fragments into a single sequence of 73,328 bp that has only eight differences from the original sequence. The effects of fragment length, coverage, and error rate on the performance of CAP2 were evaluated using artificial data sets. (C) 1996 Academic Press, Inc.
引用
收藏
页码:21 / 31
页数:11
相关论文
共 19 条
[1]  
BERGER MP, 1991, COMPUT APPL BIOSCI, V7, P479
[2]  
Chen W Q, 1992, DNA Seq, V2, P335, DOI 10.3109/10425179209020814
[3]  
ENGLE ML, 1994, COMPUT APPL BIOSCI, V10, P567
[4]   ARTIFICIALLY GENERATED DATA SETS FOR TESTING DNA-SEQUENCE ASSEMBLY ALGORITHMS [J].
ENGLE, ML ;
BURKS, C .
GENOMICS, 1993, 16 (01) :286-288
[5]  
GLEIZES A, 1994, COMPUT APPL BIOSCI, V10, P401
[6]  
HAANPAA DP, 1993, THESIS MICHIGAN TECH
[7]  
HIROSAWA M, 1995, COMPUT APPL BIOSCI, V11, P13
[8]   A TIME-EFFICIENT, LINEAR-SPACE LOCAL SIMILARITY ALGORITHM [J].
HUANG, XQ ;
MILLER, W .
ADVANCES IN APPLIED MATHEMATICS, 1991, 12 (03) :337-357
[9]  
HUANG XQ, 1994, COMPUT APPL BIOSCI, V10, P227
[10]   A CONTIG ASSEMBLY PROGRAM BASED ON SENSITIVE DETECTION OF FRAGMENT OVERLAPS [J].
HUANG, XQ .
GENOMICS, 1992, 14 (01) :18-25