Consed: A graphical tool for sequence finishing

被引:2809
作者
Gordon, D [1 ]
Abajian, C [1 ]
Green, P [1 ]
机构
[1] Univ Washington, Dept Mol Biotechnol, Seattle, WA 98195 USA
关键词
D O I
10.1101/gr.8.3.195
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Sequencing of large clones or small genomes is generally done by the shotgun approach (Anderson et al. 1982). This has two phases: (1) a shotgun phase in which a number of reads are generated from random subclones and assembled into contigs followed by (2) a directed, or finishing phase in which the assembly is inspected for correctness and for various kinds of data anomalies (such as contaminant reads, unremoved vector sequence, and chimeric or deleted reads), additional data are collected to close gaps and resolve low quality regions, and editing is performed to correct assembly or base-calling errors. Finishing is currently a bottleneck in large-scale sequencing efforts, and throughput gains will depend both on reducing the need for human intervention and making it as efficient as possible. We have developed a finishing tool, consed, which attempts to implement these principles. A distinguishing feature relative to other programs is the use of error probabilities from our programs phred and phrap as an objective criterion to guide the entire Finishing process. More information is available at http: // www.genome.washington.edu/consed/consed.html.
引用
收藏
页码:195 / 202
页数:8
相关论文
共 10 条
[1]   COMPLETE SEQUENCE OF BOVINE MITOCHONDRIAL-DNA - CONSERVED FEATURES OF THE MAMMALIAN MITOCHONDRIAL GENOME [J].
ANDERSON, S ;
DEBRUIJN, MHL ;
COULSON, AR ;
EPERON, IC ;
SANGER, F ;
YOUNG, IG .
JOURNAL OF MOLECULAR BIOLOGY, 1982, 156 (04) :683-717
[2]   A new DNA sequence assembly program [J].
Bonfield, JK ;
Smith, KF ;
Staden, R .
NUCLEIC ACIDS RESEARCH, 1995, 23 (24) :4992-4999
[3]   PREDICTING DNA DUPLEX STABILITY FROM THE BASE SEQUENCE [J].
BRESLAUER, KJ ;
FRANK, R ;
BLOCKER, H ;
MARKY, LA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1986, 83 (11) :3746-3750
[4]  
Dear S, 1992, DNA Seq, V3, P107, DOI 10.3109/10425179209034003
[5]   A SEQUENCE ASSEMBLY AND EDITING PROGRAM FOR EFFICIENT MANAGEMENT OF LARGE PROJECTS [J].
DEAR, S ;
STADEN, R .
NUCLEIC ACIDS RESEARCH, 1991, 19 (14) :3907-3911
[6]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185
[7]   PolyPhred: Automating the detection and genotyping of single nucleotide substitutions using fluorescence-based resequencing [J].
Nickerson, DA ;
Tobe, VO ;
Taylor, SL .
NUCLEIC ACIDS RESEARCH, 1997, 25 (14) :2745-2751
[8]   OPTIMIZATION OF THE ANNEALING TEMPERATURE FOR DNA AMPLIFICATION INVITRO [J].
RYCHLIK, W ;
SPENCER, WJ ;
RHOADS, RE .
NUCLEIC ACIDS RESEARCH, 1990, 18 (21) :6409-6412
[9]   IDENTIFICATION OF COMMON MOLECULAR SUBSEQUENCES [J].
SMITH, TF ;
WATERMAN, MS .
JOURNAL OF MOLECULAR BIOLOGY, 1981, 147 (01) :195-197
[10]  
Swindell S R, 1997, Methods Mol Biol, V70, P75, DOI 10.1385/0-89603-358-9:75