Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment

被引:3819
作者
Ewing, B
Hillier, L
Wendl, MC
Green, P [1 ]
机构
[1] Univ Washington, Dept Mol Biotechnol, Seattle, WA 98195 USA
[2] Washington Univ, Sch Med, Genome Sequencing Ctr, St Louis, MO 63108 USA
来源
GENOME RESEARCH | 1998年 / 8卷 / 03期
关键词
D O I
10.1101/gr.8.3.175
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The availability of massive amounts of DNA sequence information has begun to revolutionize the practice of biology. As a result, current large-scale sequencing output, while impressive, is not adequate to keep pace with growing demand and, in particular, is far short of what will be required to obtain the 3-billion-base human genome sequence by the target date of 2005. To reach this goal, improved automation will be essential, and it is particularly important that human involvement in sequence data processing be significantly reduced or eliminated. Progress in this respect will require both improved accuracy of the data processing software and reliable accuracy measures to reduce the need for human involvement in error correction and make human review more efficient. Here, we describe one step toward that goal: a base-calling program For automated sequencer traces, phred, with improved accuracy. phred appears to be the first base-calling program to achieve a lower error rate than the ABI software, averaging 40%-50% fewer errors in the data sets examined independent of position in read, machine running conditions, or sequencing chemistry.
引用
收藏
页码:175 / 185
页数:11
相关论文
共 16 条
  • [1] *ABI, 1996, ABI PRISM DNA SEQ AN
  • [2] A graph theoretic approach to the analysis of DNA sequencing data
    Berno, AJ
    [J]. GENOME RESEARCH, 1996, 6 (02): : 80 - 91
  • [3] CONNELL C, 1987, BIOTECHNIQUES, V5, P342
  • [4] Dear S, 1992, DNA Seq, V3, P107, DOI 10.3109/10425179209034003
  • [5] AN ADAPTIVE, OBJECT-ORIENTED STRATEGY FOR BASE CALLING IN DNA-SEQUENCE ANALYSIS
    GIDDINGS, MC
    BRUMLEY, RL
    HAKER, M
    SMITH, LM
    [J]. NUCLEIC ACIDS RESEARCH, 1993, 21 (19) : 4530 - 4540
  • [6] GOLDEN J, 1995, EVOLUTIONARY PROGRAM, V4, P579
  • [7] Golden J B 3rd, 1993, Proc Int Conf Intell Syst Mol Biol, V1, P136
  • [8] DNA SEQUENCING WITH DYE-LABELED TERMINATORS AND T7 DNA-POLYMERASE - EFFECT OF DYES AND DNTPS ON INCORPORATION OF DYE-TERMINATORS AND PROBABILITY ANALYSIS OF TERMINATION FRAGMENTS
    LEE, LG
    CONNELL, CR
    WOO, SL
    CHENG, RD
    MCARDLE, BF
    FULLER, CW
    HALLORAN, ND
    WILSON, RK
    [J]. NUCLEIC ACIDS RESEARCH, 1992, 20 (10) : 2471 - 2483
  • [9] Parker LT, 1996, BIOTECHNIQUES, V21, P694
  • [10] Press W. H., 1994, NUMERICAL RECIPES C