An optimized protocol for analysis of EST sequences

被引:99
作者
Liang, F [1 ]
Holt, I [1 ]
Pertea, G [1 ]
Karamycheva, S [1 ]
Salzberg, SL [1 ]
Quackenbush, J [1 ]
机构
[1] Inst Genome Res, Rockville, MD 20850 USA
关键词
D O I
10.1093/nar/28.18.3657
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The vast body of Expressed Sequence Tag (EST) data in the public databases provide an important resource for comparative and functional genomics studies and an invaluable tool for the annotation of genomic sequences. We have developed a rigorous protocol for reconstructing the sequences of transcribed genes from EST and gene sequence fragments. A key element in developing this protocol has been the evaluation of a number of sequence assembly programs to determine which most faithfully reproduce transcript sequences from EST data. The TIGR Gene Indices constructed using this protocol for human, mouse, rat and a variety of other plant and animal models have demonstrated their utility in a variety of applications and are freely available to the scientific research community.
引用
收藏
页码:3657 / 3665
页数:9
相关论文
共 18 条
  • [1] ADAMS MD, 1995, NATURE, V377, P3
  • [2] COMPLEMENTARY-DNA SEQUENCING - EXPRESSED SEQUENCE TAGS AND HUMAN GENOME PROJECT
    ADAMS, MD
    KELLEY, JM
    GOCAYNE, JD
    DUBNICK, M
    POLYMEROPOULOS, MH
    XIAO, H
    MERRIL, CR
    WU, A
    OLDE, B
    MORENO, RF
    KERLAVAGE, AR
    MCCOMBIE, WR
    VENTER, JC
    [J]. SCIENCE, 1991, 252 (5013) : 1651 - 1656
  • [3] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [4] ESTABLISHING A HUMAN TRANSCRIPT MAP
    BOGUSKI, MS
    SCHULER, GD
    [J]. NATURE GENETICS, 1995, 10 (04) : 369 - 371
  • [5] Comparison of gene indexing databases
    Bouck, J
    Yu, W
    Gibbs, R
    Worley, K
    [J]. TRENDS IN GENETICS, 1999, 15 (04) : 159 - 162
  • [6] A physical map of 30,000 human genes
    Deloukas, P
    Schuler, GD
    Gyapay, G
    Beasley, EM
    Soderlund, C
    Rodriguez-Tomé, P
    Hui, L
    Matise, TC
    McKusick, KB
    Beckmann, JS
    Bentolila, S
    Bihoreau, MT
    Birren, BB
    Browne, J
    Butler, A
    Castle, AB
    Chiannilkulchai, N
    Clee, C
    Day, PJR
    Dehejia, A
    Dibling, T
    Drouot, N
    Duprat, S
    Fizames, C
    Fox, S
    Gelling, S
    Green, L
    Harrison, P
    Hocking, R
    Holloway, E
    Hunt, S
    Keil, S
    Lijnzaad, P
    Louis-Dit-Sully, C
    Ma, J
    Mendis, A
    Miller, J
    Morissette, J
    Muselet, D
    Nusbaum, HC
    Peck, A
    Rozen, S
    Simon, D
    Slonim, DK
    Staples, R
    Stein, LD
    Stewart, EA
    Suchard, MA
    Thangarajah, T
    Vega-Czarny, N
    [J]. SCIENCE, 1998, 282 (5389) : 744 - 746
  • [7] Base-calling of automated sequencer traces using phred.: II.: Error probabilities
    Ewing, B
    Green, P
    [J]. GENOME RESEARCH, 1998, 8 (03): : 186 - 194
  • [8] CAP3: A DNA sequence assembly program
    Huang, XQ
    Madan, A
    [J]. GENOME RESEARCH, 1999, 9 (09) : 868 - 877
  • [9] A tool for analyzing and annotating genomic sequences
    Huang, XQ
    Adams, MD
    Zhou, H
    Kerlavage, AR
    [J]. GENOMICS, 1997, 46 (01) : 37 - 45
  • [10] AN STS-BASED MAP OF THE HUMAN GENOME
    HUDSON, TJ
    STEIN, LD
    GERETY, SS
    MA, JL
    CASTLE, AB
    SILVA, J
    SLONIM, DK
    BAPTISTA, R
    KRUGLYAK, L
    XU, SH
    HU, XT
    COLBERT, AME
    ROSENBERG, C
    REEVEDALY, MP
    ROZEN, S
    HUI, L
    WU, XY
    VESTERGAARD, C
    WILSON, KM
    BAE, JS
    MAITRA, S
    GANIATSAS, S
    EVANS, CA
    DEANGELIS, MM
    INGALLS, KA
    NAHF, RW
    HORTON, LT
    ANDERSON, MO
    COLLYMORE, AJ
    YE, WJ
    KOUYOUMJIAN, V
    ZEMSTEVA, IS
    TAM, J
    DEVINE, R
    COURTNEY, DF
    RENAUD, MT
    NGUYEN, H
    OCONNOR, TJ
    FIZAMES, C
    FAURE, S
    GYAPAY, G
    DIB, C
    MORISSETTE, J
    ORLIN, JB
    BIRREN, BW
    GOODMAN, N
    WEISSENBACH, J
    HAWKINS, TL
    FOOTE, S
    PAGE, DC
    [J]. SCIENCE, 1995, 270 (5244) : 1945 - 1954