The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species

被引:353
作者
Quackenbush, J [1 ]
Cho, J [1 ]
Lee, D [1 ]
Liang, F [1 ]
Holt, I [1 ]
Karamycheva, S [1 ]
Parvizi, B [1 ]
Pertea, G [1 ]
Sultana, R [1 ]
White, J [1 ]
机构
[1] Inst Genom Res, Rockville, MD 20850 USA
关键词
D O I
10.1093/nar/29.1.159
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
While genome sequencing projects are advancing rapidly, EST sequencing and analysis remains a primary research tool for the identification and categorization of gene sequences in a wide variety of species and an important resource for annotation of genomic sequence. The TIGR Gene Indices (http:// www.tigr.org/tdb/tgi.shtml) are a collection of species-specific databases that use a highly refined protocol to analyze EST sequences in an attempt to identify the genes represented by that data and to provide additional information regarding those genes. Gene Indices are constructed by first clustering, then assembling EST and annotated gene sequences from GenBank for the targeted species. This process produces a set of unique, high-fidelity virtual transcripts, or Tentative Consensus (TC) sequences. The TC sequences can be used to provide putative genes with functional annotation, to link the transcripts to mapping and genomic sequence data, to provide links between orthologous and paralogous genes and as a resource for comparative sequence analysis.
引用
收藏
页码:159 / 164
页数:6
相关论文
共 12 条
[1]   COMPLEMENTARY-DNA SEQUENCING - EXPRESSED SEQUENCE TAGS AND HUMAN GENOME PROJECT [J].
ADAMS, MD ;
KELLEY, JM ;
GOCAYNE, JD ;
DUBNICK, M ;
POLYMEROPOULOS, MH ;
XIAO, H ;
MERRIL, CR ;
WU, A ;
OLDE, B ;
MORENO, RF ;
KERLAVAGE, AR ;
MCCOMBIE, WR ;
VENTER, JC .
SCIENCE, 1991, 252 (5013) :1651-1656
[2]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[3]   ESTABLISHING A HUMAN TRANSCRIPT MAP [J].
BOGUSKI, MS ;
SCHULER, GD .
NATURE GENETICS, 1995, 10 (04) :369-371
[4]   Alternative gene form discovery and candidate gene selection from gene indexing projects [J].
Burke, J ;
Wang, H ;
Hide, W ;
Davison, DB .
GENOME RESEARCH, 1998, 8 (03) :276-290
[5]   DISTINGUISHING HOMOLOGOUS FROM ANALOGOUS PROTEINS [J].
FITCH, WM .
SYSTEMATIC ZOOLOGY, 1970, 19 (02) :99-&
[6]   CAP3: A DNA sequence assembly program [J].
Huang, XQ ;
Madan, A .
GENOME RESEARCH, 1999, 9 (09) :868-877
[7]   A tool for analyzing and annotating genomic sequences [J].
Huang, XQ ;
Adams, MD ;
Zhou, H ;
Kerlavage, AR .
GENOMICS, 1997, 46 (01) :37-45
[8]   An optimized protocol for analysis of EST sequences [J].
Liang, F ;
Holt, I ;
Pertea, G ;
Karamycheva, S ;
Salzberg, SL ;
Quackenbush, J .
NUCLEIC ACIDS RESEARCH, 2000, 28 (18) :3657-3665
[9]   Gene Index analysis of the human genome estimates approximately 120,000 genes [J].
Liang, F ;
Holt, I ;
Pertea, G ;
Karamycheva, S ;
Salzberg, SL ;
Quackenbush, J .
NATURE GENETICS, 2000, 25 (02) :239-240
[10]   Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana [J].
Lin, XY ;
Kaul, SS ;
Rounsley, S ;
Shea, TP ;
Benito, MI ;
Town, CD ;
Fujii, CY ;
Mason, T ;
Bowman, CL ;
Barnstead, M ;
Feldblyum, TV ;
Buell, CR ;
Ketchum, KA ;
Lee, J ;
Ronning, CM ;
Koo, HL ;
Moffat, KS ;
Cronin, LA ;
Shen, M ;
Pai, G ;
Van Aken, S ;
Umayam, L ;
Tallon, LJ ;
Gill, JE ;
Adams, MD ;
Carrera, AJ ;
Creasy, TH ;
Goodman, HM ;
Somerville, CR ;
Copenhaver, GP ;
Preuss, D ;
Nierman, WC ;
White, O ;
Eisen, JA ;
Salzberg, SL ;
Fraser, CM ;
Venter, JC .
NATURE, 1999, 402 (6763) :761-+