Cross-referencing eukaryotic genomes: TIGR orthologous gene alignments (TOGA)

被引:113
作者
Lee, Y [1 ]
Sultana, R [1 ]
Pertea, G [1 ]
Cho, J [1 ]
Karamycheva, S [1 ]
Tsai, J [1 ]
Parvizi, B [1 ]
Cheung, F [1 ]
Antonescu, V [1 ]
White, J [1 ]
Holt, I [1 ]
Liang, F [1 ]
Quackenbush, J [1 ]
机构
[1] Inst Genom Res, Rockville, MD 20850 USA
关键词
D O I
10.1101/gr.212002
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Comparative genomics promises to rapidly accelerate the identification and functional classification of biologically important human genes. We developed the TIGR Orthologous Gene Alignment (TOGA; http://www.tigr.or,-/tdb/to,-a/toga.shtml) database to provide a cross-reference between fully and partially sequenced eukaryotic transcribed sequences. Starting with the assembled expressed Sequence tag (EST) and gene sequences that comprise the 28 TIGR Gene Indices, we used high-stringency pair-wise sequence searches and a reflexive, transitive closure process to associate sequence-specific best hits, generating 32,652 tentative ortholog groups (TOGs). This has allowed us to identify putative orthologs and paralogs for known genes, as well as those that exist only as uncharacterized ESTs and to provide links to additional information including genome sequence and mapping data. TOGA provides an important new resource for the analysis of gene function in eukaryotes. In addition, an analysis of the most widely represented sequences can begin to provide insight into eukaryotic biological processes.
引用
收藏
页码:493 / 502
页数:10
相关论文
共 50 条
  • [41] Sherr CJ, 2000, CANCER RES, V60, P3689
  • [42] CDK inhibitors:: positive and negative regulators of G1-phase progression
    Sherr, CJ
    Roberts, JM
    [J]. GENES & DEVELOPMENT, 1999, 13 (12) : 1501 - 1512
  • [43] Smith DF, 1998, PHARMACOL REV, V50, P493
  • [44] Proteins shared by the transcription and translation machines
    Squires, CL
    Zaporojets, D
    [J]. ANNUAL REVIEW OF MICROBIOLOGY, 2000, 54 : 775 - 798
  • [45] A genomic perspective on protein families
    Tatusov, RL
    Koonin, EV
    Lipman, DJ
    [J]. SCIENCE, 1997, 278 (5338) : 631 - 637
  • [46] The COG database: a tool for genome-scale analysis of protein functions and evolution
    Tatusov, RL
    Galperin, MY
    Natale, DA
    Koonin, EV
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 33 - 36
  • [47] Regulation of the G2/M transition by p53
    Taylor, WR
    Stark, GR
    [J]. ONCOGENE, 2001, 20 (15) : 1803 - 1815
  • [48] CLUSTAL-W - IMPROVING THE SENSITIVITY OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT THROUGH SEQUENCE WEIGHTING, POSITION-SPECIFIC GAP PENALTIES AND WEIGHT MATRIX CHOICE
    THOMPSON, JD
    HIGGINS, DG
    GIBSON, TJ
    [J]. NUCLEIC ACIDS RESEARCH, 1994, 22 (22) : 4673 - 4680
  • [49] The sequence of the human genome
    Venter, JC
    Adams, MD
    Myers, EW
    Li, PW
    Mural, RJ
    Sutton, GG
    Smith, HO
    Yandell, M
    Evans, CA
    Holt, RA
    Gocayne, JD
    Amanatides, P
    Ballew, RM
    Huson, DH
    Wortman, JR
    Zhang, Q
    Kodira, CD
    Zheng, XQH
    Chen, L
    Skupski, M
    Subramanian, G
    Thomas, PD
    Zhang, JH
    Miklos, GLG
    Nelson, C
    Broder, S
    Clark, AG
    Nadeau, C
    McKusick, VA
    Zinder, N
    Levine, AJ
    Roberts, RJ
    Simon, M
    Slayman, C
    Hunkapiller, M
    Bolanos, R
    Delcher, A
    Dew, I
    Fasulo, D
    Flanigan, M
    Florea, L
    Halpern, A
    Hannenhalli, S
    Kravitz, S
    Levy, S
    Mobarry, C
    Reinert, K
    Remington, K
    Abu-Threideh, J
    Beasley, E
    [J]. SCIENCE, 2001, 291 (5507) : 1304 - +
  • [50] p53: Death star
    Vousden, KH
    [J]. CELL, 2000, 103 (05) : 691 - 694