Automated Gene Ontology annotation for anonymous sequence data

被引:67
作者
Hennig, S [1 ]
Groth, D [1 ]
Lehrach, H [1 ]
机构
[1] Max Planck Inst Mol Genet, D-14195 Berlin, Germany
关键词
D O I
10.1093/nar/gkg582
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene Ontology (GO) is the most widely accepted attempt to construct a unified and structured vocabulary for the description of genes and their products in any organism. Annotation by GO terms is performed in most of the current genome projects, which besides generality has the advantage of being very convenient for computer based classification methods. However, direct use of GO in small sequencing projects is not easy, especially for species not commonly represented in public databases. We present a software package (GOblet), which performs annotation based on GO terms for anonymous cDNA or protein sequences. It uses the species independent GO structure and vocabulary together with a series of protein databases collected from various sites, to perform a detailed GO annotation by sequence similarity searches. The sensitivity and the reference protein sets can be selected by the user. GOblet runs automatically and is available as a public service on our web server. The paper also addresses the reliability of automated GO annotations by using a reference set of more than 6000 human proteins. The GOblet server is accessible at http://goblet.molgen.mpg.de.
引用
收藏
页码:3712 / 3715
页数:4
相关论文
共 19 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [3] Ashburner M, 2001, GENOME RES, V11, P1425
  • [4] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [5] Bairoch A, 1997, J MOL MED-JMM, V75, P312
  • [6] FANTOM DB: Database of functional annotation of RIKEN mouse cDNA clones
    Bono, H
    Kasukawa, T
    Furuno, M
    Hayashizaki, Y
    Okazaki, Y
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (01) : 116 - 118
  • [7] Comparison of the complete protein sets of worm and yeast: Orthology and divergence
    Chervitz, SA
    Aravind, L
    Sherlock, G
    Ball, CA
    Koonin, EV
    Dwight, SS
    Harris, MA
    Dolinski, K
    Mohr, S
    Smith, T
    Weng, S
    Cherry, JM
    Botstein, D
    [J]. SCIENCE, 1998, 282 (5396) : 2022 - 2028
  • [8] Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO)
    Dwight, SS
    Harris, MA
    Dolinski, K
    Ball, CA
    Binkley, G
    Christie, KR
    Fisk, DG
    Issel-Tarver, L
    Schroeder, M
    Sherlock, G
    Sethuraman, A
    Weng, S
    Botstein, D
    Cherry, JM
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (01) : 69 - 72
  • [9] The FlyBase database of the Drosophila genome projects and community literature
    Gelbart, W
    Bayraktaroglu, L
    Bettencourt, B
    Campbell, K
    Crosby, M
    Emmert, D
    Hradecky, P
    Huang, Y
    Letovsky, S
    Matthews, B
    Russo, S
    Schroeder, A
    Smutniak, F
    Zhou, P
    Zytkovicz, M
    Ashburner, M
    Drysdale, R
    de Grey, A
    Foulger, R
    Millburn, G
    Yamada, C
    Kaufman, T
    Matthews, K
    Gilbert, D
    Grumbling, G
    Strelets, V
    Shemen, C
    Rubin, G
    Berman, B
    Frise, E
    Gibson, M
    Harris, N
    Kaminker, J
    Lewis, S
    Marshall, B
    Misra, S
    Mungall, C
    Prochnik, S
    Richter, J
    Smith, C
    Shu, S
    Tupy, J
    Wiel, C
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 172 - 175
  • [10] WormBase: a cross-species database for comparative genomics
    Harris, TW
    Lee, R
    Schwarz, E
    Bradnam, K
    Lawson, D
    Chen, W
    Blasier, D
    Kenny, E
    Cunningham, F
    Kishore, R
    Chan, J
    Muller, HM
    Petcherski, A
    Thorisson, G
    Day, A
    Bieri, T
    Rogers, A
    Chen, CK
    Spieth, J
    Sternberg, P
    Durbin, R
    Stein, LD
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 133 - 137