A procedure for assessing GO annotation consistency

被引:34
作者
Dolan, ME
Ni, L
Camon, E
Blake, JA
机构
[1] Jackson Lab, Mouse Genome Informat, Bar Harbor, ME 04609 USA
[2] European Bioinformat Inst, Cambridge CB10 1SD, England
关键词
D O I
10.1093/bioinformatics/bti1019
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The Gene Ontology (GO) is widely used to annotate molecular attributes of genes and gene products. Multiple groups undertaking functional annotations of genomes contribute their annotation sets to the GO database resource and these data are subsequently used in comparative functional analysis research. Although GO curators adhere to the same protocols and standards while assigning GO annotations, the specific procedure followed by each annotation group can vary. Since differences in application of annotation standards would dilute the effectiveness of comparative analysis, methods for assessing annotation consistency are essential. The development of methodologies that are broadly applicable for the assessment of GO annotation consistency is an important issue for the comparative genomics community. Results: We have developed a methodology for assessing the consistency of GO annotations provided by different annotation groups. The method is completely general and can be applied to compare any two sets of GO annotations. This is the first attempt to assess cross-species GO annotation consistency. Our method compares annotation sets utilizing the hierarchical structure of the GO to compare GO annotations between orthologous gene pairs. The method produces a report on the annotation consistency and inconsistency for each orthologous pair. We present results obtained by comparing GO annotations for mouse and human gene sets.
引用
收藏
页码:I136 / I143
页数:8
相关论文
共 13 条
  • [1] Ashburner M, 2001, GENOME RES, V11, P1425
  • [2] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [3] Connecting sequence and biology in the laboratory mouse
    Baldarelli, RM
    Hill, DP
    Blake, JA
    Adachi, J
    Furuno, M
    Bradt, D
    Corbani, LE
    Cousins, S
    Frazer, KS
    Qi, D
    Yang, LL
    Ramachandran, S
    Reed, D
    Zhu, YX
    Kasukawa, T
    Ringwald, M
    King, BL
    Maltais, LJ
    McKenzie, LM
    Schriml, LM
    Maglott, D
    Church, DM
    Pruitt, K
    Eppig, JT
    Richardson, JE
    Kadin, JA
    Bult, CJ
    [J]. GENOME RESEARCH, 2003, 13 (6B) : 1505 - 1519
  • [4] MGD: the Mouse Genome Database
    Blake, JA
    Richardson, JE
    Bult, RJ
    Kadin, JA
    Eppig, JT
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 193 - 195
  • [5] The Mouse Genome Database (MGD): integrating biology with the genome
    Bult, CJ
    Blake, JA
    Richardson, JE
    Kadin, JA
    Eppig, JT
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D476 - D481
  • [6] The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology
    Camon, E
    Magrane, M
    Barrell, D
    Lee, V
    Dimmer, E
    Maslen, J
    Binns, D
    Harte, N
    Lopez, R
    Apweiler, R
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D262 - D266
  • [7] The Gene Ontology (GO) database and informatics resource
    Harris, MA
    Clark, J
    Ireland, A
    Lomax, J
    Ashburner, M
    Foulger, R
    Eilbeck, K
    Lewis, S
    Marshall, B
    Mungall, C
    Richter, J
    Rubin, GM
    Blake, JA
    Bult, C
    Dolan, M
    Drabkin, H
    Eppig, JT
    Hill, DP
    Ni, L
    Ringwald, M
    Balakrishnan, R
    Cherry, JM
    Christie, KR
    Costanzo, MC
    Dwight, SS
    Engel, S
    Fisk, DG
    Hirschman, JE
    Hong, EL
    Nash, RS
    Sethuraman, A
    Theesfeld, CL
    Botstein, D
    Dolinski, K
    Feierbach, B
    Berardini, T
    Mundodi, S
    Rhee, SY
    Apweiler, R
    Barrell, D
    Camon, E
    Dimmer, E
    Lee, V
    Chisholm, R
    Gaudet, P
    Kibbe, W
    Kishore, R
    Schwarz, EM
    Sternberg, P
    Gwinn, M
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D258 - D261
  • [8] Strategies for biological annotation of mammalian systems: Implementing gene ontologies in mouse genome informatics
    Hill, DP
    Davis, AP
    Richardson, JE
    Corradi, JP
    Ringwald, M
    Eppig, JT
    Blake, JA
    [J]. GENOMICS, 2001, 74 (01) : 121 - 128
  • [9] Revised nomenclature for the mammalian long-chain acyl-CoA synthetase gene family
    Mashek, DG
    Bornfeldt, KE
    Coleman, RA
    Berger, J
    Bernlohr, DA
    Black, P
    DiRusso, CC
    Farber, SA
    Guo, W
    Hashimoto, N
    Khodiyar, V
    Kuypers, FA
    Maltais, LJ
    Nebert, DW
    Renieri, A
    Schaffer, JE
    Stahl, A
    Watkins, PA
    Vasiliou, V
    Yamamoto, TT
    [J]. JOURNAL OF LIPID RESEARCH, 2004, 45 (10) : 1958 - 1961
  • [10] Comparison of cytochrome P450 (CYP) genes from the mouse and human genomes, including nomenclature recommendations for genes, pseudogenes and alternative-splice variants
    Nelson, DR
    Zeldin, DC
    Hoffman, SMG
    Maltais, LJ
    Wain, HM
    Nebert, DW
    [J]. PHARMACOGENETICS, 2004, 14 (01): : 1 - 18