On the Use of Gene Ontology Annotations to Assess Functional Similarity among Orthologs and Paralogs: A Short Report

被引:70
作者
Thomas, Paul D. [1 ]
Wood, Valerie [2 ,3 ]
Mungall, Christopher J. [4 ]
Lewis, Suzanna E. [4 ]
Blake, Judith A. [5 ]
机构
[1] Univ So Calif, Dept Prevent Med, Div Bioinformat, Los Angeles, CA 90089 USA
[2] Univ Cambridge, Cambridge Syst Biol Ctr, Cambridge, England
[3] Univ Cambridge, Dept Biochem, Cambridge CB2 1QW, England
[4] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Genom Div, Berkeley, CA 94720 USA
[5] Jackson Lab, Bar Harbor, ME 04609 USA
基金
美国国家卫生研究院;
关键词
DATABASE;
D O I
10.1371/journal.pcbi.1002386
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A recent paper (Nehrt et al., PLoS Comput. Biol. 7:e1002073, 2011) has proposed a metric for the "functional similarity'' between two genes that uses only the Gene Ontology (GO) annotations directly derived from published experimental results. Applying this metric, the authors concluded that paralogous genes within the mouse genome or the human genome are more functionally similar on average than orthologous genes between these genomes, an unexpected result with broad implications if true. We suggest, based on both theoretical and empirical considerations, that this proposed metric should not be interpreted as a functional similarity, and therefore cannot be used to support any conclusions about the "ortholog conjecture'' (or, more properly, the "ortholog functional conservation hypothesis''). First, we reexamine the case studies presented by Nehrt et al. as examples of orthologs with divergent functions, and come to a very different conclusion: they actually exemplify how GO annotations for orthologous genes provide complementary information about conserved biological functions. We then show that there is a global ascertainment bias in the experiment-based GO annotations for human and mouse genes: particular types of experiments tend to be performed in different model organisms. We conclude that the reported statistical differences in annotations between pairs of orthologous genes do not reflect differences in biological function, but rather complementarity in experimental approaches. Our results underscore two general considerations for researchers proposing novel types of analysis based on the GO: 1) that GO annotations are often incomplete, potentially in a biased manner, and subject to an "open world assumption'' (absence of an annotation does not imply absence of a function), and 2) that conclusions drawn from a novel, large-scale GO analysis should whenever possible be supported by careful, in-depth examination of examples, to help ensure the conclusions have a justifiable biological basis.
引用
收藏
页数:7
相关论文
共 18 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]   The GOA database in 2009-an integrated Gene Ontology Annotation resource [J].
Barrell, Daniel ;
Dimmer, Emily ;
Huntley, Rachael P. ;
Binns, David ;
O'Donovan, Claire ;
Apweiler, Rolf .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D396-D403
[3]   The Gene Ontology in 2010: extensions and refinements The Gene Ontology Consortium [J].
Berardini, Tanya Z. ;
Li, Donghui ;
Huala, Eva ;
Bridges, Susan ;
Burgess, Shane ;
McCarthy, Fiona ;
Carbon, Seth ;
Lewis, Suzanna E. ;
Mungall, Christopher J. ;
Abdulla, Amina ;
Wood, Valerie ;
Feltrin, Erika ;
Valle, Giorgio ;
Chisholm, Rex L. ;
Fey, Petra ;
Gaudet, Pascale ;
Kibbe, Warren ;
Basu, Siddhartha ;
Bushmanova, Yulia ;
Eilbeck, Karen ;
Siegele, Deborah A. ;
McIntosh, Brenley ;
Renfro, Daniel ;
Zweifel, Adrienne ;
Hu, James C. ;
Ashburner, Michael ;
Tweedie, Susan ;
Alam-Faruque, Yasmin ;
Apweiler, Rolf ;
Auchinchloss, Andrea ;
Bairoch, Amos ;
Barrell, Daniel ;
Binns, David ;
Blatter, Marie-Claude ;
Bougueleret, Lydie ;
Boutet, Emmanuel ;
Breuza, Lionel ;
Bridge, Alan ;
Browne, Paul ;
Chan, Wei Mun ;
Coudert, Elizabeth ;
Daugherty, Louise ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Estreicher, Anne ;
Famiglietti, Livia ;
Ferro-Rojas, Serenella ;
Feuermann, Marc ;
Foulger, Rebecca ;
Gruaz-Gumowski, Nadine .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D331-D335
[4]   The Mouse Genome Database (MGD): premier model organism resource for mammalian genomics and genetics [J].
Blake, Judith A. ;
Bult, Carol J. ;
Kadin, James A. ;
Richardson, Joel E. ;
Eppig, Janan T. .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D842-D848
[5]   Protein Evolution by Molecular Tinkering: Diversification of the Nuclear Receptor Superfamily from a Ligand-Dependent Ancestor [J].
Bridgham, Jamie T. ;
Eick, Geeta N. ;
Larroux, Claire ;
Deshpande, Kirti ;
Harms, Michael J. ;
Gauthier, Marie E. A. ;
Ortlund, Eric A. ;
Degnan, Bernard M. ;
Thornton, Joseph W. .
PLOS BIOLOGY, 2010, 8 (10)
[6]   Evolution of hormone-receptor complexity by molecular exploitation [J].
Bridgham, JT ;
Carroll, SM ;
Thornton, JW .
SCIENCE, 2006, 312 (5770) :97-101
[7]   Estrogen receptors and human disease [J].
Deroo, BJ ;
Korach, KS .
JOURNAL OF CLINICAL INVESTIGATION, 2006, 116 (03) :561-570
[8]   DISTINGUISHING HOMOLOGOUS FROM ANALOGOUS PROTEINS [J].
FITCH, WM .
SYSTEMATIC ZOOLOGY, 1970, 19 (02) :99-&
[9]   When orthologs diverge between human and mouse [J].
Gharib, Walid H. ;
Robinson-Rechavi, Marc .
BRIEFINGS IN BIOINFORMATICS, 2011, 12 (05) :436-441
[10]   Gene Ontology annotations: what they mean and where they come from [J].
Hill, David P. ;
Smith, Barry ;
McAndrews-Hill, Monica S. ;
Blake, Judith A. .
BMC BIOINFORMATICS, 2008, 9 (Suppl 5)