Resolving the Ortholog Conjecture: Orthologs Tend to Be Weakly, but Significantly, More Similar in Function than Paralogs

被引:153
作者
Altenhoff, Adrian M. [1 ,2 ]
Studer, Romain A. [2 ,3 ,4 ]
Robinson-Rechavi, Marc [2 ,3 ]
Dessimoz, Christophe [1 ,2 ,5 ]
机构
[1] ETH, Dept Comp Sci, Zurich, Switzerland
[2] Swiss Inst Bioinformat, Lausanne, Switzerland
[3] Univ Lausanne, Dept Ecol & Evolut, Lausanne, Switzerland
[4] UCL, Div Biosci, Inst Struct & Mol Biol, London, England
[5] EMBL European Bioinformat Inst, Cambridge, England
基金
瑞士国家科学基金会;
关键词
CONSERVATION; DATABASE; SEQUENCE; GENES;
D O I
10.1371/journal.pcbi.1002514
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
The function of most proteins is not determined experimentally, but is extrapolated from homologs. According to the "ortholog conjecture'', or standard model of phylogenomics, protein function changes rapidly after duplication, leading to paralogs with different functions, while orthologs retain the ancestral function. We report here that a comparison of experimentally supported functional annotations among homologs from 13 genomes mostly supports this model. We show that to analyze GO annotation effectively, several confounding factors need to be controlled: authorship bias, variation of GO term frequency among species, variation of background similarity among species pairs, and propagated annotation bias. After controlling for these biases, we observe that orthologs have generally more similar functional annotations than paralogs. This is especially strong for sub-cellular localization. We observe only a weak decrease in functional similarity with increasing sequence divergence. These findings hold over a large diversity of species; notably orthologs from model organisms such as E. coli, yeast or mouse have conserved function with human proteins.
引用
收藏
页数:10
相关论文
共 30 条
[1]
OMA 2011: orthology inference among 1000 complete genomes [J].
Altenhoff, Adrian M. ;
Schneider, Adrian ;
Gonnet, Gaston H. ;
Dessimoz, Christophe .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D289-D294
[2]
Phylogenetic and Functional Assessment of Orthologs Inference Projects and Methods [J].
Altenhoff, Adrian M. ;
Dessimoz, Christophe .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (01)
[3]
The ENZYME database in 2000 [J].
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :304-305
[4]
The GOA database in 2009-an integrated Gene Ontology Annotation resource [J].
Barrell, Daniel ;
Dimmer, Emily ;
Huntley, Rachael P. ;
Binns, David ;
O'Donovan, Claire ;
Apweiler, Rolf .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D396-D403
[5]
Predicting function: From genes to genomes and back [J].
Bork, P ;
Dandekar, T ;
Diaz-Lazcoz, Y ;
Eisenhaber, F ;
Huynen, M ;
Yuan, YP .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 283 (04) :707-725
[6]
The what, where, how and why of gene ontology-a primer for bioinformaticians [J].
du Plessis, Louis ;
Skunca, Nives ;
Dessimoz, Christophe .
BRIEFINGS IN BIOINFORMATICS, 2011, 12 (06) :723-735
[7]
Phylogenomics: Improving functional predictions for uncharacterized genes by evolutionary analysis [J].
Eisen, JA .
GENOME RESEARCH, 1998, 8 (03) :163-167
[8]
Ensembl 2012 [J].
Flicek, Paul ;
Amode, M. Ridwan ;
Barrell, Daniel ;
Beal, Kathryn ;
Brent, Simon ;
Carvalho-Silva, Denise ;
Clapham, Peter ;
Coates, Guy ;
Fairley, Susan ;
Fitzgerald, Stephen ;
Gil, Laurent ;
Gordon, Leo ;
Hendrix, Maurice ;
Hourlier, Thibaut ;
Johnson, Nathan ;
Kaehaeri, Andreas K. ;
Keefe, Damian ;
Keenan, Stephen ;
Kinsella, Rhoda ;
Komorowska, Monika ;
Koscielny, Gautier ;
Kulesha, Eugene ;
Larsson, Pontus ;
Longden, Ian ;
McLaren, William ;
Muffato, Matthieu ;
Overduin, Bert ;
Pignatelli, Miguel ;
Pritchard, Bethan ;
Riat, Harpreet Singh ;
Ritchie, Graham R. S. ;
Ruffier, Magali ;
Schuster, Michael ;
Sobral, Daniel ;
Tang, Y. Amy ;
Taylor, Kieron ;
Trevanion, Stephen ;
Vandrovcova, Jana ;
White, Simon ;
Wilson, Mark ;
Wilder, Steven P. ;
Aken, Bronwen L. ;
Birney, Ewan ;
Cunningham, Fiona ;
Dunham, Ian ;
Durbin, Richard ;
Fernandez-Suarez, Xose M. ;
Harrow, Jennifer ;
Herrero, Javier ;
Hubbard, Tim J. P. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D84-D90
[9]
Domain architecture conservation in orthologs [J].
Forslund, Kristoffer ;
Pekkari, Isabella ;
Sonnhammer, Erik L. L. .
BMC BIOINFORMATICS, 2011, 12
[10]
The Gene Ontology's Reference Genome Project: A Unified Framework for Functional Annotation across Species [J].
Gaudet, Pascale ;
Chisholm, Rex ;
Berardini, Tanya ;
Dimmer, Emily ;
Engel, Stacia R. ;
Fey, Petra ;
Hill, David P. ;
Howe, Doug ;
Hu, James C. ;
Huntley, Rachael ;
Khodiyar, Varsha K. ;
Kishore, Ranjana ;
Li, Donghui ;
Lovering, Ruth C. ;
McCarthy, Fiona ;
Ni, Li ;
Petri, Victoria ;
Siegele, Deborah A. ;
Tweedie, Susan ;
Van Auken, Kimberly ;
Wood, Valerie ;
Basu, Siddhartha ;
Carbon, Seth ;
Dolan, Mary ;
Mungall, Christopher J. ;
Dolinski, Kara ;
Thomas, Paul ;
Ashburner, Michael ;
Blake, Judith A. ;
Cherry, J. Michael ;
Lewis, Suzanna E. .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (07)