eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences

被引:1698
作者
Huerta-Cepas, Jaime [1 ]
Szklarczyk, Damian [2 ,3 ]
Forslund, Kristoffer [1 ]
Cook, Helen [4 ]
Heller, Davide [2 ,3 ]
Walter, Mathias C. [5 ]
Rattei, Thomas [6 ]
Mende, Daniel R. [7 ]
Sunagawa, Shinichi [1 ]
Kuhn, Michael [8 ]
Jensen, Lars Juhl [4 ]
von Mering, Christian [2 ,3 ]
Bork, Peer [1 ,9 ,10 ,11 ]
机构
[1] European Mol Biol Lab, Struct & Computat Biol Unit, Heidelberg, Germany
[2] Univ Zurich, Inst Mol Life Sci, CH-8057 Zurich, Switzerland
[3] SIB, Bioinformat Syst Biol Grp, CH-8057 Zurich, Switzerland
[4] Univ Copenhagen, Fac Hlth & Med Sci, Novo Nordisk Fdn, Ctr Prot Res, DK-2200 Copenhagen N, Denmark
[5] German Res Ctr Environm Hlth GmbH, Helmholtz Zentrum Munchen, Inst Bioinformat & Syst Biol, D-85764 Neuherberg, Germany
[6] Univ Vienna, Dept Microbiol & Ecosyst Sci, CUBE Div Computat Syst Biol, A-1090 Vienna, Austria
[7] Univ Hawaii, Daniel K Inouye Ctr Microbial Oceanog Res & Educ, Honolulu, HI 96822 USA
[8] Max Planck Inst Mol Cell Biol & Genet, D-01307 Dresden, Germany
[9] Univ Heidelberg Hosp, Germany Mol Med Partnership Unit MMPU, D-69117 Heidelberg, Germany
[10] European Mol Biol Lab, D-69117 Heidelberg, Germany
[11] Max Delbruck Ctr Mol Med, D-13125 Berlin, Germany
基金
欧洲研究理事会;
关键词
PROTEIN-SEQUENCE; DATABASE; GENES; TREE; REPRESENTATION; DUPLICATION; PERFORMANCE; SOFTWARE; CLUSTERS; CATALOG;
D O I
10.1093/nar/gkv1248
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
070307 [化学生物学]; 071010 [生物化学与分子生物学];
摘要
eggNOG is a public resource that provides Orthologous Groups (OGs) of proteins at different taxonomic levels, each with integrated and summarized functional annotations. Developments since the latest public release include changes to the algorithm for creating OGs across taxonomic levels, making nested groups hierarchically consistent. This allows for a better propagation of functional terms across nested OGs and led to the novel annotation of 95 890 previously uncharacterized OGs, increasing overall annotation coverage from 67% to 72%. The functional annotations of OGs have been expanded to also provide Gene Ontology terms, KEGG pathways andSMART/Pfam domains for each group. Moreover, eggNOG now provides pairwise orthology relationships within OGs based on analysis of phylogenetic trees. We have also incorporated a framework for quickly mapping novel sequences to OGs based on precomputed HMM profiles. Finally, eggNOG version 4.5 incorporates a novel data set spanning 2605 viral OGs, covering 5228 proteins from 352 viral proteomes. All data are accessible for bulk downloading, as a web-service, and through a completely redesigned web interface. The new access points provide faster searches and a number of new browsing and visualization capabilities, facilitating the needs of both experts and less experienced users. eggNOG v4.5 is available at http://eggnog.embl.de.
引用
收藏
页码:D286 / D293
页数:8
相关论文
共 53 条
[1]
The OMA orthology database in 2015: function predictions, better plant support, synteny view and other improvements [J].
Altenhoff, Adrian M. ;
Skunca, Nives ;
Glover, Natasha ;
Train, Clement-Marie ;
Sueki, Anna ;
Pilizota, Ivana ;
Gori, Kevin ;
Tomiczek, Bartlomiej ;
Mueller, Steven ;
Redestig, Henning ;
Gonnet, Gaston H. ;
Dessimoz, Christophe .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D240-D249
[2]
BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]
SIMAP-the database of all-against-all protein sequence similarities and annotations with new interfaces and increased coverage [J].
Arnold, Roland ;
Goldenberg, Florian ;
Mewes, Hans-Werner ;
Rattei, Thomas .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D279-D284
[4]
Following Gene Duplication, Paralog Interference Constrains Transcriptional Circuit Evolution [J].
Baker, Christopher R. ;
Hanson-Smith, Victor ;
Johnson, Alexander D. .
SCIENCE, 2013, 342 (6154) :104-108
[5]
UniProt: a hub for protein information [J].
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Apweiler, Rolf ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Gane, Paul ;
Cas-tro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightin-gale, Andrew ;
Orchard, Sandra ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier ;
Zellner, Hermann ;
Cowley, Andrew ;
Figueira, Luis ;
Li, Weizhong ;
McWilliam, Hamish .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D204-D212
[6]
trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses [J].
Capella-Gutierrez, Salvador ;
Silla-Martinez, Jose M. ;
Gabaldon, Toni .
BIOINFORMATICS, 2009, 25 (15) :1972-1973
[7]
OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups [J].
Chen, Feng ;
Mackey, Aaron J. ;
Stoeckert, Christian J., Jr. ;
Roos, David S. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D363-D368
[8]
Toward automatic reconstruction of a highly resolved tree of life [J].
Ciccarelli, FD ;
Doerks, T ;
von Mering, C ;
Creevey, CJ ;
Snel, B ;
Bork, P .
SCIENCE, 2006, 311 (5765) :1283-1287
[9]
Ensembl 2015 [J].
Cunningham, Fiona ;
Amode, M. Ridwan ;
Barrell, Daniel ;
Beal, Kathryn ;
Billis, Konstantinos ;
Brent, Simon ;
Carvalho-Silva, Denise ;
Clapham, Peter ;
Coates, Guy ;
Fitzgerald, Stephen ;
Gil, Laurent ;
Giron, Carlos Garcia ;
Gordon, Leo ;
Hourlier, Thibaut ;
Hunt, Sarah E. ;
Janacek, Sophie H. ;
Johnson, Nathan ;
Juettemann, Thomas ;
Kaehaeri, Andreas K. ;
Keenan, Stephen ;
Martin, Fergal J. ;
Maurel, Thomas ;
McLaren, William ;
Murphy, Daniel N. ;
Nag, Rishi ;
Overduin, Bert ;
Parker, Anne ;
Patricio, Mateus ;
Perry, Emily ;
Pignatelli, Miguel ;
Riat, Harpreet Singh ;
Sheppard, Daniel ;
Taylor, Kieron ;
Thormann, Anja ;
Vullo, Alessandro ;
Wilder, Steven P. ;
Zadissa, Amonida ;
Aken, Bronwen L. ;
Birney, Ewan ;
Harrow, Jennifer ;
Kinsella, Rhoda ;
Muffato, Matthieu ;
Ruffier, Magali ;
Searle, Stephen M. J. ;
Spudich, Giulietta ;
Trevanion, Stephen J. ;
Yates, Andy ;
Zerbino, Daniel R. ;
Flicek, Paul .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D662-D669
[10]
Accelerated Profile HMM Searches [J].
Eddy, Sean R. .
PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (10)