BlastKOALA and GhostKOALA: KEGG Tools for Functional Characterization of Genome and Metagenome Sequences

被引:2695
作者
Kanehisa, Minoru [1 ]
Sato, Yoko [2 ]
Morishima, Kanae [1 ]
机构
[1] Kyoto Univ, Inst Chem Res, Kyoto 6110011, Japan
[2] Fujitsu Kyushu Syst Ltd, Healthcare Solut Dept, Hakata Ku, Fukuoka 8120007, Japan
基金
日本科学技术振兴机构;
关键词
genome annotation; metagenome analysis; taxonomic composition; KEGG Orthology; KEGG pathway mapping; RAST SERVER; ANNOTATION; PROTEIN;
D O I
10.1016/j.jmb.2015.11.006
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
BlastKOALA and GhostKOALA are automatic annotation servers for genome and metagenome sequences, which perform KO (KEGG Orthology) assignments to characterize individual gene functions and reconstruct KEGG pathways, BRITE hierarchies and KEGG modules to infer high-level functions of the organism or the ecosystem. Both servers are made freely available at the KEGG Web site (http://www.kegg.jp/blastkoala/). In BlastKOALA, the KO assignment is performed by a modified version of the internally used KOALA algorithm after the BLAST search against a non-redundant dataset of pangenome sequences at the species, genus or family level, which is generated from the KEGG GENES database by retaining the KO content of each taxonomic category. In GhostKOALA, which utilizes more rapid GHOSTX for database search and is suitable for metagenome annotation, the pangenome dataset is supplemented with Cd-hit clusters including those for viral genes. The result files may be downloaded and manipulated for further KEGG Mapper analysis, such as comparative pathway analysis using multiple BlastKOALA results. (C) 2015 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:726 / 731
页数:6
相关论文
共 15 条
[1]   The RAST server: Rapid annotations using subsystems technology [J].
Aziz, Ramy K. ;
Bartels, Daniela ;
Best, Aaron A. ;
DeJongh, Matthew ;
Disz, Terrence ;
Edwards, Robert A. ;
Formsma, Kevin ;
Gerdes, Svetlana ;
Glass, Elizabeth M. ;
Kubal, Michael ;
Meyer, Folker ;
Olsen, Gary J. ;
Olson, Robert ;
Osterman, Andrei L. ;
Overbeek, Ross A. ;
McNeil, Leslie K. ;
Paarmann, Daniel ;
Paczian, Tobias ;
Parrello, Bruce ;
Pusch, Gordon D. ;
Reich, Claudia ;
Stevens, Rick ;
Vassieva, Olga ;
Vonstein, Veronika ;
Wilke, Andreas ;
Zagnitko, Olga .
BMC GENOMICS, 2008, 9 (1)
[2]   UniProt: a hub for protein information [J].
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Apweiler, Rolf ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Gane, Paul ;
Cas-tro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightin-gale, Andrew ;
Orchard, Sandra ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier ;
Zellner, Hermann ;
Cowley, Andrew ;
Figueira, Luis ;
Li, Weizhong ;
McWilliam, Hamish .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D204-D212
[3]   Tara Oceans studies plankton at PLANETARY SCALE [J].
Bork, P. ;
Bowler, C. ;
de Vargas, C. ;
Gorsky, G. ;
Karsenti, E. ;
Wincker, P. .
SCIENCE, 2015, 348 (6237) :873-873
[4]   Complete genome of Kangiella geojedonensis KCTC 23420T, putative evidence for recent genome reduction in marine environments [J].
Choe, Hanna ;
Kim, Seil ;
Oh, Jeongsu ;
Nasir, Arshan ;
Kim, Byung Kwon ;
Kim, Kyung Mo .
MARINE GENOMICS, 2015, 24 :215-217
[5]   Butterfly genome reveals promiscuous exchange of mimicry adaptations among species [J].
Dasmahapatra, Kanchon K. ;
Walters, James R. ;
Briscoe, Adriana D. ;
Davey, John W. ;
Whibley, Annabel ;
Nadeau, Nicola J. ;
Zimin, Aleksey V. ;
Hughes, Daniel S. T. ;
Ferguson, Laura C. ;
Martin, Simon H. ;
Salazar, Camilo ;
Lewis, James J. ;
Adler, Sebastian ;
Ahn, Seung-Joon ;
Baker, Dean A. ;
Baxter, Simon W. ;
Chamberlain, Nicola L. ;
Chauhan, Ritika ;
Counterman, Brian A. ;
Dalmay, Tamas ;
Gilbert, Lawrence E. ;
Gordon, Karl ;
Heckel, David G. ;
Hines, Heather M. ;
Hoff, Katharina J. ;
Holland, Peter W. H. ;
Jacquin-Joly, Emmanuelle ;
Jiggins, Francis M. ;
Jones, Robert T. ;
Kapan, Durrell D. ;
Kersey, Paul ;
Lamas, Gerardo ;
Lawson, Daniel ;
Mapleson, Daniel ;
Maroja, Luana S. ;
Martin, Arnaud ;
Moxon, Simon ;
Palmer, William J. ;
Papa, Riccardo ;
Papanicolaou, Alexie ;
Pauchet, Yannick ;
Ray, David A. ;
Rosser, Neil ;
Salzberg, Steven L. ;
Supple, Megan A. ;
Surridge, Alison ;
Tenger-Trolander, Ayse ;
Vogel, Heiko ;
Wilkinson, Paul A. ;
Wilson, Derek .
NATURE, 2012, 487 (7405) :94-98
[6]   Data, information, knowledge and principle: back to metabolism in KEGG [J].
Kanehisa, Minoru ;
Goto, Susumu ;
Sato, Yoko ;
Kawashima, Masayuki ;
Furumichi, Miho ;
Tanabe, Mao .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D199-D205
[7]   Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences [J].
Li, Weizhong ;
Godzik, Adam .
BIOINFORMATICS, 2006, 22 (13) :1658-1659
[8]   ExplorEnz: the primary source of the IUBMB enzyme list [J].
McDonald, Andrew G. ;
Boyce, Sinead ;
Tipton, Keith F. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D593-D597
[9]   The metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes [J].
Meyer, F. ;
Paarmann, D. ;
D'Souza, M. ;
Olson, R. ;
Glass, E. M. ;
Kubal, M. ;
Paczian, T. ;
Rodriguez, A. ;
Stevens, R. ;
Wilke, A. ;
Wilkening, J. ;
Edwards, R. A. .
BMC BIOINFORMATICS, 2008, 9 (1)
[10]   Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions [J].
Mistry, Jaina ;
Finn, Robert D. ;
Eddy, Sean R. ;
Bateman, Alex ;
Punta, Marco .
NUCLEIC ACIDS RESEARCH, 2013, 41 (12) :e121