ToppGene Suite for gene list enrichment analysis and candidate gene prioritization

被引:2226
作者
Chen, Jing [2 ]
Bardes, Eric E. [1 ]
Aronow, Bruce J. [1 ,3 ]
Jegga, Anil G. [1 ,3 ]
机构
[1] Cincinnati Childrens Hosp, Med Ctr, Div Biomed Informat, Cincinnati, OH USA
[2] Univ Cincinnati, Dept Environm Hlth, Cincinnati, OH USA
[3] Univ Cincinnati, Coll Med, Dept Pediat, Cincinnati, OH USA
基金
美国国家卫生研究院;
关键词
PROTEIN INTERACTION NETWORK; GENOME-WIDE ASSOCIATION; HUMAN-DISEASE GENES; CROHNS-DISEASE; ULCERATIVE-COLITIS; IDENTIFICATION; RESOURCE; SEQUENCE; DATABASE; LOCI;
D O I
10.1093/nar/gkp427
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
070307 [化学生物学]; 071010 [生物化学与分子生物学];
摘要
ToppGene Suite (http://toppgene.cchmc.org; this web site is free and open to all users and does not require a login to access) is a one-stop portal for (i) gene list functional enrichment, (ii) candidate gene prioritization using either functional annotations or network analysis and (iii) identification and prioritization of novel disease candidate genes in the interactome. Functional annotation-based disease candidate gene prioritization uses a fuzzy-based similarity measure to compute the similarity between any two genes based on semantic annotations. The similarity scores from individual features are combined into an overall score using statistical meta-analysis. A P-value of each annotation of a test gene is derived by random sampling of the whole genome. The protein-protein interaction network (PPIN)-based disease candidate gene prioritization uses social and Web networks analysis algorithms (extended versions of the PageRank and HITS algorithms, and the K-Step Markov method). We demonstrate the utility of ToppGene Suite using 20 recently reported GWAS-based gene-disease associations (including novel disease genes) representing five diseases. ToppGene ranked 19 of 20 (95%) candidate genes within the top 20%, while ToppNet ranked 12 of 16 (75%) candidate genes among the top 20%.
引用
收藏
页码:W305 / W311
页数:7
相关论文
共 42 条
[1]
Speeding disease gene discovery by sequence based candidate prioritization [J].
Adie, EA ;
Adams, RR ;
Evans, KL ;
Porteous, DJ ;
Pickard, BS .
BMC BIOINFORMATICS, 2005, 6 (1)
[2]
SUSPECTS: enabling fast and effective prioritization of positional candidates [J].
Adie, EA ;
Adams, RR ;
Evans, KL ;
Porteous, DJ ;
Pickard, BS .
BIOINFORMATICS, 2006, 22 (06) :773-774
[3]
Gene prioritization through genomic data fusion [J].
Aerts, S ;
Lambrechts, D ;
Maity, S ;
Van Loo, P ;
Coessens, B ;
De Smet, F ;
Tranchevent, LC ;
De Moor, B ;
Marynen, P ;
Hassan, B ;
Carmeliet, P ;
Moreau, Y .
NATURE BIOTECHNOLOGY, 2006, 24 (05) :537-544
[4]
[Anonymous], 2003, KDD '03
[5]
Bader GD, 2003, NUCLEIC ACIDS RES, V31, P248, DOI 10.1093/nar/gkg056
[6]
Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease [J].
Barrett, Jeffrey C. ;
Hansoul, Sarah ;
Nicolae, Dan L. ;
Cho, Judy H. ;
Duerr, Richard H. ;
Rioux, John D. ;
Brant, Steven R. ;
Silverberg, Mark S. ;
Taylor, Kent D. ;
Barmada, M. Michael ;
Bitton, Alain ;
Dassopoulos, Themistocles ;
Datta, Lisa Wu ;
Green, Todd ;
Griffiths, Anne M. ;
Kistner, Emily O. ;
Murtha, Michael T. ;
Regueiro, Miguel D. ;
Rotter, Jerome I. ;
Schumm, L. Philip ;
Steinhart, A. Hillary ;
Targan, Stephan R. ;
Xavier, Ramnik J. ;
Libioulle, Cecile ;
Sandor, Cynthia ;
Lathrop, Mark ;
Belaiche, Jacques ;
Dewit, Olivier ;
Gut, Ivo ;
Heath, Simon ;
Laukens, Debby ;
Mni, Myriam ;
Rutgeerts, Paul ;
Van Gossum, Andre ;
Zelenika, Diana ;
Franchimont, Denis ;
Hugot, Jean-Pierre ;
de Vos, Martine ;
Vermeire, Severine ;
Louis, Edouard ;
Cardon, Lon R. ;
Anderson, Carl A. ;
Drummond, Hazel ;
Nimmo, Elaine ;
Ahmad, Tariq ;
Prescott, Natalie J. ;
Onnie, Clive M. ;
Fisher, Sheila A. ;
Marchini, Jonathan ;
Ghori, Jilur .
NATURE GENETICS, 2008, 40 (08) :955-962
[7]
Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases [J].
Berger, Seth I. ;
Posner, Jeremy M. ;
Ma'ayan, Avi .
BMC BIOINFORMATICS, 2007, 8 (1)
[8]
The BioGRID interaction database:: 2008 update [J].
Breitkreutz, Bobby-Joe ;
Stark, Chris ;
Reguly, Teresa ;
Boucher, Lorrie ;
Breitkreutz, Ashton ;
Livstone, Michael ;
Oughtred, Rose ;
Lackner, Daniel H. ;
Bahler, Jurg ;
Wood, Valerie ;
Dolinski, Kara ;
Tyers, Mike .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D637-D640
[9]
Improved human disease candidate gene prioritization using mouse phenotype [J].
Chen, Jing ;
Xu, Huan ;
Aronow, Bruce J. ;
Jegga, Anil G. .
BMC BIOINFORMATICS, 2007, 8 (1)
[10]
Disease candidate gene identification and prioritization using protein interaction networks [J].
Chen, Jing ;
Aronow, Bruce J. ;
Jegga, Anil G. .
BMC BIOINFORMATICS, 2009, 10