BayGO:: Bayesian analysis of ontology term enrichment in microarray data

被引:44
作者
Vêncio, RZN
Koide, T
Gomes, SL
Pereira, CAD
机构
[1] Univ Sao Paulo, BIOINFO USP Nucleo Pesquisas Bioinformat, BR-05508090 Sao Paulo, Brazil
[2] Hosp Israelita Albert Einstein, Inst Israelita Ensino & Pesquisa Albert Einstein, BR-05651901 Sao Paulo, Brazil
[3] Univ Sao Paulo, Inst Quim, Dept Bioquim, BR-05508000 Sao Paulo, Brazil
[4] Univ Sao Paulo, Inst Matemat & Estatist, Dept Estatist, BR-05508090 Sao Paulo, Brazil
关键词
D O I
10.1186/1471-2105-7-86
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The search for enriched ( aka over-represented or enhanced) ontology terms in a list of genes obtained from microarray experiments is becoming a standard procedure for a system-level analysis. This procedure tries to summarize the information focussing on classification designs such as Gene Ontology, KEGG pathways, and so on, instead of focussing on individual genes. Although it is well known in statistics that association and significance are distinct concepts, only the former approach has been used to deal with the ontology term enrichment problem. Results: BayGO implements a Bayesian approach to search for enriched terms from microarray data. The R source-code is freely available at http://blasto.iq.usp.br/similar to tkoide/BayGO in three versions: Linux, which can be easily incorporated into pre-existent pipelines; Windows, to be controlled interactively; and as a web-tool. The software was validated using a bacterial heat shock response dataset, since this stress triggers known system-level responses. Conclusion: The Bayesian model accounts for the fact that, eventually, not all the genes from a given category are observable in microarray data due to low intensity signal, quality filters, genes that were not spotted and so on. Moreover, BayGO allows one to measure the statistical association between generic ontology terms and differential expression, instead of working only with the common significance analysis.
引用
收藏
页数:11
相关论文
共 32 条
[1]   FatiGO:: a web tool for finding significant associations of Gene Ontology terms with groups of genes [J].
Al-Shahrour, F ;
Díaz-Uriarte, R ;
Dopazo, J .
BIOINFORMATICS, 2004, 20 (04) :578-580
[2]  
[Anonymous], 1983, MEASURES ASS
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]  
Barrett T, 2005, NUCLEIC ACIDS RES, V33, pD562
[5]   ON THE BAYESIAN-ANALYSIS OF CATEGORICAL-DATA - THE PROBLEM OF NONRESPONSE [J].
BASU, D ;
PEREIRA, CAD .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1982, 6 (04) :345-362
[6]   GOstat: find statistically overrepresented Gene Ontologies within a group of genes [J].
Beissbarth, T ;
Speed, TP .
BIOINFORMATICS, 2004, 20 (09) :1464-1465
[7]   Characterizing gene sets with FuncAssociate [J].
Berriz, GF ;
King, OD ;
Bryant, B ;
Sander, C ;
Roth, FP .
BIOINFORMATICS, 2003, 19 (18) :2502-2504
[8]   ClutrFree: cluster tree visualization and interpretation [J].
Bidaut, G ;
Ochs, MF .
BIOINFORMATICS, 2004, 20 (16) :2869-2871
[9]   GO::TermFinder - open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes [J].
Boyle, EI ;
Weng, SA ;
Gollub, J ;
Jin, H ;
Botstein, D ;
Cherry, JM ;
Sherlock, G .
BIOINFORMATICS, 2004, 20 (18) :3710-3715
[10]  
Camon EB, 2005, BMC BIOINFORMATICS, V6, DOI 10.1186/1471-2105-6-S1-S17