The Firegoose: two-way integration of diverse data from different bioinformatics web resources with desktop applications

被引:25
作者
Bare, J. Christopher [1 ]
Shannon, Paul T. [1 ]
Schmid, Amy K. [1 ]
Baliga, Nitin S. [1 ]
机构
[1] Inst Syst Biol, Seattle, WA 98103 USA
关键词
D O I
10.1186/1471-2105-8-456
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Information resources on the World Wide Web play an indispensable role in modern biology. But integrating data from multiple sources is often encumbered by the need to reformat data files, convert between naming systems, or perform ongoing maintenance of local copies of public databases. Opportunities for new ways of combining and re-using data are arising as a result of the increasing use of web protocols to transmit structured data. Results: The Firegoose, an extension to the Mozilla Firefox web browser, enables data transfer between web sites and desktop tools. As a component of the Gaggle integration framework, Firegoose can also exchange data with Cytoscape, the R statistical package, Multiexperiment Viewer ( MeV), and several other popular desktop software tools. Firegoose adds the capability to easily use local data to query KEGG, EMBL STRING, DAVID, and other widely-used bioinformatics web sites. Query results from these web sites can be transferred to desktop tools for further analysis with a few clicks. Firegoose acquires data from the web by screen scraping, microformats, embedded XML, or web services. We define a microformat, which allows structured information compatible with the Gaggle to be embedded in HTML documents. We demonstrate the capabilities of this software by performing an analysis of the genes activated in the microbe Halobacterium salinarum NRC-1 in response to anaerobic environments. Starting with microarray data, we explore functions of differentially expressed genes by combining data from several public web resources and construct an integrated view of the cellular processes involved. Conclusion: The Firegoose incorporates Mozilla Firefox into the Gaggle environment and enables interactive sharing of data between diverse web resources and desktop software tools without maintaining local copies. Additional web sites can be incorporated easily into the framework using the scripting platform of the Firefox browser. Performing data integration in the browser allows the excellent search and navigation capabilities of the browser to be used in combination with powerful desktop tools.
引用
收藏
页数:12
相关论文
共 31 条
[1]  
[Anonymous], MOZILLA FIREFOX
[2]   caCORE: A common infrastructure for cancer informatics [J].
Covitz, PA ;
Hartel, F ;
Schaefer, C ;
De Coronado, S ;
Fragoso, G ;
Sahni, H ;
Gustafson, S ;
Buetow, KH .
BIOINFORMATICS, 2003, 19 (18) :2404-2412
[3]   DAVID: Database for annotation, visualization, and integrated discovery [J].
Dennis, G ;
Sherman, BT ;
Hosack, DA ;
Yang, J ;
Gao, W ;
Lane, HC ;
Lempicki, RA .
GENOME BIOLOGY, 2003, 4 (09)
[4]   Taverna: a tool for building and running workflows of services [J].
Hull, Duncan ;
Wolstencroft, Katy ;
Stevens, Robert ;
Goble, Carole ;
Pocock, Mathew R. ;
Li, Peter ;
Oinn, Tom .
NUCLEIC ACIDS RESEARCH, 2006, 34 :W729-W732
[5]  
HUYNH D, 2005, INT SEM WEB C 2005
[6]   Testing for differentially-expressed genes by maximum-likelihood analysis of microarray data [J].
Ideker, T ;
Thorsson, V ;
Siegel, AF ;
Hood, LE .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (06) :805-817
[7]  
Kanehisa M, 2002, NOVART FDN SYMP, V247, P91
[8]   Genomic analysis of anaerobic respiration in the archaeon Halobacterium sp strain NRC-1:: Dimethyl sulfoxide and trimethylamine N-oxide as terminal electron acceptors [J].
Müller, JA ;
DasSarma, S .
JOURNAL OF BACTERIOLOGY, 2005, 187 (05) :1659-1667
[9]   Genome sequence of Halobacterium species NRC-1 [J].
Ng, WV ;
Kennedy, SP ;
Mahairas, GG ;
Berquist, B ;
Pan, M ;
Shukla, HD ;
Lasky, SR ;
Baliga, NS ;
Thorsson, V ;
Sbrogna, J ;
Swartzell, S ;
Weir, D ;
Hall, J ;
Dahl, TA ;
Welti, R ;
Goo, YA ;
Leithauser, B ;
Keller, K ;
Cruz, R ;
Danson, MJ ;
Hough, DW ;
Maddocks, DG ;
Jablonski, PE ;
Krebs, MP ;
Angevine, CM ;
Dale, H ;
Isenbarger, TA ;
Peck, RF ;
Pohlschroder, M ;
Spudich, JL ;
Jung, KH ;
Alam, M ;
Freitas, T ;
Hou, SB ;
Daniels, CJ ;
Dennis, PP ;
Omer, AD ;
Ebhardt, H ;
Lowe, TM ;
Liang, R ;
Riley, M ;
Hood, L ;
DasSarma, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (22) :12176-12181
[10]   Taverna: a tool for the composition and enactment of bioinformatics workflows [J].
Oinn, T ;
Addis, M ;
Ferris, J ;
Marvin, D ;
Senger, M ;
Greenwood, M ;
Carver, T ;
Glover, K ;
Pocock, MR ;
Wipat, A ;
Li, P .
BIOINFORMATICS, 2004, 20 (17) :3045-3054