Mayday - integrative analytics for expression data

被引:89
作者
Battke, Florian [1 ]
Symons, Stephan [1 ]
Nieselt, Kay [1 ]
机构
[1] Univ Tubingen, Ctr Bioinformat Tubingen, D-72076 Tubingen, Germany
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
MICROARRAY DATA-ANALYSIS; STREPTOMYCES-COELICOLOR; GENES; TOOL; DATABASES; CONSENSUS; SEQUENCE; BIOLOGY;
D O I
10.1186/1471-2105-11-121
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: DNA Microarrays have become the standard method for large scale analyses of gene expression and epigenomics. The increasing complexity and inherent noisiness of the generated data makes visual data exploration ever more important. Fast deployment of new methods as well as a combination of predefined, easy to apply methods with programmer's access to the data are important requirements for any analysis framework. Mayday is an open source platform with emphasis on visual data exploration and analysis. Many built-in methods for clustering, machine learning and classification are provided for dissecting complex datasets. Plugins can easily be written to extend Mayday's functionality in a large number of ways. As Java program, Mayday is platform-independent and can be used as Java WebStart application without any installation. Mayday can import data from several file formats, database connectivity is included for efficient data organization. Numerous interactive visualization tools, including box plots, profile plots, principal component plots and a heatmap are available, can be enhanced with metadata and exported as publication quality vector files. Results: We have rewritten large parts of Mayday's core to make it more efficient and ready for future developments. Among the large number of new plugins are an automated processing framework, dynamic filtering, new and efficient clustering methods, a machine learning module and database connectivity. Extensive manual data analysis can be done using an inbuilt R terminal and an integrated SQL querying interface. Our visualization framework has become more powerful, new plot types have been added and existing plots improved. Conclusions: We present a major extension of Mayday, a very versatile open-source framework for efficient micro array data analysis designed for biologists and bioinformaticians. Most everyday tasks are already covered. The large number of available plugins as well as the extension possibilities using compiled plugins and ad-hoc scripting allow for the rapid adaption of Mayday also to very specialized data exploration. Mayday is available at http://microarray-analysis.org.
引用
收藏
页数:10
相关论文
共 31 条
[1]   Microarray data analysis: from disarray to consolidation and consensus [J].
Allison, DB ;
Cui, XQ ;
Page, GP ;
Sabripour, M .
NATURE REVIEWS GENETICS, 2006, 7 (01) :55-65
[2]  
*AP DERB, OP SOURC REL DAT 200
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2) [J].
Bentley, SD ;
Chater, KF ;
Cerdeño-Tárraga, AM ;
Challis, GL ;
Thomson, NR ;
James, KD ;
Harris, DE ;
Quail, MA ;
Kieser, H ;
Harper, D ;
Bateman, A ;
Brown, S ;
Chandra, G ;
Chen, CW ;
Collins, M ;
Cronin, A ;
Fraser, A ;
Goble, A ;
Hidalgo, J ;
Hornsby, T ;
Howarth, S ;
Huang, CH ;
Kieser, T ;
Larke, L ;
Murphy, L ;
Oliver, K ;
O'Neil, S ;
Rabbinowitsch, E ;
Rajandream, MA ;
Rutherford, K ;
Rutter, S ;
Seeger, K ;
Saunders, D ;
Sharp, S ;
Squares, R ;
Squares, S ;
Taylor, K ;
Warren, T ;
Wietzorrek, A ;
Woodward, J ;
Barrell, BG ;
Parkhill, J ;
Hopwood, DA .
NATURE, 2002, 417 (6885) :141-147
[5]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[6]   Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments [J].
Breitling, R ;
Armengaud, P ;
Amtmann, A ;
Herzyk, P .
FEBS LETTERS, 2004, 573 (1-3) :83-92
[7]  
Caspi R, 2008, NUCLEIC ACIDS RES, V36, pD623, DOI [10.1093/nar/gkm900, 10.1093/nar/gkt1103]
[8]   The Protein Identifier Cross-Referencing (PICR) service:: reconciling protein identifiers across multiple source databases [J].
Cote, Richard G. ;
Jones, Philip ;
Martens, Lennart ;
Kerrien, Samuel ;
Reisinger, Florian ;
Lin, Quan ;
Leinonen, Rasko ;
Apweiler, Rolf ;
Hermjakob, Henning .
BMC BIOINFORMATICS, 2007, 8 (1) :401
[9]   Mayday - a microarray data analysis workbench [J].
Dietzsch, J ;
Gehlenborg, N ;
Nieselt, K .
BIOINFORMATICS, 2006, 22 (08) :1010-1012
[10]   EMMA 2-A MAGE-compliant system for the collaborative analysis and integration of microarray data [J].
Dondrup, Michael ;
Albaum, Stefan P. ;
Griebel, Thasso ;
Henckel, Kolja ;
Juenemann, Sebastian ;
Kahlke, Tim ;
Kleindt, Christiane K. ;
Kuester, Helge ;
Linke, Burkhard ;
Mertens, Dominik ;
Mittard-Runte, Virginie ;
Neuweger, Heiko ;
Runte, Kai J. ;
Tauch, Andreas ;
Tille, Felix ;
Puehler, Alfred ;
Goesmann, Alexander .
BMC BIOINFORMATICS, 2009, 10