EMMA 2-A MAGE-compliant system for the collaborative analysis and integration of microarray data

被引:53
作者
Dondrup, Michael [1 ]
Albaum, Stefan P.
Griebel, Thasso [2 ]
Henckel, Kolja [1 ]
Juenemann, Sebastian
Kahlke, Tim [3 ]
Kleindt, Christiane K. [1 ,4 ]
Kuester, Helge [6 ]
Linke, Burkhard
Mertens, Dominik
Mittard-Runte, Virginie
Neuweger, Heiko [1 ]
Runte, Kai J.
Tauch, Andreas [4 ]
Tille, Felix
Puehler, Alfred [4 ,5 ]
Goesmann, Alexander
机构
[1] Univ Bielefeld, Ctr Biotechnol, Int NRW Grad Sch Bioinformat & Genome Res, D-33594 Bielefeld, Germany
[2] Univ Jena, Lehrstuhl Bioinformat, D-07743 Jena, Germany
[3] Univ Tromso, Fac Med, Dept Mol Biotechnol, Inst Med Biol, N-9037 Tromso, Norway
[4] Univ Bielefeld, Inst Genome Res & Syst Biol, D-33594 Bielefeld, Germany
[5] Univ Bielefeld, Fac Biol, D-33594 Bielefeld, Germany
[6] Leibniz Univ Hannover, Inst Plant Genet, Unit Plant Genom 4, D-30419 Hannover, Germany
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
GENE-EXPRESSION DATA; CORYNEBACTERIUM-GLUTAMICUM; REGULATORY NETWORK; TRANSCRIPTOME; PLATFORM; IDENTIFICATION; NORMALIZATION; BIOCONDUCTOR; ONTOLOGY; STORAGE;
D O I
10.1186/1471-2105-10-50
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Understanding transcriptional regulation by genome-wide microarray studies can contribute to unravel complex relationships between genes. Attempts to standardize the annotation of microarray data include the Minimum Information About a Microarray Experiment (MIAME) recommendations, the MAGE-ML format for data interchange, and the use of controlled vocabularies or ontologies. The existing software systems for microarray data analysis implement the mentioned standards only partially and are often hard to use and extend. Integration of genomic annotation data and other sources of external knowledge using open standards is therefore a key requirement for future integrated analysis systems. Results: The EMMA 2 software has been designed to resolve shortcomings with respect to full MAGE-ML and ontology support and makes use of modern data integration techniques. We present a software system that features comprehensive data analysis functions for spotted arrays, and for the most common synthesized oligo arrays such as Agilent, Affymetrix and NimbleGen. The system is based on the full MAGE object model. Analysis functionality is based on R and Bioconductor packages and can make use of a compute cluster for distributed services. Conclusion: Our model-driven approach for automatically implementing a full MAGE object model provides high flexibility and compatibility. Data integration via SOAP-based web-services is advantageous in a distributed client-server environment as the collaborative analysis of microarray data is gaining more and more relevance in international research consortia. The adequacy of the EMMA 2 software design and implementation has been proven by its application in many distributed functional genomics projects. Its scalability makes the current architecture suited for extensions towards future transcriptomics methods based on high-throughput sequencing approaches which have much higher computational requirements than microarrays.
引用
收藏
页数:14
相关论文
共 51 条
[1]  
[Anonymous], SOAP Specifications
[2]   A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[3]   Standardizing global gene expression analysis between laboratories and across platforms [J].
Bammler, T ;
Beyer, RP ;
Bhattacharya, S ;
Boorman, GA ;
Boyles, A ;
Bradford, BU ;
Bumgarner, RE ;
Bushel, PR ;
Chaturvedi, K ;
Choi, D ;
Cunningham, ML ;
Dengs, S ;
Dressman, HK ;
Fannin, RD ;
Farun, FM ;
Freedman, JH ;
Fry, RC ;
Harper, A ;
Humble, MC ;
Hurban, P ;
Kavanagh, TJ ;
Kaufmann, WK ;
Kerr, KF ;
Jing, L ;
Lapidus, JA ;
Lasarev, MR ;
Li, J ;
Li, YJ ;
Lobenhofer, EK ;
Lu, X ;
Malek, RL ;
Milton, S ;
Nagalla, SR ;
O'Malley, JP ;
Palmer, VS ;
Pattee, P ;
Paules, RS ;
Perou, CM ;
Phillips, K ;
Qin, LX ;
Qiu, Y ;
Quigley, SD ;
Rodland, M ;
Rusyn, I ;
Samson, LD ;
Schwartz, DA ;
Shi, Y ;
Shin, JL ;
Sieber, SO ;
Slifer, S .
NATURE METHODS, 2005, 2 (05) :351-356
[4]   NCBI GEO: archive for high-throughput functional genomic data [J].
Barrett, Tanya ;
Troup, Dennis B. ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Rudnev, Dmitry ;
Evangelista, Carlos ;
Kim, Irene F. ;
Soboleva, Alexandra ;
Tomashevsky, Maxim ;
Marshall, Kimberly A. ;
Phillippy, Katherine H. ;
Sherman, Patti M. ;
Muertter, Rolf N. ;
Edgar, Ron .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D885-D890
[5]   CoryneRegNet: An ontology-based data warehouse of corynebacterial transcription factors and regulatory networks [J].
Baumbach, J ;
Brinkrolf, K ;
Czaja, LF ;
Rahmann, S ;
Tauch, A .
BMC GENOMICS, 2006, 7 (1)
[6]  
Becker A, 2005, GENOMES AND GENOMICS OF NITROGEN-FIXING ORGANISMS, P169
[7]  
BEKEL T, 2009, J BIOTECHNOLOGY
[8]   Minimum information about a microarray experiment (MIAME) - toward standards for microarray data [J].
Brazma, A ;
Hingamp, P ;
Quackenbush, J ;
Sherlock, G ;
Spellman, P ;
Stoeckert, C ;
Aach, J ;
Ansorge, W ;
Ball, CA ;
Causton, HC ;
Gaasterland, T ;
Glenisson, P ;
Holstege, FCP ;
Kim, IF ;
Markowitz, V ;
Matese, JC ;
Parkinson, H ;
Robinson, A ;
Sarkans, U ;
Schulze-Kremer, S ;
Stewart, J ;
Taylor, R ;
Vilo, J ;
Vingron, M .
NATURE GENETICS, 2001, 29 (04) :365-371
[9]   ArrayExpress - a public repository for microarray gene expression data at the EBI [J].
Brazma, A ;
Parkinson, H ;
Sarkans, U ;
Shojatalab, M ;
Vilo, J ;
Abeygunawardena, N ;
Holloway, E ;
Kapushesky, M ;
Kemmeren, P ;
Lara, GG ;
Oezcimen, A ;
Rocca-Serra, P ;
Sansone, SA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :68-71
[10]   The LacI/GalR family transcriptional regulator UriR negatively controls uridine utilization of Corynebacterium glutamicum by binding to catabolite-responsive element (cre)-like sequences [J].
Brinkrolf, Karina ;
Ploeger, Svenja ;
Solle, Sandra ;
Brune, Iris ;
Nentwich, Svenia S. ;
Hueser, Andrea T. ;
Kalinowski, Joern ;
Puehler, Alfred ;
Tauch, Andreas .
MICROBIOLOGY-SGM, 2008, 154 :1068-1081