openBIS: a flexible framework for managing and analyzing complex data in biology research

被引:94
作者
Bauch, Angela [1 ]
Adamczyk, Izabela [1 ]
Buczek, Piotr [1 ]
Elmer, Franz-Josef [1 ]
Enimanev, Kaloyan [1 ]
Glyzewski, Pawel [1 ]
Kohler, Manuel [1 ]
Pylak, Tomasz [1 ]
Quandt, Andreas [3 ]
Ramakrishnan, Chandrasekhar [1 ]
Beisel, Christian [2 ]
Malmstroem, Lars [3 ]
Aebersold, Ruedi [3 ,4 ]
Rinn, Bernd [1 ]
机构
[1] Swiss Fed Inst Technol, Ctr Informat Sci & Databases, Dept Biosyst Sci & Engn, Zurich, Switzerland
[2] Swiss Fed Inst Technol, Quantitat Genom Facil, Dept Biosyst Sci & Engn, Zurich, Switzerland
[3] Swiss Fed Inst Technol, Dept Biol, Inst Mol Syst Biol, Zurich, Switzerland
[4] Univ Zurich, Fac Sci, CH-8006 Zurich, Switzerland
来源
BMC BIOINFORMATICS | 2011年 / 12卷
关键词
PLATFORM; INTEGRATION; MICROARRAY; SYSTEM;
D O I
10.1186/1471-2105-12-468
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Modern data generation techniques used in distributed systems biology research projects often create datasets of enormous size and diversity. We argue that in order to overcome the challenge of managing those large quantitative datasets and maximise the biological information extracted from them, a sound information system is required. Ease of integration with data analysis pipelines and other computational tools is a key requirement for it. Results: We have developed openBIS, an open source software framework for constructing user-friendly, scalable and powerful information systems for data and metadata acquired in biological experiments. openBIS enables users to collect, integrate, share, publish data and to connect to data processing pipelines. This framework can be extended and has been customized for different data types acquired by a range of technologies. Conclusions: openBIS is currently being used by several SystemsX.ch and EU projects applying mass spectrometric measurements of metabolites and proteins, High Content Screening, or Next Generation Sequencing technologies. The attributes that make it interesting to a large research community involved in systems biology projects include versatility, simplicity in deployment, scalability to very large data, flexibility to handle any biological data type and extensibility to the needs of any research domain.
引用
收藏
页数:19
相关论文
共 34 条
[1]  
Altintas I, P FUT GRID DAT ENV 2
[2]  
[Anonymous], SOFTWARE ENG PRACTIT
[3]   Silencing chromatin: comparing modes and mechanisms [J].
Beisel, Christian ;
Paro, Renato .
NATURE REVIEWS GENETICS, 2011, 12 (02) :123-135
[4]   Data management: it starts at the bench [J].
Chaussabel, Damien ;
Ueno, Hideki ;
Banchereau, Jacques ;
Quinn, Charles .
NATURE IMMUNOLOGY, 2009, 10 (12) :1225-1227
[5]  
CODD EF, 1970, COMMUN ACM, V13, P377, DOI 10.1145/357980.358007
[6]   Guidelines for the effective use of entity-attribute-value modeling for biomedical databases [J].
Dinu, Valentin ;
Nadkarni, Prakash .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2007, 76 (11-12) :769-779
[7]   Polycomb preferentially targets stalled promoters of coding and noncoding transcripts [J].
Enderle, Daniel ;
Beisel, Christian ;
Stadler, Michael B. ;
Gerstung, Moritz ;
Athri, Prashanth ;
Paro, Renato .
GENOME RESEARCH, 2011, 21 (02) :216-226
[8]   MIMAS 3.0 is a Multiomics Information Management and Annotation System [J].
Gattiker, Alexandre ;
Hermida, Leandro ;
Liechti, Robin ;
Xenarios, Ioannis ;
Collin, Olivier ;
Rougemont, Jacques ;
Primig, Michael .
BMC BIOINFORMATICS, 2009, 10
[9]   PERSISTENCE AND LOOSE COUPLING IN LIVING SYSTEMS [J].
GLASSMAN, RB .
BEHAVIORAL SCIENCE, 1973, 18 (02) :83-98
[10]   Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences [J].
Goecks, Jeremy ;
Nekrutenko, Anton ;
Taylor, James .
GENOME BIOLOGY, 2010, 11 (08)