openBIS: a flexible framework for managing and analyzing complex data in biology research

被引:94
作者
Bauch, Angela [1 ]
Adamczyk, Izabela [1 ]
Buczek, Piotr [1 ]
Elmer, Franz-Josef [1 ]
Enimanev, Kaloyan [1 ]
Glyzewski, Pawel [1 ]
Kohler, Manuel [1 ]
Pylak, Tomasz [1 ]
Quandt, Andreas [3 ]
Ramakrishnan, Chandrasekhar [1 ]
Beisel, Christian [2 ]
Malmstroem, Lars [3 ]
Aebersold, Ruedi [3 ,4 ]
Rinn, Bernd [1 ]
机构
[1] Swiss Fed Inst Technol, Ctr Informat Sci & Databases, Dept Biosyst Sci & Engn, Zurich, Switzerland
[2] Swiss Fed Inst Technol, Quantitat Genom Facil, Dept Biosyst Sci & Engn, Zurich, Switzerland
[3] Swiss Fed Inst Technol, Dept Biol, Inst Mol Syst Biol, Zurich, Switzerland
[4] Univ Zurich, Fac Sci, CH-8006 Zurich, Switzerland
来源
BMC BIOINFORMATICS | 2011年 / 12卷
关键词
PLATFORM; INTEGRATION; MICROARRAY; SYSTEM;
D O I
10.1186/1471-2105-12-468
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Modern data generation techniques used in distributed systems biology research projects often create datasets of enormous size and diversity. We argue that in order to overcome the challenge of managing those large quantitative datasets and maximise the biological information extracted from them, a sound information system is required. Ease of integration with data analysis pipelines and other computational tools is a key requirement for it. Results: We have developed openBIS, an open source software framework for constructing user-friendly, scalable and powerful information systems for data and metadata acquired in biological experiments. openBIS enables users to collect, integrate, share, publish data and to connect to data processing pipelines. This framework can be extended and has been customized for different data types acquired by a range of technologies. Conclusions: openBIS is currently being used by several SystemsX.ch and EU projects applying mass spectrometric measurements of metabolites and proteins, High Content Screening, or Next Generation Sequencing technologies. The attributes that make it interesting to a large research community involved in systems biology projects include versatility, simplicity in deployment, scalability to very large data, flexibility to handle any biological data type and extensibility to the needs of any research domain.
引用
收藏
页数:19
相关论文
共 34 条
[11]  
Kacsuk P., 2005, J GRID COMPUT, V3, P221, DOI [DOI 10.1007/S10723-005-9012-6, 10.1007/s10723-005-9012-6]
[12]   On the Future of Genomic Data [J].
Kahn, Scott D. .
SCIENCE, 2011, 331 (6018) :728-729
[13]   A uniform proteomics MS/MS analysis platform utilizing open XML file formats [J].
Keller, Andrew ;
Eng, Jimmy ;
Zhang, Ning ;
Li, Xiao-jun ;
Aebersold, Ruedi .
MOLECULAR SYSTEMS BIOLOGY, 2005, 1 (1) :2005.0017
[14]   PRISM: A data management system for high-throughput proteomics [J].
Kiebel, GR ;
Auberry, KJ ;
Jaitly, N ;
Clark, DA ;
Monroe, ME ;
Peterson, ES ;
Tolic, N ;
Anderson, GA ;
Smith, RD .
PROTEOMICS, 2006, 6 (06) :1783-1790
[15]   TOPP -: the OpenMS proteomics pipeline [J].
Kohlbacher, Oliver ;
Reinert, Knut ;
Groepl, Clemens ;
Lange, Eva ;
Pfeifer, Nico ;
Schulz-Trieglaff, Ole ;
Sturm, Marc .
BIOINFORMATICS, 2007, 23 (02) :E191-E197
[16]  
Kozak K, 2010, EUR PHARM REV, V4, P34
[17]   BiologicalNetworks 2.0-an integrative view of genome biology data [J].
Kozhenkov, Sergey ;
Dubinina, Yulia ;
Sedova, Mayya ;
Gupta, Amarnath ;
Ponomarenko, Julia ;
Baitaluk, Michael .
BMC BIOINFORMATICS, 2010, 11
[18]   ProHits: integrated software for mass spectrometry-based interaction proteomics [J].
Liu, Guomin ;
Zhang, Jianping ;
Larsen, Brett ;
Stark, Chris ;
Breitkreutz, Ashton ;
Lin, Zhen-Yuan ;
Breitkreutz, Bobby-Joe ;
Ding, Yongmei ;
Colwill, Karen ;
Pasculescu, Adrian ;
Pawson, Tony ;
Wrana, Jeffrey L. ;
Nesvizhskii, Alexey I. ;
Raught, Brian ;
Tyers, Mike ;
Gingras, Anne-Claude .
NATURE BIOTECHNOLOGY, 2010, 28 (10) :1015-1017
[19]  
Malmstrom J, 2011, J BIOL CHEM
[20]   2DDB -: a bioinformatics solution for analysis of quantitative proteomics data [J].
Malmström, L ;
Marko-Varga, G ;
Westergren-Thorsson, G ;
Laurell, T ;
Malmström, J .
BMC BIOINFORMATICS, 2006, 7 (1)