Adaptable data management for systems biology investigations

被引:14
作者
Boyle, John [1 ]
Rovira, Hector [1 ]
Cavnor, Chris [1 ]
Burdick, David [1 ]
Killcoyne, Sarah [1 ]
Shmulevich, Ilya [1 ]
机构
[1] Inst Syst Biol, Seattle, WA 98103 USA
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
Data Management System; Aggregation System; Content Management System; Rich Functionality; XPATH Query;
D O I
10.1186/1471-2105-10-79
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Within research each experiment is different, the focus changes and the data is generated from a continually evolving barrage of technologies. There is a continual introduction of new techniques whose usage ranges from in-house protocols through to high-throughput instrumentation. To support these requirements data management systems are needed that can be rapidly built and readily adapted for new usage. Results: The adaptable data management system discussed is designed to support the seamless mining and analysis of biological experiment data that is commonly used in systems biology (e.g. ChIP-chip, gene expression, proteomics, imaging, flow cytometry). We use different content graphs to represent different views upon the data. These views are designed for different roles: equipment specific views are used to gather instrumentation information; data processing oriented views are provided to enable the rapid development of analysis applications; and research project specific views are used to organize information for individual research experiments. This management system allows for both the rapid introduction of new types of information and the evolution of the knowledge it represents. Conclusion: Data management is an important aspect of any research enterprise. It is the foundation on which most applications are built, and must be easily extended to serve new functionality for new scientific areas. We have found that adopting a three-tier architecture for data management, built around distributed standardized content repositories, allows us to rapidly develop new applications to support a diverse user community.
引用
收藏
页数:16
相关论文
共 16 条
[1]  
AEA B, 2001, NAT GENET, P365
[2]  
BACHMAN C, 1974, ACM SIGMOD, V6, P16
[3]   Systems biology driven software design for the research enterprise [J].
Boyle, John ;
Cavnor, Christopher ;
Killcoyne, Sarah ;
Shmulevich, Ilya .
BMC BIOINFORMATICS, 2008, 9 (1)
[4]  
Etzold T, 1996, METHOD ENZYMOL, V266, P114
[5]  
FISCHER G, 1999, P CMCD, V4, P83
[6]   The Open Microscopy Environment (OME) Data Model and XML file: open tools for informatics and quantitative analysis in biological imaging [J].
Goldberg, IG ;
Allan, C ;
Burel, JM ;
Creager, D ;
Falconi, A ;
Hochheiser, H ;
Johnston, J ;
Mellen, J ;
Sorger, PK ;
Swedlow, JR .
GENOME BIOLOGY, 2005, 6 (05)
[7]   DiscoveryLink: A system for integrated access to life sciences data sources [J].
Haas, LM ;
Schwarz, PM ;
Kodali, P ;
Kotlar, E ;
Rice, JE ;
Swope, WC .
IBM SYSTEMS JOURNAL, 2001, 40 (02) :489-511
[8]   The digital code of DNA [J].
Hood, L ;
Galas, D .
NATURE, 2003, 421 (6921) :444-448
[9]   A software chasm: Software engineering and scientific computing [J].
Kelly, Diane F. .
IEEE SOFTWARE, 2007, 24 (06) :120-+
[10]  
KICZALES GLJ, 1997, EUR C OBJ ORIENT PRO