cPath: open source software for collecting, storing, and querying biological pathways

被引:73
作者
Cerami, Ethan G.
Bader, Gary D.
Gross, Benjamin E.
Sander, Chris
机构
[1] Mem Sloan Kettering Canc Ctr, Computat Biol Ctr, New York, NY 10021 USA
[2] Univ Toronto, Banting & Best Dept Med Res, Terrence Donnelly Ctr Cellular & Biomol Res, Toronto, ON M5S 3E1, Canada
关键词
D O I
10.1186/1471-2105-7-497
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Biological pathways, including metabolic pathways, protein interaction networks, signal transduction pathways, and gene regulatory networks, are currently represented in over 220 diverse databases. These data are crucial for the study of specific biological processes, including human diseases. Standard exchange formats for pathway information, such as BioPAX, CellML, SBML and PSI-MI, enable convenient collection of this data for biological research, but mechanisms for common storage and communication are required. Results: We have developed cPath, an open source database and web application for collecting, storing, and querying biological pathway data. cPath makes it easy to aggregate custom pathway data sets available in standard exchange formats from multiple databases, present pathway data to biologists via a customizable web interface, and export pathway data via a web service to third-party software, such as Cytoscape, for visualization and analysis. cPath is software only, and does not include new pathway information. Key features include: a built-in identifier mapping service for linking identical interactors and linking to external resources; built-in support for PSI-MI and BioPAX standard pathway exchange formats; a web service interface for searching and retrieving pathway data sets; and thorough documentation. The cPath software is freely available under the LGPL open source license for academic and commercial use. Conclusion: cPath is a robust, scalable, modular, professional-grade software platform for collecting, storing, and querying biological pathways. It can serve as the core data handling component in information systems for pathway visualization, analysis and modeling.
引用
收藏
页数:9
相关论文
共 42 条
[1]   PIANA: protein interactions and network analysis [J].
Aragues, R ;
Jaeggi, D ;
Oliva, B .
BIOINFORMATICS, 2006, 22 (08) :1015-1017
[2]   Pathguide: a Pathway Resource List [J].
Bader, Gary D. ;
Cary, Michael P. ;
Sander, Chris .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D504-D506
[3]   BIOZON: a system for unification, management and analysis of heterogeneous biological data [J].
Birkland, A ;
Yona, G .
BMC BIOINFORMATICS, 2006, 7 (1)
[4]   Coming soon: a global grid for cancer research [J].
Bouchie, A .
NATURE BIOTECHNOLOGY, 2004, 22 (09) :1071-1073
[5]   Cyberinfrastructure: Empowering a "third way" in biomedical research [J].
Buetow, KH .
SCIENCE, 2005, 308 (5723) :821-824
[6]  
CAMPAGNE F, 2004, SCI STKE, P111
[7]   Pathway information for systems biology [J].
Cary, MP ;
Bader, GD ;
Sander, C .
FEBS LETTERS, 2005, 579 (08) :1815-1820
[8]  
Fielding R.T., 2000, INFORM COMPUTER SCI
[9]  
GUDGIN M, 2003, SOAP VERSION 1 2 1
[10]   Modelling the molecular circuitry of cancer [J].
Hahn, WC ;
Weinberg, RA .
NATURE REVIEWS CANCER, 2002, 2 (05) :331-341