BioGraphE: high-performance bionetwork analysis using the Biological Graph Environment

被引:12
作者
Chin, George, Jr. [1 ]
Chavarria, Daniel G. [1 ]
Nakamura, Grant C. [2 ]
Sofia, Heidi J. [3 ]
机构
[1] Pacific NW Natl Lab, High Performance Comp Grp, Computat Sci & Math Div, Richland, WA 99352 USA
[2] Pacific NW Natl Lab, Informat Analyt Dept, Computat & Stat Analyt Div, Richland, WA 99352 USA
[3] Pacific NW Natl Lab, Computat Biol & Bioinformat Dept, Computat Sci & Math Div, Richland, WA 99352 USA
关键词
D O I
10.1186/1471-2105-9-S6-S6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Graphs and networks are common analysis representations for biological systems. Many traditional graph algorithms such as k-clique, k-coloring, and subgraph matching have great potential as analysis techniques for newly available data in biology. Yet, as the amount of genomic and bionetwork information rapidly grows, scientists need advanced new computational strategies and tools for dealing with the complexities of the bionetwork analysis and the volume of the data. Results: We introduce a computational framework for graph analysis called the Biological Graph Environment (BioGraphE), which provides a general, scalable integration platform for connecting graph problems in biology to optimized computational solvers and high-performance systems. This framework enables biology researchers and computational scientists to identify and deploy network analysis applications and to easily connect them to efficient and powerful computational software and hardware that are specifically designed and tuned to solve complex graph problems. In our particular application of BioGraphE to support network analysis in genome biology, we investigate the use of a Boolean satisfiability solver known as Survey Propagation as a core computational solver executing on standard high-performance parallel systems, as well as multi-threaded architectures. Conclusion: In our application of BioGraphE to conduct bionetwork analysis of homology networks, we found that BioGraphE and a custom, parallel implementation of the Survey Propagation SAT solver were capable of solving very large bionetwork problems at high rates of execution on different high-performance computing platforms.
引用
收藏
页数:10
相关论文
共 16 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
[Anonymous], 2005, Proceedings of the 2nd conference on Computing frontiers, DOI DOI 10.1145/1062261.1062268
[3]  
[Anonymous], 2003, Proceedings of the 2003 ACM/IEEE conference on Supercomputing, SC'03
[4]  
[Anonymous], 2007, CF 07
[5]   On the architectural requirements for efficient execution of graph algorithms [J].
Bader, DA ;
Cong, GJ ;
Feo, J .
2005 International Conference on Parallel Processsing, Proceedings, 2005, :547-556
[6]   Designing multithreaded algorithms for breadth-first search and st-connectivity on the cray MTA-2 [J].
Bader, David A. ;
Madduri, Kamesh .
2006 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDINGS, 2006, :523-530
[7]   Survey propagation:: An algorithm for satisfiability [J].
Braunstein, A ;
Mézard, M ;
Zecchina, R .
RANDOM STRUCTURES & ALGORITHMS, 2005, 27 (02) :201-226
[8]  
CAMPBELL EA, MOL CELL IN PRESS
[9]   Parallel data intensive computing in scientific and commercial applications [J].
Cannataro, M ;
Talia, D ;
Srimani, PK .
PARALLEL COMPUTING, 2002, 28 (05) :673-704
[10]   Survey-propagation decimation through distributed local computations -: art. no. P11016 [J].
Chavas, J ;
Furtlehner, C ;
Mézard, M ;
Zecchina, R .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2005, :300-325