X!!Tandem, an improved method for running X!Tandem in parallel on collections of commodity computers

被引:103
作者
Bjornson, Robert D. [1 ,2 ]
Carriero, Nicholas J. [1 ,2 ]
Colangelo, Christopher [1 ]
Shifinan, Mark [3 ,6 ]
Cheung, Kei-Hoi [2 ,3 ,4 ,6 ]
Miller, Perry L. [3 ,5 ,6 ]
Williams, Kenneth [1 ,7 ]
机构
[1] Yale Univ, Keck Biotechnol Resource Lab, New Haven, CT 06520 USA
[2] Yale Univ, Dept Comp Sci, New Haven, CT 06520 USA
[3] Yale Univ, Ctr Med Informat, New Haven, CT 06520 USA
[4] Yale Univ, Dept Genet, New Haven, CT 06520 USA
[5] Yale Univ, Dept Mol Cellular & Dev Biol, New Haven, CT 06520 USA
[6] Yale Univ, Dept Anesthesiol, New Haven, CT 06520 USA
[7] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
关键词
proteomics; protein identification; parallel database search; X!Tandem; tandem mass spectrometry; parallel X!Tandem; MPI;
D O I
10.1021/pr0701198
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The widespread use of mass spectrometry for protein identification has created a demand for computationally efficient methods of matching mass spectrometry data to protein databases. A search using X!Tandem, a popular and representative program, can require hours or days to complete, particularly when missed cleavages and post-translational modifications are considered. Existing techniques for accelerating X!Tandem by employing parallelism are unsatisfactory for a variety of reasons. The paper describes a parallelization of X!Tandem, called X!!Tandem, that shows excellent speedups on commodity hardware and produces the same results as the original program. Furthermore, the parallelization technique used is unusual and potentially useful for parallelizing other complex programs.
引用
收藏
页码:293 / 299
页数:7
相关论文
共 14 条
[11]   General framework for developing and evaluating database scoring algorithms using the TANDEM search engine [J].
MacLean, Brendan ;
Eng, Jimmy K. ;
Beavis, Ronald C. ;
McIntosh, Martin .
BIOINFORMATICS, 2006, 22 (22) :2830-2832
[12]  
Perkins DN, 1999, ELECTROPHORESIS, V20, P3551, DOI 10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO
[13]  
2-2
[14]  
Snir M., 1998, MPI: The Complete Reference