Improved ranking functions for protein and modification-site identifications

被引:23
作者
Bern, Marshall [1 ]
Goldberg, David [1 ]
机构
[1] Xerox Corp, Palo Alto Res Ctr, Palo Alto, CA 94304 USA
关键词
algorithms; combinatorial optimization; combinatorics; dynamic programming; mass spectroscopy;
D O I
10.1089/cmb.2007.0119
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
There are a number of computational tools for assigning identifications to peptide tandem mass spectra, but only a few tools, most notably ProteinProphet, for the crucial next step of integrating peptide identifications into higher-level identifications, such as proteins or modification sites. Here we describe a new program called ComByne for scoring and ranking higher-level identifications. Unlike other identification integration tools, ComByne corrects for protein lengths; it also makes use of more information, such as retention times and spectrum-to-spectrum corroborations. We compare ComByne to existing algorithms on several complex biological samples, including a sample of mouse blood plasma spiked with known concentrations of human proteins. On our samples, the combination of ComByne with our database search tool ByOnic is more sensitive than the combinations of Mascot with ProteinProphet and SEQUEST with DTASelect, with over 40% more proteins identified at 1% false discovery rate. A Web interface to our software is at http://bio.parc.xerox.com.
引用
收藏
页码:705 / 719
页数:15
相关论文
共 38 条
[1]   Toward a human blood serum proteome - Analysis by multidimensional separation coupled with mass spectrometry [J].
Adkins, JN ;
Varnum, SM ;
Auberry, KJ ;
Moore, RJ ;
Angell, NH ;
Smith, RD ;
Springer, DL ;
Pounds, JG .
MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (12) :947-955
[2]   The human plasma proteome - A nonredundant list developed by combination of four separate sources [J].
Anderson, NL ;
Polanski, M ;
Pieper, R ;
Gatlin, T ;
Tirumalai, RS ;
Conrads, TP ;
Veenstra, TD ;
Adkins, JN ;
Pounds, JG ;
Fagan, R ;
Lobley, A .
MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (04) :311-326
[3]   Protein identification by spectral networks analysis [J].
Bandeira, Nuno ;
Tsur, Dekel ;
Frank, Ari ;
Pevzner, Pavel A. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (15) :6140-6145
[4]   De novo analysis of peptide tandem mass spectra by spectral graph partitioning [J].
Bern, M ;
Goldberg, D .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (02) :364-378
[5]   Reporting protein identification data - The next generation of guidelines [J].
Bradshaw, RA ;
Burlingame, AL ;
Carr, S ;
Aebersold, R .
MOLECULAR & CELLULAR PROTEOMICS, 2006, 5 (05) :787-788
[6]   Serum proteomics profiling - a young technology begins to mature [J].
Coombes, KR ;
Morris, JRS ;
Hu, JH ;
Edmonson, SR ;
Baggerly, KA .
NATURE BIOTECHNOLOGY, 2005, 23 (03) :291-292
[7]   A method for reducing the time required to match protein sequences with tandem mass spectra [J].
Craig, R ;
Beavis, RC .
RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2003, 17 (20) :2310-2316
[8]   TANDEM: matching proteins with tandem mass spectra [J].
Craig, R ;
Beavis, RC .
BIOINFORMATICS, 2004, 20 (09) :1466-1467
[9]   Unimod: Protein modifications for mass spectrometry [J].
Creasy, DM ;
Cottrell, JS .
PROTEOMICS, 2004, 4 (06) :1534-1536
[10]  
Creasy DM, 2002, PROTEOMICS, V2, P1426, DOI 10.1002/1615-9861(200210)2:10<1426::AID-PROT1426>3.0.CO