PepNovo: De novo peptide sequencing via probabilistic network modeling

被引:460
作者
Frank, A [1 ]
Pevzner, P [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
关键词
D O I
10.1021/ac048788h
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
We present a novel scoring method for de novo, interpretation of peptides from tandem mass spectrometry data. Our scoring method uses a probabilistic network whose structure reflects the chemical and physical rules that govern the peptide fragmentation. We use a likelihood ratio hypothesis test to determine whether the peaks observed in the mass spectrum are more likely to have been produced under our fragmentation model than under a model that treats peaks as random events. We tested our de novo algorithm PepNovo on ion trap data and achieved results that are superior to popular de novo peptide sequencing algorithms.
引用
收藏
页码:964 / 973
页数:10
相关论文
共 40 条
[1]  
Bafna V, 2001, Bioinformatics, V17 Suppl 1, pS13
[2]  
Bafna V., 2003, P 7 ANN INT C COMP M, P9
[3]   FAST ALGORITHM FOR PEPTIDE SEQUENCING BY MASS-SPECTROSCOPY [J].
BARTELS, C .
BIOMEDICAL AND ENVIRONMENTAL MASS SPECTROMETRY, 1990, 19 (06) :363-368
[4]   Cleavage N-terminal to proline: Analysis of a database of peptide tandem mass spectra [J].
Breci, LA ;
Tabb, DL ;
Yates, JR ;
Wysocki, VH .
ANALYTICAL CHEMISTRY, 2003, 75 (09) :1963-1971
[5]   A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry [J].
Chen, T ;
Kao, MY ;
Tepel, M ;
Rush, J ;
Church, GM .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (03) :325-337
[6]  
Colinge J, 2003, LECT N BIOINFORMAT, V2812, P25
[7]   OLAV: Towards high-throughput tandem mass spectrometry data identification [J].
Colinge, J ;
Masselot, A ;
Giron, M ;
Dessingy, T ;
Magnin, J .
PROTEOMICS, 2003, 3 (08) :1454-1463
[8]   De novo peptide sequencing via tandem mass spectrometry [J].
Dancík, V ;
Addona, TA ;
Clauser, KR ;
Vath, JE ;
Pevzner, PA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :327-342
[9]  
Day RM, 2004, 2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, P505
[10]   Intensity-based protein identification by machine learning from a library of tandem mass spectra [J].
Elias, JE ;
Gibbons, FD ;
King, OD ;
Roth, FP ;
Gygi, SP .
NATURE BIOTECHNOLOGY, 2004, 22 (02) :214-219