Informatics for protein identification by mass spectrometry

被引:80
作者
Johnson, RS
Davis, MT
Taylor, JA
Patterson, SD
机构
[1] Amgen Corp, Mol Sci, Seattle, WA 98119 USA
[2] Amgen Corp, Bioinformat Dept, Seattle, WA 98119 USA
关键词
peptide mass fingerprinting; tandem mass spectrometry; peptide sequencing; protein identification; database search; de novo sequencing; bioinformatics; proteomics; homology search;
D O I
10.1016/j.ymeth.2004.08.014
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High throughput protein analysis (i.e., proteomics) first became possible when sensitive peptide mass mapping techniques were developed, thereby allowing for the possibility of identifying and cataloging most 2D gel electrophoresis spots. Shortly thereafter a few groups pioneered the idea of identifying proteins by using peptide tandem mass spectra to search protein sequence databases. Hence, it became possible to identify proteins from very complex mixtures. One drawback to these latter techniques is that it is not entirely straightforward to make matches using tandem mass spectra of peptides that are modified or have sequences that differ slightly from what is present in the sequence database that is being searched. This has been part of the motivation behind automated de novo sequencing programs that attempt to derive a peptide sequence regardless of its presence in a sequence database. The sequence candidates thus generated are then subjected to homology-based database search programs (e.g., BLAST or FASTA). These homology search programs, however, were not developed with mass spectrometry in mind, and it became necessary to make minor modifications such that mass spectrometric ambiguities can be taken into account when comparing query and database sequences. Finally, this review will discuss the important issue of validating protein identifications. All of the search programs will produce a top ranked answer; however, only the credulous are willing to accept them carte blanche. (c) 2004 Elsevier Inc. All rights reserved.
引用
收藏
页码:223 / 236
页数:14
相关论文
共 71 条
[1]   Toward a human blood serum proteome - Analysis by multidimensional separation coupled with mass spectrometry [J].
Adkins, JN ;
Varnum, SM ;
Auberry, KJ ;
Moore, RJ ;
Angell, NH ;
Smith, RD ;
Springer, DL ;
Pounds, JG .
MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (12) :947-955
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   The human plasma proteome - A nonredundant list developed by combination of four separate sources [J].
Anderson, NL ;
Polanski, M ;
Pieper, R ;
Gatlin, T ;
Tirumalai, RS ;
Conrads, TP ;
Veenstra, TD ;
Adkins, JN ;
Pounds, JG ;
Fagan, R ;
Lobley, A .
MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (04) :311-326
[4]  
[Anonymous], 1978, Atlas of protein sequence and structure
[5]   INTRAMOLECULAR [O-18] ISOTOPIC EXCHANGE IN THE GAS-PHASE OBSERVED DURING THE TANDEM MASS-SPECTROMETRIC ANALYSIS OF PEPTIDES [J].
BALLARD, KD ;
GASKELL, SJ .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1992, 114 (01) :64-71
[6]   Improving large-scale proteomics by clustering of mass spectrometry data [J].
Beer, I ;
Barnea, E ;
Ziv, T ;
Admon, A .
PROTEOMICS, 2004, 4 (04) :950-960
[7]   DETERMINATION OF AMINO ACID SEQUENCE IN OLIGOPEPTIDES BY COMPUTER INTERPRETATION OF THEIR HIGH-RESOLUTION MASS SPECTRA [J].
BIEMANN, K ;
CONE, C ;
WEBSTER, BR ;
ARSENAUL.GP .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1966, 88 (23) :5598-&
[8]  
CARR SA, 2000, MASS SPECTROMETRY BI, P553
[9]  
Choudhary JS, 2001, PROTEOMICS, V1, P651, DOI 10.1002/1615-9861(200104)1:5<651::AID-PROT651>3.0.CO
[10]  
2-N