A High-Confidence Human Plasma Proteome Reference Set with Estimated Concentrations in PeptideAtlas

被引:370
作者
Farrah, Terry [1 ]
Deutsch, Eric W. [1 ]
Omenn, Gilbert S. [1 ,2 ]
Campbell, David S. [1 ]
Sun, Zhi [1 ]
Bletz, Julie A. [1 ]
Mallick, Parag [3 ]
Katz, Jonathan E. [3 ]
Malmstroem, Johan [4 ]
Ossola, Reto [4 ]
Watts, Julian D. [1 ]
Lin, Biaoyang [5 ,6 ]
Zhang, Hui [7 ]
Moritz, Robert L. [1 ]
Aebersold, Ruedi [4 ]
机构
[1] Inst Syst Biol, Seattle, WA 98109 USA
[2] Univ Michigan, Ctr Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[3] Univ So Calif, Dept Med, Los Angeles, CA USA
[4] ETH Swiss Fed Inst Technol, Dept Biol, Inst Mol Syst Biol, Zurich, Switzerland
[5] Swedish Med Ctr, Swedish Neurosci Inst, Seattle, WA USA
[6] Zhejiang Univ, Syst Biol Div, Zhejiang California Int Nanosyst Inst ZCNI, Hangzhou 310003, Zhejiang, Peoples R China
[7] Johns Hopkins Univ, Dept Pathol, Baltimore, MD USA
基金
美国国家卫生研究院; 欧洲研究理事会; 瑞士国家科学基金会;
关键词
TANDEM MASS-SPECTROMETRY; DYNAMIC-RANGE CHARACTERIZATION; HUMAN SERUM PROTEOME; QUANTITATIVE PROTEOMICS; HYDRAZIDE CHEMISTRY; SHOTGUN PROTEOMICS; STATISTICAL-MODEL; ABSOLUTE PROTEIN; PILOT PHASE; DATABASE;
D O I
10.1074/mcp.M110.006353
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Human blood plasma can be obtained relatively noninvasively and contains proteins from most, if not all, tissues of the body. Therefore, an extensive, quantitative catalog of plasma proteins is an important starting point for the discovery of disease biomarkers. In 2005, we showed that different proteomics measurements using different sample preparation and analysis techniques identify significantly different sets of proteins, and that a comprehensive plasma proteome can be compiled only by combining data from many different experiments. Applying advanced computational methods developed for the analysis and integration of very large and diverse data sets generated by tandem MS measurements of tryptic peptides, we have now compiled a high-confidence human plasma proteome reference set with well over twice the identified proteins of previous high-confidence sets. It includes a hierarchy of protein identifications at different levels of redundancy following a clearly defined scheme, which we propose as a standard that can be applied to any proteomics data set to facilitate cross-proteome analyses. Further, to aid in development of blood-based diagnostics using techniques such as selected reaction monitoring, we provide a rough estimate of protein concentrations using spectral counting. We identified 20,433 distinct peptides, from which we inferred a highly nonredundant set of 1929 protein sequences at a false discovery rate of 1%. We have made this resource available via PeptideAtlas, a large, multiorganism, publicly accessible compendium of peptides identified in tandem MS experiments conducted by laboratories around the world. Molecular & Cellular Proteomics 10: 10.1074/mcp.M110.006353, 1-14, 2011.
引用
收藏
页数:14
相关论文
共 69 条
[1]
Data management and preliminary data analysis in the pilot phase of the HUPO Plasma Proteome Project [J].
Adamski, M ;
Blackwell, T ;
Menon, R ;
Martens, L ;
Hermjakob, H ;
Taylor, C ;
Omenn, GS ;
States, DJ .
PROTEOMICS, 2005, 5 (13) :3246-3261
[2]
Toward a human blood serum proteome - Analysis by multidimensional separation coupled with mass spectrometry [J].
Adkins, JN ;
Varnum, SM ;
Auberry, KJ ;
Moore, RJ ;
Angell, NH ;
Smith, RD ;
Springer, DL ;
Pounds, JG .
MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (12) :947-955
[3]
The human plasma proteome - History, character, and diagnostic prospects [J].
Anderson, NL ;
Anderson, NG .
MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (11) :845-867
[4]
The human plasma proteome - A nonredundant list developed by combination of four separate sources [J].
Anderson, NL ;
Polanski, M ;
Pieper, R ;
Gatlin, T ;
Tirumalai, RS ;
Conrads, TP ;
Veenstra, TD ;
Adkins, JN ;
Pounds, JG ;
Fagan, R ;
Lobley, A .
MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (04) :311-326
[5]
Armandola Elena A, 2003, MedGenMed, V5, P18
[6]
Simple modification of a protein database for mass spectral identification of N-linked glycopeptides [J].
Atwood, JA ;
Sahoo, SS ;
Alvarez-Manilla, G ;
Weatherly, DB ;
Kolli, K ;
Orlando, R ;
York, WS .
RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2005, 19 (21) :3002-3006
[7]
The Universal Protein Resource (UniProt) 2009 [J].
Bairoch, Amos ;
Consortium, UniProt ;
Bougueleret, Lydie ;
Altairac, Severine ;
Amendolia, Valeria ;
Auchincloss, Andrea ;
Argoud-Puy, Ghislaine ;
Axelsen, Kristian ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte ;
Bolleman, Jerven ;
Bollondi, Laurent ;
Boutet, Emmanuel ;
Quintaje, Silvia Braconi ;
Breuza, Lionel ;
Bridge, Alan ;
deCastro, Edouard ;
Ciapina, Luciane ;
Coral, Danielle ;
Coudert, Elisabeth ;
Cusin, Isabelle ;
Delbard, Gwennaelle ;
Dornevil, Dolnide ;
Roggli, Paula Duek ;
Duvaud, Severine ;
Estreicher, Anne ;
Famiglietti, Livia ;
Feuermann, Marc ;
Gehant, Sebastian ;
Farriol-Mathis, Nathalie ;
Ferro, Serenella ;
Gasteiger, Elisabeth ;
Gateau, Alain ;
Gerritsen, Vivienne ;
Gos, Arnaud ;
Gruaz-Gumowski, Nadine ;
Hinz, Ursula ;
Hulo, Chantal ;
Hulo, Nicolas ;
James, Janet ;
Jimenez, Silvia ;
Jungo, Florence ;
Junker, Vivien ;
Kappler, Thomas ;
Keller, Guillaume ;
Lachaize, Corinne ;
Lane-Guermonprez, Lydie ;
Langendijk-Genevaux, Petra ;
Lara, Vicente .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D169-D174
[8]
The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[9]
Boutet Emmanuel, 2007, V406, P89
[10]
Characterization of the Human Plasma Phosphoproteome Using Linear Ion Trap Mass Spectrometry and Multiple Search Engines [J].
Carrascal, Montserrat ;
Gay, Marina ;
Ovelleiro, David ;
Casas, Vanessa ;
Gelpi, Emilio ;
Abian, Joaquin .
JOURNAL OF PROTEOME RESEARCH, 2010, 9 (02) :876-884