A common open representation of mass spectrometry data and its application to proteomics research

被引:575
作者
Pedrioli, PGA
Eng, JK
Hubley, R
Vogelzang, M
Deutsch, EW
Raught, B
Pratt, B
Nilsson, E
Angeletti, RH
Apweiler, R
Cheung, K
Costello, CE
Hermjakob, H
Huang, S
Julian, RK
Kapp, E
McComb, ME
Oliver, SG
Omenn, G
Paton, NW
Simpson, R
Smith, R
Taylor, CF
Zhu, WM
Aebersold, R
机构
[1] Inst Syst Biol, Seattle, WA 98103 USA
[2] Insilicos LLC, Seattle, WA 98103 USA
[3] Albert Einstein Coll Med, Bronx, NY 10461 USA
[4] EMBL Outstn European Bioinformat Inst, Cambridge, England
[5] Yale Univ, Sch Med, Dept Anesthesiol, Ctr Med Informat, New Haven, CT 06520 USA
[6] Boston Univ, Sch Med, Boston, MA 02118 USA
[7] Lilly Res Labs, Indianapolis, IN 46285 USA
[8] Royal Melbourne Hosp, Ludwig Inst Canc Res, Joint Proteom Lab, Parkville, Vic 3050, Australia
[9] Royal Melbourne Hosp, Walter & Eliza Hall Inst Med Res, Parkville, Vic 3050, Australia
[10] Univ Manchester, Sch Biol Sci, Manchester M13 9PT, Lancs, England
[11] Univ Michigan, Sch Med, Ann Arbor, MI 48109 USA
[12] Univ Manchester, Dept Comp Sci, Manchester M13 9PL, Lancs, England
[13] Pacific NW Natl Lab, Div Biol Sci, Richland, WA 99352 USA
[14] Pacific NW Natl Lab, Environm Mol Sci Lab, Richland, WA 99352 USA
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
10.1038/nbt1031
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
A broad range of mass spectrometers are used in mass spectrometry (MS)-based proteomics research. Each type of instrument possesses a unique design, data system and performance specifications, resulting in strengths and weaknesses for different types of experiments. Unfortunately, the native binary data formats produced by each type of mass spectrometer also differ and are usually proprietary. The diverse, nontransparent nature of the data structure complicates the integration of new instruments into preexisting infrastructure, impedes the analysis, exchange, comparison and publication of results from different experiments and laboratories, and prevents the bioinformatics community from accessing data sets required for software development. Here, we introduce the 'mzXML' format, an open, generic XML (extensible markup language) representation of MS data. We have also developed an accompanying suite of supporting programs. We expect that this format will facilitate data management, interpretation and dissemination in proteomics research.
引用
收藏
页码:1459 / 1466
页数:8
相关论文
共 17 条
[1]   Protein identification by mass spectrometry - Issues to be considered [J].
Baldwin, MA .
MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (01) :1-9
[2]   AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].
ENG, JK ;
MCCORMACK, AL ;
YATES, JR .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989
[3]   Quantitative analysis of complex protein mixtures using isotope-coded affinity tags [J].
Gygi, SP ;
Rist, B ;
Gerber, SA ;
Turecek, F ;
Gelb, MH ;
Aebersold, R .
NATURE BIOTECHNOLOGY, 1999, 17 (10) :994-999
[4]   Quantitative profiling of differentiation-induced microsomal proteins using isotope-coded affinity tags and mass spectrometry [J].
Han, DK ;
Eng, J ;
Zhou, HL ;
Aebersold, R .
NATURE BIOTECHNOLOGY, 2001, 19 (10) :946-951
[5]   Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search [J].
Keller, A ;
Nesvizhskii, AI ;
Kolker, E ;
Aebersold, R .
ANALYTICAL CHEMISTRY, 2002, 74 (20) :5383-5392
[6]   A tool to visualize and evaluate data obtained by liquid chromatography-electrospray ionization-mass spectrometry [J].
Li, XJ ;
Pedrioli, PGA ;
Eng, J ;
Martin, D ;
Yi, EC ;
Lee, H ;
Aebersold, R .
ANALYTICAL CHEMISTRY, 2004, 76 (13) :3856-3860
[7]   Automated statistical analysis of protein abundance ratios from data generated by stable-isotope dilution and tandem mass spectrometry [J].
Li, XJ ;
Zhang, H ;
Ranish, JA ;
Aebersold, R .
ANALYTICAL CHEMISTRY, 2003, 75 (23) :6648-6657
[8]   A statistical model for identifying proteins by tandem mass spectrometry [J].
Nesvizhskii, AI ;
Keller, A ;
Kolker, E ;
Aebersold, R .
ANALYTICAL CHEMISTRY, 2003, 75 (17) :4646-4658
[9]   Proteomics: the first decade and beyond [J].
Patterson, SD ;
Aebersold, RH .
NATURE GENETICS, 2003, 33 (Suppl 3) :311-323
[10]  
Perkins DN, 1999, ELECTROPHORESIS, V20, P3551, DOI 10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO