Statistical and computational methods for comparative proteomic profiling using liquid chromatography-tandem mass spectrometry

被引:200
作者
Listgarten, J
Emili, A
机构
[1] Univ Toronto, Charles H Best Inst, Banting & Best Dept Med Res, Toronto, ON M5G 1L6, Canada
[2] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3G4, Canada
[3] Univ Toronto, Program Prote & Bioinformat, Toronto, ON M5G 1L6, Canada
关键词
D O I
10.1074/mcp.R500005-MCP200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The combined method of LC-MS/MS is increasingly being used to explore differences in the proteomic composition of complex biological systems. The reliability and utility of such comparative protein expression profiling studies is critically dependent on an accurate and rigorous assessment of quantitative changes in the relative abundance of the myriad of proteins typically present in a biological sample such as blood or tissue. In this review, we provide an overview of key statistical and computational issues relevant to bottom-up shotgun global proteomic analysis, with an emphasis on methods that can be applied to improve the dependability of biological inferences drawn from large proteomic datasets. Focusing on a start-to-finish approach, we address the following topics: 1) low-level data processing steps, such as formation of a data matrix, filtering, and baseline subtraction to minimize noise, 2) mid-level processing steps, such as data normalization, alignment in time, peak detection, peak quantification, peak matching, and error models, to facilitate profile comparisons; and, 3) high-level processing steps such as sample classification and biomarker discovery, and related topics such as significance testing, multiple testing, and choice of feature space. We report on approaches that have recently been developed for these steps, discussing their merits and limitations, and propose areas deserving of further research.
引用
收藏
页码:419 / 434
页数:16
相关论文
共 52 条
[1]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[2]   Quantifying reproducibility for differential proteomics: noise analysis for protein liquid chromatography-mass spectrometry of human serum [J].
Anderle, M ;
Roy, S ;
Lin, H ;
Becker, C ;
Joho, K .
BIOINFORMATICS, 2004, 20 (18) :3575-3582
[3]  
[Anonymous], P GENS
[4]   A comprehensive approach to the analysis of matrix-assisted laser desorption/ionization-time of flight proteomics spectra from serum samples [J].
Baggerly, KA ;
Morris, JS ;
Wang, J ;
Gold, D ;
Xiao, LC ;
Coombes, KR .
PROTEOMICS, 2003, 3 (09) :1667-1672
[5]   Chromatographic alignment by warping and dynamic programming as a pre-processing tool for PARAFAC modelling of liquid chromatography-mass spectrometry data [J].
Bylund, D ;
Danielsson, R ;
Malmquist, G ;
Markides, KE .
JOURNAL OF CHROMATOGRAPHY A, 2002, 961 (02) :237-244
[6]  
BYLUND D, 2001, CHEMOMETRIC TOOLS EN
[7]   Listening to silence and understanding nonsense: Exonic mutations that affect splicing [J].
Cartegni, L ;
Chew, SL ;
Krainer, AR .
NATURE REVIEWS GENETICS, 2002, 3 (04) :285-298
[8]  
Chau F.T., 2004, CHEMOMETRICS BASICS
[9]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[10]   Point - Proteomic patterns in biological fluids: Do they represent the future of cancer diagnostics? [J].
Diamandis, EP .
CLINICAL CHEMISTRY, 2003, 49 (08) :1272-1275