Significance Analysis of Spectral Count Data in Label-free Shotgun Proteomics
被引:283
作者:
Choi, Hyungwon
论文数: 0引用数: 0
h-index: 0
机构:
Univ Michigan, Dept Pathol, Ann Arbor, MI 48109 USA
Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USAUniv Michigan, Dept Pathol, Ann Arbor, MI 48109 USA
Choi, Hyungwon
[1
,2
]
Fermin, Damian
论文数: 0引用数: 0
h-index: 0
机构:
Univ Michigan, Dept Pathol, Ann Arbor, MI 48109 USAUniv Michigan, Dept Pathol, Ann Arbor, MI 48109 USA
Fermin, Damian
[1
]
Nesvizhskii, Alexey I.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Michigan, Dept Pathol, Ann Arbor, MI 48109 USA
Univ Michigan, Ctr Computat Med & Biol, Ann Arbor, MI 48109 USAUniv Michigan, Dept Pathol, Ann Arbor, MI 48109 USA
Nesvizhskii, Alexey I.
[1
,3
]
机构:
[1] Univ Michigan, Dept Pathol, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Ctr Computat Med & Biol, Ann Arbor, MI 48109 USA
Spectral counting has become a commonly used approach for measuring protein abundance in label-free shotgun proteomics. At the same time, the development of data analysis methods has lagged behind. Currently most studies utilizing spectral counts rely on simple data transforms and posthoc corrections of conventional signal-to-noise ratio statistics. However, these adjustments can neither handle the bias toward high abundance proteins nor deal with the drawbacks due to the limited number of replicates. We present a novel statistical framework (QSpec) for the significance analysis of differential expression with extensions to a variety of experimental design factors and adjustments for protein properties. Using synthetic and real experimental data sets, we show that the proposed method outperforms conventional statistical methods that search for differential expression for individual proteins. We illustrate the flexibility of the model by analyzing a data set with a complicated experimental design involving cellular localization and time course. Molecular & Cellular Proteomics 7:2373-2385, 2008.