Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach

被引:58
作者
Lu, J [1 ]
Tomfohr, JK [1 ]
Kepler, TB [1 ]
机构
[1] Duke Univ, Dept Biostat & Bioinformat, Durham, NC 27708 USA
关键词
D O I
10.1186/1471-2105-6-165
中图分类号
Q5 [生物化学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
Background: In testing for differential gene expression involving multiple serial analysis of gene expression (SAGE) libraries, it is critical to account for both between and within library variation. Several methods have been proposed, including the t test, t(w) test, and an overdispersed logistic regression approach. The merits of these tests, however, have not been fully evaluated. Questions still remain on whether further improvements can be made. Results: In this article, we introduce an overdispersed log-linear model approach to analyzing SAGE; we evaluate and compare its performance with three other tests: the two-sample t test, t(w) test and another based on overdispersed logistic linear regression. Analysis of simulated and real datasets show that both the log-linear and logistic overdispersion methods generally perform better than the t and t(w) tests; the log-linear method is further found to have better performance than the logistic method, showing equal or higher statistical power over a range of parameter values and with different data distributions. Conclusion: Overdispersed log-linear models provide an attractive and reliable framework for analyzing SAGE experiments involving multiple libraries. For convenience, the implementation of this method is available through a user-friendly web-interface available at http://www.cbcb.duke.edu/sage.
引用
收藏
页数:14
相关论文
共 34 条
[1]
[Anonymous], 2011, Categorical data analysis
[2]
[Anonymous], 2021, Bayesian Data Analysis
[3]
The significance of digital gene expression profiles [J].
Audic, S ;
Claverie, JM .
GENOME RESEARCH, 1997, 7 (10) :986-995
[4]
Overdispersed logistic regression for SAGE: Modelling multiple groups and covariates [J].
Baggerly, KA ;
Deng, L ;
Morris, JS ;
Aldaz, CM .
BMC BIOINFORMATICS, 2004, 5 (1)
[5]
Differential expression in SAGE: accounting for normal between-library variation [J].
Baggerly, KA ;
Deng, L ;
Morris, JS ;
Aldaz, CM .
BIOINFORMATICS, 2003, 19 (12) :1477-1483
[6]
MicroSAGE is highly representative and reproducible but reveals major differences in gene expression among samples obtained from similar tissues [J].
Blackshaw, S ;
Kuo, WP ;
Park, PJ ;
Tsujikawa, M ;
Gunnersen, JM ;
Scott, HS ;
Boon, WM ;
Tan, SS ;
Cepko, CL .
GENOME BIOLOGY, 2003, 4 (03)
[7]
An anatomy of normal and malignant gene expression [J].
Boon, K ;
Osório, EC ;
Greenhut, SF ;
Schaefer, CF ;
Shoemaker, J ;
Polyak, K ;
Morin, PJ ;
Buetow, KH ;
Strausberg, RL ;
de Souza, SJ ;
Riggins, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (17) :11287-11292
[8]
BRESLOW NE, 1984, J R STAT SOC C-APPL, V33, P38
[9]
BYU B, 2002, CANCER RES, V62, P819
[10]
CASELLA G, 2002, STAT INFERENCES