Bias-reduced estimators and confidence intervals for odds ratios in genome-wide association studies

被引:123
作者
Zhong, Hua [1 ]
Prentice, Ross L. [2 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98105 USA
[2] Fred Hutchinson Canc Res Ctr, Div Publ Hlth Sci, Seattle, WA 98109 USA
关键词
bias-reduced estimator; genome-wide association study; odds ratio; selection adjusted confidence interval; selection bias;
D O I
10.1093/biostatistics/kxn001
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genome-wide association studies (GWAS) provide an important approach to identifying common genetic variants that predispose to human disease. A typical GWAS may genotype hundreds of thousands of single nucleotide polymorphisms (SNPs) located throughout the human genome in a set of cases and controls. Logistic regression is often used to test for association between a SNP genotype and case versus control status, with corresponding odds ratios (ORs) typically reported only for those SNPs meeting selection criteria. However, when these estimates are based on the original data used to detect the variant, the results are affected by a selection bias sometimes referred to the "winner's curse" (Capen and others, 1971). The actual genetic association is typically overestimated. We show that such selection bias may be severe in the sense that the conditional expectation of the standard OR estimator may be quite far away from the underlying parameter. Also standard confidence intervals (CIs) may have far from the desired coverage rate for the selected ORs. We propose and evaluate 3 bias-reduced estimators, and also corresponding weighted estimators that combine corrected and uncorrected estimators, to reduce selection bias. Their corresponding CIs are also proposed. We study the performance of these estimators using simulated data sets and show that they reduce the bias and give CI coverage close to the desired level under various scenarios, even for associations having only small statistical power.
引用
收藏
页码:621 / 634
页数:14
相关论文
共 20 条
[1]  
Agresti A., 1990, Analysis of categorical data
[2]   TESTS FOR LINEAR TRENDS IN PROPORTIONS AND FREQUENCIES [J].
ARMITAGE, P .
BIOMETRICS, 1955, 11 (03) :375-386
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[5]   COMPETITIVE BIDDING IN HIGH-RISK SITUATIONS [J].
CAPEN, EC ;
CLAPP, RV ;
CAMPBELL, WM .
JOURNAL OF PETROLEUM TECHNOLOGY, 1971, 23 (JUN) :641-&
[6]   SOME METHODS FOR STRENGTHENING THE COMMON X2 TESTS [J].
COCHRAN, WG .
BIOMETRICS, 1954, 10 (04) :417-451
[7]   Genome-wide association study identifies novel breast cancer susceptibility loci [J].
Easton, Douglas F. ;
Pooley, Karen A. ;
Dunning, Alison M. ;
Pharoah, Paul D. P. ;
Thompson, Deborah ;
Ballinger, Dennis G. ;
Struewing, Jeffery P. ;
Morrison, Jonathan ;
Field, Helen ;
Luben, Robert ;
Wareham, Nicholas ;
Ahmed, Shahana ;
Healey, Catherine S. ;
Bowman, Richard ;
Meyer, Kerstin B. ;
Haiman, Christopher A. ;
Kolonel, Laurence K. ;
Henderson, Brian E. ;
Le Marchand, Loic ;
Brennan, Paul ;
Sangrajrang, Suleeporn ;
Gaborieau, Valerie ;
Odefrey, Fabrice ;
Shen, Chen-Yang ;
Wu, Pei-Ei ;
Wang, Hui-Chun ;
Eccles, Diana ;
Evans, D. Gareth ;
Peto, Julian ;
Fletcher, Olivia ;
Johnson, Nichola ;
Seal, Sheila ;
Stratton, Michael R. ;
Rahman, Nazneen ;
Chenevix-Trench, Georgia ;
Bojesen, Stig E. ;
Nordestgaard, Borge G. ;
Axelsson, Christen K. ;
Garcia-Closas, Montserrat ;
Brinton, Louise ;
Chanock, Stephen ;
Lissowska, Jolanta ;
Peplonska, Beata ;
Nevanlinna, Heli ;
Fagerholm, Rainer ;
Eerola, Hannaleena ;
Kang, Daehee ;
Yoo, Keun-Young ;
Noh, Dong-Young ;
Ahn, Sei-Hyun .
NATURE, 2007, 447 (7148) :1087-U7
[8]  
Galton F., 1886, The Journal of the Anthropological Institute of Great Britain and Ireland, V15, P246, DOI DOI 10.2307/2841583
[9]   Upward bias in odds ratio estimates from genome-wide association studies [J].
Garner, Chad .
GENETIC EPIDEMIOLOGY, 2007, 31 (04) :288-295
[10]   A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer [J].
Hunter, David J. ;
Kraft, Peter ;
Jacobs, Kevin B. ;
Cox, David G. ;
Yeager, Meredith ;
Hankinson, Susan E. ;
Wacholder, Sholom ;
Wang, Zhaoming ;
Welch, Robert ;
Hutchinson, Amy ;
Wang, Junwen ;
Yu, Kai ;
Chatterjee, Nilanjan ;
Orr, Nick ;
Willett, Walter C. ;
Colditz, Graham A. ;
Ziegler, Regina G. ;
Berg, Christine D. ;
Buys, Saundra S. ;
McCarty, Catherine A. ;
Feigelson, Heather Spencer ;
Calle, Eugenia E. ;
Thun, Michael J. ;
Hayes, Richard B. ;
Tucker, Margaret ;
Gerhard, Daniela S. ;
Fraumeni, Joseph F., Jr. ;
Hoover, Robert N. ;
Thomas, Gilles ;
Chanock, Stephen J. .
NATURE GENETICS, 2007, 39 (07) :870-874