On inferring presence of an individual in a mixture: a Bayesian approach

被引:21
作者
Clayton, David [1 ,2 ]
机构
[1] Univ Cambridge, Wellcome Trust Juvenile Diabet Res Fdn, Addenbrookes Hosp, Diabet & Inflammat Lab, Cambridge CB2 0XY, England
[2] Univ Cambridge, Dept Med Genet, Addenbrookes Hosp, Cambridge Inst Med Res, Cambridge CB2 0XY, England
基金
英国惠康基金;
关键词
Bayesian analysis; Data confidentiality; Statistical genetics; GENOME-WIDE ASSOCIATION; SELECTION; LASSO;
D O I
10.1093/biostatistics/kxq035
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Homer and others (2008. Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays. PLoS Genetics 4, e1000167) recently showed that, given allele frequency data for a large number of single nucleotide polymorphisms in a sample together with corresponding population "reference" frequencies, by typing an individual's DNA sample at the same set of loci it can be inferred whether or not the individual was a member of the sample. This observation has been responsible for precautionary removal of large amounts of summary data from public access. This and further work on the problem has followed a frequentist approach. This paper sets out a Bayesian analysis of this problem which clarifies the role of the reference frequencies and allows incorporation of prior probabilities of the individual's membership in the sample.
引用
收藏
页码:661 / 673
页数:13
相关论文
共 10 条
[1]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[2]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[3]   Sparse inverse covariance estimation with the graphical lasso [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Robert .
BIOSTATISTICS, 2008, 9 (03) :432-441
[4]   The International HapMap Project [J].
Gibbs, RA ;
Belmont, JW ;
Hardenbol, P ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Ch'ang, LY ;
Huang, W ;
Liu, B ;
Shen, Y ;
Tam, PKH ;
Tsui, LC ;
Waye, MMY ;
Wong, JTF ;
Zeng, CQ ;
Zhang, QR ;
Chee, MS ;
Galver, LM ;
Kruglyak, S ;
Murray, SS ;
Oliphant, AR ;
Montpetit, A ;
Hudson, TJ ;
Chagnon, F ;
Ferretti, V ;
Leboeuf, M ;
Phillips, MS ;
Verner, A ;
Kwok, PY ;
Duan, SH ;
Lind, DL ;
Miller, RD ;
Rice, JP ;
Saccone, NL ;
Taillon-Miller, P ;
Xiao, M ;
Nakamura, Y ;
Sekine, A ;
Sorimachi, K ;
Tanaka, T ;
Tanaka, Y ;
Tsunoda, T ;
Yoshino, E ;
Bentley, DR ;
Deloukas, P ;
Hunt, S ;
Powell, D ;
Altshuler, D ;
Gabriel, SB ;
Qiu, RZ .
NATURE, 2003, 426 (6968) :789-796
[5]   Investigation of the fine structure of European populations with applications to disease association studies [J].
Heath, Simon C. ;
Gut, Ivo G. ;
Brennan, Paul ;
McKay, James D. ;
Bencko, Vladimir ;
Fabianova, Eleonora ;
Foretova, Lenka ;
Georges, Michel ;
Janout, Vladimir ;
Kabesch, Michael ;
Krokan, Hans E. ;
Elvestad, Maiken B. ;
Lissowska, Jolanta ;
Mates, Dana ;
Rudnai, Peter ;
Skorpen, Frank ;
Schreiber, Stefan ;
Soria, Jose M. ;
Syvanen, Ann-Christine ;
Meneton, Pierre ;
Hercberg, Serge ;
Galan, Pilar ;
Szeszenia-Dabrowska, Neonilia ;
Zaridze, David ;
Genin, Emmanuel ;
Cardon, Lon R. ;
Lathrop, Mark .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2008, 16 (12) :1413-1429
[6]   Resolving Individuals Contributing Trace Amounts of DNA to Highly Complex Mixtures Using High-Density SNP Genotyping Microarrays [J].
Homer, Nils ;
Szelinger, Szabolcs ;
Redman, Margot ;
Duggan, David ;
Tembe, Waibhav ;
Muehling, Jill ;
Pearson, John V. ;
Stephan, Dietrich A. ;
Nelson, Stanley F. ;
Craig, David W. .
PLOS GENETICS, 2008, 4 (08)
[7]   A new statistic and its power to infer membership in a genome-wide association study using genotype frequencies [J].
Jacobs, Kevin B. ;
Yeager, Meredith ;
Wacholder, Sholom ;
Craig, David ;
Kraft, Peter ;
Hunter, David J. ;
Paschal, Justin ;
Manolio, Teri A. ;
Tucker, Margaret ;
Hoover, Robert N. ;
Thomas, Gilles D. ;
Chanock, Stephen J. ;
Chatterjee, Nilanjan .
NATURE GENETICS, 2009, 41 (11) :1253-U126
[8]   High-dimensional graphs and variable selection with the Lasso [J].
Meinshausen, Nicolai ;
Buehlmann, Peter .
ANNALS OF STATISTICS, 2006, 34 (03) :1436-1462
[9]   A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics [J].
Schäfer, J ;
Strimmer, K .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2005, 4 :1-30
[10]   Model selection and estimation in the Gaussian graphical model [J].
Yuan, Ming ;
Lin, Yi .
BIOMETRIKA, 2007, 94 (01) :19-35