PLINK: A tool set for whole-genome association and population-based linkage analyses

被引:23903
作者
Purcell, Shaun
Neale, Benjamin
Todd-Brown, Kathe
Thomas, Lori
Ferreira, Manuel A. R.
Bender, David
Maller, Julian
Sklar, Pamela
de Bakker, Paul I. W.
Daly, Mark J.
Sham, Pak C.
机构
[1] Massachusetts Gen Hosp, Ctr Human Genet Res, Boston, MA 02114 USA
[2] Harvard & Massachusetts Inst Technol, Broad Inst, Cambridge, MA USA
[3] Univ London, Inst Psychiat, London, England
[4] Univ Hong Kong, Ctr Gene Res, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
10.1086/519795
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.
引用
收藏
页码:559 / 575
页数:17
相关论文
共 39 条
  • [1] A general test of association for quantitative traits in nuclear families
    Abecasis, GR
    Cardon, LR
    Cookson, WOC
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 66 (01) : 279 - 292
  • [2] Agresti A, 1990, CATEGORICAL DATA ANA, P100
  • [3] Haploview: analysis and visualization of LD and haplotype maps
    Barrett, JC
    Fry, B
    Maller, J
    Daly, MJ
    [J]. BIOINFORMATICS, 2005, 21 (02) : 263 - 265
  • [4] Haplotype sharing analysis using mantel statistics
    Beckmann, L
    Thomas, DC
    Fischer, C
    Chang-Claude, J
    [J]. HUMAN HEREDITY, 2005, 59 (02) : 67 - 78
  • [5] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [6] BESAG J, 1991, BIOMETRIKA, V78, P301
  • [7] Long homozygous chromosomal segments in reference families from the Centre d'Etude du Polymorphisme Humain
    Broman, KW
    Weber, JL
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 1999, 65 (06) : 1493 - 1500
  • [8] Population structure, differential bias and genomic control in a large-scale, case-control association study
    Clayton, DG
    Walker, NM
    Smyth, DJ
    Pask, R
    Cooper, JD
    Maier, LM
    Smink, LJ
    Lam, AC
    Ovington, NR
    Stevens, HE
    Nutland, S
    Howson, JMM
    Faham, M
    Moorhead, M
    Jones, HB
    Falkowski, M
    Hardenbol, P
    Willis, TD
    Todd, JA
    [J]. NATURE GENETICS, 2005, 37 (11) : 1243 - 1246
  • [9] Genomic control for association studies
    Devlin, B
    Roeder, K
    [J]. BIOMETRICS, 1999, 55 (04) : 997 - 1004
  • [10] Doerge RW, 1996, GENETICS, V142, P285