An Arabidopsis example of association mapping in structured samples

被引:558
作者
Zhao, Keyan
Aranzana, Maria Jose
Kim, Sung
Lister, Clare
Shindo, Chikako
Tang, Chunlao
Toomajian, Christopher
Zheng, Honggang
Dean, Caroline
Marjoram, Paul
Nordborg, Magnus [1 ]
机构
[1] Univ So Calif, Los Angeles, CA 90089 USA
[2] John Innes Ctr Plant Sci Res, Dept Cell & Dev Biol, Norwich NR4 7UH, Norfolk, England
[3] Univ So Calif, Keck Sch Med, Dept Prevent Med, Los Angeles, CA 90089 USA
来源
PLOS GENETICS | 2007年 / 3卷 / 01期
基金
英国自然环境研究理事会; 英国生物技术与生命科学研究理事会;
关键词
D O I
10.1371/journal.pgen.0030004
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
A potentially serious disadvantage of association mapping is the fact that marker-trait associations may arise from confounding population structure as well as from linkage to causative polymorphisms. Using genome-wide marker data, we have previously demonstrated that the problem can be severe in a global sample of 95 Arabidopsis thaliana accessions, and that established methods for controlling for population structure are generally insufficient. Here, we use the same sample together with a number of flowering-related phenotypes and data-perturbation simulations to evaluate a wider range of methods for controlling for population structure. We find that, in terms of reducing the false-positive rate while maintaining statistical power, a recently introduced mixed-model approach that takes genome-wide differences in relatedness into account via estimated pairwise kinship coefficients generally performs best. By combining the association results with results from linkage mapping in F2 crosses, we identify one previously known true positive and several promising new associations, but also demonstrate the existence of both false positives and false negatives. Our results illustrate the potential of genome-wide association scans as a tool for dissecting the genetics of natural variation, while at the same time highlighting the pitfalls. The importance of study design is clear; our study is severely under-powered both in terms of sample size and marker density. Our results also provide a striking demonstration of confounding by population structure. While statistical methods can be used to ameliorate this problem, they cannot always be effective and are certainly not a substitute for independent evidence, such as that obtained via crosses or transgenic experiments. Ultimately, association mapping is a powerful tool for identifying a list of candidates that is short enough to permit further genetic study.
引用
收藏
页码:0071 / 0082
页数:12
相关论文
共 50 条
  • [1] A haplotype map of the human genome
    Altshuler, D
    Brooks, LD
    Chakravarti, A
    Collins, FS
    Daly, MJ
    Donnelly, P
    Gibbs, RA
    Belmont, JW
    Boudreau, A
    Leal, SM
    Hardenbol, P
    Pasternak, S
    Wheeler, DA
    Willis, TD
    Yu, FL
    Yang, HM
    Zeng, CQ
    Gao, Y
    Hu, HR
    Hu, WT
    Li, CH
    Lin, W
    Liu, SQ
    Pan, H
    Tang, XL
    Wang, J
    Wang, W
    Yu, J
    Zhang, B
    Zhang, QR
    Zhao, HB
    Zhao, H
    Zhou, J
    Gabriel, SB
    Barry, R
    Blumenstiel, B
    Camargo, A
    Defelice, M
    Faggart, M
    Goyette, M
    Gupta, S
    Moore, J
    Nguyen, H
    Onofrio, RC
    Parkin, M
    Roy, J
    Stahl, E
    Winchester, E
    Ziaugra, L
    Shen, Y
    [J]. NATURE, 2005, 437 (7063) : 1299 - 1320
  • [2] Genome-wide association mapping in Arabidopsis identifies previously known flowering time and pathogen resistance genes
    Aranzana, MJ
    Kim, S
    Zhao, KY
    Bakker, E
    Horton, M
    Jakob, K
    Lister, C
    Molitor, J
    Shindo, C
    Tang, CL
    Toomajian, C
    Traw, B
    Zheng, HG
    Bergelson, J
    Dean, C
    Marjoram, P
    Nordborg, M
    [J]. PLOS GENETICS, 2005, 1 (05): : 531 - 539
  • [3] Regulation of flowering time by FVE, a retinoblastoma-associated protein
    Ausín, I
    Alonso-Blanco, C
    Jarillo, JA
    Ruiz-García, L
    Martínez-Zapater, JM
    [J]. NATURE GENETICS, 2004, 36 (02) : 162 - 166
  • [4] The PHYTOCHROME C photoreceptor gene mediates natural variation in flowering and growth responses of Arabidopsis thaliana
    Balasubramanian, Sureshkumar
    Sureshkumar, Sridevi
    Agrawal, Mitesh
    Michael, Todd P.
    Wessinger, Carrie
    Maloof, Julin N.
    Clark, Richard
    Warthmann, Norman
    Chory, Joanne
    Weigel, Detlef
    [J]. NATURE GENETICS, 2006, 38 (06) : 711 - 715
  • [5] Epistatic interaction between Arabidopsis FRI and FLC flowering time genes generates a latitudinal cline in a life history trait
    Caicedo, AL
    Stinchcombe, JR
    Olsen, KM
    Schmitt, J
    Purugganan, MD
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (44) : 15670 - 15675
  • [6] Interaction of polycomb-group proteins controlling flowering in Arabidopsis
    Chanvivattana, Y
    Bishopp, A
    Schubert, D
    Stock, C
    Moon, YH
    Sung, ZR
    Goodrich, J
    [J]. DEVELOPMENT, 2004, 131 (21): : 5263 - 5276
  • [7] Population structure, differential bias and genomic control in a large-scale, case-control association study
    Clayton, DG
    Walker, NM
    Smyth, DJ
    Pask, R
    Cooper, JD
    Maier, LM
    Smink, LJ
    Lam, AC
    Ovington, NR
    Stevens, HE
    Nutland, S
    Howson, JMM
    Faham, M
    Moorhead, M
    Jones, HB
    Falkowski, M
    Hardenbol, P
    Willis, TD
    Todd, JA
    [J]. NATURE GENETICS, 2005, 37 (11) : 1243 - 1246
  • [8] Genomic control to the extreme
    Devlin, B
    Bacanu, SA
    Roeder, K
    [J]. NATURE GENETICS, 2004, 36 (11) : 1129 - 1130
  • [9] Genomic control for association studies
    Devlin, B
    Roeder, K
    [J]. BIOMETRICS, 1999, 55 (04) : 997 - 1004
  • [10] Falush D, 2003, GENETICS, V164, P1567