Design Considerations for Massively Parallel Sequencing Studies of Complex Human Disease

被引:23
作者
Feng, Bing-Jian [1 ]
Tavtigian, Sean V. [2 ,3 ]
Southey, Melissa C. [4 ]
Goldgar, David E. [1 ]
机构
[1] Univ Utah Sch Med, Dept Dermatol, Salt Lake City, UT 84112 USA
[2] Univ Utah, Huntsman Canc Inst, Salt Lake City, UT USA
[3] Univ Utah, Dept Oncol Sci, Salt Lake City, UT USA
[4] Univ Melbourne, Dept Pathol, Melbourne, Vic, Australia
基金
美国国家卫生研究院; 加拿大健康研究院;
关键词
GENOME-WIDE ASSOCIATION; RARE VARIANTS; LINKAGE ANALYSIS; STRATEGIES; HETEROGENEITY;
D O I
10.1371/journal.pone.0023221
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
070301 [无机化学]; 070403 [天体物理学]; 070507 [自然资源与国土空间规划学]; 090105 [作物生产系统与生态工程];
摘要
Massively Parallel Sequencing (MPS) allows sequencing of entire exomes and genomes to now be done at reasonable cost, and its utility for identifying genes responsible for rare Mendelian disorders has been demonstrated. However, for a complex disease, study designs need to accommodate substantial degrees of locus, allelic, and phenotypic heterogeneity, as well as complex relationships between genotype and phenotype. Such considerations include careful selection of samples for sequencing and a well-developed strategy for identifying the few "true" disease susceptibility genes from among the many irrelevant genes that will be found to harbor rare variants. To examine these issues we have performed simulation-based analyses in order to compare several strategies for MPS sequencing in complex disease. Factors examined include genetic architecture, sample size, number and relationship of individuals selected for sequencing, and a variety of filters based on variant type, multiple observations of genes and concordance of genetic variants within pedigrees. A two-stage design was assumed where genes from the MPS analysis of high-risk families are evaluated in a secondary screening phase of a larger set of probands with more modest family histories. Designs were evaluated using a cost function that assumes the cost of sequencing the whole exome is 400 times that of sequencing a single candidate gene. Results indicate that while requiring variants to be identified in multiple pedigrees and/or in multiple individuals in the same pedigree are effective strategies for reducing false positives, there is a danger of over-filtering so that most true susceptibility genes are missed. In most cases, sequencing more than two individuals per pedigree results in reduced power without any benefit in terms of reduced overall cost. Further, our results suggest that although no single strategy is optimal, simulations can provide important guidelines for study design.
引用
收藏
页数:11
相关论文
共 22 条
[1]
A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[2]
Statistical analysis strategies for association studies involving rare variants [J].
Bansal, Vikas ;
Libiger, Ondrej ;
Torkamani, Ali ;
Schork, Nicholas J. .
NATURE REVIEWS GENETICS, 2010, 11 (11) :773-785
[3]
Uncovering the roles of rare variants in common disease through whole-genome sequencing [J].
Cirulli, Elizabeth T. ;
Goldstein, David B. .
NATURE REVIEWS GENETICS, 2010, 11 (06) :415-425
[4]
Rare Variants Create Synthetic Genome-Wide Associations [J].
Dickson, Samuel P. ;
Wang, Kai ;
Krantz, Ian ;
Hakonarson, Hakon ;
Goldstein, David B. .
PLOS BIOLOGY, 2010, 8 (01)
[5]
Potential etiologic and functional implications of genome-wide association loci for human diseases and traits [J].
Hindorff, Lucia A. ;
Sethupathy, Praveen ;
Junkins, Heather A. ;
Ramos, Erin M. ;
Mehta, Jayashri P. ;
Collins, Francis S. ;
Manolio, Teri A. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (23) :9362-9367
[6]
Power of deep, all-exon resequencing for discovery of human trait genes [J].
Kryukov, Gregory V. ;
Shpunt, Alexander ;
Stamatoyannopoulos, John A. ;
Sunyaev, Shamil R. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (10) :3871-3876
[7]
Unexpected Allelic Heterogeneity and Spectrum of Mutations in Fowler Syndrome Revealed by Next-Generation Exome Sequencing [J].
Lalonde, Emilie ;
Albrecht, Steffen ;
Ha, Kevin C. H. ;
Jacob, Karine ;
Bolduc, Nathalie ;
Polychronakos, Constantin ;
Dechelotte, Pierre ;
Majewski, Jacek ;
Jabado, Nada .
HUMAN MUTATION, 2010, 31 (08) :918-923
[8]
STRATEGIES FOR MULTILOCUS LINKAGE ANALYSIS IN HUMANS [J].
LATHROP, GM ;
LALOUEL, JM ;
JULIER, C ;
OTT, J .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1984, 81 (11) :3443-3446
[9]
LATHROP GM, 1984, AM J HUM GENET, V36, P460
[10]
Methods for detecting associations with rare variants for common diseases: Application to analysis of sequence data [J].
Li, Bingshan ;
Leal, Suzanne M. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 83 (03) :311-321