Sample size considerations in genetic polymorphism studies

被引:29
作者
B-Rao, C [1 ]
机构
[1] CSIR, Ctr Biochem Technol, Funct Genom Unit, Delhi 110007, India
关键词
sample size; genetic polymorphisms; allele frequency distribution estimation; population studies;
D O I
10.1159/000053376
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Objectives: Molecular studies for genetic polymorphisms are being carried out for a number of different applications, such as genetic disorders in different populations, pharmacogenomics, genetic identification of ethnic groups for forensic and legal applications, genetic identification of breed/stock in animals and plants for commercial applications and conservation of germ plasm. In this paper, for a random sampling scheme, we address two questions: (A) What should be the minimum size of the sample so that, with a prespecified probability, all alleles at a given locus (or haplotypes at a given set of loci) are detected? (B) What should be the sample size so that the allele frequency distribution at a given locus (or haplotype frequency distribution at a given set of loci) is estimated reliably within permissible error limits? Methods: We have used combinatorial probabilistic arguments and Monte Carlo simulations to answer these questions. Results: We found that the minimum sample size required in case A depends mainly on the prespecified probability of detecting all alleles, while in case B, it varies greatly depending on the permissible error in estimation (which will vary with the application). We have obtained the minimum sample sizes for different degrees of polymorphism at a locus under high stringency, as well as a relaxed level of permissible error. We present a detailed sampling procedure for estimating allele frequencies at a given locus, which will be of use in practical applications. Conclusion: Since the sample size required for reliable estimation of allele frequency distribution increases with the number of alleles at the locus, there is a strong case for using biallelic markers (like single nucleotide polymorphisms) when the available sample size is about 800 or less. Copyright (C) 2001 S. Karger AG, Basel.
引用
收藏
页码:191 / 200
页数:10
相关论文
共 41 条
[1]  
Borecki IB, 1999, GENET EPIDEMIOL, V17, pS73
[2]   DETERMINATION OF EVOLUTIONARY RELATIONSHIPS AMONG SHEEP BREEDS USING MICROSATELLITES [J].
BUCHANAN, FC ;
ADAMS, LJ ;
LITTLEJOHN, RP ;
MADDOX, JF ;
CRAWFORD, AM .
GENOMICS, 1994, 22 (02) :397-403
[3]   Isolation of a novel potassium channel gene hSKCa3 containing a polymorphic CAG repeat: a candidate for schizophrenia and bipolar disorder? [J].
Chandy, KG ;
Fantino, E ;
Wittekindt, O ;
Kalman, K ;
Tong, LL ;
Ho, TH ;
Gutman, GA ;
Crocq, MA ;
Ganguli, R ;
Nimgaonkar, V ;
Morris-Rosendahl, DJ ;
Gargus, JJ .
MOLECULAR PSYCHIATRY, 1998, 3 (01) :32-37
[5]   INTRA-POPULATION AND INTER-POPULATION DIVERSITY AT SHORT TANDEM REPEAT LOCI IN DIVERSE POPULATIONS OF THE WORLD [J].
DEKA, R ;
SHRIVER, MD ;
YU, LM ;
FERRELL, RE ;
CHAKRABORTY, R .
ELECTROPHORESIS, 1995, 16 (09) :1659-1664
[6]  
FELLER W, 1983, INTRO PROBABILITY TH, V1
[7]  
Hayney M S, 1998, Int J Infect Dis, V2, P143, DOI 10.1016/S1201-9712(98)90116-3
[8]   Genetic variation at six STR loci (HUMTH01, HUMTPOX, HUMCSF1PO, HUMF13A01, HUMFES/FPS, HUMVWFA31) in Aragon (North Spain) [J].
Jarreta, BM ;
Roche, PD ;
Abecia, E .
FORENSIC SCIENCE INTERNATIONAL, 1999, 100 (1-2) :87-92
[9]  
JORDE LB, 1995, AM J HUM GENET, V57, P523
[10]   Genetic and immunological characteristics of Type I diabetes mellitus in an Indo-Aryan population [J].
Kelly, MA ;
Alvi, NS ;
Croft, NJ ;
Mijovic, CH ;
Bottazzo, GF ;
Barnett, AH .
DIABETOLOGIA, 2000, 43 (04) :450-456