Cubic exact solutions for the estimation of pairwise haplotype frequencies:: implications for linkage disequilibrium analyses and a web tool 'CubeX'

被引:248
作者
Gaunt, Tom R. [1 ,2 ]
Rodriguez, Santiago [1 ,2 ]
Day, Ian N. M. [1 ,2 ]
机构
[1] Univ Bristol, Dept Social Med, BGEL, Bristol BS8 2PR, Avon, England
[2] Univ Bristol, Dept Social Med, CAiTE, Bristol BS8 2PR, Avon, England
基金
英国医学研究理事会;
关键词
D O I
10.1186/1471-2105-8-428
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The frequency of a haplotype comprising one allele at each of two loci can be expressed as a cubic equation (the 'Hill equation'), the solution of which gives that frequency. Most haplotype and linkage disequilibrium analysis programs use iteration-based algorithms which substitute an estimate of haplotype frequency into the equation, producing a new estimate which is repeatedly fed back into the equation until the values converge to a maximum likelihood estimate (expectation-maximisation). Results: We present a program, "CubeX", which calculates the biologically possible exact solution(s) and provides estimated haplotype frequencies, D', r(2) and X(2) values for each. CubeX provides a "complete" analysis of haplotype frequencies and linkage disequilibrium for a pair of biallelic markers under situations where sampling variation and genotyping errors distort sample Hardy-Weinberg equilibrium, potentially causing more than one biologically possible solution. We also present an analysis of simulations and real data using the algebraically exact solution, which indicates that under perfect sample Hardy-Weinberg equilibrium there is only one biologically possible solution, but that under other conditions there may be more. Conclusion: Our analyses demonstrate that lower allele frequencies, lower sample numbers, population stratification and a possible vertical bar D'vertical bar value of 1 are particularly susceptible to distortion of sample Hardy-Weinberg equilibrium, which has significant implications for calculation of linkage disequilibrium in small sample sizes (eg HapMap) and rarer alleles (eg paucimorphisms, q < 0.05) that may have particular disease relevance and require improved approaches for meaningful evaluation.
引用
收藏
页数:9
相关论文
共 29 条
[1]   GOLD - Graphical Overview of Linkage Disequilibrium [J].
Abecasis, GR ;
Cookson, WOC .
BIOINFORMATICS, 2000, 16 (02) :182-183
[2]   Prevalence and functionality of paucimorphic and private MC4R mutations in a large, unselected European British population, scanned by meltMADGE [J].
Alharbi, Khalid K. ;
Spanakis, Emmanuel ;
Tan, Karen ;
Smith, Matt J. ;
Aldahmesh, Mohammed A. ;
O'Dell, Sandra D. ;
Sayer, Avan Aihie ;
Lawlor, Debbie A. ;
Ebrahim, Shah ;
Smith, George Davey ;
O'Rahilly, Stephen ;
Farooqi, Sadaf ;
Cooper, Cyrus ;
Phillips, David I. W. ;
Day, Ian N. M. .
HUMAN MUTATION, 2007, 28 (03) :294-302
[3]   Mutation scanning by meltMADGE:: Validations using BRCA1 and LDLR, and demonstration of the potential to identify severe, moderate, silent, rare, and paucimorphic mutations in the general population [J].
Alharbi, KK ;
Aldahmesh, MA ;
Spanakis, E ;
Haddad, L ;
Whittall, RA ;
Chen, XH ;
Rassoulian, H ;
Smith, MJ ;
Sillibourne, J ;
Ball, NJ ;
Graham, NJ ;
Briggs, PJ ;
Simpson, IA ;
Phillips, DIW ;
Lawlor, DA ;
Ye, S ;
Humphries, SE ;
Cooper, C ;
Smith, GD ;
Ebrahim, S ;
Eccles, DM ;
Day, INM .
GENOME RESEARCH, 2005, 15 (07) :967-977
[4]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[5]   Patterns of linkage disequilibrium in the human genome [J].
Ardlie, KG ;
Kruglyak, L ;
Seielstad, M .
NATURE REVIEWS GENETICS, 2002, 3 (04) :299-309
[6]   Haploview: analysis and visualization of LD and haplotype maps [J].
Barrett, JC ;
Fry, B ;
Maller, J ;
Daly, MJ .
BIOINFORMATICS, 2005, 21 (02) :263-265
[7]   Paucimorphic alleles versus polymorphic alleles and rare mutations in disease causation:: Theory, observation and detection [J].
Day, INM ;
Alharbi, KK ;
Smith, M ;
Aldahmesh, MA ;
Chen, XH ;
Lotery, AJ ;
Pante-De-Sousa, G ;
Hou, GW ;
Ye, S ;
Eccles, D ;
Cross, NCP ;
Fox, KR ;
Rodriguez, S .
CURRENT GENOMICS, 2004, 5 (05) :431-438
[8]  
*FDN PS, 2006, PYTH PROGR LANG
[9]   MIDAS: software for analysis and visualisation of interallelic disequilibrium between multiallelic markers [J].
Gaunt, Tom R. ;
Rodriguez, Santiago ;
Zapata, Carlos ;
Day, Ian N. M. .
BMC BIOINFORMATICS, 2006, 7 (1)
[10]   The International HapMap Project [J].
Gibbs, RA ;
Belmont, JW ;
Hardenbol, P ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Ch'ang, LY ;
Huang, W ;
Liu, B ;
Shen, Y ;
Tam, PKH ;
Tsui, LC ;
Waye, MMY ;
Wong, JTF ;
Zeng, CQ ;
Zhang, QR ;
Chee, MS ;
Galver, LM ;
Kruglyak, S ;
Murray, SS ;
Oliphant, AR ;
Montpetit, A ;
Hudson, TJ ;
Chagnon, F ;
Ferretti, V ;
Leboeuf, M ;
Phillips, MS ;
Verner, A ;
Kwok, PY ;
Duan, SH ;
Lind, DL ;
Miller, RD ;
Rice, JP ;
Saccone, NL ;
Taillon-Miller, P ;
Xiao, M ;
Nakamura, Y ;
Sekine, A ;
Sorimachi, K ;
Tanaka, T ;
Tanaka, Y ;
Tsunoda, T ;
Yoshino, E ;
Bentley, DR ;
Deloukas, P ;
Hunt, S ;
Powell, D ;
Altshuler, D ;
Gabriel, SB ;
Qiu, RZ .
NATURE, 2003, 426 (6968) :789-796