A Confidence Interval for the Wallace Coefficient of Concordance and Its Application to Microbial Typing Methods

被引:65
作者
Pinto, Francisco R. [1 ,2 ]
Melo-Cristino, Jose [1 ]
Ramirez, Mario [1 ]
机构
[1] Fac Med, Inst Mol Med, Inst Microbiol, Lisbon, Portugal
[2] Univ Lisbon, Fac Ciencias, Dept Quim Bioquim, Ctr Quim Bioquim, P-1699 Lisbon, Portugal
来源
PLOS ONE | 2008年 / 3卷 / 11期
关键词
D O I
10.1371/journal.pone.0003696
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Very diverse research fields frequently deal with the analysis of multiple clustering results, which should imply an objective detection of overlaps and divergences between the formed groupings. The congruence between these multiple results can be quantified by clustering comparison measures such as the Wallace coefficient (W). Since the measured congruence is dependent on the particular sample taken from the population, there is variability in the estimated values relatively to those of the true population. In the present work we propose the use of a confidence interval (CI) to account for this variability when W is used. The CI analytical formula is derived assuming a Gaussian sampling distribution and recurring to the algebraic relationship between W and the Simpson's index of diversity. This relationship also allows the estimation of the expected Wallace value under the assumption of independence of classifications. We evaluated the CI performance using simulated and published microbial typing data sets. The simulations showed that the CI has the desired 95% coverage when the W is greater than 0.5. This behaviour is robust to changes in cluster number, cluster size distributions and sample size. The analysis of the published data sets demonstrated the usefulness of the new CI by objectively validating some of the previous interpretations, while showing that other conclusions lacked statistical support.
引用
收藏
页数:8
相关论文
共 14 条
[1]   Comparison of single and complete linkage clustering with the hierarchical factor classification of variables [J].
Camiz, S. ;
Pillar, V. D. .
COMMUNITY ECOLOGY, 2007, 8 (01) :25-30
[2]   Illustration of a common framework for relating multiple typing methods by application to macrolide-resistant Streptococcus pyogenes [J].
Carrico, J. A. ;
Silva-Costa, C. ;
Melo-Cristino, J. ;
Pinto, F. R. ;
de Lencastre, H. ;
Almeida, J. S. ;
Ramirez, M. .
JOURNAL OF CLINICAL MICROBIOLOGY, 2006, 44 (07) :2524-2532
[3]   Antibiotic susceptibility patterns and clones of Pseudomonas aeruginosa in Swedish ICUs [J].
Erlandsson, Marcus ;
Gill, Hans ;
Nordlinder, David ;
Giske, Christian G. ;
Jonas, Daniel ;
Nilsson, Lennart E. ;
Walther, Sten ;
Hanberger, Hakan .
SCANDINAVIAN JOURNAL OF INFECTIOUS DISEASES, 2008, 40 (6-7) :487-494
[4]   Analysis of typing methods for epidemiological surveillance of both methicillin-resistant and methicillin-susceptible Staphylococcus aureus strains [J].
Faria, Nuno A. ;
Carrico, Joao A. ;
Oliveira, Duarte C. ;
Ramirez, Mario ;
de Lencastre, Herminia .
JOURNAL OF CLINICAL MICROBIOLOGY, 2008, 46 (01) :136-144
[5]   Nonoutbreak surveillance of group A streptococci causing invasive disease in Portugal identified internationally disseminated clones among members of a genetically heterogeneous population [J].
Friaes, A. ;
Ramirez, M. ;
Melo-Cristino, J. .
JOURNAL OF CLINICAL MICROBIOLOGY, 2007, 45 (06) :2044-2047
[6]   Determining confidence intervals when measuring genetic diversity and the discriminatory abilities of typing methods for microorganisms [J].
Grundmann, H ;
Hori, S ;
Tanner, G .
JOURNAL OF CLINICAL MICROBIOLOGY, 2001, 39 (11) :4190-4192
[7]   Analysis of group B streptococcal isolates from infants and pregnant women in Portugal revealing two lineages with enhanced invasiveness [J].
Martins, E. R. ;
Pessanha, M. A. ;
Ramirez, M. ;
Melo-Cristino, J. .
JOURNAL OF CLINICAL MICROBIOLOGY, 2007, 45 (10) :3224-3229
[8]   Comparing clusterings by the variation of information [J].
Meila, M .
LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 :173-187
[9]   Comparison of molecular typing methods for characterization of Staphylococcus epidermidis:: Proposal for clone definition [J].
Miragaia, M. ;
Carrico, J. A. ;
Thomas, J. C. ;
Couto, I. ;
Enright, M. C. ;
de Lencastre, H. .
JOURNAL OF CLINICAL MICROBIOLOGY, 2008, 46 (01) :118-129
[10]   Ranked Adjusted Rand:: integrating distance and partition information in a measure of clustering agreement [J].
Pinto, Francisco R. ;
Carrico, Joao A. ;
Ramirez, Mario ;
Almeida, Jonas S. .
BMC BIOINFORMATICS, 2007, 8 (1)