MISCLASSIFICATION AMONG METHODS USED FOR MULTIPLE GROUP DISCRIMINATION - THE EFFECTS OF DISTRIBUTIONAL PROPERTIES

被引:21
作者
BARON, AE
机构
[1] Department of Preventive Medicine and Biometrics and National Center for American Indian and Alaska Native Mental Health Research, Department of Psychiatry, University of Colorado Health Sciences Center, Denver, Colorado, 80262
关键词
D O I
10.1002/sim.4780100511
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Methods of multiple group discriminant analysis have not been fully studied with respect to classification into more than two populations when the covariate distributions are normal or non-normal. The present study examines the classification performance of several multiple discrimination methods under a variety of simulated continuous normal and non-normal covariate distributions. The methods include polychotomous logistic regression, multiple group linear discriminant analysis, kernel density estimation, and rank transformations of the data as input into the linear function. The parameters of interest were distance among populations, configuration of population mean vectors (collinear or forming the vertices of a regular simplex), skewness, kurtosis and bimodality. Simulation of the last three parameters was by log-normal, sinh-1 normal and a two-component mixture of normal distributions, respectively. Results with three trivariate populations show that for all distributions, logistic discrimination classifies close to the optimal under Neyman-Pearson allocation. These results suggest that logistic discrimination is preferable to other widely-used methods for multiple group classification with non-normal data, and is comparable to classification by multiple linear discrimination with normal data.
引用
收藏
页码:757 / 766
页数:10
相关论文
共 55 条
[31]  
Terrell G.R., Scott D.W., Oversmoothed nonparametric density estimates, Journal of the American Statistical Association, 80, pp. 209-214, (1985)
[32]  
Marron J.S., Optimal rates of convergence to Bayes risk in nonparametric discrimination, Annals of Statistics, 11, pp. 1142-1155, (1983)
[33]  
Nezames D.D., (1980)
[34]  
Silverman B.W., Using kernel density estimates to investigate multimodality, Journal of the Royal Statistical Society, Series B, 43, pp. 97-99, (1981)
[35]  
Ashikaga T., Chang P.C., Robustness of Fisher's linear discriminant function under twocomponent mixed normal models, Journal of the American Statistical Association, 76, pp. 676-680, (1981)
[36]  
Balakrishna N., Kocherlakota S., Robustness to nonnormality of the linear discriminant function: Mixtures of normal distributions, Communications in Statistics, 14 A, pp. 465-478, (1985)
[37]  
Day N.E., Kerridge D.F., A general maximum likelihood discriminant, Biometrics, 23, pp. 313-323, (1967)
[38]  
Anderson J.A., Separate sample logistic discrimination, Biometrika, 59, pp. 19-36, (1972)
[39]  
Schlesselman J.J., Case‐Control Studies, (1982)
[40]  
Efron B., The efficiency of logistic regression compared to normal discriminant analysis, Journal of the American Statistical Association, 70, pp. 892-898, (1975)