Quantification of variability and uncertainty for censored data sets and application to air toxic emission factors

被引:29
作者
Zhao, YC [1 ]
Frey, HC [1 ]
机构
[1] N Carolina State Univ, Dept Civil Construct & Environm Engn, Raleigh, NC 27695 USA
关键词
Bootstrap simulation; censored data sets; Kaplan-Meier estimator; maximum likelihood estimation; Monte Carlo simulation; nondetects; urban air toxics;
D O I
10.1111/j.0272-4332.2004.00504.x
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Many environmental data sets, such as for air toxic emission factors, contain several values reported only as below detection limit. Such data sets are referred to as "censored." Typical approaches to dealing with the censored data sets include replacing censored values with arbitrary values of zero, one-half of the detection limit, or the detection limit. Here, an approach to quantification of the variability and uncertainty of censored data sets is demonstrated. Empirical bootstrap simulation is used to simulate censored bootstrap samples from the original data. Maximum likelihood estimation (MLE) is used to fit parametric probability distributions to each bootstrap sample, thereby specifying alternative estimates of the unknown population distribution of the censored data sets. Sampling distributions for uncertainty in statistics such as the mean, median, and percentile are calculated. The robustness of the method was tested by application to different degrees of censoring, sample sizes, coefficients of variation, and numbers of detection limits. Lognormal, gamma, and Weibull distributions were evaluated. The reliability of using this method to estimate the mean is evaluated by averaging the best estimated means of 20 cases for small sample size of 20. The confidence intervals for distribution percentiles estimated with bootstrap/MLE method compared favorably to results obtained with the nonparametric Kaplan-Meier method. The bootstrap/MLE method is illustrated via an application to an empirical air toxic emission factor data set.
引用
收藏
页码:1019 / 1034
页数:16
相关论文
共 22 条
[1]  
[Anonymous], 1986, ENV POL SUS DEV
[2]  
Casella G, 2001, STAT INFERENCE
[3]  
CLARK JU, 1994, INT C DREDG DREDG MA, V1, P747
[4]  
Cohen A.C., 1988, Parameter Estimation in Reliability and Life SPAN Models
[5]  
Cullen A.C., 1999, Probabilistic Techniques in Expose Assessment: A Handbook for Dealing with Variability and Uncertainty in Models and Inputs
[6]  
Efron B., 1994, INTRO BOOTSTRAP, DOI DOI 10.1201/9780429246593
[7]  
ELVIRA B, 1999, ENVIRON SCI TECHNOL, V33, P2273
[8]  
Frey H.C., 1999, QUANTITATIVE ANAL VA
[9]  
Frey H. C., 2002, TECHNICAL DOCUMENTAT
[10]   Characterizing, simulating, and analyzing variability and uncertainty: An illustration of methods using an air toxics emissions example [J].
Frey, HC ;
Rhodes, DS .
HUMAN AND ECOLOGICAL RISK ASSESSMENT, 1996, 2 (04) :762-797