Smoothing observational data: A philosophy and implementation for the health sciences

被引:13
作者
Greenland, S [1 ]
机构
[1] Univ Calif Los Angeles, Dept Epidemiol, Los Angeles, CA 90024 USA
[2] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA USA
关键词
bias; empirical Bayes; epidemiologic methods; hierarchical regression; penalized likelihood; sensitivity analysis; smoothing;
D O I
10.1111/j.1751-5823.2006.tb00159.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Standard statistical methods (such as regression analysis) presume the data are generated by an identifiable random process, and attempt to model that process in a parsimonious fashion. In contrast, observational data in the health sciences are generated by complex, nonidentified, and largely nonrandom mechanisms, and are analyzed to form inferences on latent structures. Despite this gap between the methods and reality, most observational data analysis comprises application of standard methods, followed by narrative discussion of the problems of entailed by doing so. Alternative approaches employ latent-structure models that include components for nonidentified mechanisms. Standard methods can still be useful, however, provided their modeling philosophy is modified to encourage preservation of structure, rather than achieving parsimonious description. With this modification they can be viewed as smoothing or filtering methods for separating noise from signal before the task of latent-structure modeling begins. I here give a detailed justification of this view, and a hierarchical-modeling implementation that can be carried out with popular software. Concepts are illustrated in the smoothing of a contingency table from an analysis of magnetic fields and childhood leukemia.
引用
收藏
页码:31 / 46
页数:16
相关论文
共 71 条
[1]  
[Anonymous], 2000, C&H TEXT STAT SCI
[2]  
[Anonymous], 2000, CAUSALITY
[3]   Bayesian binomial regression: Predicting survival at a trauma center [J].
Bedrick, EJ ;
Christensen, R ;
Johnson, W .
AMERICAN STATISTICIAN, 1997, 51 (03) :211-218
[4]   A new perspective on priors for generalized linear models [J].
Bedrick, EJ ;
Christensen, R ;
Johnson, W .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (436) :1450-1460
[5]  
Berk RA., 2004, Regression analysis: a constructive critique
[6]  
Bishop M.M., 1975, DISCRETE MULTIVARIAT
[7]   APPROXIMATE INFERENCE IN GENERALIZED LINEAR MIXED MODELS [J].
BRESLOW, NE ;
CLAYTON, DG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :9-25
[8]  
Cheng KK, 1999, LANCET, V354, P1925, DOI 10.1016/S0140-6736(99)10074-6
[9]   MULTIPLE IMPUTATION OF INDUSTRY AND OCCUPATION CODES IN CENSUS PUBLIC-USE SAMPLES USING BAYESIAN LOGISTIC-REGRESSION [J].
CLOGG, CC ;
RUBIN, DB ;
SCHENKER, N ;
SCHULTZ, B ;
WEIDMAN, L .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1991, 86 (413) :68-78
[10]   Extra low frequency electric and magnetic fields in the bedplace of children diagnosed with leukaemia: A case-control study [J].
Coghill, RW ;
Steward, J ;
Philips, A .
EUROPEAN JOURNAL OF CANCER PREVENTION, 1996, 5 (03) :153-158