Can one assess whether missing data are missing at random in medical studies?

被引:98
作者
Potthoff, Richard F.
Tudor, Gail E.
Pieper, Karen S.
Hasselblad, Vic
机构
[1] Duke Univ, Med Ctr, Duke Clin Res Inst, Durham, NC 27715 USA
[2] Univ N Carolina, Dept Biostat, Chapel Hill, NC USA
关键词
D O I
10.1191/0962280206sm448oa
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
For handling missing data, newer methods such as those based on multiple imputation are generally more accurate than older ones and entail weaker assumptions. Yet most do assume that data are missing at random (MAR). The issue of assessing whether the MAR assumption holds to begin with has been largely ignored. In fact, no way to directly test MAR is available. We propose an alternate assumption, MAR+, that can be tested. MAR+ always implies MAR, so inability to reject MAR+ bodes well for MAR. In contrast, MAR implies MAR+ not universally, but under certain conditions that are often plausible; thus, rejection of MAR+ can raise suspicions about MAR. Our approach is applicable mainly to studies that are not longitudinal. We present five illustrative medical examples, in most of which it turns out that MAR+ fails. There are limits to the ability of sophisticated statistical methods to correct for missing data. Efforts to try to prevent missing data in the first place should therefore receive more attention in medical studies than they have heretofore attracted. If MAR+ is found to fail for a study whose data have already been gathered, extra caution may need to be exercised in the interpretation of the results.
引用
收藏
页码:213 / 234
页数:22
相关论文
共 36 条
[1]  
Agresti A., 1990, CATEGORICAL DATA ANA
[2]  
BARNARD J, 1998, ENCY BIOSTATISTICS, P2772
[3]   Contemporary management of patients with left ventricular systolic dysfunction - Results from the Study of Patients Intolerant of Converting Enzyme Inhibitors (SPICE) Registry [J].
Bart, BA ;
Ertl, G ;
Held, P ;
Kuch, J ;
Maggioni, AP ;
McMurray, J ;
Michelson, EL ;
Rouleau, JL ;
Stevenson, LW ;
Swedberg, K ;
Young, JB ;
Yusuf, S ;
Sellers, MA ;
Granger, CB ;
Califf, RM ;
Pfeffer, MA .
EUROPEAN HEART JOURNAL, 1999, 20 (16) :1182-1190
[4]   ANALYSIS OF RANK MEASURES OF ASSOCIATION FOR ORDINAL DATA FROM LONGITUDINAL-STUDIES [J].
CARR, GJ ;
HAFNER, KB ;
KOCH, GG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1989, 84 (407) :797-804
[5]   A test of missing completely at random for generalised estimating equations with missing data [J].
Chen, HY ;
Little, R .
BIOMETRIKA, 1999, 86 (01) :1-13
[6]   A comparison of inclusive and restrictive strategies in modern missing data procedures [J].
Collins, LM ;
Schafer, JL ;
Kam, CM .
PSYCHOLOGICAL METHODS, 2001, 6 (04) :330-351
[7]   MAXIMUM-LIKELIHOOD ESTIMATION AND MODEL SELECTION IN CONTINGENCY-TABLES WITH MISSING DATA [J].
FUCHS, C .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1982, 77 (378) :270-278
[8]  
Harrell FE Jr, 2001, REGRESSION MODELING
[9]   CONTEMPORARY REPERFUSION THERAPY FOR CARDIOGENIC-SHOCK - THE GUSTO-I TRIAL EXPERIENCE [J].
HOLMES, DR ;
BATES, ER ;
KLEIMAN, NS ;
SADOWSKI, Z ;
HORGAN, JHS ;
MORRIS, DC ;
CALIFF, RM ;
BERGER, PB ;
TOPOL, EJ .
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 1995, 26 (03) :668-674
[10]   BIOSTATISTICAL IMPLICATIONS OF DESIGN, SAMPLING, AND MEASUREMENT TO HEALTH-SCIENCE DATA-ANALYSIS [J].
KOCH, GG ;
GILLINGS, DB ;
STOKES, ME .
ANNUAL REVIEW OF PUBLIC HEALTH, 1980, 1 :163-225