AUXILIARY OUTCOME DATA AND THE MEAN SCORE METHOD

被引:69
作者
PEPE, MS
REILLY, M
FLEMING, TR
机构
[1] FRED HUTCHINSON CANC RES CTR,SEATTLE,WA 98104
[2] DUBLIN INST TECHNOL,DEPT MATH & STAT,DUBLIN,IRELAND
[3] UNIV WASHINGTON,DEPT BIOSTAT,SEATTLE,WA 98195
基金
美国国家卫生研究院;
关键词
DOUBLE SAMPLING; PSEUDOLIKELIHOOD; SURROGATE END-POINTS; 2-STAGE DESIGNS; EM-ALGORITHM;
D O I
10.1016/0378-3758(94)90194-5
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In medical research outcomes of interest, Y, are often difficult to ascertain on a sufficiently large number of study subjects. Cost is frequently an issue for example. A more feasible approach might to be ascertain an easily measured but less accurate surrogate outcome variable, A, and to supplement the study with a validation sample of observations for whom both Y and A have been measured. In the context of a regression model P(beta)(Y\X) with X a covariate vector, we propose a method called mean score to make inference about beta using such data. This method does not require specification of the association between Y and A and is semiparametric in this sense. More-over, in contrast to previous work by Espeland and Odoroff (J. Amer. Statist. Assoc. 80 (1985), 663-670), and Buonaceorsi (J. Amer. Statist. Assoc. 85 (1990), 1075-1082), sampling of the true outcome can depend on both covariate and auxiliary data. Two illustrations in real medical contexts demonstrate that auxiliary data can substantially improve efficiency over standard statistical designs. Designs which incorporate auxiliary data may become increasingly useful as budgetary restrictions and health care management play a larger role in medical research. A third illustration demonstrates that the mean score method can be useful in the classical setting when observational datasets contain missing outcome data. Data need not be missing at random in the usual sense (Rubin (Multiple Imputation for Non-Response in Surveys (1987) Wiley, New York). Indeed the mean score method can adjust for biases induced by violation of the missing at random assumption in certain settings. We contend that the mean score method will be particularly useful in observational studies where it is possible, although perhaps inconvenient, to retrieve missing data.
引用
收藏
页码:137 / 160
页数:24
相关论文
共 23 条
[22]   DESIGNING A LOGISTIC-REGRESSION STUDY USING SURROGATE MEASURES FOR EXPOSURE AND OUTCOME [J].
TOSTESON, TD ;
WARE, JH .
BIOMETRIKA, 1990, 77 (01) :11-21
[23]   SURROGATE ENDPOINTS IN CLINICAL-TRIALS - CARDIOVASCULAR-DISEASES [J].
WITTES, J ;
LAKATOS, E ;
PROBSTFIELD, J .
STATISTICS IN MEDICINE, 1989, 8 (04) :415-425