DSM-5 Field Trials in the United States and Canada, Part II: Test-Retest Reliability of Selected Categorical Diagnoses

被引:684
作者
Regier, Darrel A. [1 ]
Narrow, William E.
Clarke, Diana E.
Kraemer, Helena C.
Kuramoto, S. Janet
Kuhl, Emily A.
Kupfer, David J.
机构
[1] Amer Psychiat Assoc, Div Res, Arlington, VA USA
关键词
COLLABORATIVE PROGRAM; BEHAVIORAL-DISORDERS; MENTAL-DISORDERS; MOOD DISORDERS; VALIDITY; CRITERIA; DEPRESSION; ANXIETY; CLASSIFICATION; PSYCHOBIOLOGY;
D O I
10.1176/appi.ajp.2012.12070999
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Objective: The DSM-5 Field Trials were designed to obtain precise (standard error < 0.1) estimates of the intraclass kappa as a measure of the degree to which two clinicians could independently agree on the presence or absence of selected DSM-5 diagnoses when the same patient was interviewed on separate occasions, in clinical settings, and evaluated with usual clinical interview methods. Method: Eleven academic centers in the United States and Canada were selected, and each was assigned several target diagnoses frequently treated in that setting. Consecutive patients visiting a site during the study were screened and stratified on the basis of DSM-IV diagnoses or symptomatic presentations. Patients were randomly assigned to two clinicians for a diagnostic interview; clinicians were blind to any previous diagnosis. All data were entered directly via an Internet-based software system to a secure central server. Detailed research design and statistical methods are presented in an accompanying article. Results: There were a total of 15 adult and eight child/adolescent diagnoses for which adequate sample sizes were obtained to report adequately precise estimates of the intraclass kappa. Overall, five diagnoses were in the very good range (kappa=0.60-0.79), nine in the good range (kappa=0.40-0.59), six in the questionable range (kappa=0.20-0.39), and three in the unacceptable range (kappa values < 0.20). Eight diagnoses had insufficient sample sizes to generate precise kappa estimates at any site. Conclusions: Most diagnoses adequately tested had good to very good reliability with these representative clinical populations assessed with usual clinical interview methods. Some diagnoses that were revised to encompass a broader spectrum of symptom expression or had a more dimensional approach tested in the good to very good range. (Am J Psychiatry 2013; 170:59-70)
引用
收藏
页码:59 / 70
页数:12
相关论文
共 45 条
[21]  
Kramer M., 1961, P 3 WORLD C PSYCHIAT, V3, P153
[22]   The PHQ-15: Validity of a new measure for evaluating the severity of somatic symptoms [J].
Kroenke, K ;
Spitzer, RL ;
Williams, JBW .
PSYCHOSOMATIC MEDICINE, 2002, 64 (02) :258-266
[23]   The PHQ-9 - Validity of a brief depression severity measure [J].
Kroenke, K ;
Spitzer, RL ;
Williams, JBW .
JOURNAL OF GENERAL INTERNAL MEDICINE, 2001, 16 (09) :606-613
[24]  
Kupfer D., 2002, RES AGENDA DSM 5
[25]  
LAHEY BB, 1994, AM J PSYCHIAT, V151, P1163
[26]   Severe Mood Dysregulation, Irritability, and the Diagnostic Boundaries of Bipolar Disorder in Youths [J].
Leibenluft, Ellen .
AMERICAN JOURNAL OF PSYCHIATRY, 2011, 168 (02) :129-142
[27]   Depression, anxiety and somatization in primary care:: syndrome overlap and functional impairment [J].
Loewe, Bernd ;
Spitzer, Robert L. ;
Williams, Janet B. W. ;
Mussell, Monika ;
Schellberg, Dieter ;
Kroenke, Kurt .
GENERAL HOSPITAL PSYCHIATRY, 2008, 30 (03) :191-199
[28]   BIOLOGICAL COMPONENT OF THE NIMH CLINICAL RESEARCH BRANCH COLLABORATIVE PROGRAM ON THE PSYCHOBIOLOGY OF DEPRESSION .1. BACKGROUND AND THEORETICAL CONSIDERATIONS [J].
MAAS, JW ;
KOSLOW, SH ;
DAVIS, JM ;
KATZ, MM ;
MENDELS, J ;
ROBINS, E ;
STOKES, PE ;
BOWDEN, CL .
PSYCHOLOGICAL MEDICINE, 1980, 10 (04) :759-776
[29]   The case for practical clinical trials in psychiatry [J].
March, JS ;
Silva, SG ;
Compton, S ;
Shapiro, M ;
Califf, R ;
Krishnan, R .
AMERICAN JOURNAL OF PSYCHIATRY, 2005, 162 (05) :836-846
[30]  
Marshall H., 1974, Themes and Variations in European Psychiatry: an Anthology, P7