A diagnostic meta-analysis of the Patient Health Questionnaire-9 (PHQ-9) algorithm scoring method as a screen for depression

被引:563
作者
Manea, Laura [1 ,2 ]
Gilbody, Simon
McMillan, Dean
机构
[1] Univ York, Hull York Med Sch, York YO10 5DD, N Yorkshire, England
[2] Univ York, Dept Hlth Sci, York YO10 5DD, N Yorkshire, England
关键词
Depression; Screening; Questionnaire; Psychometrics; Meta-analysis; PRIMARY-CARE; MAJOR DEPRESSION; SYSTEMATIC REVIEWS; MENTAL-DISORDERS; VALIDITY; VALIDATION; ANXIETY; HETEROGENEITY; ACCURACY; VERSION;
D O I
10.1016/j.genhosppsych.2014.09.009
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Background: The depression module of the Patient Health Questionnaire-9 (PHQ-9) is a widely used depression screening instrument in nonpsychiatric settings. The PHQ-9 can be scored using different methods, including an algorithm based on Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition criteria and a cut-off based on summed-item scores. The algorithm was the originally proposed scoring method to screen for depression. We summarized the diagnostic test accuracy of the PHQ-9 using the algorithm scoring method across a range of validation studies and compared the diagnostic properties of the PHQ-9 using the algorithm and summed scoring method at the proposed cut-off point of 10. Methods: We performed a systematic review of diagnostic accuracy studies of the PHQ-9 using the algorithm scoring method to detect major depressive disorder (MDD). We used meta-analytic methods to calculate summary sensitivity, specificity, likelihood ratios and diagnostic odds ratios for diagnosing MDD of the PHQ-9 using algorithm scoring method. In studies that reported both scoring methods (algorithm and summed-item scoring at proposed cut-off point of >= 10), we compared the diagnostic properties of the PHQ-9 using these methods. Results: We found 27 validation studies that validated the algorithm scoring method of the PHQ-9 in various settings. There was substantial heterogeneity across studies, which makes the pooled results difficult to interpret. In general, sensitivity was low whereas specificity was good. Thirteen studies reported the diagnostic properties of the PHQ-9 for both scoring methods. Pooled sensitivity for algorithm scoring method was lower while specificities were good for both scoring methods. Heterogeneity was consistently high; therefore, caution should be used when interpreting these results. Interpretation: This review shows that, if the algorithm scoring method is used, the PHQ-9 has a low sensitivity for detecting MDD. This could be due to the rating scale categories of the measure, higher specificity or other factors that warrant further research. The summed-item score method at proposed cut-off point of >= 10 has better diagnostic performance for screening purposes or where a high sensitivity is needed. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:67 / 75
页数:9
相关论文
共 49 条
  • [1] [Anonymous], 2009, Systematic reviews: CRD's guidancefor undertaking reviews in health care
  • [2] [Anonymous], 2009, CLIN GUID 91 DEPR AD
  • [3] [Anonymous], 2009, DEPR TREATM MAN DEPR
  • [4] [Anonymous], 1996, GUID CLIN PREV SERV
  • [5] Validation of PHQ-2 and PHQ-9 to Screen for Major Depression in the Primary Care Population
    Arroll, Bruce
    Goodyear-Smith, Felicity
    Crengle, Susan
    Gunn, Jane
    Kerse, Ngaire
    Fishman, Tana
    Falloon, Karen
    Hatcher, Simon
    [J]. ANNALS OF FAMILY MEDICINE, 2010, 8 (04) : 348 - 353
  • [6] 'Do you think you suffer from depression?' Reevaluating the use of a single item question for the screening of depression in older primary care patients
    Ayalon, Liat
    Goldfracht, Margalit
    Bech, Per
    [J]. INTERNATIONAL JOURNAL OF GERIATRIC PSYCHIATRY, 2010, 25 (05) : 497 - 502
  • [7] DEEKS J, 2000, SYSTEMATIC REV HLTH, P248
  • [8] Conducting systematic reviews of diagnostic studies: Didactic guidelines
    Devillé W.L.
    Buntinx F.
    Bouter L.M.
    Montori V.M.
    De Vet H.C.W.
    Van Der Windt D.A.W.M.
    Bezemer P.D.
    [J]. BMC Medical Research Methodology, 2 (1) : 1 - 13
  • [9] Validation and utility of the patient health questionnaire in diagnosing mental disorders in 1003 general hospital Spanish inpatients
    Diez-Quevedo, C
    Rangil, T
    Sanchez-Planell, L
    Kroenke, K
    Spitzer, RL
    [J]. PSYCHOSOMATIC MEDICINE, 2001, 63 (04): : 679 - 686
  • [10] Limitations of the Patient Health Questionnaire in identifying anxiety and depression in community mental health: Many cases are undetected
    Eack, Shaun M.
    Greeno, Catherine G.
    Lee, Bong-Jae
    [J]. RESEARCH ON SOCIAL WORK PRACTICE, 2006, 16 (06) : 625 - 631