Reliability of Health Information on the Internet: An Examination of Experts' Ratings

被引:52
作者
Craigie, Mark [1 ]
Loader, Brian [2 ]
Burrows, Roger [3 ]
Muncer, Steven [1 ]
机构
[1] Univ Durham, Dept Appl Psychol, Thornaby TS17 6BH, Stockton On Tee, England
[2] Univ Teesside, Ctr Informat Res & Applicat, Middlesbrough, Cleveland, England
[3] Univ York, Ctr Housing Policy, York YO10 5DD, N Yorkshire, England
基金
英国经济与社会研究理事会;
关键词
Newsgroup; Internet; rating information; reliability; reproducibility of results; statistics; quality control;
D O I
10.2196/jmir.4.1.e2
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: The use of medical experts in rating the content of health-related sites on the Internet has flourished in recent years. In this research, it has been common practice to use a single medical expert to rate the content of the Web sites. In many cases, the expert has rated the Internet health information as poor, and even potentially dangerous. However, one problem with this approach is that there is no guarantee that other medical experts will rate the sites in a similar manner. Objectives: The aim was to assess the reliability of medical experts' judgments of threads in an Internet newsgroup related to a common disease. A secondary aim was to show the limitations of commonly-used statistics for measuring reliability (eg, kappa). Method: The participants in this study were 5 medical doctors, who worked in a specialist unit dedicated to the treatment of the disease. They each rated the information contained in newsgroup threads using a 6-point scale designed by the experts themselves. Their ratings were analyzed for reliability using a number of statistics: Cohen's kappa, gamma, Kendall's W, and Cronbach's alpha. Results: Reliability was absent for ratings of questions, and low for ratings of responses. The various measures of reliability used gave conflicting results. No measure produced high reliability. Conclusions: The medical experts showed a low agreement when rating the postings from the newsgroup. Hence, it is important to test inter-rater reliability in research assessing the accuracy and quality of health-related information on the Internet. A discussion of the different measures of agreement that could be used reveals that the choice of statistic can be problematic. It is therefore important to consider the assumptions underlying a measure of reliability before using it. Often, more than one measure will be needed for "triangulation" purposes.
引用
收藏
页码:17 / 27
页数:5
相关论文
共 18 条
[1]  
BROWN JD, 1999, LANG TEST, V16, P216, DOI DOI 10.1191/026553299674908889
[2]   BIAS, PREVALENCE AND KAPPA [J].
BYRT, T ;
BISHOP, J ;
CARLIN, JB .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1993, 46 (05) :423-429
[3]   Towards quality management of medical information on the Internet: evaluation, labelling, and filtering of information [J].
Eysenbach, G ;
Diepgen, TL .
BRITISH MEDICAL JOURNAL, 1998, 317 (7171) :1496-1500
[4]   Shopping around the internet today and tomorrow: towards the millennium of cybermedicine [J].
Eysenbach, G ;
Sa, ER ;
Diepgen, TL .
BRITISH MEDICAL JOURNAL, 1999, 319 (7220) :1294-U47
[5]  
FOX S, 2001, ONLINE HLTH CARE REV
[6]  
*GRAPH VIS US CTR, 1998, GVUS 10 WWW US SURV
[7]   Consumers and evaluation of interactive health communication applications [J].
Gustafson, DH ;
Robinson, TN ;
Ansley, D ;
Adler, L ;
Brennan, PF .
AMERICAN JOURNAL OF PREVENTIVE MEDICINE, 1999, 16 (01) :23-29
[8]   Reliability of health information for the public on the world wide web: Systematic survey of advice on managing fever in children at home [J].
Impicciatore, P ;
Pandolfini, C ;
Casella, N ;
Bonati, M .
BMJ-BRITISH MEDICAL JOURNAL, 1997, 314 (7098) :1875-1879
[9]   MEASUREMENT OF OBSERVER AGREEMENT FOR CATEGORICAL DATA [J].
LANDIS, JR ;
KOCH, GG .
BIOMETRICS, 1977, 33 (01) :159-174
[10]   MISINTERPRETATION AND MISUSE OF THE KAPPA-STATISTIC [J].
MACLURE, M ;
WILLETT, WC .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1987, 126 (02) :161-169