Predictive data mining in clinical medicine: Current issues and guidelines

被引:467
作者
Bellazzi, Riccardo [1 ]
Zupan, Blaz [2 ,3 ]
机构
[1] Univ Pavia, Dipartimento Informat & Sistemist, I-27100 Pavia, Italy
[2] Univ Ljubljana, Fac Comp Sci, Ljubljana 61000, Slovenia
[3] Baylor Coll Med, Dept Human & Mol Genet, Houston, TX 77030 USA
关键词
data mining; predictive models; clinical medicine; data mining process; data analysis;
D O I
10.1016/j.ijmedinf.2006.11.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: The widespread availability of new computational methods and tools for data analysis and predictive modeling requires medical informatics researchers and practitioners to systematically select the most appropriate strategy to cope with clinical prediction problems. in particular, the collection of methods known as 'data mining' offers methodological and technical solutions to deal with the analysis of medical data and construction of prediction models. A large variety of these methods requires general and simple guidelines that may help practitioners in the appropriate selection of data mining tools, construction and validation of predictive models, along with the dissemination of predictive models within clinical environments. Purpose: The goal of this review is to discuss the extent and role of the research area of predictive data mining and to propose a framework to cope with the problems of constructing, assessing and exploiting data mining models in clinical medicine. Methods: We review the recent relevant work published in the area of predictive data mining in clinical medicine, highlighting critical issues and summarizing the approaches in a set of learned lessons. Results: The paper provides a comprehensive review of the state of the art of predictive data mining in clinical medicine and gives guidelines to carry out data mining studies in this field. Conclusions: Predictive data mining is becoming an essential instrument for researchers and clinical practitioners in medicine. Understanding the main issues underlying these methods and the application of agreed and standardized procedures is mandatory for their deployment and the dissemination of results. Thanks to the integration of molecular and clinical data taking place within genomic medicine, the area has recently not only gained a fresh impulse but also a new set of complex problems it needs to address. (c) 2006 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:81 / 97
页数:17
相关论文
共 110 条
  • [1] Knowledge management in healthcare: towards 'knowledge-driven' decision-support services
    Abidi, SSR
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2001, 63 (1-2) : 5 - 18
  • [2] Adam BL, 2002, CANCER RES, V62, P3609
  • [3] MEDICAL EXPERT SYSTEMS BASED ON CAUSAL PROBABILISTIC NETWORKS
    ANDREASSEN, S
    JENSEN, FV
    OLESEN, KG
    [J]. INTERNATIONAL JOURNAL OF BIO-MEDICAL COMPUTING, 1991, 28 (1-2): : 1 - 30
  • [4] Predicting recovery in patients suffering from traumatic brain injury by using admission variables and physiological data: a comparison between decision tree analysis and logistic regression
    Andrews, PJD
    Sleeman, DH
    Statham, PFX
    McQuatt, A
    Corruble, V
    Jones, PA
    Howells, TP
    Macmillan, CSA
    [J]. JOURNAL OF NEUROSURGERY, 2002, 97 (02) : 326 - 336
  • [5] [Anonymous], C4 5 PROGR MACHINE L
  • [6] Aronson AR, 2001, J AM MED INFORM ASSN, P17
  • [7] Barbarini N, 2006, AMIA Annu Symp Proc, P26
  • [8] BECK JR, 1986, ARCH PATHOL LAB MED, V110, P13
  • [9] Bellazzi R, 2001, METHOD INFORM MED, V40, P362
  • [10] BELLAZZI R, 1998, WORKSH INT DAT AN ME, P2