A Method for Mining Infrequent Causal Associations and Its Application in Finding Adverse Drug Reaction Signal Pairs

被引:31
作者
Ji, Yanqing [1 ]
Ying, Hao [2 ]
Tran, John [3 ]
Dews, Peter [4 ]
Mansour, Ayman [2 ]
Massanari, R. Michael [5 ]
机构
[1] Gonzaga Univ, Dept Elect & Comp Engn, Spokane, WA 99258 USA
[2] Wayne State Univ, Dept Elect & Comp Engn, Detroit, MI 48202 USA
[3] Spokane Mental Hlth, Spokane, WA 99202 USA
[4] St Mary Mercy Hosp Trinity Hlth, Dept Med, Livonia, MI 48154 USA
[5] Crit Junctures Inst, Bellingham, WA 98225 USA
基金
美国国家卫生研究院;
关键词
Adverse drug reactions; association rules; data mining algorithms; interestingness measure; recognition primed decision model; ALGORITHM; RULES; PHARMACOVIGILANCE; GENERATION; MODELS;
D O I
10.1109/TKDE.2012.28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many real-world applications, it is important to mine causal relationships where an event or event pattern causes certain outcomes with low probability. Discovering this kind of causal relationships can help us prevent or correct negative outcomes caused by their antecedents. In this paper, we propose an innovative data mining framework and apply it to mine potential causal associations in electronic patient data sets where the drug-related events of interest occur infrequently. Specifically, we created a novel interestingness measure, exclusive causal-leverage, based on a computational, fuzzy recognition-primed decision (RPD) model that we previously developed. On the basis of this new measure, a data mining algorithm was developed to mine the causal relationship between drugs and their associated adverse drug reactions (ADRs). The algorithm was tested on real patient data retrieved from the Veterans Affairs Medical Center in Detroit, Michigan. The retrieved data included 16,206 patients (15,605 male, 601 female). The exclusive causal-leverage was employed to rank the potential causal associations between each of the three selected drugs (i.e., enalapril, pravastatin, and rosuvastatin) and 3,954 recorded symptoms, each of which corresponded to a potential ADR. The top 10 drug-symptom pairs for each drug were evaluated by the physicians on our project team. The numbers of symptoms considered as likely real ADRs for enalapril, pravastatin, and rosuvastatin were 8, 7, and 6, respectively. These preliminary results indicate the usefulness of our method in finding potential ADR signal pairs for further analysis (e. g., epidemiology study) and investigation (e. g., case review) by drug safety professionals.
引用
收藏
页码:721 / 733
页数:13
相关论文
共 56 条
[1]   Rare itemset mining [J].
Adda, Mehdi ;
Wu, Lei ;
Feng, Yi .
ICMLA 2007: SIXTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2007, :73-+
[2]  
Agrawal R, 2000, P 20 INT C VER LARG
[3]  
[Anonymous], 2009, CAUSALITY MODELS REA
[4]   FUZZY PATTERN-MATCHING [J].
CAYROL, M ;
FARRENY, H ;
PRADE, H .
KYBERNETES, 1982, 11 (02) :103-116
[5]   A simple constraint-based algorithm for efficiently mining observational databases for causal relationships [J].
Cooper, GF .
DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1 (02) :203-224
[6]  
DuMouchel W, 1999, AM STAT, V53, P177, DOI 10.2307/2686093
[7]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[8]   Adverse drug reactions: definitions, diagnosis, and management [J].
Edwards, IR ;
Aronson, JK .
LANCET, 2000, 356 (9237) :1255-1259
[9]   Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reports [J].
Evans, SJW ;
Waller, PC ;
Davis, S .
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2001, 10 (06) :483-486
[10]  
Fink G.A., 2010, MARKOV MODELS PATTER