Contrasting temporal trend discovery for large healthcare databases

被引:17
作者
Hrovat, Goran [1 ]
Stiglic, Gregor [1 ,2 ]
Kokol, Peter [1 ,2 ]
Ojstersek, Milan [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, Maribor 2000, Slovenia
[2] Univ Maribor, Fac Hlth Sci, Maribor 2000, Slovenia
关键词
Data mining; Decision support; Trend discovery; ASSOCIATION RULES; TESTS;
D O I
10.1016/j.cmpb.2013.09.005
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the increased acceptance of electronic health records, we can observe the increasing interest in the application of data mining approaches within this field. This study introduces a novel approach for exploring and comparing temporal trends within different in-patient subgroups, which is based on associated rule mining using Apriori algorithm and linear model-based recursive partitioning. The Nationwide Inpatient Sample (NIS), Healthcare Cost and Utilization Project (HCUP), Agency for Healthcare Research and Quality was used to evaluate the proposed approach. This study presents a novel approach where visual analytics on big data is used for trend discovery in form of a regression tree with scatter plots in the leaves of the tree. The trend lines are used for directly comparing linear trends within a specified time frame. Our results demonstrate the existence of opposite trends in relation to age and sex based subgroups that would be impossible to discover using traditional trend-tracking techniques. Such an approach can be employed regarding decision support applications for policy makers when organizing campaigns or by hospital management for observing trends that cannot be directly discovered using traditional analytical techniques. (C) 2013 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:251 / 257
页数:7
相关论文
共 30 条
[1]   PRIM versus CART in subgroup discovery: When patience is harmful [J].
Abu-Hanna, Ameen ;
Nannings, Barry ;
Dongelmans, Dave ;
Hasman, Arie .
JOURNAL OF BIOMEDICAL INFORMATICS, 2010, 43 (05) :701-708
[2]   C-TREND: Temporal cluster graphs for identifying and visualizing trends in multiattribute transactional data [J].
Adomavicius, Gediminas ;
Bockstedt, Jesse .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (06) :721-735
[3]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[4]  
Agrawal R., P 20 INT C VERY LARG
[5]   TESTS FOR PARAMETER INSTABILITY AND STRUCTURAL-CHANGE WITH UNKNOWN CHANGE-POINT [J].
ANDREWS, DWK .
ECONOMETRICA, 1993, 61 (04) :821-856
[6]  
[Anonymous], 2004, R LANG ENV STAT COMP
[7]  
Bayardo R.J., 1999, Proc. of the Fifth ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining, P145, DOI [DOI 10.1145/312129.312219, 10.1145/3121312219]
[8]  
Brin S., 1997, SIGMOD Record, V26, P255, DOI [10.1145/253262.253327, 10.1145/253262.253325]
[9]   A COMMUNITY-WIDE PERSPECTIVE OF GENDER DIFFERENCES AND TEMPORAL TRENDS IN THE USE OF DIAGNOSTIC AND REVASCULARIZATION PROCEDURES FOR ACUTE MYOCARDIAL-INFARCTION [J].
CHIRIBOGA, DE ;
YARZEBSKI, J ;
GOLDBERG, RJ ;
CHEN, ZY ;
GURWITZ, J ;
GORE, JM ;
ALPERT, JS ;
DALEN, JE .
AMERICAN JOURNAL OF CARDIOLOGY, 1993, 71 (04) :268-273
[10]  
Frederick P.D., 2012, JOURNAL OF THE AMERI, V307, P813