Random-forests-based network intrusion detection systems

被引:338
作者
Zhang, Jiong [1 ]
Zulkernine, Mohammad [1 ]
Haque, Anwar [2 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON K7L 3N6, Canada
[2] Bell Canada, Network Planning Div, Hamilton, ON L8P 4S6, Canada
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS | 2008年 / 38卷 / 05期
基金
加拿大自然科学与工程研究理事会;
关键词
computer network security; data mining; intrusion detection; random forests;
D O I
10.1109/TSMCC.2008.923876
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
Prevention of security breaches completely using the existing security technologies is unrealistic. As a result, intrusion detection is an important component in network security. However, many current intrusion detection systems (IDSs) are rule-based systems, which have limitations to detect novel intrusions. Moreover, encoding rules is time-consuming and highly depends on the knowledge of known intrusions. Therefore, we propose new systematic frameworks that apply a data mining algorithm called random forests in misuse, anomaly, and hybrid-network-based IDSs. In misuse detection, patterns of intrusions are built automatically by the random forests algorithm over training data. After that, intrusions are detected by matching network activities against the In anomaly detection, novel intrusions are detected by the patterns. outlier detection mechanism of the random forests algorithm. After building the patterns of network services by the random forests algorithm, outliers related to the patterns are determined by the outlier detection algorithm. The hybrid detection system improves the detection performance by combining the advantages of the misuse and anomaly detection. We evaluate our approaches over the Knowledge Discovery and Data Mining 1999 (KDD'99) dataset. The experimental results demonstrate that the performance provided by the proposed misuse approach is better than the best KDD'99 result; compared to other reported unsupervised anomaly detection approaches, our anomaly detection approach achieves higher detection rate when the false positive rate is low; and the presented hybrid system can improve the overall performance of the aforementioned IDSs.
引用
收藏
页码:649 / 659
页数:11
相关论文
共 39 条
[1]
Abraham T., 2001, DSTOGD0286 EL SURV R
[2]
ANDERSON D, 1995, SRICSL9507
[3]
[Anonymous], P IEEE WORKSH INF AS
[4]
[Anonymous], 2004, INTRUSION DETECTION
[5]
[Anonymous], P 3 ANN C PRIV SEC T
[6]
[Anonymous], 2000, PROC 23 NAT INF SYST
[7]
Barbara D, 2002, APPL DATA MINING COM
[8]
Barnett V., 1994, Outliers in Statistical Data, V3rd
[9]
BIVENS A, 2002, P ARTIF NEURAL NETW, V12, P527
[10]
Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32