Toward the accurate identification of network applications

被引:342
作者
Moore, AW [1 ]
Papagiannaki, K [1 ]
机构
[1] Univ Cambridge, Cambridge CB2 1TN, England
来源
PASSIVE AND ACTIVE NETWORK MEASUREMENT, PROCEEDINGS | 2005年 / 3431卷
关键词
D O I
10.1007/978-3-540-31966-5_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Well-known port numbers can no longer be used to reliably identify network applications. There is a variety of new Internet applications that either do not use well-known port numbers or use other protocols, such as HTTP, as wrappers in order to go through firewalls without being blocked. One consequence of this is that a simple inspection of the port numbers used by flows may lead to the inaccurate classification of network traffic. In this work, we look at these inaccuracies in detail. Using a full payload packet trace collected from an Internet site we attempt to identify the types of errors that may result from port-based classification and quantify them for the specific trace under study. To address this question we devise a classification methodology that relies on the full packet payload. We describe the building blocks of this methodology and elaborate on the complications that arise in that context. A classification technique approaching 100% accuracy proves to be a labor-intensive process that needs to test flow-characteristics against multiple classification criteria in order to gain sufficient confidence in the nature of the causal application. Nevertheless, the benefits gained from a content-based classification approach are evident. We are capable of accurately classifying what would be otherwise classified as unknown as well as identifying traffic flows that could otherwise be classified incorrectly. Our work opens up multiple research issues that we intend to address in future work.
引用
收藏
页码:41 / 54
页数:14
相关论文
共 8 条
[1]  
[Anonymous], P LISA 2001 15 SYST
[2]  
[Anonymous], PASS ACT MEAS WORKSH
[3]  
CHOI T, 2004, IEEE IFIP NETW OP MA
[4]   Packet-level traffic measurements from the Sprint IP backbone [J].
Fraleigh, C ;
Moon, S ;
Lyles, B ;
Cotton, C ;
Khan, M ;
Moll, D ;
Rockell, R ;
Seely, T ;
Diot, C .
IEEE NETWORK, 2003, 17 (06) :6-16
[5]  
Logg C., 2003, Characterization of the traffic between SLAC and the internet
[6]  
MOORE A, 2005, DISCRETE CONTENT BAS
[7]  
Orebaugh A.D., 2004, ETHEREAL PACKET SNIF
[8]  
ROESCH M, 1999, USENIX 13 SYST ADM C