Robust traffic classification with mislabelled training samples


Autoria(s): Wang, Binfeng; Zhang, Jun; Zhang, Zili; Luo, Wei; Xia, Dawen
Contribuinte(s)

[Unknown]

Data(s)

01/01/2015

Resumo

Traffic classification plays the significant role in the network security and management. However, accurate classification is challenging if the training data is contaminated with unclean traffic. Recent researches often assume clean training data, and hence performance reduced on real-time network traffic. To meet this challenge, in this paper, we propose a robust method, Unclean Traffic Classification (UTC), which incorporates noise elimination and suspected noise reweighting. Firstly, UTC eliminates strong noisy training data identified by a consensus filtering with multiple classifiers. Furthermore, UTC estimates the relevance of remaining training data and learns a robust traffic classifier. Through a number of experiments on a real-world traffic dataset, we show that the new method outperforms existing state-of-the-art traffic classification methods, under the extremely difficult circumstance with unclean training data.

Identificador

http://hdl.handle.net/10536/DRO/DU:30084259

Idioma(s)

eng

Publicador

IEEE

Relação

http://dro.deakin.edu.au/eserv/DU:30084259/zhang-robusttraffic-2015.pdf

http://dro.deakin.edu.au/eserv/DU:30084259/zhang-robusttraffic-evid-2015.pdf

http://www.dx.doi.org/10.1109/ICPADS.2015.49

Direitos

2015, IEEE

Palavras-Chave #traffic classification #unclean internet data #machine learning #random forest #network security
Tipo

Conference Paper