Host-based P2P flow identification and use in real-time


Autoria(s): Garcia-Palacios, Emi; Hurley, John; Sezer, Sakir
Data(s)

01/05/2011

Resumo

Data identification is a key task for any Internet Service Provider (ISP) or network administrator. As port fluctuation and encryption become more common in P2P traffic wishing to avoid identification, new strategies must be developed to detect and classify such flows. This paper introduces a new method of separating P2P and standard web traffic that can be applied as part of a data mining process, based on the activity of the hosts on the network. Unlike other research, our method is aimed at classifying individual flows rather than just identifying P2P hosts or ports. Heuristics are analysed and a classification system proposed. The accuracy of the system is then tested using real network traffic from a core internet router showing over 99% accuracy in some cases. We expand on this proposed strategy to investigate its application to real-time, early classification problems. New proposals are made and the results of real-time experiments compared to those obtained in the data mining research. To the best of our knowledge this is the first research to use host based flow identification to determine a flows application within the early stages of the connection.

Identificador

http://pure.qub.ac.uk/portal/en/publications/hostbased-p2p-flow-identification-and-use-in-realtime(6772d1e2-6685-44e0-8519-5c6939f5b4a1).html

http://dx.doi.org/10.1145/1961659.1961661

http://www.scopus.com/inward/record.url?scp=80052074576&partnerID=8YFLogxK

Idioma(s)

eng

Direitos

info:eu-repo/semantics/restrictedAccess

Fonte

Garcia-Palacios , E , Hurley , J & Sezer , S 2011 , ' Host-based P2P flow identification and use in real-time ' ACM Transactions on the Web , vol 5 , no. 2 , 7 . DOI: 10.1145/1961659.1961661

Palavras-Chave #/dk/atira/pure/subjectarea/asjc/1700/1705 #Computer Networks and Communications
Tipo

article