SVM Training Phase Reduction Using Dataset Feature Filtering for Malware Detection


Autoria(s): O'Kane, Philip; Sezer, Sakir; McLaughlin, Kieran; Gyu Im, Eul
Data(s)

01/03/2013

Resumo

N-gram analysis is an approach that investigates the structure of a program using bytes, characters, or text strings. A key issue with N-gram analysis is feature selection amidst the explosion of features that occurs when N is increased. The experiments within this paper represent programs as operational code (opcode) density histograms gained through dynamic analysis. A support vector machine is used to create a reference model, which is used to evaluate two methods of feature reduction, which are 'area of intersect' and 'subspace analysis using eigenvectors.' The findings show that the relationships between features are complex and simple statistics filtering approaches do not provide a viable approach. However, eigenvector subspace analysis produces a suitable filter.

Formato

application/pdf

Identificador

http://pure.qub.ac.uk/portal/en/publications/svm-training-phase-reduction-using-dataset-feature-filtering-for-malware-detection(f382a397-cfa6-439a-b2e5-f20a41bfafe3).html

http://dx.doi.org/10.1109/TIFS.2013.2242890

http://pure.qub.ac.uk/ws/files/1744242/IEEE_Transactions_on_Information_Forensics_and_Security_30July2012.pdf

Idioma(s)

eng

Direitos

info:eu-repo/semantics/openAccess

Fonte

O'Kane , P , Sezer , S , McLaughlin , K & Gyu Im , E 2013 , ' SVM Training Phase Reduction Using Dataset Feature Filtering for Malware Detection ' IEEE Transactions on Information Forensics and Security , vol 8 , no. 3 , pp. 500-509 . DOI: 10.1109/TIFS.2013.2242890

Palavras-Chave #Obfuscation, Packers, Polymorphism, Metamorphism Malware, KNN, SVM
Tipo

article