Aiding classification of gene expression data with feature selection: a comparative study


Autoria(s): Shen, Qiang; Shang, Changjing
Contribuinte(s)

Department of Computer Science

Advanced Reasoning Group

Data(s)

24/01/2008

24/01/2008

2006

Resumo

C. Shang and Q. Shen. Aiding classification of gene expression data with feature selection: a comparative study. Computational Intelligence Research, 1(1):68-76.

This paper presents an application of supervised machine learning approaches to the classification of the yeast S. cerevisiae gene expression data. Established feature selection techniques based on information gain ranking and principal component analysis are, for the first time, applied to this data set to support learning and classification. Different classifiers are implemented to investigate the impact of combining feature selection and classification methods. Learning classifiers implemented include K-Nearest Neighbours (KNN), Naive Bayes and Decision Trees. Results of comparative studies are provided, demonstrating that effective feature selection is essential to the development of classifiers intended for use in highdimension domains. In particular, amongst a large corpus of systematic experiments carried out, best classification performance is achieved using a subset of features chosen via information gain ranking for KNN and Naive Bayes classifiers. Naive Bayes may also perform accurately with a relatively small set of linearly transformed principal features in classifying this difficult data set. This research also shows that feature selection helps increase computational efficiency while improving classification accuracy.

Peer reviewed

Formato

9

Identificador

Shen , Q & Shang , C 2006 , ' Aiding classification of gene expression data with feature selection: a comparative study ' Journal of Computational Intelligence Research (IJCIR) , pp. 68-76 .

0973-1873

PURE: 74345

PURE UUID: 05f662d0-2ed2-48b9-a57a-56a5859e4332

dspace: 2160/472

http://hdl.handle.net/2160/472

Idioma(s)

eng

Relação

Journal of Computational Intelligence Research (IJCIR)

Tipo

/dk/atira/pure/researchoutput/researchoutputtypes/contributiontojournal/article

Direitos