Semantics-Preserving Dimensionality Reduction: Rough and Fuzzy-Rough-Based Approaches


Autoria(s): Jensen, Richard; Shen, Qiang
Contribuinte(s)

Department of Computer Science

Advanced Reasoning Group

Data(s)

15/11/2007

15/11/2007

2004

Resumo

R. Jensen and Q. Shen. Semantics-Preserving Dimensionality Reduction: Rough and Fuzzy-Rough Based Approaches. IEEE Transactions on Knowledge and Data Engineering, 16(12): 1457-1471. 2004.

Semantics-preserving dimensionality reduction refers to the problem of selecting those input features that are most predictive of a given outcome; a problem encountered in many areas such as machine learning, pattern recognition, and signal processing. This has found successful application in tasks that involve data sets containing huge numbers of features (in the order of tens of thousands), which would be impossible to process further. Recent examples include text processing and Web content classification. One of the many successful applications of rough set theory has been to this feature selection area. This paper reviews those techniques that preserve the underlying semantics of the data, using crisp and fuzzy rough set-based methodologies. Several approaches to feature selection based on rough set theory are experimentally compared. Additionally, a new area in feature selection, feature grouping, is highlighted and a rough set-based feature grouping technique is detailed.

Peer reviewed

Formato

15

Identificador

Jensen , R & Shen , Q 2004 , ' Semantics-Preserving Dimensionality Reduction: Rough and Fuzzy-Rough-Based Approaches ' IEEE Transactions on Knowledge and Data Engineering , vol 16 , no. 12 , pp. 1457-1471 . DOI: 10.1109/TKDE.2004.96

1041-4347

PURE: 73154

PURE UUID: 4ed9d299-942e-46b3-a331-318eff0ab5dc

dspace: 2160/358

http://hdl.handle.net/2160/358

http://dx.doi.org/10.1109/TKDE.2004.96

Idioma(s)

eng

Relação

IEEE Transactions on Knowledge and Data Engineering

Tipo

/dk/atira/pure/researchoutput/researchoutputtypes/contributiontojournal/article

Direitos