Privacy aware K-means clustering with high utility


Autoria(s): Nguyen, Thanh Dai; Gupta, Sunil; Rana, Santu; Venkatesh, Svetha
Contribuinte(s)

Bailey, James

Khan, Latifur

Washio, Takashi

Dobbie, Gillian

Huang, Joshua Zhexue

Wang, Ruili

Data(s)

01/01/2016

Resumo

Privacy-preserving data mining aims to keep data safe, yet useful. But algorithms providing strong guarantees often end up with low utility. We propose a novel privacy preserving framework that thwarts an adversary from inferring an unknown data point by ensuring that the estimation error is almost invariant to the inclusion/exclusion of the data point. By focusing directly on the estimation error of the data point, our framework is able to significantly lower the perturbation required. We use this framework to propose a new privacy aware K-means clustering algorithm. Using both synthetic and real datasets, we demonstrate that the utility of this algorithm is almost equal to that of the unperturbed K-means, and at strict privacy levels, almost twice as good as compared to the differential privacy counterpart.

Identificador

http://hdl.handle.net/10536/DRO/DU:30083253

Idioma(s)

eng

Publicador

Springer

Relação

http://dro.deakin.edu.au/eserv/DU:30083253/gupta-privacyaware-evid-2016.pdf

http://www.dx.doi.org/10.1007/978-3-319-31750-2_31

Direitos

2016, Springer

Tipo

Book Chapter