spacer
School of Computer Science The University of Adelaide Australia
Computer Science Home
About the School
News
Current Students
Future Students
International Students
Business & Industry
Visitors
Staff
Programs
Courses
Research
Facilities
Seminars
Occupational Health & Safety
Staff Only
text zoom: S | M | L

School of Computer Science
Plaza Building
THE UNIVERSITY OF ADELAIDE
SA 5005
AUSTRALIA
Email

Telephone: +61 8 8303 5586
Facsimile: +61 8 8303 4366


You are here: Computer Science > Seminars

Computer Science Seminars

 

Equi-width Data Swapping for Private Data Publication

Yidong LI, Computer Science, University of Adelaide

Wed, 4 Nov, 2009, 10:10

Lecture Room 2060, Computer Science, Plaza Building

Abstract

Data Swapping is a popular value-invariant data perturbation technique. The quality of a data swapping method is measured by how well it preserves data privacy and data utility. As swapping data globally is computationally impractical, to guarantee its performance in these metrics appropriate, localization schemes are often conducted in advance. Equi-depth partitioning is preferred by most of the existing data perturbation techniques as it provides uniform privacy protection for each data tuple. However, this method performs ineffectively for two types of applications: one is to maintain statistics based on equi-width partitioning, such as the multivariate histogram with equal bin width, and the other is to preserve parametric statistics, such as covariance, in the context of sparse data with non-uniform distribution.

As a natural solution for the above application, this paper explores the possibility of using data swapping with equi-width partitioning for private data publication, which has been little used in data perturbation due to the difficulty of preserving data privacy. With extensive theoretical analysis and experimental results, we show that, Equi-Width Swapping (EWS) can achieve a similar performance in privacy preservation to that of Equi-Depth Swapping (EDS) if the number of partitions is sufficiently large (e.g. >=N, where N is the size of dataset). Our experimental results in both synthetic and real-world data validate our theoretical analysis.


[ Seminars Home | List seminars for: '01 '02 '03 '04 '05 '06 '07 '08 '09 | Email Notification | lockAdminister Future Seminars ]