data anonymization

18
SATYAM AGARWALA DEVELOPER

Upload: satyam-agarwala

Post on 24-May-2015

315 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Data anonymization

!

!

!

SATYAM AGARWALA

DEVELOPER

Page 2: Data anonymization

DATA ANONYMIZATION

Page 3: Data anonymization

Why do we need data?

Page 4: Data anonymization

What is data anonymization?

Page 5: Data anonymization

Why anonymize data?

Page 6: Data anonymization
Page 7: Data anonymization
Page 8: Data anonymization

How do we anonymize data?

Page 9: Data anonymization

https://github.com/sunitparekh/data-anonymization

Sunit Parekh Satyam Agarwala

Page 10: Data anonymization

You choose which attributes to anonymize

!!first name last name address zipcode handphone birth date

!!: Satyam : Agarwala : 87B Amoy Street : 069906 : 8765 4321 : 01/01/1945

BLACKLIST

Page 11: Data anonymization

WHITELIST

You choose which attributes NOT to

anonymize

!!first name last name address zipcode handphone birth date

!!: Satyam : Woodward : 10 Downing Street : 123456 : 8765 4321 : 01/01/1945

Page 12: Data anonymization

Show me!

Page 13: Data anonymization

Script (DSL, strategies, parallelization)

ORM (RDBMS, NoSQL)

source DB destination DB

SUMMARY

Page 14: Data anonymization

GOTCHAS

FK CONSTRAINTS Disable foreign key checks globally before running the script.

!

UNIQUE CONSTRAINTS Whitelist or ensure a sequential non-random strategy for attributes that need to be unique.

Page 15: Data anonymization

Are there other ways to anonymize data?

Page 16: Data anonymization

FORMAL APPROACH

k-anonymity !

l-diversity !

t-closeness !

δ-presence

Page 17: Data anonymization

ALTERNATIVE TOOLS

Arx https://github.com/arx-

deidentifier/arx

Page 18: Data anonymization

THANK YOU!