30
Mondrian Mul+dimensional K‐ Anonymity Kristen Lefevre, David J. DeWi<, and Raghu Ramakrishnan George W. Boulos [email protected] October 21 2009

Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

MondrianMul+dimensionalK‐Anonymity

KristenLefevre,DavidJ.DeWi<,andRaghuRamakrishnan

[email protected]

Page 2: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

TableLinking

Page 3: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

Overview

•  Mo+va+on&contribu+ons•  Terminology

•  QualityMetrics

•  Mul+dimensionalK‐Anonymiza+on

•  GreedyPar++oningAlgorithm

•  Performanceexperiments

Page 4: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

Mo+va+on

•  Protectthedataownersprivacyusingk‐anonymoustables.

•  Achievehigher‐qualityofanonymzeddata.

•  Provideanalgorithmforanonymizingtables.

Theprimarygoalofk‐anonymiza3onistoprotecttheprivacyoftheindividualstowhomthedatapertains.However,subjecttothisconstraint,itis

importantthatthereleaseddataremainas“useful”aspossible.

Page 5: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

Contribu+ons

•  Introducingmul+dimensionalk‐anymiza+on.•  IntroducingagreedyalgorithmforK‐anonymiza+on:

•  moreefficientthanproposedop0malk‐anonymiza0onalgorithmsforsingle‐dimensionalmodels;complexityO(nlogn),comparedtoexponen0al.•  Thegreedymul0dimensionalalgorithmoAenproduceshigher‐qualityresultsthanop0malsingledimensionalalgorithms.

•  Moretargetedno0onofqualitymeasurement.

Page 6: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

Terminology

•  QuasiIdenAfier:Minimalsetofa<ributesX1,…XdintableTthatcanbejoinedwithexternalinforma+ontore‐iden+fyindividualrecords.

•  Equivalenceclass:thesetofalltuplesinTcontainingiden0calvalues(x1…xd)forX1…Xd.

•  K‐AnonymityProperty:TableTisk‐anonymouswithrespecttoa<ributesX1…Xdifeveryuniquetuple(x1…xd)inthe(mul0set)projec0onofTonX1…Xdoccursatleastk0mes.

•  K‐AnonymizaAon:AviewVofrela0onTissaidtobeak‐anonymiza0oniftheviewmodifiesorgeneralizesthedataofTaccordingtosomemodelsuchthatVisk‐anonymouswithrespecttothequasi‐iden+fier.

Page 7: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

GeneralQualityMetrics

•  DiscernabilityMetric:

•  NormalizedAverageEquivalence:

Page 8: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

K‐anonymiza+on

•  globalrecoding:achievesanonymitybymappingthedomainsofthequasi‐iden+fiera<ributestogeneralizedoralteredvalues.

Page 9: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

SingleVS.Mul+dimensionalK‐Anonymiza+on

•  Single‐dimensional:Asingle‐dimensionalpar++oningdefines,foreachXi,asetofnon‐overlappingsingle‐dimensionalintervalsthatcoverDxi.øimapseachxЄDxitosumsummarysta0s0c.

•  Mul0‐dimensional:

Aglobalrecodingachievesanonymitybymappingthedomainsofthequasi‐iden+fiera<ributestogeneralizedoralteredvalues.Øi:Dxix…xDxn→D’

Page 10: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

SingleVS.Mul+dimensionalK‐Anonymiza+on(Cont.)

Page 11: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

Single‐dimensionalPar++oning

•  Asingle‐dimensionalpar++oningdefines,foreachXi,asetofnon‐overlappingsingledimensionalintervalsthatcoverDxi.ФimapseachxЄDxitosomesummarysta0s0cfortheintervalinwhichitiscontained.

Page 12: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

StrictMul+dimensionalPar++oning

•  Astrictmul+dimensionalpar++oningdefinesasetofnon‐overlappingmul+dimensionalregionsthatcoverDX1…DXd.Ømapseachtuple(x1…xd)2DX1…DXdtoasummarysta+s+cfortheregioninwhichitiscontained.

•  Proposi3on1:Everysingle‐dimensionalpar00oningforquasi‐iden0fieraWributesX1…Xdcanbeexpressedasastrictmul0dimensionalpar00oning.

Page 13: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

StrictMul+dimensionalPar++oning(Cont.)

NP‐Hard

Page 14: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

Single‐dimensionalpar++oningvs.mul+dimensional

•  Proposi+on1:Everysingle‐dimensionalpar00oningforquasi‐iden0fieraWributesX1…Xdcanbeexpressedasastrictmul0dimensionalpar00oning.However,whend>=2andforalli,|Dxi|>=2,thereexistsastrictmul0dimensionalpar00oningthatcannotbeexpressedasasingledimensionalpar00oning.

Page 15: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

DecisionalK‐AnonymousMul+dimensionalPar++oning

•  GivenasetPofunique(point,count)pairs,withpointsind‐dimensionalspace,foreveryresul0ngmul+dimensionalregionRi:–  OR– 

NP‐Complete

Page 16: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

AllowableCut

•  Mul+dimensional:AcutperpendiculartoaxisXiatxiisallowableifandonlyifCount(P.Xi>xi)>=kandCount(P.Xi<xi)>=k.

•  Single‐Dimensional:Asingle‐dimensionalcutperpendiculartoXiatxiisallowable,givenS,if

Page 17: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

MinimalPar++oning

•  MinimalStrictMul+dimensionalPar++oning:•  LetR1…Rndenoteasetofregionsinducedbyastrictmul0dimensionalpar++oning,andleteachregionRicontainmul+setPiofpoints.Thismul0dimensionalpar00oningisminimalifandthereexistsnoallowablemul+dimensionalcutforPi.

•  MinimalSingle‐DimensionalPar00oning:•  AsetSofallowablesingle‐dimensionalcutsisaminimalsingle‐dimensionalpar++oningformul+setPofpointsiftheredoesnotexistanallowablesingle‐dimensionalcutforPgivenS.

Page 18: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

BoundsonPar++onsizeinMul+dimensionalK‐Anonymiza+on

Page 19: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

BoundsonPar++onsizeinSingle‐DimensionalK‐anonymiza+on

<=2k‐1

Page 20: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

RelaxedMul+dimensionalPar++oning

•  Arelaxedmul+dimensionalpar++oningforrela+onTdefinesasetof(poten+allyoverlapping)dis0nctmul0dimensionalregionsthatcoverDX1…DXd.Localrecodingfunc0onФ’mapseachtuple(x1…xd)ЄTtoasummarysta0s0cforoneoftheregionsinwhichitiscontained.

•  Proposi0on2:Everystrictmul0dimensionalpar00oningcanbeexpressedasarelaxedmul0dimensionalpar00oning.However,ifthereareatleasttwotuplesintableThavingthesamevectorofquasi‐iden0fiervalues,thereexistsarelaxedmul0dimensionalpar00oningthatcannotbeexpressedasastrictmul0dimensionalpar00oning.

Page 21: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

GreedyPar++oningAlgorithm

Choosethedimensionwiththewidestrangeofvalues

Page 22: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

BoundsonQuality

Page 23: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

ScalabilityProblem

Tablemaybetoolargetofitintheavailablememory

Calculatethefrequencysetofa<ributesandloadonlythefrequencysetInmemory.

Page 24: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

WorkloadDrivenQuality

•  RangeSta+s+cs:–  SelectAvg(Age)FromPa+entswheresex=‘male’

•  MeanSta+s+cs–  Selectcount(*)FromPa+entswheresex=‘male’andage<=26

Itisimpossibletoanswerthesecondquerypreciselyusingthesingle‐dimensionalrecoding.

Page 25: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

ExperimentalEvalua+on

•  Usedasynthe+cdatageneratortoproducetwodiscretejointdistribu+ons:discreteuniformanddiscretenormal.

•  Alsotestedonadultsdatabase.

Page 26: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

ExperimentalEvalua+onforSynthe+cdata

Page 27: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

ExperimentalEvalua+onforAdultsDatabase

Page 28: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

Op+malsingle‐dimensionalvs.Greedystrictmul+dimensionalpar++oning

Page 29: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

Strengthsvs.Weaknesses

•  Definestheprocessofk‐anonymityinalargerandmoreaccurateconcept.

•  Mul+dimensionalapproachmakesuretoincludeminimalpointsinapar++onsotheoutputdataisbe<er.

AnyWeaknesses?

Page 30: Kristen Lefevre, David J. DeWi, and Raghu Ramakrishnanadamlee/courses/cs3525/2009fa/lectures/mondrian.pdfMondrian Muldimensional K‐ Anonymity Kristen Lefevre, David J. DeWi, and

Q&A

Thankyou