33
Hide in Plain Sight: Enabling Mobile Applications and Data Analytics with Local Differential Privacy Li Xiong Department of Computer Science Department of Biomedical Informatics Emory University IEEE International Conference on Mobile Data Management Workshops/PhD Forum, 06/10/2019 * Work supported by National Science Foundation and Google Research Award

Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Hide in Plain Sight: Enabling Mobile

Applications and Data Analytics with

Local Differential Privacy

Li Xiong

Department of Computer Science

Department of Biomedical Informatics

Emory University

IEEE International Conference on Mobile Data Management

Workshops/PhD Forum, 06/10/2019

* Work supported by National Science Foundation and Google Research Award

Page 2: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

2Location data collected from individual devices

(Source: New York Times 12/2018)

Page 3: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Over 235 million locations captured from more than 1.2

million unique devices during a three-day period in 2017

(Source: New York Times 12/2018)

Page 4: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Who Knows What About Me? A Survey of Behind the Scenes Personal Data Sharing to Third Parties by Mobile Apps,

2015-10-30 https://techscience.org/a/2015103001/

33%/47% of Android/

iOS apps shared GPS

coordinates with third

parties

Location data sharing by iOS apps (left) to domains (right)

Page 5: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

5

Page 6: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

The Mobile Data Economy

Page 7: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Enabling Data Analytics with Centralized

Differential Privacy

Page 8: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Enable Mobile Apps and Analytics with Local

Differential Privacy

Page 9: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Enabling Mobile Apps and Analytics with Local

Differential Privacy

• Background

• Local differential privacy

• Geo-indistinguishability (local d-privacy)

• Extended privacy notions

• Protecting dynamic locations (CCS15, VLDB17 demo)

• Protecting spatiotemporal events (ICDE19)

• New mobile applications

• Spatial crowdsourcing with geo-indistinguishability

(ICDE18)

• New mechanisms

• Supporting both analytics and mobile applications

(CNS19)

Page 10: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

• Privacy definition

• Any two locations produce “similar”

distributions (bounded by 𝜖)

• Mechanism

• Randomized response (with

encoding)

• Applications

• Simple analytics (e.g. frequency

estimation)

• Google, Apple, Microsoft

• Limitations

• Output not useful for mobile apps

Local Differential Privacy

Page 11: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

• Privacy Definition

• Any two locations at distance at

most 𝑟 produce “similar”

distributions proportional to the

distance (bounded by 𝜖 𝑟)

• Mechanism:

• Planar Laplace mechanism

• Applications

• Mobile apps/location sharing

• Limitations:

• Temporal correlations of dynamic

locations not considered

• Not optimal for analytics

Geo-Indistinguishability (Local d-privacy)

e 𝜖 d(x1,x2)

Page 12: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

• Generating random point z (from actual point

x ∈ X) according to planar Laplace distribution

Geo-Indistinguishability: Planar Laplace Mechanism

𝝐= log 6𝑟=1 km

Better privacy: 𝝐= log 2𝑟=1 km

Page 13: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Enabling Mobile Apps and Analytics with Local

Differential Privacy

• Background

• Local differential privacy

• Geo-indistinguishability (local d-privacy)

• Extended privacy notions

• Protecting dynamic locations (CCS15, VLDB17 demo)

• Protecting spatiotemporal events (ICDE19)

• New mobile applications

• Spatial crowdsourcing with geo-indistinguishability

(ICDE18)

• New mechanisms

• Supporting both analytics and mobile applications

(CNS19)

Page 14: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Location Privacy: Temporal Correlations

• Temporal correlations (adversary knowledge): moving patterns and

previously released perturbed locations

Page 15: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Differential Privacy with δ-location set

• δ-location set differential privacy

• Any two locations in the probable location set produce “similar”

distributions proportional to the distance (bounded by 𝜖)

• Probable location set determined by hidden Markov Model

Y. Xiao, L. Xiong. Protecting Locations with Differential Privacy under Temporal Correlations. CCS 2015

Y. Xiao, L. Xiong, S. Zhang, Y. Cao. LocLok: Location Cloaking with Differential

Privacy via Hidden Markov Model. VLDB demo, 2017

· · ·z1 z2 z3 ztobservable:

x1 x3unobservable: x2 x t· · ·

Page 16: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Optimal perturbation mechanism

• Minimize expected distance between perturbed location z and true

location x

• While satisfying constraint of differential privacy – any pair of

locations x1 and x2 are indistinguishable

• Exponential mechanism and Laplace

mechanism are not optimal

x

Page 17: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Planar Isotropic Mechanism

• Based on sensitivity hull K of δ-location set which

determines the lower bound error

• An improved K-norm mechanism based on Isotropic

transformation

• Achieves optimality while achieving differential privacy

Page 18: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Results: Perturbed Trace Illustration

Page 19: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Results: k-Nearest Neighbor Queries

Page 20: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

From Location Privacy to Spatiotemporal

Privacy

• Location privacy mechanisms protect location at a time

point

• May not protect spatiotemporal activities?

• Staying in hospital for 2 hours

• From home to office every morning

• Need formal notions and mechanisms

Yang Cao, Yonghui Xiao, Li Xiong, Liquan Bai. PriSTE: From Location Privacy to

Spatiotemporal Event Privacy (short paper). ICDE 2019

Page 21: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Spatiotemporal events

• Boolean expression for spatiotemporal event

• Location at a time point (ut = si)

Page 22: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

From Location Privacy to Spatiotemporal Event

Privacy

• Location privacy

• Two locations produce “similar” distributions/observations

• Spatiotemporal event privacy

• A true event and a negative event produce “similar”

location traces

Page 23: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Spatiotemporal Privacy Framework

• LPPM: Existing location privacy mechanism, e.g. Planar Laplace

Mechanism for geo-indistinguishibility

• PrivacyCheck: check spatiotemporal event privacy and calibrate

privacy budget

Page 24: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Results

• Strong LPPM may satisfy spatiotemporal privacy already

• Weak LPPM need to reduce privacy budget significantly (less utility) to

achieve same level of spatiotemporal privacy

• Stronger spatiotemporal privacy, less utility of the locations

Page 25: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Enabling Mobile Apps and Analytics with Local

Differential Privacy

• Background

• Local differential privacy

• Geo-indistinguishability (local d-privacy)

• Extended privacy notions

• Protecting dynamic locations (CCS15, VLDB17 demo)

• Protecting spatiotemporal events (ICDE19)

• New mobile applications

• Spatial crowdsourcing with geo-indistinguishability

(ICDE18)

• New mechanisms

• Supporting both analytics and mobile applications

(CNS19)

Page 26: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

ONLINE TASK ASSIGNMENT IN SPATIAL

CROWDSOURCING

Page 27: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Privacy preserving online task assignment in

spatial crowdsourcing

• Both requester and worker locations are perturbed using geo-

indistinguishability

• Three-stage framework for task assignment using uncertain locations

Hien To, Cyrus Shahabi, Li Xiong. Privacy-Preserving Online Task

Assignment in Spatial Crowdsourcing with Untrusted Server. ICDE 2018

Page 28: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Enabling Mobile Apps and Analytics with Local

Differential Privacy

• Background

• Local differential privacy

• Geo-indistinguishability (local d-privacy)

• Extended privacy notions

• Protecting dynamic locations (CCS15, VLDB17 demo)

• Protecting spatiotemporal events (ICDE19)

• New mobile applications

• Spatial crowdsourcing with geo-indistinguishability

(ICDE18)

• New mechanisms

• Supporting both analytics and mobile applications

(CNS19)

Page 29: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Supporting both range queries and frequency

estimation

• Existing

• Local differential privacy with randomized response –

frequency estimation

• Geo-indistinguishability (local d-privacy) with planar

Laplace mechanism – range queries

• Goal

• Optimize for both frequency estimation and range

queries while ensuring local d-privacy

• Basic idea

• Assign different perturbation probabilities for different

input/output pairs in a way related to the distance

X. Gu, M. Li, Y. Cao and L. Xiong, Privacy-Preserving Range Queries and Frequency

Estimation with Geo-indistinguishability. IEEE Conference on Communications and

Network Security (CNS), 2019

Page 30: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Results: Comparison

30

RR: Randomized Response

OU: Optimized with Unary Encoding

PL: Planar Laplace mechanism

EM: Exponential mechanism

LE: Linear equation mechanism

Gowalla dataset

Page 31: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Enabling Mobile Apps and Analytics with Local

Differential Privacy

• Extended privacy notions

• Protecting dynamic locations (CCS15, VLDB17 demo)

• Protecting spatiotemporal events (ICDE19)

• New mobile applications

• Spatial crowdsourcing with geo-indistinguishability

(ICDE18)

• New mechanisms

• Supporting both analytics and mobile applications

(CNS19)

• Open challenges

• Privacy/utility tradeoff

• User empowerment

Page 32: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Assured Information Management and Sharing

(AIMS)

+ +

Page 33: Hide in Plain Sight: Enabling Mobile Applications and Data ... · • δ-location set differential privacy • Any two locations in the probable location set produce “similar”

Assured Information Management and Sharing

(AIMS)

+ +

http://www.cs.emory.edu/site/aims