KTH Royal Institute of Technologystaff.damvar/Speeches/UCLouvain__2012_10_19__Louvain... ·...

Fast graph discovering in anonymous networks

Damiano Varagnolo

KTH Royal Institute of Technology

October 19, 2012

Thanks to. . .

This talk

the bigpicture

(& motivations)

a particularapproach(& usages)

futureworks

(& visions)

Smart objects. . .

. . . for a distributed world?(cf. the Metcalfe law)

An example: the transportation system

Distributed systems: requirements

easy implementability

sufficiently fast convergence

contained bandwidth requirements

robustness w.r.t. failures

robustness w.r.t. churn

scalability

Topology Matters

Examples of (high-level) applications

knowledge ofthe topologyallows to. . .

optimizeinfer

reconfigure detect

Graph discovering literature – some dichotomies

static networks vs dynamic networksidentity-based vs privacy-aware algorithmsinformation-aggregation vs information-propagationalgorithms

Examples:construction of graph viewsrandom walkscapture-recapture

Graph discovering literature – some dichotomies

static networks vs dynamic networksidentity-based vs privacy-aware algorithmsinformation-aggregation vs information-propagationalgorithms

Examples:construction of graph viewsrandom walkscapture-recapture

Big question:

what can we estimate?

without any constraint ⇒ infer the whole graph perfectlywith anonymity constraints ⇒ infer the whole graph w.h.p.

Big question:

Today’s niche

time & consensus & scalability constraints(i.e., highly dynamic networks where convergence speed is crucial)

⇒ analyze max-consensus

A building block: size estimation with max-consensus

i.i.d. local generation

max consensus

−log (ymax) ∼ exp(S)

estimate S withstatistical inference

y1 ∼ U [0, 1]

y2 ∼ U [0, 1]

y3 ∼ U [0, 1]

y4 ∼ U [0, 1]

y5 ∼ U [0, 1]i.i.d. local generation

max consensus

y1 → maxi{yi}

y2 → maxi{yi}

y3 → maxi{yi}

y4 → maxi{yi}

y5 → maxi{yi}i.i.d. local generation

max consensus

ymaxi.i.d. local generation

max consensus

Performance characterization(under no-quantization issues)

Generalizations:perform M independent trials in parallelyi ,m ∼ F (·) (absolutely continuous distribution)

⇒ ML estimator: S =

(− 1M

M∑m=1

log (F (ymax,m))

⇒ SSM ∼ Inv-Gamma

)⇒ E

MM − 1

⇒ var(S − SS

)≈ 1

⇒(S)−1

= S−1 and S−1 is MVUE for S−1

Extension 1 – neighborhoods size estimation

Assumption: synchronous communicationstime is divided in epochseverybody communicates once per epoch

a a a ab b b bc c c cd d d dt

⇒ induces well-defined k-steps neighborhoods:

i0.30.60.10.4

yi(t):

0t =0.70.60.80.4

10.70.60.80.4

20.70.90.80.4

Extension 2 – number of links estimation

Assumption:every agent knows its in- and out-degrees

⇒ agents can pretend the behavior of an equivalent number ofagents:

Remarksestimates twice the number of linkscan estimate the number of links between k-steps neighbors

Extension 2 – number of links estimation

Assumption:every agent knows its in- and out-degrees

⇒ agents can pretend the behavior of an equivalent number ofagents:

Remarksestimates twice the number of linkscan estimate the number of links between k-steps neighbors

Extension 3 – estimation of eccentricities

Definition

e(i) := maxj∈V

dist(i , j)

(i.e., longest shortest path starting from i)

Assumption: synchronous communications

0.30.60.10.4

yi(t):

0t =0.70.60.80.4

10.70.60.80.4

20.70.90.80.4

30.70.90.80.4

40.70.90.80.4

50.70.90.80.4

⇒ e(i) = 3

remark: statistical properties may depend on the actual graph17

Definition

e(i) := maxj∈V

dist(i , j)

0.30.60.10.4

yi(t):

0t =0.70.60.80.4

10.70.60.80.4

20.70.90.80.4

30.70.90.80.4

40.70.90.80.4

50.70.90.80.4

⇒ e(i) = 3

Definition

e(i) := maxj∈V

dist(i , j)

0.30.60.10.4

yi(t):

0t =0.70.60.80.4

10.70.60.80.4

20.70.90.80.4

30.70.90.80.4

40.70.90.80.4

50.70.90.80.4

⇒ e(i) = 3

Definition

e(i) := maxj∈V

dist(i , j)

0.30.60.10.4

yi(t):

0t =0.70.60.80.4

10.70.60.80.4

20.70.90.80.4

30.70.90.80.4

40.70.90.80.4

50.70.90.80.4

⇒ e(i) = 3

Definition

e(i) := maxj∈V

dist(i , j)

0.30.60.10.4

yi(t):

0t =0.70.60.80.4

10.70.60.80.4

20.70.90.80.4

30.70.90.80.4

40.70.90.80.4

50.70.90.80.4

⇒ e(i) = 3

Definition

e(i) := maxj∈V

dist(i , j)

0.30.60.10.4

yi(t):

0t =0.70.60.80.4

10.70.60.80.4

20.70.90.80.4

30.70.90.80.4

40.70.90.80.4

50.70.90.80.4

⇒ e(i) = 3

Definition

e(i) := maxj∈V

dist(i , j)

0.30.60.10.4

yi(t):

0t =0.70.60.80.4

10.70.60.80.4

20.70.90.80.4

30.70.90.80.4

40.70.90.80.4

50.70.90.80.4

⇒ e(i) = 3

Definition

e(i) := maxj∈V

dist(i , j)

0.30.60.10.4

yi(t):

0t =0.70.60.80.4

10.70.60.80.4

20.70.90.80.4

30.70.90.80.4

40.70.90.80.4

50.70.90.80.4

⇒ e(i) = 3

Definition

e(i) := maxj∈V

dist(i , j)

0.30.60.10.4

yi(t):

0t =0.70.60.80.4

10.70.60.80.4

20.70.90.80.4

30.70.90.80.4

40.70.90.80.4

50.70.90.80.4

⇒ e(i) = 3

Definition

e(i) := maxj∈V

dist(i , j)

0.30.60.10.4

yi(t):

0t =0.70.60.80.4

10.70.60.80.4

20.70.90.80.4

30.70.90.80.4

40.70.90.80.4

50.70.90.80.4

⇒ e(i) = 3

Extension 4 – estimation of radii and diameters

Definitions

r := mini∈V

e(i) d := maxi∈V

⇒ under synchronous communications assumptions one candistributedly estimate r , d through

r = mini∈V

e(i) d = maxi∈V

Application 1 - change detectionImplemented INRIA SensLab Strasbourg

t∗ − 12 2

t∗ + 12 2

t∗ + 20 0

GLR approach H0 : S(i) ≥ S for all i ∈ {t − T , . . . , t}

H1 : exists i ∈ {t − T , . . . , t} s.t. S(i) < S

⇒ accept H0 or H1 depending on the relative likelihood

Application 2 - transportation systems

→ → → → → → → → →

→→ → → → →

←←←←←

Idea: estimatehow many cars per meter per road’s segment & lanetraffic behavior per segment(average speed / acceleration, variances, . . . )

Uses:speed controlearly warning. . .

Big question:

what can we estimate using max consensus?

statistical strategies requirestatistical identifiability characterizations

Big question:

what can we estimate using max consensus?

statistical strategies requirestatistical identifiability characterizations

Statistical identifiability

Notation

local values: x1 ∼ px1(·), . . . , xS ∼ pxS (·)graph-dependent map: y = fG(x1, . . . , xS) = fG(x)parameter of interest: θ

Definitionθ is said statistically identifiable if

G1,G2 s.t. θ1 6= θ2 ⇒ P[x ∈ f −1

G1(y)]6= P

[x ∈ f −1

G2(y)]

for some y

A first characterization

TheoremHypotheses:

pxi = px (i.e., agents are equal)θ statistically not identifiable from px

f (x) anonymously computable and independent on Gf (x) is a consensus

Thesis:

unique statistically identifiable θ is the network size

Implication:

under theorem’s assumptions and “at most dcommunications” one can infer just the network size

A first characterization

TheoremHypotheses:

pxi = px (i.e., agents are equal)θ statistically not identifiable from px

f (x) anonymously computable and independent on Gf (x) is a consensus

Thesis:

unique statistically identifiable θ is the network size

Implication:

under theorem’s assumptions and “at most dcommunications” one can infer just the network size

Induced (and unanswered) questions

Assumptions (bounded memory / network size)memory

network size enough to have time counters

memorynetwork size not enough to share graph views

what can be computed with “max-consensus +time-counters”?can we prove that “max-consensus + time-counters” are thefastest?are they also “unique”?

An even more basic question

Why should we do max-consensus on reals?

I.e., is it better to use discrete or “continuous” r.v.s?

Two simple and open problems

Assumptions:memory = 50 bits (example)∃ upper bound on the number of agentsmetric: statistical estimation performance

Scheme A: do as before (quantize real values)

divide the 50 bits in M scalars (how?)quantize opportunely (how?)

Scheme B: each bit = Bernoullicompute the ML estimator (how?)set the success probability optimally (how?)

Summary: what do we know

statistical anonymous graph discovering

using maxconsensus

yi ,mcontinuous

yi ,mdiscrete

using averageconsensus

yi ,mcontinuous

yi ,mdiscrete

using otherconsensus

Summary: what do we know

statistical anonymous graph discovering

using maxconsensus

yi ,mcontinuous

yi ,mdiscrete

using averageconsensus

yi ,mcontinuous

yi ,mdiscrete

using otherconsensus

Summarizing . . .

graph disco-vering is useful

max consensusis a “lowerbound”

many questionsare still open

Bibliography

Varagnolo, Pillonetto, Schenato (20??)Distributed size estimation in anonymous networksIEEE Transactions on Automatic Control

Garin, Varagnolo, Johansson (2012)Distributed estimation of diameter, radius and eccentricities inanonymous networksNecSys

Fast graph discovering in anonymous networks

Damiano Varagnolo

KTH Royal Institute of Technology

October 19, 2012

damiano@kth.se

entirely

writtenin

LATEX2ε using

Beamer and Tik Z

KTH Royal Institute of Technologystaff.damvar/Speeches/UCLouvain__2012_10_19__Louvain... ·...

Documents

Principles - KTH

Distributedsystems 090709113230-phpapp02

Relating Through Informative Speeches and Persuasive Speeches CHAPTER 13

Keynote Speeches - mica.edu.vn€¦ · Janardhan HARYADI RAMESH, Mayank Kumar JHA, Malepati Bala Siva SAI AKHIL and et al. KTH Royal Institute of Technology M.S ... Robotics &

KTH Challenge

· Web viewMACBETH MALE AUDITION SPEECHES MACBETH FEMALE AUDITION SPEECHES MACBETH MALE AUDITION SPEECHES MACBETH FE MALE AUDITION SPEECHES MACBETH by William Shakespeare ... is

Deadlock detection in distributed computing systems · distributedsystems,themajorityofthemhavealsobeenshown ... deadlocks. Althoughtherehave ... forconcurrencycontrol,reliability,recoveryorsecurity

Coursebook speeches

Webquest- Speeches

Persuasive Speeches

Prototyper - KTH

Studentwebben - KTH

Matriculation Speeches

KTH Royal Institute of Technology · KTH Royal Institute of Technology . KTH web site Facts about KTH Study at KTH . KTH Royal Institute of Technology One of the top technical universities

Managerial speeches

CESIS - KTH

KTH ICT – School of Information and Communication Technology KTH campuses in Stockholm KTH Campus Valhallavägen KTH Campus Haninge, Telge and Flemingsberg

KTH Renault

Renässansen Kth

KTH Internetworking