32
ptimization in very large graph László Lovász Eötvös Loránd University, Budapest December 2012 1

Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

Embed Size (px)

Citation preview

Page 1: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

Optimization in very large graphs

László Lovász

Eötvös Loránd University, Budapest

December 2012 1

Page 2: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

2December 2012

How is the graph given?

- Graph is HUGE.

- Not known explicitly, not even the number of nodes.

Idealize: define minimum amount of info.

Page 3: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 3

Dense case: cn2 edges.

- We can sample a uniform random node a bounded number of times, and see edges

between sampled nodes.

Bounded degree (d)

- We can sample a uniform random node a bounded number of times, and explore its neighborhood to a bounded depth.

How is the graph given?

„Property testing”: Goldreich-Goldwasser-Ron,

Arora-Karger-Karpinski, Rubinfeld-Sudan,

Alon-Fischer-Krivelevich-Szegedy, Fischer,

Frieze-Kannan, Alon-Shapira

Page 4: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 4

Algorithms for very large graphs

Parameter estimation: edge density, triangle density, maximum cut

Property testing: is the graph bipartite? triangle-free? perfect?

Computing a structure: find a maximum cut, regularity partition,...

Page 5: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

5December 2012

The maximum cut problem

maximize

Applications: optimization, statistical mechanics…

NP-hard, even with 6% error HastadPolynomial-time computable with 13% error Goemans-Williamson

Page 6: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

6December 2012

Max cut in dense graphs How to estimate the density?cut with many

edges

Sampling O(-4) nodes the density of max cut in the induced subgraph

is closer than to the density of the max cut in the whole

graphwith high probability.

Alon-Fernandez de la Vega-Kannan-Karpinski

Page 7: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

A graph parameter f can be estimated from samples iff

(a) f(G(k)) is convergent as k

(b) If V(Gn)=V(Hn)=[n] and then f(Gn)-f(Hn)0

(c) If |V(Gn)|, vV(Gn), then f(Gn)-f(Gn\v)0

Density of maximum cut is estimable.

Estimable graph parameters

, 2

1max ( , ) ( , ) 0S T nn HGe S T e S T

n- ®

December 2012 7

blow up every node

cut distance

Borgs-Chayes-L-Sós-Vesztergombi

Page 8: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

Nondeterministic parameter estimation

Divine help: coloring nodes and edges, orienting edges

L: directed, (edge)-colored graph

L’: forget orientation, delete some colors, forget coloring; “shadow of G”

December 2012 8

f’(G) = max {f(L): L’ = G} ; “shadow of f”

f: parameter defined on colored oriented graphs

If f is estimable, then f’ is estimable.

L.-Vesztergombi

Page 9: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

Testable graph properties

P testable: for every ε > 0 there is a k0 ≥ 1 and a

there is a „test property” P’, such that

(a) for every graph G ∈ P and every k ≥ k0,

G(k,G) ∈ P′ with probability at least 2/3, and

(b) for every graph G with d1(G,P) > ε

and every k ≥ k0 we have G(k,G) ∈ P′

with probability at most 1/3.

P: graph property

December 2012 9

Page 10: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

Testable graph properties

Example: triangle-free

Removal Lemma:

’ if t(,G)<’, then we can delete n2 edges

to get a triangle-free graph.

Ruzsa - Szemerédi

G’: sampled induced subgraph

G’ not triangle-free G not triangle free

G’ triangle-free with high probability, G has few triangles

December 2012 10

Page 11: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

Testable graph properties

Every hereditary graph property is testable

Alon-Shapira

inherited by induced subgraphs

December 2012 11

Page 12: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

Nondeterministically testable graph properties

Q: property of directed, colored graphs

Q’={G’: GQ}; “shadow of Q”

P nondeterministically testable: P=Q’, where Q is a testable property of colored directed graphs.

December 2012 12

Divine help: coloring nodes and edges, orienting edges

L: directed, (edge)-colored graph

L’: forget orientation, delete some colors, forget coloring; “shadow of L”

Page 13: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

Nondeterministically testable graph properties

Examples: maximum cut contains at least n2/100 edges

contains a spanning subgraph with a testable property P

we can delete n2/100 edges to get a graph with a testable property P

December 2012 13

Page 14: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

N=NP for denseproperty testing

Nondeterministically testable graph properties

Every nondeterministically testable graph property is testable.

L-Vesztergombi

Proof via graph limit theory:pure existence proof

of an algorithm...

December 2012 14

Page 15: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 15

Computing a structure: representative set

Representative set of nodes: bounded size, (almost) every node is “similar” to one of the nodes in the set

When are two nodes similar? Neighbors? Same neighborhood?

Page 16: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 16

sim( , ) : E E ( ) E ( )v u su vu w wvtwa a ad t as = -

This is a metric, computable in the sampling model

Similarity distance of nodes

st

v

wu

Representative set U:

for any two nodes in U, dsim(s,t) >

for most nodes s, dsim(U,s)

Page 17: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 17

Every graph contains a representative set

with at most nodes. 2(1/ )2O e

Strong representative set U:

for any two nodes in U, dsim(s,t) >

for every node, dsim(U,v)

Every graph contains a strong representative set

with at most nodes. 2(log(1/ / ))2O e e

Alon

Similarity distance of nodes

Page 18: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 18

Voronoi diagram= weak regularity

partition

Representative set and regularity partition

Page 19: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

19December 2012

Max cut in dense graphs What answer to expect?

- Cannot list all nodes on one side

For any given node, we want to tell on which side of the cut it lies

(after some preprocessing)

Page 20: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 20

Construct representative set U

How to compute a (weak) regularity partition?

Each node is in same class as closest representative.

Page 21: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 21

- Construct representative set U

- Compute pij = density between Voronoi cells Vi and Vj

(use sampling)

- Compute max cut (U1,U2) in complete graph on U

with edge-weights pij

How to compute the maximum cut?

(Different algorithm implicit by Frieze-Kannan.)

Each node is on same side as closest representative.

Page 22: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 22

Bounded degree (d)

- We can sample a uniform random node a bounded number of times, and explore its neighborhood to a bounded depth.

Algorithms for bounded degree graphs

Page 23: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012

Bad examples

Gn: random 3-regular graph

Fn: random 3-regular bipartite graph

Hn: GnGn

Large girth graphs

(except for a small part)

Expandergraphs

(except for a small part)

23

Page 24: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 24

Algorithms for bounded degree graphs

Maximum cut cannot be estimated in this model

(random d-regular graph vs. random bipartite d-regular graph)

PNP in this model

(random d-regular graph vs. union of two random d-regular graphs)

Page 25: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 25

Algorithms for bounded degree graphs

Cellular automata (1970’s): network of finite automata

Distributed computing (1980’s): agent at every node, bounded time

Constant time algorithms (2000’s): bounded number of nodes sampled, explored at bounded depth

Page 26: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 26

Distributed computing model

Page 27: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 27

Algorithms for bounded degree graphs

(Almost) maximum matching can be computedin bounded time.

Nguyen-Onak

(Almost) maximum flow and (almost) minimum cut can be computed in bounded time.

Csóka

Page 28: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 28

Distributed computing model

Page 29: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 29

Algorithms for bounded degree graphs

(Almost) maximum matching can be computedin bounded time.

Nguyen-Onak

(Almost) maximum flow and (almost) minimum cut can be computed in bounded time.

Csóka

need local random bits

no random bits need global random

bits

Page 30: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 30

Algorithms for bounded degree graphs

Suppose you tell the agents any information aboutthe graph.

In the presence of global random bits, this does not help! Csóka

Page 31: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 31

Open problems

● Can an (almost) maximum independent node set be

computed in a random d-regular graph in this model?

● Can an (almost) maximum cut be

computed in a random d-regular graph in this model?

(Gn)/n tends to a limit with probability 1

MaxCut(Gn)/n tends to a limit with probability 1

Bayati-Gamarnik-Tetali 2010

Page 32: Optimization in very large graphs László Lovász Eötvös Loránd University, Budapest December 20121

December 2012 32

Thanks, that’s all!